Journal of Social Sciences and Philosophy
logo-Journal of Social Sciences and Philosophy

Journal of Social Sciences and Philosophy
logo_m-Journal of Social Sciences and Philosophy

    Jump To中央區塊/Main Content :::
  • About JSSP
    • About JSSP
    • Indexed in
    • Subscription Information
    • Contact Us
  • Editorial Board
  • Notes to Contributors
    • Guidelines for Submitted Manuscripts
    • Publication Ethics Statement
    • Statement Regarding Artificial Intelligence (AI)
    • Review Process
    • Online Submission
  • Volumes and Issues
    • Forthcoming Papers
  • Annual Statistics
    • Submission Trends by Academic Discipline in 2014-2024
    • First Round Processing Time of Submissions in 2024
    • Top 10 Most Viewed Papers in 2024
    • Top 10 Most Downloaded Papers in 2024
  • RCHSS
中文
RCHSS
search
Journal of Social Sciences and Philosophy
  • Home
  • Volumes and Issues
  • 《Journal of Social Sciences and Philosophy》 Volume 27, Number 04
  • Facebook
  • line
  • email
  • Twitter
  • Print
2015 / December Volume 27 No.4
Text Mining for Social Studies: Meaning-based Document Classification and Its Problems

Number of Clicks:1970; Number of Abstract Downloads:0; Number of full PDF text Downloads:0;

發刊日期/Published Date
2015 / December
中英文篇名/Title
社會科學研究中的文字探勘應用:以文意為基礎的文件分類及其問題 
Text Mining for Social Studies: Meaning-based Document Classification and Its Problems
論文屬性/Type
研究論文 Research Article
作者/Author
陳世榮
Roger S. Chen
頁碼/Pagination
683-718
摘要/Abstract

隨著電子典藏技術的精進,文字探勘技術逐漸受到重視,本文以社會科學研究在文意區別上的需求,評估監督式機器學習對非結構、複雜文本的分類效果,並就所見問題提出分析與建議。本文從文字探勘與內容分析文意區別上的差異與共通性出發,繼而以新聞報導為分析資料,針就特定文件意向,遵循一般文字探勘程序,以支持向量機與簡易貝式分類器執行文件分類評估。分析結果指出,文字探勘對於複雜文意的判讀效果值得肯定,但經由共詞網絡分析也發現,文件的編撰風格將影響文件分類的效果。建議研究者在資料處理初期,應反覆評估研究目的、資料特性與分類器模型間的契合度。

Along with the growing development of electronic information storage, text mining has increasingly gained attention from scholars and practitioners across various disciplines. In response to the need for meaning differentiation in social studies, the study aims to evaluate supervised machine learning classifiers in terms of the performance of document classification. Setting out from the comparison between traditional content analysis and text mining, the evaluation follows a normal procedure of text mining and applies Support Vector Machine and Naïve Bayes classifiers on non-structural, complex social texts extracted from news media. The outcomes of the analysis validate that text mining manages classification well for documents with complex meaning. However, a further coword network analysis in the study finds that the editing style of data may affect classifiers’ performance. It is suggested that, in the early stage of data processing, greater care must be given to the fit between research problems, editing styles, and classifiers.

關鍵字/Keyword
文字探勘, 文意區別, 文件分類, 機器學習, 共詞網絡分析
text mining, meaning differentiation, document classification, machine learning, co-word network analysis
學科分類/Subject
政治學
Political Science
主題分類/Theme

DOI
檔案下載/Download
Abstract full PDF text
相關文章
  • Reverse Reforming the Land Taxation System in Taiwan
  • Comment
  • The Relation between Organizational Performance and Persistence
  • The Impact of Voluntary Export Restraints on the Competitive Exporting Firm
  • The Search for Organizational Gestalts

  • About JSSP
  • Editorial Board
  • Notes to Contributors
  • Volumes and Issues
  • Annual Statistics

Research Center for Humanities and Social Sciences, Academia Sinica, No. 128, Sec. 2, Academia Rd, Taipei 115, Taiwan

Tel: 886-2-27898156 Fax: 02-27898157 Email: issppub@sinica.edu.tw

© Copyright 2025. RCHSS Sinica All Rights Reserved.Privacy Policy & Security PolicyVersion:V1.1.3