首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:Application of an Improved CHI Feature Selection Algorithm
  • 本地全文:下载
  • 作者:Liang-jing Cai ; Shu Lv ; Kai-bo Shi
  • 期刊名称:Discrete Dynamics in Nature and Society
  • 印刷版ISSN:1026-0226
  • 电子版ISSN:1607-887X
  • 出版年度:2021
  • 卷号:2021
  • 页码:1-8
  • DOI:10.1155/2021/9963382
  • 出版社:Hindawi Publishing Corporation
  • 摘要:Text classification is the critical content of machine learning, and it is widely applied in information filtering, sentimental analysis, and text review. It is very important to improve the accuracy of classification results, and this is also the main research purpose of researchers in this field in recent years. Feature selection plays an important role in text classification, which has the functions of eliminating irrelevant features, reducing dimensionality, and improving classification accuracy. So, this paper studies the CHI feature selection algorithm, and the main work and innovations are as follows: firstly, this paper analyzed the CHI algorithm’s flaws, determined that the introduction of new parameters will be the improvement direction of the CHI algorithm, and thus proposed a new algorithm based on variance and coefficient of variation. Secondly, experiment to verify the effectiveness of the new algorithm. In terms of language, the experiment in this paper includes two text classification systems, which were Chinese and English. In terms of classifiers, two classifier algorithms were used, which included the KNN classifier and the Naive Bayes classifier. In terms of data types, two distribution types of data were used: balanced datasets and unbalanced datasets. Finally, experiment and result analysis. This paper has conducted 3 comparative experiments and analyzed the results of each experiment. The experimental results obtained are all significantly improved compared to the results before the improvement.
国家哲学社会科学文献中心版权所有