期刊名称:International Journal of Computer Science and Network Security
印刷版ISSN:1738-7906
出版年度:2009
卷号:9
期号:10
页码:227-230
出版社:International Journal of Computer Science and Network Security
摘要:Focused on the effect on classification of short text sparse features, propose a method extending the short text features. First, according to the theory of words co- occurrence model, the association rules between feature items of corpus are mined by FP-growth algorithm. Then, we search the rules in the set of association rules, which have the relationship with short text feature items, calculate the mutual information between the antecedent and subsequent of association rules, and estimate the degree of association between two features. Based on these work, we choose short text extension feature words and construct the collection of the short text features. Experiments show that the efficiency of short text classification is improved after extending the short text features.
关键词:Association Rules; Short Text; Text Feature; Extension