摘要:This paper contains an overview of basic formulations and approaches to text classification. This paper surveys the algorithms used in text categorization: handcrafted rules, decision trees, decision rules, on-line learning, linear classifier, Rocchio’s algorithm, k Nearest Neighbor (kNN), Support Vector Machines (SVM).
关键词:information retrieval, algorithms, machine learning, text classification.