期刊名称:International Journal of Computer Science and Network Solutions
印刷版ISSN:2345-3397
出版年度:2016
卷号:4
期号:3
页码:11-19
出版社:International Journal of Computer Science and Network Solutions
摘要:Text segmentation is a fundamental operation in the fields of natural language processing such as text summarization and information retrieval systems. The main purpose of this study is to examine and describe a new method of segmentation and to improve the former segmentation method of Persiantiling. All the methods discussed are suitable for linear texts and are categorized as unsupervised algorithms based on the word. After expressing how the methods work, they will be evaluated on 50 Persian samples and will be compared using standard evaluation criteria and a general conclusion will
关键词:natural language processing; text segmentation method; Persiantiling method; standard evaluation criteria; unsupervised algorithms; linear segmentation algorithms; Persian language processing