期刊名称:Journal of Advanced Computer Science & Technology
印刷版ISSN:2227-4332
电子版ISSN:2227-4332
出版年度:2015
卷号:4
期号:1
页码:53-59
DOI:10.14419/jacst.v4i1.4012
出版社:Science Publishing Corporation
摘要:Calculating article similarities enables users to find similar articles and documents in a collection of articles. Two similar documents are extremely helpful for text applications such as document-to-document similarity search, plagiarism checker, text mining for repetition, and text filtering. This paper proposes a new method for calculating the semantic similarities of articles. WordNet is used to find word semantic associations. The proposed technique first compares the similarity of each part two by two. The final results are then calculated based on weighted mean from different parts. Results are compared with human scores to find how it is close to Pearson’s correlation coefficient. The correlation coefficient above 87 percent is the result of the proposed system. The system works precisely in identifying the similarities.