首页    期刊浏览 2025年02月21日 星期五
登录注册

文章基本信息

  • 标题:A Framework for Plagiarism Detection in Arabic Documents
  • 本地全文:下载
  • 作者:Imtiaz Hussain Khan ; Muazzam Ahmed Siddiqui ; Kamal MansoorJambi
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2015
  • 卷号:5
  • 期号:2
  • 页码:01-09
  • DOI:10.5121/csit.2015.50201
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:We are developing a web-based plagiarism detection system to detect plagiarism in writtenArabic documents. This paper describes the proposed framework of our plagiarism detectionsystem. The proposed plagiarism detection framework comprises of two main components, oneglobal and the other local. The global component is heuristics-based, in which a potentiallyplagiarized given document is used to construct a set of representative queries by using differentbest performing heuristics. These queries are then submitted to Google via Google's search APIto retrieve candidate source documents from the Web. The local component carries out detailedsimilarity computations by combining different similarity computation techniques to checkwhich parts of the given document are plagiarised and from which source documents retrievedfrom the Web. Since this is an ongoing research project, the quality of overall system is notevaluated yet.
  • 关键词:Plagiarism Detection; Arabic NLP; Similarity Computation; Query Generation; Document;Retrieval .
国家哲学社会科学文献中心版权所有