出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:We are developing a web-based plagiarism detection system to detect plagiarism in writtenArabic documents. This paper describes the proposed framework of our plagiarism detectionsystem. The proposed plagiarism detection framework comprises of two main components, oneglobal and the other local. The global component is heuristics-based, in which a potentiallyplagiarized given document is used to construct a set of representative queries by using differentbest performing heuristics. These queries are then submitted to Google via Google's search APIto retrieve candidate source documents from the Web. The local component carries out detailedsimilarity computations by combining different similarity computation techniques to checkwhich parts of the given document are plagiarised and from which source documents retrievedfrom the Web. Since this is an ongoing research project, the quality of overall system is notevaluated yet.