期刊名称:Issues in Informing Science and Information Technology
印刷版ISSN:1547-5840
电子版ISSN:1547-5867
出版年度:2007
卷号:4
页码:601-614
出版社:Informing Science Institute
摘要:In order to detect plagiarism, comparisons must be made between a target document (the suspect)
and reference documents. Numerous automated systems exist which check at the text-string level.
If the scope is kept constrained, as for example in within-cohort plagiarism checking, then performance
is very reasonable. On the other hand if one extends the focus to a very large corpus
such as the WWW then performance can be reduced to an impracticable level. The three case
studies presented in this paper give insight into the text-string comparators, whilst the third case
study considers the very new and promising conceptual analysis approach to plagiarism detection
which is now made achievable by the very computationally efficient Normalised Word Vector
algorithm. The paper concludes with a caution on the use of high-tech in the absence of hightouch.