期刊名称:International Journal of Web & Semantic Technology
印刷版ISSN:0976-2280
电子版ISSN:0975-9026
出版年度:2013
卷号:4
期号:2
DOI:10.5121/ijwest.2013.4205
出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:Semantic web is a web of future. The Resource Description Framework (RDF) is a language to represent resources in the World Wide Web. When these resources are queried the problem of duplicate query results occurs. The present techniques used hash index comparison to remove duplicate query results. The major drawback of using the hash index to remove duplicate query results is that, if there is a slight change in formatting or word order, then hash index is changed and query results are no more considered as duplicate even though they have same contents. We presented an algorithm for detection and elimination of duplicate query results from semantic web using hash index and page size comparisons. Experimental results showed that the proposed technique removed duplicate query results from semantic web efficiently, solved the problems of using hash index for duplicate handling and could be embedded in existing SQL-Based query system for semantic web. Research could be carried out for certain flexibilities in existing SQL-Based query system of semantic web to accommodate other duplicate detection techniques as well.
关键词:Duplicate query results; Semantic web; Hash index; SQL-Based query system