首页    期刊浏览 2024年12月04日 星期三
登录注册

文章基本信息

  • 标题:An Adaptive Updating Topic Specific Web Search System Using T-Graph
  • 本地全文:下载
  • 作者:Patel, Ahmed
  • 期刊名称:Journal of Computer Science
  • 印刷版ISSN:1549-3636
  • 出版年度:2010
  • 卷号:6
  • 期号:4
  • 页码:450-456
  • DOI:10.3844/jcssp.2010.450.456
  • 出版社:Science Publications
  • 摘要:Problem statement: The main goal of a Web crawler is to collect documents that are relevant to a given topic in which the search engine specializes. These topic specific search systems typically take the whole document's content in predicting the importance of an unvisited link. But current research had proven that the document's content pointed to by an unvisited link is mainly dependent on the anchor text, which is more accurate than predicting it on the contents of the whole page. Approach: Between these two extremes, it was proposed that Treasure Graph, called T-Graph is a more effective way to guide the Web crawler to fetch topic specific documents predicted by identifying the topic boundary around the unvisited link and comparing that text with all the nodes of the T-Graph to obtain the matching node(s) and calculating the distance in the form of documents to be downloaded to reach the target documents. Results: Web search systems based on this strategy allowed crawlers and robots to update their experiences more rapidly and intelligently that can also offer speed of access and presentation advantages. Conclusion/Recommendations: The consequences of visiting a link to update a robot's experiences based on the principles and usage of T-Graph can be deployed as intelligent-knowledge Web crawlers as shown by the proposed novel Web search system architecture.
  • 关键词:Topic specific search engines; DDC; T-graph; web crawling; web robot
国家哲学社会科学文献中心版权所有