首页    期刊浏览 2024年09月19日 星期四
登录注册

文章基本信息

  • 标题:Effective Performance of Information Retrieval on Web by Using Web Crawling
  • 本地全文:下载
  • 作者:Sk.AbdulNabi ; P. Premchand
  • 期刊名称:International Journal of Web & Semantic Technology
  • 印刷版ISSN:0976-2280
  • 电子版ISSN:0975-9026
  • 出版年度:2012
  • 卷号:3
  • 期号:2
  • DOI:10.5121/ijwest.2012.3205
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:World Wide Web consists of more than 50 billion pages online. It is highly dynamic [6] i.e. the web continuously introduces new capabilities and attracts many people. Due to this explosion in size, the effective information retrieval system or search engine can be used to access the information. In this paper we have proposed the EPOW (Effective Performance of WebCrawler) architecture. It is a software agent whose main objective is to minimize the overload of a user locating needed information. We have designed the web crawler by considering the parallelization policy. Since our EPOW crawler has a highly optimized system it can download a large number of pages per second while being robust against crashes. We have also proposed to use the data structure concepts for implementation of scheduler & circular Queue to improve the performance of our web crawler. (Abstract)
  • 关键词:EPOW; Effective Web Crawler; Circular Queue; Scheduler; Basic Crawler; Precision & Recall.
国家哲学社会科学文献中心版权所有