首页    期刊浏览 2024年09月20日 星期五
登录注册

文章基本信息

  • 标题:Effective Page Refresh Policies for Green Web Crawling
  • 本地全文:下载
  • 作者:Pallavi Ghodke ; Prof. P.S.Desai
  • 期刊名称:International Journal of Innovative Research in Science, Engineering and Technology
  • 印刷版ISSN:2347-6710
  • 电子版ISSN:2319-8753
  • 出版年度:2017
  • 卷号:6
  • 期号:6
  • 页码:10882
  • DOI:10.15680/IJIRSET.2017.0606137
  • 出版社:S&S Publications
  • 摘要:On web large no. of requests goes to server. Due to large no. of HTTP requests to web servers increasethe energy consumption and carbon footprint of the web servers and for that computational resources are used whileserving the requests. In existingsystem the problem of Web crawlers that maintain local copies of remote Web pagesfor Web search engines. In thiscontext, remote data sources (Websites) do not notify the copies (Web crawlers) of newchanges, so we need to periodically poll the sources to maintain the copies up-to-date.The proposed work is mainlymotivated by THEneed to manage updated web data. In this Web basesystem,it stores a significant portion of the webin local repository for websearching. This web search engines also maintain copies and/or indexes of Web page, andthey need to periodically visit the pages to maintain them up-to-date.Here page to be refreshed is selected based on ametric that considers the page’s staleness, its size, andthe greenness of the energy consumed at the web serverpremises.If page is updated then increase its greenness and not updated then increase its staleness. Itavoidsno. offrequently sending http request to server. Personalized profile is maintained for each user.
  • 关键词:Crawling; carbonfootprint; greenness; staleness; Web Dynamics;Webrepository.
国家哲学社会科学文献中心版权所有