期刊名称:International Journal of Innovative Research in Science, Engineering and Technology
印刷版ISSN:2347-6710
电子版ISSN:2319-8753
出版年度:2017
卷号:6
期号:6
页码:10882
DOI:10.15680/IJIRSET.2017.0606137
出版社:S&S Publications
摘要:On web large no. of requests goes to server. Due to large no. of HTTP requests to web servers increasethe energy consumption and carbon footprint of the web servers and for that computational resources are used whileserving the requests. In existingsystem the problem of Web crawlers that maintain local copies of remote Web pagesfor Web search engines. In thiscontext, remote data sources (Websites) do not notify the copies (Web crawlers) of newchanges, so we need to periodically poll the sources to maintain the copies up-to-date.The proposed work is mainlymotivated by THEneed to manage updated web data. In this Web basesystem,it stores a significant portion of the webin local repository for websearching. This web search engines also maintain copies and/or indexes of Web page, andthey need to periodically visit the pages to maintain them up-to-date.Here page to be refreshed is selected based on ametric that considers the page’s staleness, its size, andthe greenness of the energy consumed at the web serverpremises.If page is updated then increase its greenness and not updated then increase its staleness. Itavoidsno. offrequently sending http request to server. Personalized profile is maintained for each user.
关键词:Crawling; carbonfootprint; greenness; staleness; Web Dynamics;Webrepository.