期刊名称:International Journal of Electronics, Communication and Soft Computing Science and Engineering
印刷版ISSN:2277-9477
出版年度:2015
卷号:4
期号:Special 3
出版社:IJECSCSE
摘要:As Web is the large information repository, users findresources by following hypertext links. These links connect from onedocument to another. In the small systems where resources sharethe same fundamental classification, users can find resources easilyand in efficient manner. perhaps Web now encompasses millions ofsites with many different information, navigation is difficult.WebCrawler, is the efficient full-text search engine. It is a toolthat assists users in their Web surfing by automating the taskof link traversal, creating a searchable index of the web, andfulfilling searchers’ queries from the index. Conceptually thedistributed crawler harnesses the excess bandwidth and computingresources of clients to crawl the web. In this paper we are going toreview some basic concepts of distributed web crawling by usingmining.