期刊名称:International Journal of Advanced Research In Computer Science and Software Engineering
印刷版ISSN:2277-6451
电子版ISSN:2277-128X
出版年度:2013
卷号:3
期号:5
出版社:S.S. Mishra
摘要:Information searching has becoming one of the most important and popular activities on the Web. These search engines deal only with the surface Web, the set of Web pages directly accessible through hyperlinks, mostly ignoring the vast amount of information hidden behind forms, which is called the hidden Web. Web information is accessed today primarily relies on the search engines. Current search engines cannot make index to the pages which are generated automatically by the back ¨C end of the databases called invisible web or hidden web. The information is hidden behind HTML forms and it is only available on response to user's request. In this paper a system based on domain and keyword specific information extraction is described.
关键词:DSIM; Information Retrieval; Hidden web; Search engine; Fuzzy matching and Spiders.