期刊名称:International Journal on Computer Science and Engineering
印刷版ISSN:2229-5631
电子版ISSN:0975-3397
出版年度:2013
卷号:5
期号:4
页码:252-257
出版社:Engg Journals Publications
摘要:As the Deep web (or Hidden web) information is hidden behind the search query forms, this information can only be accessed by interacting with these forms. Therefore, development of automated system that interacts with the search forms and extracts the hidden web pages would be of great value to human users. To accomplish this task stated above, this paper proposes a novel method �Deep Webpage Classification and Extraction� which classifies the websites into appropriate domain, extracts their query interfaces and retrieves all result pages of deep websites using query building system.