文章基本信息

标题：Relevant Information Retrieval from Deep Web Page
本地全文：下载
作者：Nripendra Narayan Das ; Dr. Ela Kumar
期刊名称：International Journal of Research in Management, Science & Technology
印刷版ISSN：2321-3264
出版年度：2014
卷号：2
期号：3
出版社：Prannath Parnami Institute of Management & Technology, Hisar
摘要：There is large volume of information available to be mined from the World Wide Web. The information on the Web is contained in the form of structured and unstructured objects, which is known as data records. Such data records are important because essential information are available in these host pages, e.g., lists of products and there detail information. It is necessary to extract such data records to provide relevance information to user as per their requirements. Some of the approaches used to solve this problem are manual approach, supervised learning, and automatic techniques. The manual method is not suitable for large number of pages. It is a challenging work to retrieve appropriate and useful information from Web pages. Currently, many web retrieval systems called web wrappers, web crawler have been designed . In this paper, some existing techniques are examined, then our current work on web information extraction is presented
关键词：World Wide Web; Deep web; HTML; XHTML