首页    期刊浏览 2024年08月22日 星期四
登录注册

文章基本信息

  • 标题:Relevant Information Retrieval from Deep Web Page
  • 本地全文:下载
  • 作者:Nripendra Narayan Das ; Dr. Ela Kumar
  • 期刊名称:International Journal of Research in Management, Science & Technology
  • 印刷版ISSN:2321-3264
  • 出版年度:2014
  • 卷号:2
  • 期号:3
  • 出版社:Prannath Parnami Institute of Management & Technology, Hisar
  • 摘要:There is large volume of information available to be mined from the World Wide Web. The information on the Web is contained in the form of structured and unstructured objects, which is known as data records. Such data records are important because essential information are available in these host pages, e.g., lists of products and there detail information. It is necessary to extract such data records to provide relevance information to user as per their requirements. Some of the approaches used to solve this problem are manual approach, supervised learning, and automatic techniques. The manual method is not suitable for large number of pages. It is a challenging work to retrieve appropriate and useful information from Web pages. Currently, many web retrieval systems called web wrappers, web crawler have been designed . In this paper, some existing techniques are examined, then our current work on web information extraction is presented
  • 关键词:World Wide Web; Deep web; HTML; XHTML
国家哲学社会科学文献中心版权所有