期刊名称:International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
印刷版ISSN:2278-1323
出版年度:2015
卷号:4
期号:6
页码:3005-3009
出版社:Shri Pannalal Research Institute of Technolgy
摘要:For many search engines, data encoded inthe returned result pages come from the underlyingstructured databases i.e Deep web is a database based.Such type of search engines is often referred as Webdatabases (WDB). A web database contain a typical manysearch results records. Each SRR contain multiple dataunits which need to be label semantically for machineprocessable. Early applications require tremendoushuman efforts to annotate data units manually, whichseverely limit their scalability. Now we present anautomatic annotation approach which contains the dataunits on the web result page into a different groups suchthat same groups have the same semantic labels. Then thesix annotations are combined and predict the finalannotation label. The last is the wrapper generation, withthe help of wrapper generation we annotate the newresult page from the same web database. Our resultscontain precision and recall.
关键词:Data alignment; data annotation; web;database; wrapper generation