期刊名称:International Journal of Innovative Research in Science, Engineering and Technology
印刷版ISSN:2347-6710
电子版ISSN:2319-8753
出版年度:2015
卷号:4
期号:4
页码:2113
DOI:10.15680/IJIRSET.2015.0404058
出版社:S&S Publications
摘要:The internet give a great level of good knowledge which is usually formatted for its users, which make ittroublesome to extract relevant data from various sources. The WWW (World Wide Web) plays a major role as allkinds of information repository and has been so far very successful in broadcasting information to humans. For theencoded data units to be machine processable which is indispensible for many applications, like deep web datacollection and internet comparison shopping , they need to grouping and allot a meaningful labels. An automaticannotation approach, first align a data units on a result page into dissimilar groups, such that same group data have thesame meaning or semantics. For each group annotate it from different feature and collective the different annotations topredict a final annotation label.
关键词:Data alignment; Data annotation; Web databases; Wrapper generation.