首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:Automatic Discovery of Lexical Patterns using Pattern Extraction Algorithm to Identify Personal Name Aliases with Entities
  • 本地全文:下载
  • 作者:A. Muthusamy ; A. Subramani
  • 期刊名称:International Journal of Software Engineering and Its Applications
  • 印刷版ISSN:1738-9984
  • 出版年度:2015
  • 卷号:9
  • 期号:12
  • 页码:165-176
  • DOI:10.14257/ijseia.2015.9.12.15
  • 出版社:SERSC
  • 摘要:The personal name aliases are extremely significant in information retrieval to retrieve complete information about a personal name from the web, as some of the web pages of the person may also be referred by his or her alias name / nick name / real name. There is a rapid growth in people searching where the personal name aliases are concerned. We proposed a pattern generator which includes automatic: lexical pattern extraction algorithm and attribute extraction algorithm. We exploit three data set of known Personal names (consisting of alias name, real name, and nick name), Profession and location names of a person as training semi-structured data set to efficiently extract lexical patterns. The extracted patterns are ranked according to F-Score. It conveys information related to alias names from contingency table returned by web search engine. The extracted lexical patterns (profession pattern and location name pattern) are often used to optimize candidate personal name aliases with attributes of a person availed in the contingency table, the non-frequent items are discarded from the contingency table. Next, we ranking the candidate alias in contingency table, Graph mining ranking algorithm with various similarity measures are used then to measure the strength of association between a name and a candidate alias, co-occurrence statistics are computed.
  • 关键词:Semantic web; Information extraction; Text mining
国家哲学社会科学文献中心版权所有