期刊名称:International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN:2320-9798
电子版ISSN:2320-9801
出版年度:2015
卷号:3
期号:7
DOI:10.15680/ijircce.2015.0307182 6588
出版社:S&S Publications
摘要:Identifying the details of both the aliases and the personal name of a person using a single query is generally a tiresome process. A solution for this problem can be achieved b y improving the search process. In this project the set of patterns that describes how the aliases can be represented in different ways is first extracted. The name given as input is compared with the extracted patterns to find the aliases of a given name. The improvement made here is the introduction of a Ranking methodology for the candidate Name and Aliases. The methodology does the ranking based on the lexical pattern frequency and page count. Since considering page co unt can be less efficient sometime, in this project ranking uses the number of times the same name is repeated in the dataset. If the name co unt returned for a given input name is high, then the displayed files are more related to the same
关键词:name a lias; ran king; web mi ning; pa ttern ext ra ction