期刊名称:International Journal of Soft Computing & Engineering
电子版ISSN:2231-2307
出版年度:2013
卷号:3
期号:2
页码:238-242
出版社:International Journal of Soft Computing & Engineering
摘要:An unsupervised method for extracting keywords from a single document is proposed in this paper. A fuzzy set theoretic approach, fuzzy n-gram indexing, is used to extract n-gram keywords. It is noticed that n-gram keyword renders a better result as compared to mono-gram keyword, but for some documents the most relevant keyword is mono-gram. This paper focuses on a keyword extraction approach which neither requires a dictionary or thesaurus nor does it depend on the size of text document. The algorithm is efficient enough to dynamically determine the mono-gram, bi-gram as well as n-grams keywords for different documents.
关键词:Keyword extraction; n-gram collocation;fuzzy set; information retrieval; natural language;processing.