首页    期刊浏览 2024年11月24日 星期日
登录注册

文章基本信息

  • 标题:Automatic Keyword Extraction From Any Text Document Using N-gram Rigid Collocation
  • 本地全文:下载
  • 作者:Bidyut Das ; Subhajit Pal ; Suman Kr. Mondal
  • 期刊名称:International Journal of Soft Computing & Engineering
  • 电子版ISSN:2231-2307
  • 出版年度:2013
  • 卷号:3
  • 期号:2
  • 页码:238-242
  • 出版社:International Journal of Soft Computing & Engineering
  • 摘要:An unsupervised method for extracting keywords from a single document is proposed in this paper. A fuzzy set theoretic approach, fuzzy n-gram indexing, is used to extract n-gram keywords. It is noticed that n-gram keyword renders a better result as compared to mono-gram keyword, but for some documents the most relevant keyword is mono-gram. This paper focuses on a keyword extraction approach which neither requires a dictionary or thesaurus nor does it depend on the size of text document. The algorithm is efficient enough to dynamically determine the mono-gram, bi-gram as well as n-grams keywords for different documents.
  • 关键词:Keyword extraction; n-gram collocation;fuzzy set; information retrieval; natural language;processing.
国家哲学社会科学文献中心版权所有