首页    期刊浏览 2025年02月20日 星期四
登录注册

文章基本信息

  • 标题:Automatic Keywords Extraction for Punjabi Language
  • 作者:Vishal Gupta ; Gurpreet Singh Lehal
  • 期刊名称:International Journal of Computer Science Issues
  • 印刷版ISSN:1694-0784
  • 电子版ISSN:1694-0814
  • 出版年度:2011
  • 卷号:8
  • 期号:5
  • 出版社:IJCSI Press
  • 摘要:Automatic keywords extraction is the task to identify a small set of words, key phrases, keywords, or key segments from a document that can describe the meaning of the document. Keywords are useful tools as they give the shortest summary of the document. This paper concentrates on Automatic keywords extraction for Punjabi language text. It includes various phases like removing stop words, Identification of Punjabi nouns and noun stemming, Calculation of Term Frequency and Inverse Sentence Frequency (TF-ISF), Punjabi keywords as nouns with high TF-ISF score and title/headline feature for Punjabi text. The extracted keywords are very much helpful in automatic indexing, text summarization, information retrieval, classification, clustering, topic detection and tracking and web searches etc.
  • 关键词:Punjabi keywords extraction; Keywords; Key phrases; TF;ISF
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有