首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Automatic Keywords Extraction for Punjabi Language
  • 本地全文:下载
  • 作者:Vishal Gupta ; Gurpreet Singh Lehal
  • 期刊名称:International Journal of Computer Science Issues
  • 印刷版ISSN:1694-0784
  • 电子版ISSN:1694-0814
  • 出版年度:2011
  • 卷号:8
  • 期号:5
  • 出版社:IJCSI Press
  • 摘要:Automatic keywords extraction is the task to identify a small set of words, key phrases, keywords, or key segments from a document that can describe the meaning of the document. Keywords are useful tools as they give the shortest summary of the document. This paper concentrates on Automatic keywords extraction for Punjabi language text. It includes various phases like removing stop words, Identification of Punjabi nouns and noun stemming, Calculation of Term Frequency and Inverse Sentence Frequency (TF-ISF), Punjabi keywords as nouns with high TF-ISF score and title/headline feature for Punjabi text. The extracted keywords are very much helpful in automatic indexing, text summarization, information retrieval, classification, clustering, topic detection and tracking and web searches etc.
  • 关键词:Punjabi keywords extraction; Keywords; Key phrases; TF-ISF
国家哲学社会科学文献中心版权所有