首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Word Class Prediction of Ambiguous and Unknown Words of Punjabi Language Using Bi-gram Methods
  • 本地全文:下载
  • 作者:Sanyam Sood ; Vishal Arora ; Sanjeev Kumar Sharma
  • 期刊名称:International Journal of Computer Applications and Information Technology
  • 印刷版ISSN:2278-7720
  • 出版年度:2014
  • 卷号:7
  • 期号:2
  • 页码:152-156
  • 出版社:Mahadev Educational Society
  • 摘要:Ambiguous and unknown words are found in every language. Ambiguous words are the words having different meaning in different sentences depending upon the context of the sentence. Assigning the correct word class to these ambiguous words is the fundamental task in almost all the NLP applications. A lot of work has been done on this and a lot of work is still to be done. Many statistical and rule based techniques has been applied to assign the correct word class to the word having ambiguous word class. Most commonly used statistical techniques are HMM (Hidden Markov Model), SVM (Support Vector Machine), ME (Maximum Entropy), CRF (Conditional Random Field) and N-gram based techniques. In this research paper a bigram technique has been discussed to assign the correct word class to the ambiguous and unknown words of Punjabi language. A tag set proposed by TDIL has been used to assign the correct word class to the ambiguous and unknown words
  • 关键词:Ambiguous words; word class; Unknown words; Bi-gram technique; TDIL proposed Punjabi tag set
国家哲学社会科学文献中心版权所有