首页    期刊浏览 2024年09月15日 星期日
登录注册

文章基本信息

  • 标题:An Approach to Bodo Document Clustering
  • 本地全文:下载
  • 作者:Abdul Hannan ; Samiran Raj Boro ; Jay Prakash Sarma
  • 期刊名称:International Journal of Innovative Research in Science, Engineering and Technology
  • 印刷版ISSN:2347-6710
  • 电子版ISSN:2319-8753
  • 出版年度:2015
  • 卷号:4
  • 期号:12
  • 页码:12683
  • DOI:10.15680/IJIRSET.2015.0412069
  • 出版社:S&S Publications
  • 摘要:Searching a document from the huge collection all over the internet is becoming a challenge. Like otherlanguages Bodo language also providing content to the electronic world. Bodo is widely used in the North Easternstates of India. As text documents are increasing exponentially across the web, grouping similar documents for versatileapplications suffers from challenges in dealing with problems of high dimensionality, scalability, accuracy andmeaningful cluster label. Clustering of documents has many effective points in information retrieval and data miningfield. However, in this presentation a practical approach for clustering of Bodo documents are implemented in order tofind suitable clustering technique specifically for this purpose.
  • 关键词:Document clustering; Bodo language; NLP; Data Mining; Stammer
国家哲学社会科学文献中心版权所有