期刊名称:International Journal of Innovative Research in Science, Engineering and Technology
印刷版ISSN:2347-6710
电子版ISSN:2319-8753
出版年度:2015
卷号:4
期号:12
页码:12683
DOI:10.15680/IJIRSET.2015.0412069
出版社:S&S Publications
摘要:Searching a document from the huge collection all over the internet is becoming a challenge. Like otherlanguages Bodo language also providing content to the electronic world. Bodo is widely used in the North Easternstates of India. As text documents are increasing exponentially across the web, grouping similar documents for versatileapplications suffers from challenges in dealing with problems of high dimensionality, scalability, accuracy andmeaningful cluster label. Clustering of documents has many effective points in information retrieval and data miningfield. However, in this presentation a practical approach for clustering of Bodo documents are implemented in order tofind suitable clustering technique specifically for this purpose.
关键词:Document clustering; Bodo language; NLP; Data Mining; Stammer