期刊名称:International Journal of Computer Science Issues
印刷版ISSN:1694-0784
电子版ISSN:1694-0814
出版年度:2016
卷号:13
期号:3
出版社:IJCSI Press
摘要:With the fast development of networking, data storage, and the data collection capacity, Big Data is now rapidly expanding in all science and engineering domains, including physical, biological and biomedical sciences. This paper presents text mining and the ways used to categorize document structure techniques in big data. This subject poses a big challenge when it comes to guaranteeing the quality of extracted features in text documents to describe user interests or preferences due to large amounts of noise. This subject has many models and algorithms but still needs more to achieve best results for users, making this an open issue that needs more research.