期刊名称:International Journal of Computer and Information Technology
印刷版ISSN:2279-0764
出版年度:2018
卷号:7
期号:4
页码:163-169
出版社:International Journal of Computer and Information Technology
摘要:Latent Dirichlet Allocation (LDA) is a probabilistic
topic model that aims at organizing, visualizing, summarizing,
searching, predicting and understanding the content of any given
text data. The model enables users to discover themes in text,
annotate, organize and summarize documents. LDA inference
involves estimating the parameters and posterior distribution of a
formulated mathematical relationship. This paper investigates
topic modeling literature based on LDA and presents discoveries
and state of the art in the topic. Presented also are challenges and
popular tools. In conclusion, the paper identifies Gibbs sampling
as a popular inference mechanism and notes that the method is
limited for application in big data settings.