首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Text Mining For Information Systems Researchers: An Annotated Topic Modeling Tutorial
  • 作者:Debortoli, Stefan ; Müller, Oliver ; Junglas, Iris
  • 期刊名称:Communications of the Association for Information Systems
  • 印刷版ISSN:1529-3181
  • 出版年度:2016
  • 卷号:39
  • 期号:1
  • 页码:7
  • 出版社:Association for Information Systems
  • 摘要:Analysts have estimated that more than 80 percent of today’s data is stored in unstructured form (e.g., text, audio, image, video)—much of it expressed in rich and ambiguous natural language. Traditionally, to analyze natural language, one has used qualitative data-analysis approaches, such as manual coding. Yet, the size of text data sets obtained from the Internet makes manual analysis virtually impossible. In this tutorial, we discuss the challenges encountered when applying automated text-mining techniques in information systems research. In particular, we showcase how to use probabilistic topic modeling via Latent Dirichlet allocation, an unsupervised text-mining technique, with a LASSO multinomial logistic regression to explain user satisfaction with an IT artifact by automatically analyzing more than 12,000 online customer reviews. For fellow information systems researchers, this tutorial provides guidance for conducting text-mining studies on their own and for evaluating the quality of others.
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有