首页    期刊浏览 2024年10月05日 星期六
登录注册

文章基本信息

  • 标题:A Decoupled Architecture for Scalability in Text Mining Applications
  • 本地全文:下载
  • 作者:Rafael A. Calvo
  • 期刊名称:Journal of Universal Computer Science
  • 印刷版ISSN:0948-6968
  • 出版年度:2013
  • 卷号:19
  • 期号:3
  • 页码:406
  • 出版社:Graz University of Technology and Know-Center
  • 摘要:Sophisticated Text Mining features such as visualization, summarization, and clustering are becoming increasingly common in software applications. In Text Mining, documents are processed using techniques from different areas which can be very expensive in computation cost. This poses a scalability challenge for real-life applications in which users behavior can not be entirely predicted. This paper proposes a decoupled architecture for document processing in Text Mining applications, that allows applications to be scalable for large corpora and real-time processing. It contributes a software architecture designed around these requirements and presents TML, a Text Mining Library that implements the architecture. An experimental evaluation on its scalability using a standard corpus is also presented, and empirical evidence on its performance as part of an automated feedback system for writing tasks used by real students
  • 关键词:automatic feedback; software architecture; text mining
国家哲学社会科学文献中心版权所有