首页    期刊浏览 2024年07月05日 星期五
登录注册

文章基本信息

  • 标题:Introducing off-diagonal elements to singular value matrix in probabilistic Latent Semantic Indexing
  • 本地全文:下载
  • 作者:Naoki Shibayama ; Hiroshi Nakagawa
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2011
  • 卷号:26
  • 期号:1
  • 页码:262-272
  • DOI:10.1527/tjsai.26.262
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:probabilistic Latent Semantic Indexing (pLSI) is a fundamental method for the analysis of text and related resources which is based on a simple statistical model. This method has high extendibility and scalability due to its simplicity. pLSI is also known as matrix factorization method such as Singular Value Decomposition(SVD) or Non-negative Matrix Factorization. Using pLSI, three matrices which include one diagonal matrix as SVD are achieved. The diagonal elements of this diagonal matrix represent singular values in SVD. However it is not entirely clear what the diagonal matrix of pLSI represents. Then it is also unclear whether the diagonalization constraint is necessary in pLSI.

    This question is the starting point of this paper. To make an answer for this question, we demonstrated that introducing off-diagonal elements to singular value matrix in pLSI is equal to permitting joint probability between different hidden variables. Although permitting joint probability in pLSI does not lose scalability and simplicity, our experiments demonstrated that this extension showed tolerance for over-learning and over-fitting problems.

  • 关键词:probabilistic latent semantic indexing ; singular value decomposition ; off-diagonal elements
国家哲学社会科学文献中心版权所有