首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:Mahalanobis Encodings for Visual Categorization
  • 本地全文:下载
  • 作者:Tomoki Matsuzawa ; Raissa Relator ; Wataru Takei
  • 期刊名称:Information and Media Technologies
  • 电子版ISSN:1881-0896
  • 出版年度:2015
  • 卷号:10
  • 期号:3
  • 页码:468-472
  • DOI:10.11185/imt.10.468
  • 出版社:Information and Media Technologies Editorial Board
  • 摘要:Nowadays, the design of the representation of images is one of the most crucial factors in the performance of visual categorization. A common pipeline employed in most of recent researches for obtaining an image representation consists of two steps: the encoding step and the pooling step. In this paper, we introduce the Mahalanobis metric to the two popular image patch encoding modules, Histogram Encoding and Fisher Encoding, that are used for Bag-of-Visual-Word method and Fisher Vector method, respectively. Moreover, for the proposed Fisher Vector method, a close-form approximation of Fisher Vector can be derived with the same assumption used in the original Fisher Vector, and the codebook is built without resorting to time-consuming EM (Expectation-Maximization) steps. Experimental evaluation of multi-class classification demonstrates the effectiveness of the proposed encoding methods.
  • 关键词:Bag-of-Visual-Word;Fisher Vector;Mahalanobis metric;visual categorization
国家哲学社会科学文献中心版权所有