首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:A Novel Low-Bit Quantization Strategy for Compressing Deep Neural Networks
  • 本地全文:下载
  • 作者:Xin Long ; XiangRong Zeng ; Zongcheng Ben
  • 期刊名称:Computational Intelligence and Neuroscience
  • 印刷版ISSN:1687-5265
  • 电子版ISSN:1687-5273
  • 出版年度:2020
  • 卷号:2020
  • 页码:1-7
  • DOI:10.1155/2020/7839064
  • 出版社:Hindawi Publishing Corporation
  • 摘要:

    The increase in sophistication of neural network models in recent years has exponentially expanded memory consumption and computational cost, thereby hindering their applications on ASIC, FPGA, and other mobile devices. Therefore, compressing and accelerating the neural networks are necessary. In this study, we introduce a novel strategy to train low-bit networks with weights and activations quantized by several bits and address two corresponding fundamental issues. One is to approximate activations through low-bit discretization for decreasing network computational cost and dot-product memory. The other is to specify weight quantization and update mechanism for discrete weights to avoid gradient mismatch. With quantized low-bit weights and activations, the costly full-precision operation will be replaced by shift operation. We evaluate the proposed method on common datasets, and results show that this method can dramatically compress the neural network with slight accuracy loss.

国家哲学社会科学文献中心版权所有