首页    期刊浏览 2024年07月08日 星期一
登录注册

文章基本信息

  • 标题:Dynamic Sign Language Recognition Based on CBAM with Autoencoder Time Series Neural Network
  • 本地全文:下载
  • 作者:Yanglai Huang ; Jing Huang ; Xiaoyue Wu
  • 期刊名称:Mobile Information Systems
  • 印刷版ISSN:1574-017X
  • 出版年度:2022
  • 卷号:2022
  • DOI:10.1155/2022/3247781
  • 语种:English
  • 出版社:Hindawi Publishing Corporation
  • 摘要:The CNN-LSTM network has a low generalization ability, and the backward relevance of actions is not strong. In this work, a convolutional self-encoding timing network with a fusion of attention mechanism, namely, convolutional block attention module (CBAM), is proposed. The model first designs a convolutional self-encoding network for pretraining to obtain feature vectors of smaller dimensions. Second, it uses the BN network to speed up the training process and enhance the network generalization ability. Then, we use the encoder part of the pretrained convolutional autoencoder, embed the attention mechanism to further focus on the weight of important parts in image features, and use Bi-LSTM to form a CNN-Bi-LSTM network. As compared with the traditional CNN-LSTM model, the proposed method continuously expands the training samples through the pretrained network to improve the generalization performance. The experimental results show that the method proposed in this paper effectively recognizes the sign language video. The recognition rate reaches 89.90%, which is higher as compared to other methods. These results verify the feasibility and effectiveness of the proposed method.
国家哲学社会科学文献中心版权所有