首页    期刊浏览 2024年07月08日 星期一
登录注册

文章基本信息

  • 标题:Dynamic Detection and Recognition of Objects Based on Sequential RGB Images
  • 本地全文:下载
  • 作者:Shuai Dong ; Zhihua Yang ; Wensheng Li
  • 期刊名称:Future Internet
  • 电子版ISSN:1999-5903
  • 出版年度:2021
  • 卷号:13
  • 期号:7
  • 页码:176
  • DOI:10.3390/fi13070176
  • 出版社:MDPI Publishing
  • 摘要:Conveyors are used commonly in industrial production lines and automated sorting systems. Many applications require fast, reliable, and dynamic detection and recognition for the objects on conveyors. Aiming at this goal, we design a framework that involves three subtasks: one-class instance segmentation (OCIS), multiobject tracking (MOT), and zero-shot fine-grained recognition of 3D objects (ZSFGR3D). A new level set map network (LSMNet) and a multiview redundancy-free feature network (MVRFFNet) are proposed for the first and third subtasks, respectively. The level set map (LSM) is used to annotate instances instead of the traditional multichannel binary mask, and each peak of the LSM represents one instance. Based on the LSM, LSMNet can adopt a pix2pix architecture to segment instances. MVRFFNet is a generalized zero-shot learning (GZSL) framework based on the Wasserstein generative adversarial network for 3D object recognition. Multi-view features of an object are combined into a compact registered feature. By treating the registered features as the category attribution in the GZSL setting, MVRFFNet learns a mapping function that maps original retrieve features into a new redundancy-free feature space. To validate the performance of the proposed methods, a segmentation dataset and a fine-grained classification dataset about objects on a conveyor are established. Experimental results on these datasets show that LSMNet can achieve a recalling accuracy close to the light instance segmentation framework You Only Look At CoefficienTs (YOLACT), while its computing speed on an NVIDIA GTX1660TI GPU is 80 fps, which is much faster than YOLACT’s 25 fps. Redundancy-free features generated by MVRFFNet perform much better than original features in the retrieval task.
  • 关键词:one-class instance segmentation; level set map; multiview feature; fine-grained recognition; generalized zero-shot learning one-class instance segmentation ; level set map ; multiview feature ; fine-grained recognition ; generalized zero-shot learning
国家哲学社会科学文献中心版权所有