首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:Feature Generations Analysis of Lip Image Streams for Isolate Words Recognition
  • 本地全文:下载
  • 作者:Yong-Ki Kim ; Jong Gwan Lim ; Sahngwoon Lee
  • 期刊名称:International Journal of Multimedia and Ubiquitous Engineering
  • 印刷版ISSN:1975-0080
  • 出版年度:2015
  • 卷号:10
  • 期号:10
  • 页码:337-346
  • DOI:10.14257/ijmue.2015.10.10.33
  • 出版社:SERSC
  • 摘要:To overcome the decrease in the recognition rate of voice recognition in noisy environments, the implementation of Audio Visual Speech Recognition (AVSR), which combines voice and lip information, has been attempted since the 1990s. This study aims to investigate the discrimination of various features extracted from lip image data using dynamic time warping (DTW) as an objective function to implement a robust lip-reading system as the core process of AVSR. The features taken from existing literature are grid- based features, including gray level, optical flow, and Sobel operator gradient, and various ratios of lip shapes calculated based on coordinates. According to the results of the application of DTW to respective feature generation methods using 180 pieces of data collected from ten study subjects who each uttered six isolated words three times, the mean recognition rate was found to be up to 60.55%. The feature that showed the highest recognition rate was the combined vector of a width/height ratio of the outer lip and the height of the inner lip, and grid-based features were found to outperform coordinate- based features in the recognition rate of certain words.
  • 关键词:Lip-Reading System; AVSR; Image Processing; Isolated Words
国家哲学社会科学文献中心版权所有