文章基本信息

标题：Causal inference of asynchronous audiovisual speech
本地全文：下载
作者：Magnotti, John F ; Ma, Wei Ji ; Beauchamp, Michael S 等
期刊名称：Frontiers in Psychology
电子版ISSN：1664-1078
出版年度：2013
卷号：4
页码：1-10
DOI：10.3389/fpsyg.2013.00798
出版社：Frontiers Media
摘要：During speech perception, humans integrate auditory information from the voice with visual information from the face. This multisensory integration increases perceptual precision, but only if the two cues come from the same talker; this requirement has been largely ignored by current models of speech perception. We describe a generative model of multisensory speech perception that includes this critical step of determining the likelihood that the voice and face information have a common cause. A key feature of the model is that it is based on a principled analysis of how an observer should solve this causal inference problem using the asynchrony between two cues and the reliability of the cues. This allows the model to make predictions abut the behavior of subjects performing a synchrony judgment task, predictive power that does not exist in other approaches, such as post hoc fitting of Gaussian curves to behavioral data. We tested the model predictions against the performance of 37 subjects performing a synchrony judgment task viewing audiovisual speech under a variety of manipulations, including varying asynchronies, intelligibility, and visual cue reliability. The causal inference model outperformed the Gaussian model across two experiments, providing a better fit to the behavioral data with fewer parameters. Because the causal inference model is derived from a principled understanding of the task, model parameters are directly interpretable in terms of stimulus and subject properties.
关键词：causal inference; Synchrony judgments; Speech Perception; multisensory integration; Bayesian observer