首页    期刊浏览 2025年07月15日 星期二
登录注册

文章基本信息

  • 标题:Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research
  • 本地全文:下载
  • 作者:Fürer, Lukas ; Schenk, Nathalie ; Roth, Volker
  • 期刊名称:Frontiers in Psychology
  • 电子版ISSN:1664-1078
  • 出版年度:2020
  • 卷号:11
  • 页码:1-8
  • DOI:10.3389/fpsyg.2020.01726
  • 出版社:Frontiers Media
  • 摘要:Speaker diarization is the practice of determining who speaks when in audio recordings. Psychotherapy research often relies on labor intensive manual diarization. Unsupervised methods are available but yield higher error rates. We present a method for supervised speaker diarization based on random forests. It presents a compromise between commonly used labor-intensive manual coding and fully automated procedures. The method is validated using the EMRAI synthetic speech corpus with the goal to examine the feasibility of later use on naturalistic data and is made available. It yields low diarization error rates (M: 5.61%, STD: 2.19). Supervised speaker diarization is a promising method for psychotherapy research and similar fields.
  • 关键词:supervised speaker diarization; Psychotherapy process measure; Dyadic audio analysis; speech corpus; random forest
国家哲学社会科学文献中心版权所有