首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:Web Radio Automation for Audio Stream Management in the Era of Big Data
  • 本地全文:下载
  • 作者:Nikolaos Vryzas ; Nikolaos Tsipas ; Charalampos Dimoulas
  • 期刊名称:Information
  • 电子版ISSN:2078-2489
  • 出版年度:2020
  • 卷号:11
  • 期号:4
  • 页码:205-219
  • DOI:10.3390/info11040205
  • 出版社:MDPI Publishing
  • 摘要:Radio is evolving in a changing digital media ecosystem. Audio-on-demand has shaped the landscape of big unstructured audio data available online. In this paper, a framework for knowledge extraction is introduced, to improve discoverability and enrichment of the provided content. A web application for live radio production and streaming is developed. The application offers typical live mixing and broadcasting functionality, while performing real-time annotation as a background process by logging user operation events. For the needs of a typical radio station, a supervised speaker classification model is trained for the recognition of 24 known speakers. The model is based on a convolutional neural network (CNN) architecture. Since not all speakers are known in radio shows, a CNN-based speaker diarization method is also proposed. The trained model is used for the extraction of fixed-size identity d-vectors. Several clustering algorithms are evaluated, having the d-vectors as input. The supervised speaker recognition model for 24 speakers scores an accuracy of 88.34%, while unsupervised speaker diarization scores a maximum accuracy of 87.22%, as tested on an audio file with speech segments from three unknown speakers. The results are considered encouraging regarding the applicability of the proposed methodology.
  • 关键词:web radio; big data; web application; speaker recognition; speaker diarization; convolutional neural networks web radio ; big data ; web application ; speaker recognition ; speaker diarization ; convolutional neural networks
国家哲学社会科学文献中心版权所有