文章基本信息

标题：Continuous Bangla Speech Segmentation using Short-term Speech Features Extraction Approaches
本地全文：下载
作者：Md Mijanur Rahman ; Md. Al-Amin Bhuiyan
期刊名称：International Journal of Advanced Computer Science and Applications(IJACSA)
印刷版ISSN：2158-107X
电子版ISSN：2156-5570
出版年度：2012
卷号：3
期号：11
DOI：10.14569/IJACSA.2012.031121
出版社：Science and Information Society (SAI)
摘要：This paper presents simple and novel feature extraction approaches for segmenting continuous Bangla speech sentences into words/sub-words. These methods are based on two simple speech features, namely the time-domain features and the frequency-domain features. The time-domain features, such as short-time signal energy, short-time average zero crossing rate and the frequency-domain features, such as spectral centroid and spectral flux features are extracted in this research work. After the feature sequences are extracted, a simple dynamic thresholding criterion is applied in order to detect the word boundaries and label the entire speech sentence into a sequence of words/sub-words. All the algorithms used in this research are implemented in Matlab and the implemented automatic speech segmentation system achieved segmentation accuracy of 96%.
关键词：thesai; IJACSA; thesai.org; journal; IJACSA papers; Speech Segmentation; Features Extraction; Short-time Energy; Spectral Centroid; Dynamic Thresholding.