光电技术应用, 2019, 34 (6): 34, 网络出版: 2019-12-08  

基于MFCC和GFCC混合特征的语音情感识别研究

Research on Speech Emotion Recognition Based on Mixed Features of MFCC and GFCC
作者单位
中国刑事警察学院, 沈阳 110854
摘要
针对MFCC滤波器存在语音高频信号泄露的问题, 为避免基于MFCC特征对语音进行情感识别时存在有效情感特征丢失的局限性, 结合MFCC的高准确性和GFCC的强鲁棒性, 提出了基于MFCC与GFCC混合特征训练CNN对语音进行情感识别的方法, 有效提高了语音情感识别的准确率, 改善了CNN模型的识别性能。实验结果表明, 所设计的混合特征识别方法较传统识别方法识别率明显升高并达到了83%, 实现了语言情感识别准确率的有效提升。
Abstract
Aiming at the problem of voice high frequency signal leakage in Mel-scale frequency cepstral coefficients (MFCC) filter, in order to avoid the limitation of effective emotional feature loss when emotion recognition based on MFCC feature, combined with the high accuracy of MFCC and the strong robustness of GFCC, based on the hybrid feature of MFCC and GFCC, CNN is used to identify the emotion of speech, which improves the accuracy of speech emotion recognition and improves the recognition performance of CNN model. Experimental results show that the proposed hybrid feature recognition method has a significantly higher recognition rate than the traditional recognition method and reaches 83%, which achieves an effective improvement of the language emotion recognition accuracy.

郭卉, 姜囡, 任杰. 基于MFCC和GFCC混合特征的语音情感识别研究[J]. 光电技术应用, 2019, 34(6): 34. GUO Hui, JIANG Nan, REN Jie. Research on Speech Emotion Recognition Based on Mixed Features of MFCC and GFCC[J]. Electro-Optic Technology Application, 2019, 34(6): 34.

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!