Journal of Northeastern University(Natural Science) ›› 2024, Vol. 45 ›› Issue (1): 40-48.DOI: 10.12068/j.issn.1005-3026.2024.01.006

• Information & Control • Previous Articles     Next Articles

Speech Emotion Recognition Fusing Functional Paralanguage Proportion Coefficient

Ying SUN, Ya-ru ZHOU, Xue-ying ZHANG   

  1. College of Information and Computer,Taiyuan University of Technology,Taiyuan 030024,China. Corresponding author: ZHANG Xue-ying,E-mail: tyzhangxy@163. com
  • Received:2022-07-22 Online:2024-01-15 Published:2024-04-02

Abstract:

Nonverbal vocalizations such as laughter, sighs, and sobs in speech are called functional paralanguage and play an important role in emotional expression. However, existing research has rarely considered the synergistic effect of multiple functional paralanguages in a single emotion. To address this issue, an emotion recognition system integrating functional paralanguage proportion coefficients (FPPC) is proposed. Firstly, FPPC features that reflect the frequency and duration of multiple functional paralanguages appearing in emotional statements are extracted. Then, an attention mechanism-based ensemble learning is constructed to assign different weights to different base classifiers and train the FPPC features. Finally, the adaptive entropy weight decision fusion method is used to fuse traditional speech emotion recognition with emotion recognition based on FPPC features. Experimental results show a 16.84% improvement in emotion recognition after integrating FPPC features, proving that integrating FPPC features can effectively improve the overall recognition rate of the system.

Key words: speech emotion recognition, proportion coefficient, functional paralanguage, attention mechanism, adaptive entropy weight decision fusion

CLC Number: