A Speech Emotion Recognition Method Based on Lightweight Capsule Network

WANG Ying; GAO Sheng

doi:10.12178/1001-0548.2022086

WANG Ying, GAO Sheng. A Speech Emotion Recognition Method Based on Lightweight Capsule Network[J]. Journal of University of Electronic Science and Technology of China, 2023, 52(3): 423-429. DOI: 10.12178/1001-0548.2022086

Citation:

A Speech Emotion Recognition Method Based on Lightweight Capsule Network

Graphical Abstract

Graphical Abstract

Abstract

Abstract

Aiming at the problems of many parameters, large amount of computation and slow training speed of the current speech emotion recognition model, this paper proposes a lightweight network model suitable for small data sets. The model is based on the capsule network, and the deep separable convolution module is introduced to replace the original convolution layer in the capsule network to reduce the amount of calculation. Transfer learning is used to extract the universal underlying image features, and then spectrogram is used to finely tune the over fitting phenomenon of the whole network weakening model on small data sets. The angle cosine is used to calculate the vector similarity in the dynamic routing structure so as to improve the performance of the dynamic routing algorithm. The experimental results show that the recognition rate and operation speed of the lightweight capsule network are better than the seven deep learning network models.

FullText(HTML)

References (26)

Cited By

A Speech Emotion Recognition Method Based on Lightweight Capsule Network

Graphical Abstract

Abstract

Catalog

Journal Online

Author's Notes

Links

A Speech Emotion Recognition Method Based on Lightweight Capsule Network

Graphical Abstract

Abstract

Catalog

Journal Online

Author's Notes

Links

Export File

Citation

Format

Content