Speech Emotion Recognition Using CapsNet
Sukanya K S1, Leya Elizabeth Sunny2
1Sukanya K.S, Department of Computer Science, Mar Athanasius College of Engineering, Kothamangalam, India.
2Leya Elizabeth Sunny, Department of Computer Science, Mar Athanasius College of Engineering, Kothamangalam, India.
Manuscript received on 10 April 2019 | Revised Manuscript received on 17 April 2019 | Manuscript Published on 24 May 2019 | PP: 33-36 | Volume-8 Issue-6S3 April 2019 | Retrieval Number: F22070486S219/19©BEIESP
Open Access | Editorial and Publishing Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open-access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Extraction of emotion features is the key to emotion recognition from speech. Capsnet is an emerging neural network technology which gives better performance over convolution neural networks in feature extraction. This is the system which implement a speech emotion recognition system using Capsnet. With the gradual development of the new generation of man-machine interaction technology speech emotion recognition has attracted wide research attentions. In facing with the development trend of new technologies, speech interaction is go-ing to penetrate into thousands of households. Traditional machine learning method has achieved great progresses in speech emotion recognition. However, there are some problems: first, which features can reflect the differences between different emotions and the second, these artificially designed features rely highly on database and have low generalization ability. It takes long time to extract fea-tures from the speech. Deep learning can extract different layers of features from the original data through automatic learning. Capsule Network or Capsnet, is composed of a number of capsules in each layer as the name indicate. Each cap-sule is a group of neurons who work together to get a specific outcome for the capsule. Speech emotion recognition works based on the spectrogram constructed from the voice record. A spectrogram is the plot of the spectrum of frequencies of sound as they vary with time.
Keywords: Capsnet, Neural Networks, Speech Emotion Recognition.
Scope of the Article: Computer Science and Its Applications