Denoising of Speech Signal using Empirical Mode Decomposition and Kalman Filter
Nandhini A1, Bharath K P2, Mahalti Mohammed Sohail3, Rajesh Kumar M4
1Nandhini A, School of Electronics Engineering, VIT University, Vellore, India.
2Bharath K P, School of Electronics Engineering, VIT University, Vellore, India.
3Mahalti Mohammed Sohail, School of Electronics Engineering, VIT University, Vellore, India.
4Rajesh Kumar M*, School of Electronics Engineering, VIT University, Vellore, India.
Manuscript received on May 08, 2020. | Revised Manuscript received on May 15, 2020. | Manuscript published on June 10, 2020. | PP: 232-237 | Volume-9 Issue-8, June 2020. | Retrieval Number: 100.1/ijitee.H6313069820 | DOI: 10.35940/ijitee.H6313.069820
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Speech denoising is the process of removing the noise from the noise corrupted speech. The applications of speech denoising are used in speech enhancement, speech recognition and many more. In this work, a new approach is proposed to de-noise the speech which is corrupted from different noises, Empirical mode decomposition and the Kalman filter (EMD-KF) is used for speech denoising in the proposed work. The clean speech is corrupted by the noise with the different SNR’s, and further Empirical mode decomposition (EMD) is applied to the noise corrupted speech later the obtained resultant speech is passed through the Kalman filter (KF) which gives the denoised speech. The result shows that the mean squared error (MSE) values of EMD-KF are extremely less when compared to other methods like discrete wavelet transform (wavelet families like Daubechies and Symlet), empirical mode decomposition (EMD) and moving average filter followed by empirical mode decomposition (MA-EMD). As an application the proposed algorithm is used in the feature extraction for speech recognition. Mel frequency cepstral coefficient (MFCC) is performed on both the original speech and the denoised speech and found majority of the denoised speech features are similar to the original speech features and few denoised speech features are nearby to the original speech features.
Keywords: Empirical mode decomposition (EMD), Kalman filter (KF), Mel-frequency cepstral coefficient (MFCC), Speech denoising.
Scope of the Article: Signal and Speech Processing