Quality Evaluation of Speech Enhancement Algorithms for Normal and Hearing Loss Listeners
Hemangi Shinde1, A.M.Sapkal2, Aishwarya Phatak3
1Hemangi Shinde*, Departmrnt of Electronics & Telecommunication Engineering, Research Scholar, College of Engineering Pune, Pune, India, Faculty, AISSMS Institute of Information Technology, Pune, India.
2A.M.Sapkal, Departmrnt of Electronics & Telecommunication Engineering, College of Engineering Pune, Pune, India.
3Aishwarya Phatak, Department of Electronics & Telecommunication Engineering, AISSMS Institute of Information Technology, Pune, India.
Manuscript received on September 16, 2019. | Revised Manuscript received on 24 September, 2019. | Manuscript published on October 10, 2019. | PP: 7-12 | Volume-8 Issue-12, October 2019. | Retrieval Number: L24791081219/2019©BEIESP | DOI: 10.35940/ijitee.L2479.1081219
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: The subjective quality test of the enhanced speech from different enhancement algorithms for listeners with normal hearing (NH) capability as well as listeners with hearing impairment (HI) is reported. The subjective quality evaluation of speech enhancement methods in the literature survey is mostly done targeting NH listeners and fewer attempts are observed to subjectively evaluate for HI listeners. The algorithms evaluated are from four different classes: spectral subtraction class(SS), statistical model based class (minimum mean square error), subspace class(PKLT) and auditory class (ideal binary mask using STFT, ideal binary mask using gammatone filterbank and ideal binary mask using gammachirp filterbank). The algorithms are evaluated using four types of real world noises recorded in Indian scenarios namely cafeteria, traffic, station and train at -5, 0, 5 and 10 dB SNRs. The evaluation is being done as per ITU-T P.835 standard in terms of three parameters speech signal alone, background noise and overall quality. The noisy speech database developed in Indian regional language, Marathi, at four SNRs -5, 0, 5 and 10 dB is used for evaluation. Significant improvement is observed in ideal binary mask algorithm in terms of overall quality and signal distortion ratings for NH and HI listeners. The performance of minimum mean square error is also observed comparable with the ideal binary mask algorithm in some cases.
Keywords: Hearing Impaired, Ideal Binary Mask, Mean Opinion Score, Speech Enhancement.
Scope of the Article: Algorithm Engineering