Text to Speech Synthesis System for Punjabi language using Statistical Parametric Speech Synthesis Technique
Harsimarjeet Kaur1, Parminder Singh2
1Harsimarjeet Kaur, Department of Computer Science and Engineering, Guru Nanak Dev Engineering College, Ludhiana (Punjab), India.
2Parminder Singh, Department of Computer Science and Engineering, Guru Nanak Dev Engineering College, Ludhiana (Punjab), India.
Manuscript received on 20 August 2019 | Revised Manuscript received on 27 August 2019 | Manuscript Published on 26 August 2019 | PP: 268-272 | Volume-8 Issue-9S August 2019 | Retrieval Number: I10420789S19/19©BEIESP | DOI: 10.35940/ijitee.I1042.0789S19
Open Access | Editorial and Publishing Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open-access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Statistical Parametric Speech Synthesis has been most growing technique rather than the traditional approaches that we are used to synthesizing the speech. The shortcoming of traditional approaches will be overcome with latest statistical techniques. The main advantages of SPSS from traditional synthesis technique are that it has more flexibility to change the characteristics of voice and support more multiple languages i.e. multilingual, has good coverage of acoustic ` and robustness. It generates high quality of speech from small training database. Deep Neural network and Hidden Morkov model are basic statistical parametric speech synthesis techniques. Gaussian mixture model, sinusoidal model are also under this categories. Features were extracted in two type spectral features like spectral bandwidth, spectral centroid etc. and excitation features like F0 frequencies etc. We are using 722 Punjabi phonemes. Using sound forge software we extracted the 200 wave file from 1 hour pre-recording wave file related to those phonemes. Each and every phonemes feature was extracted and saved in database. We were extracting 28 features of each phoneme. TTS text-to-speech system generates sounds or speech as a output when provided the text of Punjabi language. There were already many TTS are developed on different Indian languages. The system that we are trying to build is based only on Punjabi language.
Keywords: SPSS, TTS, Phonemes, HMM.
Scope of the Article: Automated Software Design and Synthesis