A Method for Automatic Vietnamese Speech Segmentation
Tuan Tran Anh1, Mong Nguyen Huu2, Khanh Nguyen Trong3
1Tuan Anh Tran, Department of Computer Science, Faculty of Information Technology, Industrial College, in Thanh Hoa, Vietnam, (84 4) 819801201.
2Dr. Mong Nguyen Huu, Department of Computer Science, Department of Information Technology, Military Technical Institute, in HaNoi, Vietnam, (84 4) 913002799.
3Dr. Khanh Nguyen Trong, Software Engineering, Posts and Telecommunications Institute of Technology, Hanoi, Vietnam, (84 4) 912314482., Sorbonne University, IRD, UMMISCO, JEAI WARM, F-93143, Bondy, France
Manuscript received on 21 August 2019. | Revised Manuscript received on 02 September 2019. | Manuscript published on 30 September 2019. | PP: 2887-2892 | Volume-8 Issue-11, September 2019. | Retrieval Number: K24270981119/2019©BEIESP | DOI: 10.35940/ijitee.K2427.0981119
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: In speech synthesis and recognition, the segmentation is an important step. The result of further steps depend completely on this process. There are several effective segmentation method in the literature, but for Vietnamese speech, researchers usually base on their experience to set the length while using sliding window. It causes an inefficient segmentation; and they need to try with the other value (length of voice). In this paper, we propose a method supporting in segmentation for Vietnamese speech and automatically determine the suitable length of voices and silent pause. We firstly estimate, by experimenting, the min and average length of a voice and a silent pause for Vietnamese speech in three main type speaking (slow, normal and fast). Then, based on these values, we start to segment the voice and pause by sliding window with proposed algorithm. Experiment results show that the proposed method can be used to effectively segment the Vietnamese speech.
Keywords: Segmentation, automatic speech segmentation, Vietnamese speech segmentation.
Scope of the Article: Signal and Speech Processing