Sitemap  |  Contact  |  Home  |  中文  |  CAS  |  Director’s Email
International Cooperation
Education & Training
Societies & Publications
Chinese Journal of Acoustics
 
 
  Location:Home>Chinese Journal of Acoustics
Acoustic analysis of the vowel system in Hotan dialect and its contribution to dialect recognition of Uyghur dialects(2020 No.1)
Author:
ArticleSource:
Update time: 2024/07/24
Viewed:
Close
Text Size: A A A
Print

Title: Acoustic analysis of the vowel system in Hotan dialect and its contribution to dialect recognition of Uyghur dialects


SUN Jie; WUSHOUER Silamu; REYIMAN Turson; ZHANG Jingjing;


Affiliation(s): College of Information Science and Engineering, Xinjiang University; Department of Physics, Changji University;


Abstract: Based on the actual needs of speech application research such as speech recognition and voiceprint recognition, the acoustic characteristics and recognition of Hotan dialect were studied for the first time. Firstly, the Hetian dialect voice was selected for artificial multi-level annotation, and the formant, duration and intensity of the vowel were analyzed to describe statistically the main pattern of Hetian dialect and the pronunciation characteristics of male and female. Then using the analysis of variance and nonparametric analysis to test the formant samples of the three dialects of Uygur language, the results show that there are significant differences in the formant distribution patterns of male vowels, female vowels and whole vowels in the three dialects. Finally, the GUM-UBM (Gaussian Mixture Model-Universal Background Model), DNN-UBM (Deep Neural Networks-Universal Background Model) and LSTM-UBM (Long Short Term Memory Network-Universal Background Model) Uyghur dialect recognition models are constructed respectively. Based on the Mel-frequency cepstrum coefficient and its combination with the formant frequency for the input feature extraction, the contrastive experiment of dialect i-vector distinctiveness is carried out. The experimental results show that the combined features of the formant coefficients can increase the recognition of the dialect, and the LSTM-UBM model can extract more discriminative dialects than the GMM-UBM and DNN-UBM.


 
Copyright ? 1996 - 2020 Institute of Acoustics, Chinese Academy of Sciences
No. 21 North 4th Ring Road, Haidian District, 100190 Beijing, China
E-mail: ioa@mail.ioa.ac.cn