Hidden Markov model and Persian speech recognition

سال انتشار: 1402
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 118

فایل این مقاله در 9 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:


تاریخ نمایه سازی: 5 شهریور 1402

چکیده مقاله:

Nowadays, speech recognition, which simply refers to the process of converting an audio signal into its equivalent text, has become one of the most important research topics. Although many studies have been conducted in the field of speech recognition for many languages of the world, but can be said that no more study has been conducted in the Persian language and therefore it is necessary to conduct more studies in this field. Since Persian is a rich language that can create many new words by adding a suffix (prefix) to its main root, so it can be said that the success rate of voice recognition programs in this language has also increased with the increase in the number of phonemes and therefore can have a significant improvement. Therefore, in this study, a practical approach to Persian speech recognition based on syllables, which are a unit between phonemes and words, has been used and done by the hidden Markov model. After obtaining syllable utterances, multiple coefficients are calculated for all syllables. Finally, suitable models were created and the success rate was calculated by conducting tests for the systems. To measure the performance of the system, the error rate criterion was used. The results of this study show that the word error rate for the hidden Markov model was ۱۸.۳% and increased the system performance by approximately ۱۶% after post-processing.

کلیدواژه ها:

Hidden Markov Model ، Persian Language ، Speech Recognition ، Syllable ، Syllable Based Speech Recognition


Masoume Shafieian

Assistant Professor, Department of Technology and Media Engineering IRIBU University, Tehran, Iran.