Hidden Markov model and Persian speech recognition

  • سال انتشار: 1402
  • محل انتشار: مجله آنالیز غیر خطی و کاربردها، دوره: 14، شماره: 1
  • کد COI اختصاصی: JR_IJNAA-14-1_242
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 260
دانلود فایل این مقاله

نویسندگان

Masoume Shafieian

Assistant Professor, Department of Technology and Media Engineering IRIBU University, Tehran, Iran.

چکیده

Nowadays, speech recognition, which simply refers to the process of converting an audio signal into its equivalent text, has become one of the most important research topics. Although many studies have been conducted in the field of speech recognition for many languages of the world, but can be said that no more study has been conducted in the Persian language and therefore it is necessary to conduct more studies in this field. Since Persian is a rich language that can create many new words by adding a suffix (prefix) to its main root, so it can be said that the success rate of voice recognition programs in this language has also increased with the increase in the number of phonemes and therefore can have a significant improvement. Therefore, in this study, a practical approach to Persian speech recognition based on syllables, which are a unit between phonemes and words, has been used and done by the hidden Markov model. After obtaining syllable utterances, multiple coefficients are calculated for all syllables. Finally, suitable models were created and the success rate was calculated by conducting tests for the systems. To measure the performance of the system, the error rate criterion was used. The results of this study show that the word error rate for the hidden Markov model was ۱۸.۳% and increased the system performance by approximately ۱۶% after post-processing.

کلیدواژه ها

Hidden Markov Model, Persian Language, Speech Recognition, Syllable, Syllable Based Speech Recognition

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.