CONTEXT DEPENDENT MODELING IN CONTINUOUS SPEECH RECOGNITION BASED ON A PERSIAN PHONETIC DECISION TREE
محل انتشار: فصلنامه مهندسی برق مدرس، دوره: 3، شماره: 1
سال انتشار: 1382
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 113
فایل این مقاله در 14 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_MJEEMO-3-1_004
تاریخ نمایه سازی: 29 بهمن 1403
چکیده مقاله:
Context-dependent modeling is a well-known approach to increase modeling accuracy in continuous speech recognition. The most common way to implement this approach is via triphone modeling. Nevertheless, the large number of such models results in several problems in model training, whilst the robust training of such models is often hardly obtained. One approach to solve this problem is via parameter tying. In this paper, clustering has been carried out on HMM state parameters and the states allocated to any cluster are tied to decrease the overall number of system parameters and achieve robust training. Two types of groupings, one based on the final trained model set parameters and their inter-model distances and the other based on the training data and a decision tree, have been carried out. In the implementation of the later, a decision tree based on the acoustic properties of the Persian (Farsi) language and the phonetic similarities and differences has been designed. The results obtained have shown the usefulness of both the approaches. However, the second approach has the advantage of making the estimation of unseen model parameters possible.
کلیدواژه ها:
Context-Dependent Modeling ، Persian Continuous Speech Recognition ، Continuous Density Hidden Markov Models ، State tying ، Decision trees ، مدلسازی وابسته به متن ، بازشناسی گفتار پیوسته فارسی ، مدلهای مارکوف پنهان با چگالی پیوسته ، گره زدن حالتها ، درخت تصمیم گیری
نویسندگان
سید حسین شمس
Amirkabir university of technology
سید محمد احدی
Amirkabir university of technology