An Investigation on the Usage of Image Quality Assessment in Visual Speech Recognition

Amin Banitalebi; Maryam Moosaei; Gholam Ali Hossein-Zadeh

An Investigation on the Usage of Image Quality Assessment in Visual Speech Recognition

محل انتشار: ششمین کنفرانس ماشین بینایی و پردازش تصویر ایران

سال انتشار: 1389

نوع سند: مقاله کنفرانسی

زبان: انگلیسی

مشاهده: 1,957

فایل این مقاله در 5 صفحه با فرمت PDF قابل دریافت می باشد

دریافت فایل کامل مقاله

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/113503

شناسه ملی سند علمی:

ICMVIP06_070

تاریخ نمایه سازی: 20 فروردین 1390

چکیده مقاله:

Having a robust speech recognition scheme that can be relied upon in different environments is a strong requirement for modern systems. Previous works in field of lipreading mainly have used a level of segmentation at the beginning and then used the structure of the mouth, facial muscles of the speaker, some critical points on the lip, or the motion of these points for word recognition. In this paper we present a novel way of processing the video signal for lipreading application. We neither used segmentation level nor the extraction of important facial points. Instead, we’ve used HVS (human visual system) based image quality metrics, especially complex wavelet structural similarity (CW-SSIM) and visual information fidelity (VIF) as our similarity criterions. We used an intelligent frame by frame video comparison technique and we applied mentioned metrics in our approach. Experimental results showed that in comparison to other methods, this novel method can recognize the true letter among the letters of the utilized dictionary with an acceptable accuracy

کلیدواژه ها:

Lipreading ، Visual Information Fidelity ، Visual Word Recognition

نویسندگان

Amin Banitalebi

School of Electrical and Computer Engineering University of Tehran, Tehran ۱۴۳۹۵-۵۱۵, Iran

Maryam Moosaei

Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering

Gholam Ali Hossein-Zadeh

Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering