Temporal and Spatial Features for Visual Speech Recognition

Speech recognition from visual data is in important step towards communication when audio is not available. This paper considers several hand crafted features including HOG, MBH, DCT, LBP, MTC, and their combinations for recognizing speech from a sequence of images. Several classifiers including SVM, decision trees, K -nearest neighbor algorithm and the sub-space K-nearest algorithm were tested feature evaluation. Further, the application of PCA for dimensionality reduction was considered in this study. Two sets of tests were carried out in this study: lip pose recognition and recognition of isolated words. For evaluation, the MIRACL-VC1 data set was considered. Self -dependent tests reached an accuracy of over 95% while in the self-independent tests, the maximum accuracy of recognition was about 52%.

کلیدواژه ها:

Speech recognition ، temporal features ، spatial features ، dimensionality reduction ، classification

نویسندگان

Ali Jafari Sheshpoli

Cyber space research inst., Shahid Beheshti University, Tehran, Iran

Ali Nadian-Ghomsheh

Cyber space research inst., Shahid Beheshti University, Tehran, Iran

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/725449

شناسه ملی سند علمی:

COMCONF05_473

تاریخ نمایه سازی: 21 اردیبهشت 1397

نحوه استناد به مقاله:

در صورتی که می خواهید در اثر پژوهشی خود به این مقاله ارجاع دهید، به سادگی می توانید از عبارت زیر در بخش منابع و مراجع استفاده نمایید:

Jafari Sheshpoli, Ali and Nadian-Ghomsheh, Ali,1396,Temporal and Spatial Features for Visual Speech Recognition,Fifth International Conference on Electrical and Computer Engineering with Emphasis on Indigenous Knowledge,Tehran,https://civilica.com/doc/725449

در داخل متن نیز هر جا که به عبارت و یا دستاوردی از این مقاله اشاره شود پس از ذکر مطلب، در داخل پارانتز، مشخصات زیر نوشته می شود.
برای بار اول: (1396, Jafari Sheshpoli, Ali؛ Ali Nadian-Ghomsheh)
برای بار دوم به بعد: (1396, Jafari Sheshpoli؛ Nadian-Ghomsheh)
برای آشنایی کامل با نحوه مرجع نویسی لطفا بخش راهنمای سیویلیکا (مرجع دهی) را ملاحظه نمایید.

علم سنجی و رتبه بندی مقاله

مشخصات مرکز تولید کننده این مقاله به صورت زیر است:

رتبه علمی دانشگاه شهید بهشتی

نوع مرکز: دانشگاه دولتی

تعداد مقالات: 41,043

در بخش علم سنجی پایگاه سیویلیکا می توانید رتبه بندی علمی مراکز دانشگاهی و پژوهشی کشور را بر اساس آمار مقالات نمایه شده مشاهده نمایید.

مقالات مرتبط جدید