Language Recognition By Convolutional Neural Networks

Ladan Khosravani pour; Ali Farrokhi

Language Recognition By Convolutional Neural Networks

محل انتشار: اولین کنفرانس بین المللی فناوری اطلاعات، مدیریت و کامپیوتر

سال انتشار: 1403

نوع سند: مقاله کنفرانسی

زبان: انگلیسی

مشاهده: 144

فایل این مقاله در 8 صفحه با فرمت PDF قابل دریافت می باشد

دریافت فایل کامل مقاله

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

هوش مصنوعی > هوش مصنوعی

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/2083702

شناسه ملی سند علمی:

CONFIT01_0400

تاریخ نمایه سازی: 4 مهر 1403

چکیده مقاله:

Speech recognition representing a communication between computers and human as a sub field of computational linguistics or natural language processing has a long history. Automatic Speech Recognition (ASR), Text to Speech (TTS), speech to text, Continuous Speech Recognition (CSR), and interactive voice response systems are different approaches to solving problems in this area. The performance improvement is partially attributed to the ability of the Deep Neural Network (DNN) to model complex correlations in speech features. In this paper, unlike the use of conventional model for sequential data like voice that employs Recurrent Neural Network (RNNs) with the emergence of different architectures in deep networks and good performance of Conventional Neural Networks (CNNs) in image processing and feature extraction, the application of CNNs was developed in other domains. It was shown that prosodic features for Persian language could be extracted via CNNs for segmentation and labeling speech for short texts. By using ۱۲۸ and ۲۰۰ filters for CNN and special architectures, ۱۹.۴۶ error in detection rate and better time consumption than RNNs were obtained. In addition, CNN simplifies the learning procedure. Experimental results show that CNN networks can be a good feature extractor for speech recognition in various languages.

کلیدواژه ها:

Speech recognition ، Speech Segmentation ، Convolutional neural networks ، Persian Language ، Artificial intelligence

نویسندگان

Ladan Khosravani pour

Department of Electrical Engineering, South Tehran Branch, Islamic Azad University, Tehran, Iran

Ali Farrokhi

Department of Electrical Engineering, South Tehran Branch, Islamic Azad University, Tehran, Iran