Language Recognition By Convolutional NeuralNetworks

سال انتشار: 1402
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 128

فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

CRIAL01_045

تاریخ نمایه سازی: 7 مرداد 1403

چکیده مقاله:

Speech recognition representing acommunication between computers and human as a sub fieldof computational linguistics or natural language processing hasa long history. Automatic Speech Recognition (ASR), Text toSpeech (TTS), speech to text, Continuous Speech Recognition(CSR), and interactive voice response systems are differentapproaches to solving problems in this area. The performanceimprovement is partially attributed to the ability of the DeepNeural Network (DNN) to model complex correlations inspeech features. In this paper, unlike the use of conventionalmodel for sequential data like voice that employs RecurrentNeural Network (RNNs) with the emergence of differentarchitectures in deep networks and good performance ofConventional Neural Networks (CNNs) in image processingand feature extraction, the application of CNNs was developedin other domains. It was shown that prosodic features forPersian language could be extracted via CNNs forsegmentation and labeling speech for short texts. By using ۱۲۸and ۲۰۰ filters for CNN and special architectures, ۱۹.۴۶ errorin detection rate and better time consumption than RNNs wereobtained. In addition, CNN simplifies the learning procedure.Experimental results show that CNN networks can be a goodfeature extractor for speech recognition in various languages.

نویسندگان

Ladan Khosravani pour

Department of Electrical EngineeringSouth Tehran Branch, Islamic AzadUniversityTehran, Iran

Ali Farrokhi

Department of Electrical EngineeringSouth Tehran Branch, Islamic AzadUniversityTehran, Iran