An End-to-End Deep Learning Model to Recognize Farsi Speech from Raw Input

Automatic speech recognition systems usually solve the problem of recognizing speech by dividing the problem into different independent stages. First, they extract speech features and then use an acoustic model to reach the phoneme probabilities and from those probabilities, they reach sequence of recognized words. Recent advances in technology, especially in the area of deep neural networks in combination with speech recognition, shows that this division is not necessary and we can reach sequence of alphabet letters straight from the raw signal. In this work, we implemented and tested an endto- end convolutional neural network system with raw input for Farsi speech recognition and then compared its performance to another system that uses MFCC features. We show that using an end-to-end system with our configuration,which reaches series of phonemes from raw speech works better for Farsi speech as well as for English.

نویسندگان

Sina Alisamir

Seyed Mohammad Ahadi

Sanaz Seyedin

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

هوش مصنوعی > یادگیری عمیق

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/842943

شناسه ملی سند علمی:

SPIS04_028

تاریخ نمایه سازی: 16 اردیبهشت 1398

نحوه استناد به مقاله:

در صورتی که می خواهید در اثر پژوهشی خود به این مقاله ارجاع دهید، به سادگی می توانید از عبارت زیر در بخش منابع و مراجع استفاده نمایید:

Alisamir, Sina and Ahadi, Seyed Mohammad and Seyedin, Sanaz,1397,An End-to-End Deep Learning Model to Recognize Farsi Speech from Raw Input,4TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS,Tehran,https://civilica.com/doc/842943

در داخل متن نیز هر جا که به عبارت و یا دستاوردی از این مقاله اشاره شود پس از ذکر مطلب، در داخل پارانتز، مشخصات زیر نوشته می شود.
برای بار اول: (1397, Alisamir, Sina؛ Seyed Mohammad Ahadi and Sanaz Seyedin)
برای بار دوم به بعد: (1397, Alisamir؛ Ahadi and Seyedin)
برای آشنایی کامل با نحوه مرجع نویسی لطفا بخش راهنمای سیویلیکا (مرجع دهی) را ملاحظه نمایید.