Extract features from Persian speech signal for emotion recognition

سال انتشار: 1400
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 244

فایل این مقاله در 6 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

ITCT12_106

تاریخ نمایه سازی: 7 شهریور 1400

چکیده مقاله:

Establishing verbal communication between man and machine, as well as understanding the human emotions by the machine and providing an appropriate response to it requires understanding the speech and emotions of the speaker, in this regard, research in the field of emotion recognition in speech signal is necessary. In this paper, an attempt is made to design and implement a system to determine and detect emotions of anger and happiness in the Persian speech signal. Research on recognizing some emotions has been done in most languages, but due to the difficulty of creating a speech database, so far little research has been done to identify emotions in Persian speech. In this article, due to the lack of a suitable database in Persian to detect emotions, at first, a database for moods of happiness and anger and neutral (without any emotion) in Persian, consisting of ۷۲۰ sentences was created. Then the frequency characteristics of speech signals obtained from Fourier transform such as maximum, minimum, median and mean as well as LPC coefficients were extracted. Then, the MLP neural network was used to detect emotions of happiness and anger with an average accuracy of ۸۷.۷۴%.

کلیدواژه ها:

نویسندگان

Seyed Mehdi Hoseini

Department of Computer Science, University of Mazandaran, Babolsar, Iran