Speech Acts Classification of Persian Language Texts Using Three Machine Leaming Methods
سال انتشار: 1388
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 174
فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_ITRC-2-1_007
تاریخ نمایه سازی: 23 فروردین 1401
چکیده مقاله:
The objective of this paper is to design a system to classify Persian speech acts. The driving vision for this work is to provide inteUigent systems such as text to speech, machine translation, text summarization, etc. that are sensitive to the speech acts of the input texts and can pronounce the corresponding intonation correctly. Seven speech acts were considered and ۳ classification methods including (۱) Naive Bayes, (۲) K-Nearest Neighbors (KNN), and (۳) Tree learner were used. The performance of speech act classification was evaluated using these methods including ۱۰- Fold Cross-Validation, ۷۰-۳۰ Random Sampling and Area under ROC. KNN with an accuracy of ۷۲% was shown to be the best classifier for the classification of Persian speech acts. It was observed that the amount of labeled training data had an important role in the classification performance.
کلیدواژه ها:
نویسندگان
Mohammad Mehdi Homayounpour
Lab. for Intelligent Signal and Speech Proc. Department of Computer Engineering and IT Amirkabir University of Technology Tehran, Iran
Arezou Soltani Panah
Lab. for Intelligent Signal and Speech Proc. Department of Computer Engineering and IT Amirkabir University of Technology Tehran, Iran