Split and rephrase: Simple Syntactic Sentences for NLP applications
سال انتشار: 1404
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 183
فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_JICSE-2-1_011
تاریخ نمایه سازی: 24 اردیبهشت 1404
چکیده مقاله:
Abstract—In today's world, simplifying compound and complex sentences into simple sentences is crucial for enhancing machine understanding in various natural language processing (NLP) tasks, such as inference, machine translation, and information extraction. This simplification process improves accuracy. Consequently, our research is inspired by a text simplification method called "split and rephrase." We introduce a new sequence-to-sequence text generation model that transforms complex sentences into simple ones based on the conjunction "and" in Persian. By utilizing linguistic models with millions or even billions of parameters, our approach facilitates a better understanding of text complexities and more accurate identification of breaking points. Our results show an output accuracy of ۰.۴۷ in the BLEU score for the generated simple sentences, which are both grammatically correct and fluent. By utilizing linguistic models with millions or even billions of parameters, our approach facilitates a better understanding of text complexities and more accurate identification of breaking points. Our results show an output accuracy of ۰.۴۷ in the BLEU score for the generated simple sentences, which are both grammatically correct and fluent.
کلیدواژه ها:
نویسندگان
Alireza Talebpour
Computer Science and Engineering Shahid Beheshti University Tehran, Iran
Ghasem Darzi
Interdisciplinary Studies of Quran Shahid Beheshti University Tehran, Iran