Split and rephrase: Simple Syntactic Sentences for NLP applications

سال انتشار: 1404
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 183

فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JICSE-2-1_011

تاریخ نمایه سازی: 24 اردیبهشت 1404

چکیده مقاله:

Abstract—In today's world, simplifying compound and complex sentences into simple sentences is crucial for enhancing machine understanding in various natural language processing (NLP) tasks, such as inference, machine translation, and information extraction. This simplification process improves accuracy. Consequently, our research is inspired by a text simplification method called "split and rephrase." We introduce a new sequence-to-sequence text generation model that transforms complex sentences into simple ones based on the conjunction "and" in Persian. By utilizing linguistic models with millions or even billions of parameters, our approach facilitates a better understanding of text complexities and more accurate identification of breaking points. Our results show an output accuracy of ۰.۴۷ in the BLEU score for the generated simple sentences, which are both grammatically correct and fluent. By utilizing linguistic models with millions or even billions of parameters, our approach facilitates a better understanding of text complexities and more accurate identification of breaking points. Our results show an output accuracy of ۰.۴۷ in the BLEU score for the generated simple sentences, which are both grammatically correct and fluent.

نویسندگان

Alireza Talebpour

Computer Science and Engineering Shahid Beheshti University Tehran, Iran

Ghasem Darzi

Interdisciplinary Studies of Quran Shahid Beheshti University Tehran, Iran