External Plagiarism Detection based on Human Behaviors in Producing Paraphrases of Sentences in English and Persian Languages

سال انتشار: 1398
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 536

فایل این مقاله در 16 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JADM-7-3_010

تاریخ نمایه سازی: 19 تیر 1398

چکیده مقاله:

With the advent of the internet and easy access to digital libraries, plagiarism has become a major issue. Applying search engines is one of the plagiarism detection techniques that converts plagiarism patterns to search queries. Generating suitable queries is the heart of this technique and existing methods suffer from lack of producing accurate queries, Precision and Speed of retrieved results. This research proposes a framework called ParaMaker. It generates accurate paraphrases of any sentence, similar to human behaviors and sends them to a search engine to find the plagiarism patterns. For English language, ParaMaker was examined against six known methods with standard PAN2014 datasets. Results showed an improvement of 34% in terms of Recall parameter while Precision and Speed parameters were maintained. In Persian language, statements of suspicious documents were examined compared to an exact search approach. ParaMaker showed an improvement of at least 42% while Precision and Speed were maintained.

کلیدواژه ها:

نویسندگان

A. Shojaie

Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Najafabad, Iran.| Big Data Research Center, Najafabad Branch, Islamic Azad University, Najafabad, Iran.

F. Safi-Esfahani

Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Najafabad, Iran.| Big Data Research Center, Najafabad Branch, Islamic Azad University, Najafabad, Iran.