External Plagiarism Detection based on Human Behaviors in Producing Paraphrases of Sentences in English and Persian Languages
محل انتشار: مجله هوش مصنوعی و داده کاوی، دوره: 7، شماره: 3
سال انتشار: 1398
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 536
فایل این مقاله در 16 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_JADM-7-3_010
تاریخ نمایه سازی: 19 تیر 1398
چکیده مقاله:
With the advent of the internet and easy access to digital libraries, plagiarism has become a major issue. Applying search engines is one of the plagiarism detection techniques that converts plagiarism patterns to search queries. Generating suitable queries is the heart of this technique and existing methods suffer from lack of producing accurate queries, Precision and Speed of retrieved results. This research proposes a framework called ParaMaker. It generates accurate paraphrases of any sentence, similar to human behaviors and sends them to a search engine to find the plagiarism patterns. For English language, ParaMaker was examined against six known methods with standard PAN2014 datasets. Results showed an improvement of 34% in terms of Recall parameter while Precision and Speed parameters were maintained. In Persian language, statements of suspicious documents were examined compared to an exact search approach. ParaMaker showed an improvement of at least 42% while Precision and Speed were maintained.
کلیدواژه ها:
Plagiarism detection ، External plagiarism detection ، Resource retrieval ، Producing paraphrases of sentence
نویسندگان
A. Shojaie
Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Najafabad, Iran.| Big Data Research Center, Najafabad Branch, Islamic Azad University, Najafabad, Iran.
F. Safi-Esfahani
Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Najafabad, Iran.| Big Data Research Center, Najafabad Branch, Islamic Azad University, Najafabad, Iran.