Discrepancies Detection in Arabic and English Documents
سال انتشار: 1394
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 514
فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_ACSIJ-4-5_011
تاریخ نمایه سازی: 7 آذر 1394
چکیده مقاله:
In the paper, there are analyzed and compared results of usable methods for discrepancies detection based on character n-gram profiles (the set of character n-gram normalized frequencies ofa text) for English and Arabic documents. English and Arabic texts were analyzed from many statistical characteristics point ofview. We covered some statistical differences between both languages and we applied some heuristics for measurements oftext parts dissimilarities. The results for each text can call for an attention to the text (or not) if the text parts were written by thesame author. We evaluate some Arabic and Englishdocuments and show its parts they contain discrepancies and they need some following analysis for plagiarism detection. The analysis depends on selected parameters prepared in experiments.
کلیدواژه ها:
نویسندگان
Abdulwahed Almarimi
Institute of Computer Science, Faculty of Science, P. J. Šafárik University in Košice ۰۴۰۰۱ Košice, Slovakia
Gabriela Andrejkova
Institute of Computer Science, Faculty of Science, P. J. Šafárik University in Košice ۰۴۰۰۱ Košice, Slovakia