Empirical Evaluation of Well-known Farsi OCR Engines on the IDPL-PFOD Dataset
سال انتشار: 1404
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 263
فایل این مقاله در 13 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_ECE-2-1_006
تاریخ نمایه سازی: 15 اردیبهشت 1404
چکیده مقاله:
Optical character recognition (OCR), also referred to as text recognition, extracts text from scanned documents and camera images. OCR has numerous applications in reading forms and cheques, converting archived documents to digital files, and reading books and papers. An accurate OCR system speeds up these processes by eliminating time-consuming manual tasks. Despite advancements, OCR remains challenging for languages like Farsi due to their complex structures and lack of diverse datasets for evaluation. IDPL-PFOD, a new synthetic Farsi dataset, addresses this gap by including various backgrounds, fonts, distortions, and blurs. This paper evaluates Tesseract and EasyOCR on IDPL-PFOD to highlight their limitations. Tesseract and EasyOCR achieve ۸۴.۴۱% and ۷۳.۲۸% overall accuracy, respectively. Tesseract outperforms EasyOCR in most cases but drops to ۶۷.۱۷% on textured backgrounds, while EasyOCR achieves ۷۵.۸%. Tesseract reaches its peak accuracy (۹۲.۴۵%) on plain backgrounds and shows robustness to Gaussian blur and slope distortion, with minimal accuracy drops. In contrast, EasyOCR performs poorly on salt-and-pepper noise (۴.۷۹%), and moderately on sinewave distortion and Gaussian blur (۶۱.۴۵% and ۶۳.۷۳%). These findings emphasize the need for training OCR engines on more diverse datasets to improve their performance under real-world conditions.
کلیدواژه ها:
IDPL-PFOD dataset ، optical character recognition (OCR) ، EasyOCR ، Tesseract ، OCR engines ، Farsi OCR
نویسندگان
Fatemeh-sadat Hosseini
Department of Electrical Engineering, Shahid Bahonar University of Kerman, Kerman, Iran
Elham Shabaninia
Department of Applied mathematics, Graduate University of Advanced Technology, Kerman, Iran
Hossein Nezamabadi-pour
Department of Electrical Engineering, Shahid Bahonar University of Kerman, Kerman, Iran