Script-Independent Handwritten Text line Segmentation Using Directional ۲D Filters
سال انتشار: 1399
نوع سند: مقاله ژورنالی
زبان: فارسی
مشاهده: 110
فایل این مقاله در 16 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_JSCIT-9-1_006
تاریخ نمایه سازی: 25 مهر 1403
چکیده مقاله:
Text line segmentation is an important stage of the optical character recognition (OCR) algorithms. To analyze and recognize a document, text lines have to be segmented accurately. Text line segmentation of handwritten documents is more difficult than that of machine-printed ones. Curved and multi-skewed text lines, overlapping text lines, and very small text lines are the main challenges. Most of the proposed approaches did not consider local features of text lines in a document image. In our proposed method, both global and local features are considered. The proposed method is based on using directional ۲D anisotropic filters. The parameters of our method are tuned based on a main global parameter which is computed for each document, separately. Hence, the proposed method is a dataset-independent method. A document is divided into several blocks for which some local characteristics are calculated. In each block, text regions are detected by using local characteristics such as the block skew. In order to estimate the skew of text regions in a block, a novel text block skew estimation algorithm is proposed in this paper. Experimental results show that the proposed method outperforms all the state-of-the-art methods on three standard datasets. Our final F-Measure are ۰.۵۴%, ۰.۰۳%, and ۰.۰۲% greater than the winner of ICDAR۲۰۱۳ text line segmentation contests on ICDAR۲۰۱۳, ICDAR۰۹, and HIT-MW datasets, respectively. The experiments proved that the proposed method can accurately segment text lines of complicated handwritings.
کلیدواژه ها:
نویسندگان
Majid Ziaratban
Department of Electrical Engineering, Faculty of Engineering, Golestan University, Gorgan, Iran.