CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

A New Segmentation Method for Persian/Arabic OCR Based on Baseline Processing

عنوان مقاله: A New Segmentation Method for Persian/Arabic OCR Based on Baseline Processing
شناسه ملی مقاله: JR_MJEE-3-3_006
منتشر شده در در سال 1388
مشخصات نویسندگان مقاله:

Mahboubeh Shamsi - Islamic Azad University Bardsir
Reza Rasouli - Azad Islamic University Bardsir
Soudeh Shadravan - Azad Islamic University Bardsir

خلاصه مقاله:
One of the most important stages in Character Recognition Systems is “Segmentation”, because any mistake will affect to all other tasks, especially to character recognition. This operation is more complex in Persian/Arabic writing than other Latin writing like English, and there has been an ongoing research on it. Other algorithms, that has been used as base as proposed algorithm, show ۸۵% accuracy. In this paper, a new improved method has been presented by analyzing the visual features of the Persian/Arabic language. The proposed algorithm is able to segment existing fonts up to ۹۸.۵% accuracy or even ۱۰۰% on some cases. The remaining error could be refined by applying a good character recognition technique and a precise vocabulary.

کلمات کلیدی:
image processing, Persian OCR, Azad Islamic University Bardsir, fa, Segmentation, recognition, smoothing, Arabic OCR, Baseline Method

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1795458/