CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

The First Persian Context Sensitive Spell Checker

عنوان مقاله: The First Persian Context Sensitive Spell Checker
شناسه ملی مقاله: JR_ITRC-2-2_006
منتشر شده در در سال 1389
مشخصات نویسندگان مقاله:

Heshaam Faili - School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
Mohammad Azadnia - Information Technology Department Iran Telecom Research Center Tehran, Iran

خلاصه مقاله:
In this article an attempt to introduce the first Persian context sensitive spell checker, which tries to detect and correct the :eat-word spelling error of Persian text is presented. The proposed method is a statistical approach which uses Bayesian framework as its probabilistic model and also uses mutual information metric as a semantic relatedness measure between different Persian words. Our experiments on sample test data, shows that accuracy of correction method is about ۸۰% with respect to Fl-measure.

کلمات کلیدی:
Persian language, real-word error, Bayesian framework, mutual information

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1426615/