CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

A New Method for Stemming in Persian Language Considering Exceptions

عنوان مقاله: A New Method for Stemming in Persian Language Considering Exceptions
شناسه ملی مقاله: SASTECH05_162
منتشر شده در پنجمین کنفرانس بین المللی پیشرفت های علوم و تکنولوژی در سال 1390
مشخصات نویسندگان مقاله:

Somayye Estahbanati - Azad University Science and Research Branch Ahvaz, Iran
Reza Javidan - Islamic Azad University – Beyza Branch
Mashalla Abbasi Dezfooli - Islamic Azad University Science and Research Branch Ahvaz, Iran

خلاصه مقاله:
In this paper a new algorithm for stemming in Farsi language is presented. This stemmer is based on removing the suffixes and prefixes but a database is used to save the exceptions to decrease error rate. In the proposed method the speed of stemmer and also the percentage of errors are improved. The evaluation results on a small Farsi document collection show significant improvement in precision/recall

کلمات کلیدی:
Stemming, algorithm, Farsi language, Persian Language

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/157462/