A New Method for Stemming in Persian Language Considering Exceptions
عنوان مقاله: A New Method for Stemming in Persian Language Considering Exceptions
شناسه ملی مقاله: SASTECH05_162
منتشر شده در پنجمین کنفرانس بین المللی پیشرفت های علوم و تکنولوژی در سال 1390
شناسه ملی مقاله: SASTECH05_162
منتشر شده در پنجمین کنفرانس بین المللی پیشرفت های علوم و تکنولوژی در سال 1390
مشخصات نویسندگان مقاله:
Somayye Estahbanati - Azad University Science and Research Branch Ahvaz, Iran
Reza Javidan - Islamic Azad University – Beyza Branch
Mashalla Abbasi Dezfooli - Islamic Azad University Science and Research Branch Ahvaz, Iran
خلاصه مقاله:
Somayye Estahbanati - Azad University Science and Research Branch Ahvaz, Iran
Reza Javidan - Islamic Azad University – Beyza Branch
Mashalla Abbasi Dezfooli - Islamic Azad University Science and Research Branch Ahvaz, Iran
In this paper a new algorithm for stemming in Farsi language is presented. This stemmer is based on removing the suffixes and prefixes but a database is used to save the exceptions to decrease error rate. In the proposed method the speed of stemmer and also the percentage of errors are improved. The evaluation results on a small Farsi document collection show significant improvement in precision/recall
کلمات کلیدی: Stemming, algorithm, Farsi language, Persian Language
صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/157462/