CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Comparative Analysis of Link-based and Content-based Methods for Opinion Mining in Persian language

عنوان مقاله: Comparative Analysis of Link-based and Content-based Methods for Opinion Mining in Persian language
شناسه ملی مقاله: JR_IJWR-1-2_001
منتشر شده در شماره 2 دوره 1 فصل در سال 1397
مشخصات نویسندگان مقاله:

Niloofar Allahkaram - Islamic Azad University, Science and Research Branch
Alireza Yari - Iran telecom research center

خلاصه مقاله:
Twitter has provided a convenient platform to express feelings and opinions in different areas. Opinion mining in Twitter can be considered as studying the overall sentiment of a tweet. There are two general categories of sentiment analysis methods in the Persian language, linked-base methods and, content-based methods. In this study, we implement a new link-based method for improving opinion classification in the Persian language. To compare with the content-based method, we implement a content-based method using Naïve Bayes Method with two different weighting Methods: TF/IDF and Chi-Square. The TF/IDF method has good results in previous Persian language studies. The Chi-Square method has not been used in the Persian language researches, but the accuracy is fairly good in English. The results show that the improvement in the language-independent methods is remarkable and is in accordance with this research, the precision of the proposed algorithm for positive and negative comments was 98.87% and 97.87%, and the recall value for positive and negative comments was 99.24% and 96.84% respectively. The results also show that because of complexities in Persian syntax and lack of proper natural language processing tools in Persian, content-based algorithms operate poorly compared to English.

کلمات کلیدی:
Opinion mining, Content-Based, Link-Based, Twitter

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1013621/