Comparative Analysis of Link-based and Content-based Methods for Opinion Mining in Persian language

سال انتشار: 1397
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 350

فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_IJWR-1-2_001

تاریخ نمایه سازی: 21 اردیبهشت 1399

چکیده مقاله:

Twitter has provided a convenient platform to express feelings and opinions in different areas. Opinion mining in Twitter can be considered as studying the overall sentiment of a tweet. There are two general categories of sentiment analysis methods in the Persian language, linked-base methods and, content-based methods. In this study, we implement a new link-based method for improving opinion classification in the Persian language. To compare with the content-based method, we implement a content-based method using Naïve Bayes Method with two different weighting Methods: TF/IDF and Chi-Square. The TF/IDF method has good results in previous Persian language studies. The Chi-Square method has not been used in the Persian language researches, but the accuracy is fairly good in English. The results show that the improvement in the language-independent methods is remarkable and is in accordance with this research, the precision of the proposed algorithm for positive and negative comments was 98.87% and 97.87%, and the recall value for positive and negative comments was 99.24% and 96.84% respectively. The results also show that because of complexities in Persian syntax and lack of proper natural language processing tools in Persian, content-based algorithms operate poorly compared to English.

کلیدواژه ها:

نویسندگان

Niloofar Allahkaram

Islamic Azad University, Science and Research Branch

Alireza Yari

Iran telecom research center