CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Presenting a Text Mining Algorithm to Identify Emotion in Persian Corpus

عنوان مقاله: Presenting a Text Mining Algorithm to Identify Emotion in Persian Corpus
شناسه ملی مقاله: JR_JITM-10-2_006
منتشر شده در در سال 1397
مشخصات نویسندگان مقاله:

مسعود گرشاسبی - Research Instructor, Faculty of Iran Telecommunication Research Center, Tehran, Iran
آناهید رئیس روحانی - MSc, Software Engineering, Islamic Azad University, Karaj Branch, Tehran, Iran
محمدرضا کاباران زاده قدیم - Associate Prof. of Management, Islamic Azad University, Central Tehran Branch, Tehran, Iran

خلاصه مقاله:
The literature regarding Persian text mining indicates that most studies are conducted to detect polarity of opinions on social websites. The aim of this research is presenting an algorithm to identify emotion implemented in the text based on the following six main emotions of happiness, sadness, fear, anger, surprise and disgust. In this research, the emotion will be examined based on unsupervised lexicon method. Identifying emotions conveyed by the texts based on a single emotional word will produce low accuracy because the intervening boosters and negating words can influence the emotion of the text too. Therefore, the algorithm has been implemented in six approaches with different features. In the first approach, the algorithm is capable of detecting only one emotional word in a sentence, and then it improves to detect boosters and negating and stop word list as well. The results of running the algorithm on two domains of data showed that the more features used in the algorithm, the more accurate the algorithm becomes and that the most effective factor is part of speech.  

کلمات کلیدی:
Data Mining, Emotion analysis, Sentiment mining, Text mining, Web mining

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1400869/