Text Mining of a Classic Novel Using Machine Learning Techniques

سال انتشار: 1402
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 44

فایل این مقاله در 5 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

ICIORS16_354

تاریخ نمایه سازی: 2 اسفند 1402

چکیده مقاله:

Nowadays, there is an abundance of textual data available for analysis. Common applications of text analysis include sentiment analysis of user comments and differentiating between legitimate and spam emails. However, text mining for extracting insights from novels remains relatively rare. Since novels represent valuable resources, simplifying the process of comprehending novels can offer significant benefits. This paper focuses on the text of the renowned novel called "Anne of Green Gables". A variety of machine learning algorithms, including natural language processing techniques, are applied to discover valuable insights from the text. Our analysis encompasses identifying the most frequently occurring words and their associated parts of speech in this novel, utilizing Named Entity Recognition (NER) to detect proper nouns, employing data visualization to enhance understanding, and extracting a summary of the part of this novel. This study showcases the informative potential of employing machine learning techniques in the analysis of literary works.

کلیدواژه ها:

Text mining ، Machine learning ، Natural language processing (NLP) ، Visualization

نویسندگان

Mahshad Haghi

Department of Industrial & Systems Engineering, Tarbiat Modares University, Tehran, Iran