CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Emotion Recognition for Persian Speech Using Convolutional Neural Network and Support Vector Machine

عنوان مقاله: Emotion Recognition for Persian Speech Using Convolutional Neural Network and Support Vector Machine
شناسه ملی مقاله: JR_COAM-8-2_006
منتشر شده در در سال 1402
مشخصات نویسندگان مقاله:

Saeed Hashemi - ‎Department of Computer Engineering and Information Technology‎, ‎Payame Noor University (PNU)‎, ‎Tehran‎, ‎Iran
Saeed Ayat - ‎Department of Computer Engineering and Information Technology‎, ‎Payame Noor University (PNU)‎, ‎Tehran‎, ‎Iran

خلاصه مقاله:
The paper discusses the limitations of emotion recognition in Persian speech due to inefficient feature extraction and classification tools‎. ‎To address this‎, ‎we propose a new method for detecting hidden emotions in Persian speech with higher recognition accuracy‎. ‎The method involves four steps‎: ‎preprocessing‎, ‎feature description‎, ‎feature extraction‎, ‎and classification‎. ‎The input signal is normalized in the preprocessing step using single-channel vector conversion and signal resampling‎. ‎Feature descriptions are performed using Mel-Frequency Cepstral Coefficients and Spectro-Temporal Modulation techniques‎, ‎which produce separate feature matrices‎. ‎These matrices are then merged and used for feature extraction through a Convolutional Neural Network‎. ‎Finally‎, ‎a Support Vector Machine with a linear kernel function is used for emotion classification‎. ‎The proposed method is evaluated using the Sharif Emotional Speech dataset and achieves an average accuracy of ۸۰.۹% in classifying emotions in Persian speech‎.

کلمات کلیدی:
Emotion recognition in speech‎, ‎Mel-Frequency cepstral coefficients‎, ‎Convolutional neural network‎, ‎Support vector machine

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1844823/