Localization of Multiple Simultaneous Speakers by Combining the Information from Different Subbands
محل انتشار: بیست و یکمین کنفرانس مهندسی برق ایران
سال انتشار: 1392
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 990
فایل این مقاله در 6 صفحه با فرمت PDF قابل دریافت می باشد
- صدور گواهی نمایه سازی
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
ICEE21_379
تاریخ نمایه سازی: 27 مرداد 1392
چکیده مقاله:
Time Difference Of Arrival (TDOA)-based algorithms are the main methods for speech source localization. A category of these methods are based on Generalized Cross Correlation(GCC). These methods estimate the source location based on the calculated TDOA between microphones signals. Theaccuracy of these methods decreases as the amount of noise and reverberation increases. In this paper, we propose the utilization of subband processing for the localization of twosimultaneous speech sources. While the conventional methods consider the whole signal spectrum identically in thelocalization procedure, the proposed method takes advantage of the differences in the frequency bands of the mixed speech forthe localization of multiple speakers. Actually, the proposedmethod computes the GCC in the different frequency bands and then, combines the information from the subbands in a so-calledsmart manner. We have discussed several approaches for the combination of subband. The performance evaluations indifferent environmental conditions demonstrate the superiority of the proposed method compared to the fullband GCC method.The proposed method considerably increases the accuracy of simultaneous speaker localization.
کلیدواژه ها:
Multi Source Localization – Subband Processing – Generalized Cross Correlation – PHAT filter – DOA
نویسندگان
Ali Dehghan Firoozabadi
Speech Processing Research Lab (SPRL), Electrical and Computer Eng. Dept., Yazd University