Proposing a New Speech Enhancement Method based on Spectral Subtraction and Binary Masking With a Bank of 128 Gamma-tone Filters

  • سال انتشار: 1394
  • محل انتشار: مجله پیشرفت در تحقیقات کامپیوتری، دوره: 6، شماره: 3
  • کد COI اختصاصی: JR_JACR-6-3_007
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 583
دانلود فایل این مقاله

نویسندگان

Mozhgan Monjizadeh

M. Sc. Student, Department of Computer Engineering and Information Technology, Payame Noor University, Iran

Saeed Ayat

Associate Professor, Department of Computer Engineering and Information Technology, Payame Noor University, Iran

چکیده

In order to the enhancement of the quality of speech corrupted by additive noise, a speech enhancement method has been put forward based on the combination of spectral subtraction and binary masking. Spectral subtraction is a powerful method for removing noise from speech and binary masking provides essential elements to be used in monaural speech segregation. In the proposed combined method, first, spectral subtraction is used for reduction of noise in noisy speech and then binary masking is used for monaural speech segregation from musical noise introduced by spectral subtraction. The binary masking method, isolates the basic principle of target voice from other signals by using time frequency decomposition, energy masking and unites grouping. This masking is like what the human ear does in noisy environment. In our implementation for binary masking, an auditory filter (Gamma-tone) is divided into different frequency sub-bands. From these sub-band channels, channels 1,2,4,8,16,32,64,128 have been used from this bank of 128 Gamma-tone filter for implementing the binary masking. Evaluations show that the proposed combined method can improve the signal to noise ratio from 5 to 19 db for experimented signals and have better performance than binary masking or spectral subtraction in most situations, especially when noise and speech have not similar power spectrum.

کلیدواژه ها

speech enhancement, spectral subtraction, binary masking, Gamma-tone filter bank, musical noise

مقالات مرتبط جدید

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.