A New VAD Algorithm using Sparse Representation and Updated Dictionary in Spectrogram Domain

Mohadeseh Eshaghi

A New VAD Algorithm using Sparse Representation and Updated Dictionary in Spectrogram Domain

محل انتشار: مجله سیستم های دینامیکی کاربردی و کنترل، دوره: 4، شماره: 1

سال انتشار: 1400

نوع سند: مقاله ژورنالی

زبان: انگلیسی

مشاهده: 235

فایل این مقاله در 11 صفحه با فرمت PDF قابل دریافت می باشد

دریافت فایل کامل مقاله

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/1772568

شناسه ملی سند علمی:

JR_JADSC-4-1_008

تاریخ نمایه سازی: 15 مهر 1402

چکیده مقاله:

This article proposes the new VAD (Voice Activity Detection) method was made using Spectrogram Domain (Spectro-Temporal Response Field) space based on sparse representation. Spectrogram Domain components have two dimensions of time and frequency. On the other hand, using sparse representation in learning dictionaries of speech and noise and updating dictionaries, causes better separation of speech and noise segments. In this algorithm, using auditory spectrogram and sparse representation, an updating dictionaries with different atom sizes and K-SVD (k-means clustering method) and NMF (non-negative matrix factorization) learning methods were constructed and the results indicate that this method works well. For example, the proposed VAD performance was obtained in SNRs greater than ۰dB is more than ۹۲.۷۱% and ۹۱.۲۱% in White noise and Car noise respectively, which shows the good performance of the proposed VAD compared to other methods. By comparing the NDS and MSC evaluation parameters with other methods, the results show better performance of the proposed method.

کلیدواژه ها:

Spectro-Temporal Response Field ، Voice Activity Detection (VAD) ، sparse representation ، updating dictionaries

نویسندگان

Mohadeseh Eshaghi

Department of Electrical Engineering, Nowshahr Branch, Islamic Azad University, Nowshahr, Iran

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :

R. Johny Elton, J. Mohanalin and P. Vasuki,“A novel voice ...
C.T. Hsieh, P.Y. Huang, T.W. Chen and Y. Chen,“Speech enhancement ...
G. Martin, A. Abeer, E. Dan and et al.,“All for ...
M. Kolbæk, Zh. Tan , S. Jensen and J. Jensen,“on ...
M. Eshaghi,F. Razzazi and A. Behrad,“A New VAD Algorithm using ...
M. Mirbagheri, N. Mesgarani, and Sh. Shamma,“Nonlinear filtering of spectro-temporal ...
N. Mesgarani, S. David, and S.A. Shamma, “Representation of phoneme ...
M. Eshaghi, F. Razzazi and A. Behrad,“A voice activity detection ...
W. Li, Y. Zhou, N. Poh, F. Zhou, and Q. ...
C. Mart´ınez, J. Goddardb, D. Milone, and H. Rufiner,“sparse spectro-temporal ...
M. Elad,“Sparse and redundant representations: from theory to applicationsin signal ...
R. Rubinstein, A. M. Bruckstein and M. Elad,“Dictionaries for sparserepresentation ...
M. Wei, Zh. Liu, X. Chen and H. Zhao,“Speech enhancement ...
K. Kreutz-Delgado, J.F. Murray, B.D. Rao, K. Engan, T. Lee ...
P. O. Hoyer,“Non-negative matrix factorization with sparseness con-straints,”The Journal of ...
M. Aharon, M. Elad, and A. Bruckstein,“K-svd: A algorithm for ...
R. Zdunek, and A. Cichocki,“Non-negative matrix factorization with quadratic programming,”Neural ...
G. H. Mohimani, M. Babaie-Zadeh and Ch. Jutten,“A fast approach ...
M. S. Lewicki and T. J. Sejnowski,“Learning overcomplete represen-tations,” Neural ...
Z. Jiang, G. Zhang, and L. S. Davis,“Submodular dictionary learn-ing ...
J.F. Gemmeke, H.V. Hamme, B. Cranen and L. Boves ,“Compressive ...
W. M. Fisher, G. R. Doddington, M. Goudie and M. ...
A. Varga, H. J. M. Steeneken, M. Tomlinson and D. ...
J. McLoughlin,“Super-Audible Voice Activity Detection,” IEEE Transactions on Speech and ...
P.K. Ghosh, A. Tsiartas and S. Narayanan,“Robust voice activity detection ...
J. Sohn, N. S. Kim and W. Sung,“A statistical model-based ...
A. Benyassine, E. Shlomot, H. Y. Su, D. Massaloux, C. ...
N. Mesgarani and Sh. Shamma,“Denoising in the Domain of Spectro-temporal ...
L. N. Tan, B. J. Borgstrom, and A. Alwan,“Voice activity ...
J. Ramirez, J. Segura, C. Benitez, A. Torre and A. ...
M. Yanna and A. Nishihara,“Efficient voice activity detection algorithm using ...
X.K Yang, L. He, D. Qu and W. Q.Zhang,“Voice activity ...

نمایش کامل مراجع