基于最大后驗相位估計的多帶譜減語音增強(qiáng)算法

李真; 吳文錦; 張勤; 任慧

doi:10.11999/JEIT161381

基于最大后驗相位估計的多帶譜減語音增強(qiáng)算法

doi: 10.11999/JEIT161381 cstr: 32379.14.JEIT161381

基金項目:

十二五國家科技支撐計劃重大項目(2012BAH38F00)

計量
- 文章訪問數(shù): 1358
- HTML全文瀏覽量: 217
- PDF下載量: 277
- 被引次數(shù): 0
出版歷程
- 收稿日期: 2016-12-21
- 修回日期: 2017-04-25
- 刊出日期: 2017-09-19

Multi-band Spectral Subtraction of Speech Enhancement Based on Maximum Posteriori Phase Estimation

Funds:

The National Science and Technology Planning?Project (2012BAH38F00)

摘要

摘要: 傳統(tǒng)語音增強(qiáng)算法中因為譜減法算法簡單易于實現(xiàn)而得到廣泛研究，譜減法的原理是將帶噪語音幅度與估計的噪聲幅度進(jìn)行相減，并疊加帶噪語音相位，進(jìn)而重構(gòu)增強(qiáng)語音譜。該方法在低信噪比下因為沒有進(jìn)行相位估計，會存在較大的估計誤差，并且因為對噪聲估計的不準(zhǔn)確，會產(chǎn)生音樂噪聲?；谧V減法的缺點該文提出一種基于最大后驗相位估計的多帶譜減法，其中多帶譜減法可減少音樂噪聲的影響，最大后驗方法估計純凈語音相位，可以減少在低信噪比時的估計誤差。實驗結(jié)果表明該方法在低信噪比時取得了較好的增強(qiáng)效果。
- 語音增強(qiáng) /
- 最大后驗相位估計 /
- 多帶譜減 /
- 低信噪比
Abstract: The spectral subtraction speech enhancement is extensively used due to its simplicity and easy to implement. The principle of this method is to subtract the estimated magnitude of the noise from the magnitude of the noisy signal, but the phase of the noisy signal is unchanged. This conventional method produces the estimating error because it exploits the noisy phase, especially in low SNR, and it produces musical noise because of the inaccuracy of the noise estimation. This paper proposes a multi-band spectral subtraction algorithm based on maximum posteriori phase estimation. Experimental results show that the proposed method can get better performance than the conventional method especially in low SNR.
- Speech enhancement /
- Maximum posteriori phase estimation /
- Multi-band spectral subtraction /
- Low SNR

HTML全文

參考文獻(xiàn)(14)

WJCICKI K, MILACIC M, STARK A, et al. Exploiting conjugate symmetry of the short-time fourier spectrum for speech enhancement[J]. IEEE Signal Processing Letters, 2008, 15: 461-464. doi: 10.1109/LSP.2008.923579.

WANG Jiaching, LIN Changhong, WANG Shufan, et al. Compressive Sensing-based speech enhancement[J]. IEEE Transactions on Audio, Speech and Language Processing, 2016, 24(11): 2122-2131. doi: 10.1109/TASLP.2016.2598306.

MOWLAEE P and KULMER J. Harmonic phase estimation in single-channel speech enhancement using phase decomposition and SNR information[J]. IEEE Transactions on Audio, Speech and Language Processing, 2015, 23(9): 1521-1532. doi: 10.1109/TASLP.2015.2439038.

KULMER J and MOWLAEE P. Phase estimation in single channel speech enhancement using phase decomposition[J]. IEEE Signal Processing Letters, 2015, 22(5): 598-602. doi: 10.1109/LSP.2014.2365040.

BOLLS F. Suppression of acoustic noise in speech using spectral subtraction [J]. IEEE Transactions on Acoustics, Speech and Signal Processing, 1979, 27(2): 113-120. doi: 10.1109/TASSP.1979.1163209.

WIENER N. The Extrapolation, Interpolation, and Smoothing of Stationary Time Series With Engineering Applications[M]. Cambridge: Massachusetts, MIT, 1949: 81-101.

EPHRAIM Y and MALAH D. Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator[J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1984, 32(6): 1109-1121. doi: 10.1109/ TASSP.1984.1164453.

KAMATH S and LOIZOU P C. A multi-band spectral subtraction method for enhancing speech corrupted by colored noise[C]. IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, USA, 2002: IV-4164-IV-4164.

VARY P. Noise Suppression by spectral magnitude estimation: Mechanism and theoretical limits[J]. Signal Processing, 1985, 8(4): 387-400. doi: 10.1016/0165-1684(85) 90002-7.

SAMUI S. Improved single channel phase-aware speech enhancement technique for low signal-to-noise ration signal[J]. IET Signal Processing, 2016, 10(6): 641-650. doi: 10.1049/ iet-spr.2015.0182.

KULMER J and MOWLAEE P. Harmonic phase estimation in single-channel speech enhancement using Von Mises distribution and prior SNR[C]. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, 2015: 5063-5067.

KAY S M. Fundamentals of Statistical Signal Processing, Volume I: Estimation Theory[M]. New Jersey: Prentice Hall PTR, 1993: 164-172.

TAAL C H, HENDRIKS R C, HEUSDENS R, et al. An algorithm for intelligibility prediction of time-frequency weighted noisy speech[J]. IEEE Transactions on Audio, Speech and Languages, 2011, 19(7): 2125-2136. doi: 10.1109/ TASL.2011.2114881.

GAICH A and MOWLAEE P. On speech quality estimation of phase-aware single-channel speech enhancement[C]. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), Brisbane, Australia, 2015: 216-220.

相關(guān)文章

施引文獻(xiàn)

資源附件(0)

訪問統(tǒng)計