一種新的用于語(yǔ)音主觀質(zhì)量評(píng)價(jià)的譜失真參數(shù)
A NEW PARAMETER OF SPECTRAL DISTORTION FOR PREDICTING SUBJECTIVE QUALITY OF SPEECH
-
摘要: 該文分析和討論了各種語(yǔ)音主觀質(zhì)量評(píng)價(jià)的客觀方法,提出了一種考慮了人耳的屏蔽效應(yīng),且正比于人耳聽覺的Bark域譜失真參數(shù)PBSD(Perception-based Bark Spectral Distortion),用來(lái)映射語(yǔ)音的主觀MOS分值。實(shí)驗(yàn)表明,基于該參數(shù)及其主客觀映射關(guān)系,所得到的各類語(yǔ)音編譯碼系統(tǒng)的主觀MOS預(yù)測(cè)分,平均和最大預(yù)測(cè)偏差均較小。論文最后利用PBSD參數(shù)代替MSE參數(shù),設(shè)計(jì)的語(yǔ)音編碼系統(tǒng),改善了解碼語(yǔ)音的主觀聽覺質(zhì)量。
-
關(guān)鍵詞:
- 語(yǔ)音信號(hào); MOS分; 預(yù)測(cè)
Abstract: This paper analyses various objective measures for the prediction of subjective quality of speech. A new Perception-based Bark Spectral Distortion (PBSD) parameter is pre- sented, which takes the masking property of human ear into consideration, to predict Mean Opinion Score(MOS) of speech quality. Experiments prove that this map from objective mea-sure to subjective MOS based on the calculation of PBSD has rather small prediction error. The PBSD parameter is applied to designing new speech codec in place of MSE parameter and the subjective quality of decoded speeches is improved. -
S.R. Quackenbush, T. P. Barnwell, M. A. Clements, Objective Measures of Speech Quality, New York, U.S.A., Prentice Hall, 1988, 第 2 章. [2]丁瑾,鐘濤,胡健棟,語(yǔ)音質(zhì)量的一種新的評(píng)價(jià)方法,電子學(xué)報(bào),1997,25(4),6-9.[2]Shihua Wang, A. Sekey, A. Gersho, An objective measure for predicting subjective quality of speech coders, IEEE Journal on Selectted Areas in Communications, 1992, 10(5), 829-829.[3]M.M. Meky.[J].Tarek, N, Saadawi, A perceptually-based objective measure for speech coders using abductive network, ICASSP96, Atlanta, U.S.A.1996,:-[4]Nobuhiko Kitawaki.[J].et al, Artificial voice signal for objective quality evaluation of speech coding system, ICC89, Boston, MA, U.Y.A.1989,:-[5]J.D. Johnston, Transform coding of audio signals using perceptual noise, IEEE on Selected Areas in Communications, 1988, 6(2), 314-323.[6]L.E. Kinsler et al, Fundamental of Acoustics, New York, U.S.A.,,John Wiley Sons Inc., 1982, third edition, 246-278.[7]T W Parsons著,文成義等譯,語(yǔ)音處理,西安電子科技大學(xué)情報(bào)資料室,1989,3,49-114. -
計(jì)量
- 文章訪問數(shù): 2218
- HTML全文瀏覽量: 125
- PDF下載量: 966
- 被引次數(shù): 0