메뉴 건너뛰기




Volumn , Issue , 2014, Pages 5562-5566

Unsupervised non-parametric Bayesian modeling of non-stationary noise for model-based noise suppression

Author keywords

MMSE estimation; noise suppression; non parametric Bayesian model; unsupervised modeling

Indexed keywords

BAYESIAN NETWORKS; HIDDEN MARKOV MODELS; SIGNAL PROCESSING; SPEECH RECOGNITION; SPURIOUS SIGNAL NOISE;

EID: 84905215531     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854667     Document Type: Conference Paper
Times cited : (2)

References (30)
  • 1
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis for speech
    • April
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis for speech," J. Acoust. Soc. Am., vol. 87, no. 4, pp. 1738-1752, April 1990.
    • (1990) J. Acoust. Soc. Am , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 2
    • 33750352847 scopus 로고    scopus 로고
    • A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition
    • November
    • K. Ishizuka and T. Nakatani, "A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition," Speech Communication, vol. 48, no. 11, pp. 1447-1457, November 2006.
    • (2006) Speech Communication , vol.48 , Issue.11 , pp. 1447-1457
    • Ishizuka, K.1    Nakatani, T.2
  • 3
    • 0036298539 scopus 로고    scopus 로고
    • Non-linear transformations of the feature space for robust speech recognition
    • May
    • J. C. Segura, M.C. Benítez, A. de la Torre, A. M. Peinado, and A. Rubio, "Non-linear transformations of the feature space for robust speech recognition," in Proc. of ICASSP '02, May 2002, vol. I, pp. 401-404.
    • (2002) Proc. of ICASSP '02 , vol.1 , pp. 401-404
    • Segura, J.C.1    Benítez, M.C.2    De La Torre, A.3    Peinado, A.M.4    Rubio, A.5
  • 4
    • 0029375590 scopus 로고
    • Speaker adaptation using constrained estimation of Gaussian mixtures
    • September
    • V. Digalakis, D. Ritischev, and L. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. on SAP, vol. 3, no. 5, pp. 357-366, September 1995.
    • (1995) IEEE Trans. on SAP , vol.3 , Issue.5 , pp. 357-366
    • Digalakis, V.1    Ritischev, D.2    Neumeyer, L.3
  • 5
    • 34547496746 scopus 로고    scopus 로고
    • Constrained structural maximum a posteriori linear regression for average-voice-based speech synthesis
    • September
    • Y. Nakano, M. Tachibana, J. Yamagishi, and T. Kobayashi, "Constrained structural maximum a posteriori linear regression for average-voice-based speech synthesis," in Proc. of Interspeech '06, September 2006, pp. 2286-2289.
    • (2006) Proc. of Interspeech '06 , pp. 2286-2289
    • Nakano, Y.1    Tachibana, M.2    Yamagishi, J.3    Kobayashi, T.4
  • 6
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • April
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. on ASSP, vol. 27, no. 2, pp. 113-120, April 1979.
    • (1979) IEEE Trans. on ASSP , vol.27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 7
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • December
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. on ASSP, vol. 32, pp. 1109-1121, December 1984.
    • (1984) IEEE Trans. on ASSP , vol.32 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 8
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment-independent speech recognition
    • May
    • P. J. Moreno, B. Raj, and R. M. Stern, "A vector Taylor series approach for environment-independent speech recognition," in Proc. of ICASSP '96, May 1996, vol. II, pp. 733-736.
    • (1996) Proc. of ICASSP '96 , vol.2 , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 10
    • 70450204727 scopus 로고    scopus 로고
    • A study of mutual front-end processing method based on statistical model for noise robust speech recognition
    • September
    • M. Fujimoto, K. Ishizuka, and T. Nakatani, "A study of mutual front-end processing method based on statistical model for noise robust speech recognition," in Proc. of Interspeech '09, September 2009, pp. 1235-1238.
    • (2009) Proc. of Interspeech '09 , pp. 1235-1238
    • Fujimoto, M.1    Ishizuka, K.2    Nakatani, T.3
  • 11
    • 84865745040 scopus 로고    scopus 로고
    • A robust estimation method of noise mixture model for noise suppression
    • August
    • M. Fujimoto, S. Watanabe, and T. Nakatani, "A robust estimation method of noise mixture model for noise suppression," in Proc. of Interspeech '11, August 2011, pp. 697-700.
    • (2011) Proc. of Interspeech '11 , pp. 697-700
    • Fujimoto, M.1    Watanabe, S.2    Nakatani, T.3
  • 12
    • 84906251145 scopus 로고    scopus 로고
    • Model-based noise suppression using unsupervised estimation of hidden Markov model for non-stationary noise
    • August
    • M. Fujimoto and T. Nakatani, "Model-based noise suppression using unsupervised estimation of hidden Markov model for non-stationary noise," in Proc. of Interspeech '13, August 2013, pp. 2982-2986.
    • (2013) Proc. of Interspeech '13 , pp. 2982-2986
    • Fujimoto, M.1    Nakatani, T.2
  • 13
    • 84867606947 scopus 로고    scopus 로고
    • A reliable data selection for model-based noise suppression using unsupervised joint speaker adaptation and noise model estimation
    • August
    • M. Fujimoto and T. Nakatani, "A reliable data selection for model-based noise suppression using unsupervised joint speaker adaptation and noise model estimation," in Proc. of ICSPCC '12, August 2012, pp. 4713-4716.
    • (2012) Proc. of ICSPCC '12 , pp. 4713-4716
    • Fujimoto, M.1    Nakatani, T.2
  • 14
    • 84906262433 scopus 로고    scopus 로고
    • Speech enhancement based on deep denoising autoencoder
    • August
    • X. Lu, Y. Tsao, S. Matsuda, and C. Hori, "Speech enhancement based on deep denoising autoencoder," in Proc. of Interspeech '13, August 2013, pp. 436-440.
    • (2013) Proc. of Interspeech '13 , pp. 436-440
    • Lu, X.1    Tsao, Y.2    Matsuda, S.3    Hori, C.4
  • 15
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • May
    • M. J. F. Gales and S. J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. on SAP, vol. 4, no. 5, pp. 352-359, May 1996.
    • (1996) IEEE Trans. on SAP , vol.4 , Issue.5 , pp. 352-359
    • Gales, M.J.F.1    Young, S.J.2
  • 16
    • 79951668781 scopus 로고    scopus 로고
    • Extended VTS for noiserobust speech recognition
    • May
    • R. C. van Dalen and M. J. F Gales, "Extended VTS for noiserobust speech recognition," IEEE Trans. on SAP, vol. 19, no. 4, pp. 733-743, May 2011.
    • (2011) IEEE Trans. on SAP , vol.19 , Issue.4 , pp. 733-743
    • Van Dalen, R.C.1    Gales, M.J.F.2
  • 17
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • April
    • C. L. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech and Language, vol. 9, no. 2, pp. 171-185, April 1995.
    • (1995) Computer Speech and Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.L.1    Woodland, P.C.2
  • 18
    • 0036461005 scopus 로고    scopus 로고
    • Structural maximum a posteriori linear regression for fast HMM adaptation
    • January
    • O. Siohan, T. Myrvoll, and C. Lee, "Structural maximum a posteriori linear regression for fast HMM adaptation," Computer Speech &Language, vol. 16, no. 1, pp. 5-24, January 2002.
    • (2002) Computer Speech &Language , vol.16 , Issue.1 , pp. 5-24
    • Siohan, O.1    Myrvoll, T.2    Lee, C.3
  • 19
    • 82455212515 scopus 로고    scopus 로고
    • Bayesian linear regression for hidden Markov model based on optimizing variational bounds
    • December
    • S. Watanabe, A. Nakamura, and B. H. Juang, "Bayesian linear regression for hidden Markov model based on optimizing variational bounds," in Proc. of MLSP '11, December 2011, pp. 1-6.
    • (2011) Proc. of MLSP '11 , pp. 1-6
    • Watanabe, S.1    Nakamura, A.2    Juang, B.H.3
  • 20
    • 0036291376 scopus 로고    scopus 로고
    • Uncertainty decoding with SPLICE for noise robust speech recognition
    • May
    • J. Droppo, A. Acero, and L. Deng, "Uncertainty decoding with SPLICE for noise robust speech recognition," in Proc. of ICASSP '02, May 2002, pp. 57-60.
    • (2002) Proc. of ICASSP '02 , pp. 57-60
    • Droppo, J.1    Acero, A.2    Deng, L.3
  • 21
    • 40249103761 scopus 로고    scopus 로고
    • Issues with uncertainty decoding for noise robust automatic speech recognition
    • April
    • H. Liao and M. J. F. Gales, "Issues with uncertainty decoding for noise robust automatic speech recognition," Speech Communication, vol. 50, no. 4, pp. 265-277, April 2008.
    • (2008) Speech Communication , vol.50 , Issue.4 , pp. 265-277
    • Liao, H.1    Gales, M.J.F.2
  • 22
    • 79959825393 scopus 로고    scopus 로고
    • A comparative study of noise estimation algorithms for VTS-based robust speech recognition
    • September
    • Y. Zhao and B. H. Juang, "A comparative study of noise estimation algorithms for VTS-based robust speech recognition," in Proc. of Interspeech '10, September 2010, pp. 2090-2093.
    • (2010) Proc. of Interspeech '10 , pp. 2090-2093
    • Zhao, Y.1    Juang, B.H.2
  • 23
    • 80051616110 scopus 로고    scopus 로고
    • Rapid joint speaker and noise compensation for robust speech recognition
    • May
    • K. K. Chin, H. Xu, M. J. F. Gales, C. Breslin, and K. Knill, "Rapid joint speaker and noise compensation for robust speech recognition," in Proc. of ICASSP '11, May 2011, pp. 5500-5503.
    • (2011) Proc. of ICASSP '11 , pp. 5500-5503
    • Chin, K.K.1    Xu, H.2    Gales, M.J.F.3    Breslin, C.4    Knill, K.5
  • 24
    • 80051617808 scopus 로고    scopus 로고
    • Speaker and noise factorisation on the AURORA4 task
    • May
    • Y.-Q. Wang and M. J. F. Gales, "Speaker and noise factorisation on the AURORA4 task," in Proc. of ICASSP '11, May 2011, pp. 4584-4587.
    • (2011) Proc. of ICASSP '11 , pp. 4584-4587
    • Wang, Y.-Q.1    Gales, M.J.F.2
  • 25
    • 79551500649 scopus 로고    scopus 로고
    • Asymptotic equivalence of Bayes cross validation and widely applicable information criterion in singular learning theory
    • December
    • S. Watanabe, "Asymptotic equivalence of Bayes cross validation and widely applicable information criterion in singular learning theory," Journal of Machine Learning Research, vol. 11, pp. 3571-3591, December 2010.
    • (2010) Journal of Machine Learning Research , vol.11 , pp. 3571-3591
    • Watanabe, S.1
  • 26
    • 0001120413 scopus 로고
    • A Bayesian analysis of some nonparametric problems
    • March
    • T. S. Ferguson, "A Bayesian analysis of some nonparametric problems," The Annals of Statistics, vol. 1, no. 2, pp. 209-230, March 1973.
    • (1973) The Annals of Statistics , vol.1 , Issue.2 , pp. 209-230
    • Ferguson, T.S.1
  • 27
    • 0038669544 scopus 로고    scopus 로고
    • The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy condition
    • September
    • H. G. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy condition," in Proc. ISCA ITRW ASR'00, September 2000, pp. 18-20.
    • (2000) Proc. ISCA ITRW ASR'00 , pp. 18-20
    • Hirsch, H.G.1    Pearce, D.2
  • 29
    • 45849093239 scopus 로고    scopus 로고
    • Efficient WFST-based one-pass decoding with on-The-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition
    • May
    • T. Hori, C. Hori, Y. Minami, and A. Nakamura, "Efficient WFST-based one-pass decoding with on-The-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition," IEEE Trans. on ASLP, vol. 15, no. 4, pp. 1352-1365, May 2007.
    • (2007) IEEE Trans. on ASLP , vol.15 , Issue.4 , pp. 1352-1365
    • Hori, T.1    Hori, C.2    Minami, Y.3    Nakamura, A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.