메뉴 건너뛰기




Volumn 15, Issue 8, 2007, Pages 2431-2443

Noise condition-dependent training based on noise classification and SNR estimation

Author keywords

Condition dependent training; Noise classification; Robust speech recognition; Robustness to unknown noise; Signal to noise ratio (SNR) estimation

Indexed keywords

CONDITION-DEPENDENT TRAINING; NOISE CLASSIFICATION; ROBUST SPEECH RECOGNITION; ROBUSTNESS TO UNKNOWN NOISE; SIGNAL-TO-NOISE RATIO (SNR) ESTIMATION;

EID: 64349084660     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2007.906188     Document Type: Article
Times cited : (14)

References (27)
  • 2
    • 0023263708 scopus 로고
    • Multi-style training for robust isolated-word speech recognition
    • R. P. Lippmann, E. A. Martin, and D. B. Paul, "Multi-style training for robust isolated-word speech recognition," in Proc. ICASSP'87, 1987, pp. 705-708.
    • (1987) Proc. ICASSP'87 , pp. 705-708
    • Lippmann, R.P.1    Martin, E.A.2    Paul, D.B.3
  • 3
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, pp. 261-291, 1995.
    • (1995) Speech Commun , vol.16 , pp. 261-291
    • Gong, Y.1
  • 4
    • 0141702085 scopus 로고    scopus 로고
    • Environmental sniffing: Noise knowledge estimation for robust speech systems
    • Apr. 6-10
    • M. Akbacak and J. H. L. Hansen, "Environmental sniffing: Noise knowledge estimation for robust speech systems," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Apr. 6-10, 2003, vol. 2, pp. 113-116.
    • (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 113-116
    • Akbacak, M.1    Hansen, J.H.L.2
  • 5
    • 77955915142 scopus 로고    scopus 로고
    • Context awareness using environmental noise classification
    • L. Ma, D. J. Smith, and B. P. Milner, "Context awareness using environmental noise classification," in Proc. Eurospeech'03, 2003, pp. 2237-2240.
    • (2003) Proc. Eurospeech'03 , pp. 2237-2240
    • Ma, L.1    Smith, D.J.2    Milner, B.P.3
  • 6
    • 84897584394 scopus 로고    scopus 로고
    • Advances in acoustic noise tracking for robust in-vehicle speech systems
    • H. Abut, J. H. L. Hansen, and K. Takeda, Eds. New York: Springer, ch. 10, pp
    • M. Akbacak and J. H. L. Hansen, "Advances in acoustic noise tracking for robust in-vehicle speech systems," in Advances for In-Vehicle and Mobile Systems, H. Abut, J. H. L. Hansen, and K. Takeda, Eds. New York: Springer, 2007, ch. 10, pp. 109-122.
    • (2007) Advances for In-Vehicle and Mobile Systems , pp. 109-122
    • Akbacak, M.1    Hansen, J.H.L.2
  • 7
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
    • J. G. Fiscus, "A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)," in Proc. IEEE Workshop Autom. Speech Recognition Understanding, 1997, pp. 347-354.
    • (1997) Proc. IEEE Workshop Autom. Speech Recognition Understanding , pp. 347-354
    • Fiscus, J.G.1
  • 8
    • 0141812649 scopus 로고
    • Speaker-independent spoken digit recognition in noisy environments using dynamic spectral features and neural networks
    • T. Kitamura, S. Ando, and E. Hayahara, "Speaker-independent spoken digit recognition in noisy environments using dynamic spectral features and neural networks," in Proc. Int. Conf. Speech Lang. Process., 1992, vol. 1, pp. 699-702.
    • (1992) Proc. Int. Conf. Speech Lang. Process , vol.1 , pp. 699-702
    • Kitamura, T.1    Ando, S.2    Hayahara, E.3
  • 9
    • 33947676384 scopus 로고    scopus 로고
    • Modeling variance variation in a variable parameter HMM framework for noise robust speech recognition
    • X. Cui and Y. Gong, "Modeling variance variation in a variable parameter HMM framework for noise robust speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2006, vol. 1, pp. 1117-1120.
    • (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 1117-1120
    • Cui, X.1    Gong, Y.2
  • 10
    • 4544334449 scopus 로고    scopus 로고
    • A tree-structured clustering method integrating noise and SNR for piecewise linear-transformation- based noise adaptation
    • Z. Zhang, T. Sugimura, and S. Furui, "A tree-structured clustering method integrating noise and SNR for piecewise linear-transformation- based noise adaptation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2004, vol. 1, pp. 981-984.
    • (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 981-984
    • Zhang, Z.1    Sugimura, T.2    Furui, S.3
  • 11
    • 85009126862 scopus 로고    scopus 로고
    • Evaluation of tree-structured piecewise- linear-transformation-based noise adaptation on Aurora 2 database
    • Jeju, Korea
    • Z. Zheng, T. Ohya, and S. Furui, "Evaluation of tree-structured piecewise- linear-transformation-based noise adaptation on Aurora 2 database," in Proc. Int. Conf. Spoken Lang. Process., Jeju, Korea, 2004, vol. 1, pp. 113-116.
    • (2004) Proc. Int. Conf. Spoken Lang. Process , vol.1 , pp. 113-116
    • Zheng, Z.1    Ohya, T.2    Furui, S.3
  • 12
    • 33745184458 scopus 로고    scopus 로고
    • Robust speech recognition based on noise and SNR classification-A multiple-model framework
    • Lisbon, Portugal, Sep
    • H. Xu, Z.-H. Tan, P. Dalsgaard, and B. Lindberg, "Robust speech recognition based on noise and SNR classification-A multiple-model framework," in Proc. Interspeech 2005, Lisbon, Portugal, Sep. 2005, pp. 977-980.
    • (2005) Proc. Interspeech 2005 , pp. 977-980
    • Xu, H.1    Tan, Z.-H.2    Dalsgaard, P.3    Lindberg, B.4
  • 13
    • 33947697214 scopus 로고    scopus 로고
    • Robust speech recognition from noise-type based feature compensation and model interpolation in a multiple model framework
    • Toulouse, France, May
    • H. Xu, Z.-H. Tan, P. Dalsgaard, and B. Lindberg, "Robust speech recognition from noise-type based feature compensation and model interpolation in a multiple model framework," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Toulouse, France, May 2006, pp. I-1141-I-1144.
    • (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
    • Xu, H.1    Tan, Z.-H.2    Dalsgaard, P.3    Lindberg, B.4
  • 14
    • 0032141206 scopus 로고    scopus 로고
    • Cepstral domain segmental feature vector normalization for noise robust speech recognition
    • O. Viikki and K. Laurila, "Cepstral domain segmental feature vector normalization for noise robust speech recognition," Speech Commun., vol. 25, pp. 133-147, 1998.
    • (1998) Speech Commun , vol.25 , pp. 133-147
    • Viikki, O.1    Laurila, K.2
  • 18
    • 65549153550 scopus 로고    scopus 로고
    • Speech recognition in noisy environments,
    • Ph.D. dissertation, Carnegie Mellon Univ, Pittsburgh, PA
    • P. Moreno, "Speech recognition in noisy environments," Ph.D. dissertation, Carnegie Mellon Univ., Pittsburgh, PA, 1996.
    • (1996)
    • Moreno, P.1
  • 19
    • 0038669544 scopus 로고    scopus 로고
    • The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Paris, France, Sep. 18-20
    • H. G. Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ISCA ITRW ASR2000 (Autom. Speech Recognition: Challenges for the Next Millennium), Paris, France, Sep. 18-20, 2000, pp. 18-20.
    • (2000) Proc. ISCA ITRW ASR2000 (Autom. Speech Recognition: Challenges for the Next Millennium) , pp. 18-20
    • Hirsch, H.G.1    Pearce, D.2
  • 20
    • 0003483593 scopus 로고
    • Eng. Dept. Speech Group and Entropic Research Lab. Inc, Cambridge Univ, Washington, DC
    • S. Young, "HTK: Hidden Markov Model Toolkit V1.5," Eng. Dept. Speech Group and Entropic Research Lab. Inc., Cambridge Univ., Washington, DC, 1993.
    • (1993) HTK: Hidden Markov Model Toolkit V1.5
    • Young, S.1
  • 22
    • 85009063707 scopus 로고    scopus 로고
    • Soft decisions in missing data techniques for robust automatic speech recognition
    • Beijing, China
    • J. Barker, L. Josifovski, M. Cooke, and P. Green, "Soft decisions in missing data techniques for robust automatic speech recognition," in Proc. ICSLP'00, Beijing, China, 2000, vol. 1, pp. 373-376.
    • (2000) Proc. ICSLP'00 , vol.1 , pp. 373-376
    • Barker, J.1    Josifovski, L.2    Cooke, M.3    Green, P.4
  • 23
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • Jun
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, no. 3, pp. 267-285, Jun. 2001.
    • (2001) Speech Commun , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 24
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • Jul
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, no. 4, pp. 504-512, Jul. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.4 , pp. 504-512
    • Martin, R.1
  • 25
    • 64349093568 scopus 로고    scopus 로고
    • Hypothesis Testing
    • Online, Available
    • E. W. Weisstein, "Hypothesis Testing," MathWorld, 2005 [Online]. Available: http://mathworld.wolfram.com/HypothesisTesting.html
    • (2005) MathWorld
    • Weisstein, E.W.1
  • 27
    • 85009242725 scopus 로고    scopus 로고
    • Evaluation of a noise-robust DSR front-end on Aurora databases
    • Denver, CO
    • D. Marcho et al., "Evaluation of a noise-robust DSR front-end on Aurora databases," in Proc. ICSLP'02, Denver, CO, 2002, pp. 17-20.
    • (2002) Proc. ICSLP'02 , pp. 17-20
    • Marcho, D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.