메뉴 건너뛰기




Volumn 18, Issue 7, 2010, Pages 1676-1691

Reverberation model-based decoding in the logmelspec domain for robust distant-talking speech recognition

Author keywords

Acoustic modeling; distant talking automatic speech recognition (ASR); model based dereverberation; reverberation model; robust ASR

Indexed keywords

ACOUSTIC MODEL; ACOUSTIC MODELING; AUTOMATIC SPEECH RECOGNITION; COMBINATION OPERATION; CONNECTED DIGITS; FEATURE DOMAIN; HIGH FLEXIBILITY; IN-DEPTH ANALYSIS; JOINT DENSITIES; MODEL-BASED; MODEL-BASED DEREVERBERATION; NON-LINEAR OPTIMIZATION ALGORITHMS; OPTIMIZATION OPERATION; OPTIMIZATION PROBLEMS; PARAMETERS ESTIMATED; ROBUST ASR; STATISTICAL PROPERTIES;

EID: 77955683144     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2050511     Document Type: Article
Times cited : (53)

References (42)
  • 1
    • 44949167884 scopus 로고    scopus 로고
    • Distant-talking continuous speech recognition based on a novel reverberation model in the feature domain
    • A. Sehr, M. Zeller, and W. Kellermann, "Distant-talking continuous speech recognition based on a novel reverberation model in the feature domain," in Proc. Interspeech, 2006, pp. 769-772.
    • (2006) Proc. Interspeech , pp. 769-772
    • Sehr, A.1    Zeller, M.2    Kellermann, W.3
  • 2
    • 70349227947 scopus 로고    scopus 로고
    • The application of hidden Markov models in speech recognition
    • M. Gales and S. Young, "The application of hidden Markov models in speech recognition," Foundat. Trends Signal Process., vol.1, no.3, pp. 195-304, 2007.
    • (2007) Foundat. Trends Signal Process. , vol.1 , Issue.3 , pp. 195-304
    • Gales, M.1    Young, S.2
  • 3
    • 85132840106 scopus 로고    scopus 로고
    • Towards robust distant-talking automatic speech recognition in reverberant environments
    • E. Hänsler and G. Schmidt, Eds. Berlin: Springer
    • A. Sehr and W. Kellermann, "Towards robust distant-talking automatic speech recognition in reverberant environments," in Topics in Speech and Audio Processing in Adverse Environments, E. Hänsler and G. Schmidt, Eds. Berlin: Springer, 2008, pp. 679-728.
    • (2008) Topics in Speech and Audio Processing in Adverse Environments , pp. 679-728
    • Sehr, A.1    Kellermann, W.2
  • 5
    • 50449102864 scopus 로고    scopus 로고
    • The harming part of room acoustics in automatic speech recognition
    • Aug.
    • R. Petrick, K. Lohde, M.Wolff, and R. Hoffmann, "The harming part of room acoustics in automatic speech recognition," in Proc. Interspeech, Aug. 2007, pp. 1094-1097.
    • (2007) Proc. Interspeech , pp. 1094-1097
    • Petrick, R.1    Lohde, K.2    Wolff, M.3    Hoffmann, R.4
  • 8
    • 34247241719 scopus 로고    scopus 로고
    • Inverse filtering for speech dereverberation less sensitive to noise and room transfer function fluctuations
    • T. Hikichi, M. Delcroix, and M. Miyoshi, "Inverse filtering for speech dereverberation less sensitive to noise and room transfer function fluctuations," EURASIP J. Adv. Signal Process., vol.2007, 2007.
    • (2007) EURASIP J. Adv. Signal Process. , vol.2007
    • Hikichi, T.1    Delcroix, M.2    Miyoshi, M.3
  • 10
    • 79957754961 scopus 로고    scopus 로고
    • TRINICON for dereverberation of speech and audio signals
    • P. Naylor and N. Gaubitch, Eds. Berlin: Springer
    • H. Buchner and W. Kellermann, "TRINICON for dereverberation of speech and audio signals," in Speech Dereverberation, P. Naylor and N. Gaubitch, Eds. Berlin: Springer.
    • Speech Dereverberation
    • Buchner, H.1    Kellermann, W.2
  • 11
    • 14344274593 scopus 로고    scopus 로고
    • A new method based on spectral subtraction for speech dereverberation
    • K. Lebart and J. M. Boucher, "A new method based on spectral subtraction for speech dereverberation," Acta Acust., vol.87, pp. 359-366, 2001.
    • (2001) Acta Acust. , vol.87 , pp. 359-366
    • Lebart, K.1    Boucher, J.M.2
  • 12
    • 77955697587 scopus 로고    scopus 로고
    • Late reverberant spectral variance estimation based on a statistical model
    • Sep.
    • E. Habets, S. Gannot, and I. Cohen, "Late reverberant spectral variance estimation based on a statistical model," IEEE Signal Process. Lett., vol.16, pp. 770-773, Sep. 2009.
    • (2009) IEEE Signal Process. Lett. , vol.16 , pp. 770-773
    • Habets, E.1    Gannot, S.2    Cohen, I.3
  • 13
    • 65249167097 scopus 로고    scopus 로고
    • Suppression of late reverberation effect on speech signal using long-term multiplestep linear prediction
    • May
    • K. Kinoshita, M. Delcroix, T. Nakatani, and M. Miyoshi, "Suppression of late reverberation effect on speech signal using long-term multiplestep linear prediction," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.4, pp. 534-545, May 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.4 , pp. 534-545
    • Kinoshita, K.1    Delcroix, M.2    Nakatani, T.3    Miyoshi, M.4
  • 14
    • 0001379957 scopus 로고    scopus 로고
    • Enhancement of reverberant speech using LP residual signal
    • May
    • B. Yegnanarayana and P. S. Murthy, "Enhancement of reverberant speech using LP residual signal," IEEE Trans. Speech Audio Process., vol.8, pp. 267-281, May 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , pp. 267-281
    • Yegnanarayana, B.1    Murthy, P.S.2
  • 17
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol.55, pp. 1304-1312, 1974.
    • (1974) J. Acoust. Soc. Amer. , vol.55 , pp. 1304-1312
    • Atal, B.1
  • 18
    • 33745246234 scopus 로고    scopus 로고
    • Multiresolution channel normalization for ASR in reverberant environments
    • C. Avendano, S. Tibrewala, and H. Hermansky, "Multiresolution channel normalization for ASR in reverberant environments," in Proc. Eurospeech, 1997, pp. 1107-1110.
    • (1997) Proc. Eurospeech , pp. 1107-1110
    • Avendano, C.1    Tibrewala, S.2    Hermansky, H.3
  • 19
    • 70350439261 scopus 로고    scopus 로고
    • Enhanced speech features by single-channel joint compensation of noise and reverberation
    • Feb.
    • M. Wölfel, "Enhanced speech features by single-channel joint compensation of noise and reverberation," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.2, pp. 312-323, Feb. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.2 , pp. 312-323
    • Wölfel, M.1
  • 22
    • 33645784228 scopus 로고    scopus 로고
    • Acoustic model adaptation using first-order linear prediction for reverberant speech
    • Mar.
    • T. Takiguchi, M. Nishimura, and Y. Ariki, "Acoustic model adaptation using first-order linear prediction for reverberant speech," IEICE Trans. Inf. Syst., vol.E89-D, no.3, pp. 908-914, Mar. 2006.
    • (2006) IEICE Trans. Inf. Syst. , vol.E89-D , Issue.3 , pp. 908-914
    • Takiguchi, T.1    Nishimura, M.2    Ariki, Y.3
  • 24
    • 44949247595 scopus 로고    scopus 로고
    • A new HMM adaptation approach for the case of a hands-free speech input in reverberant rooms
    • Sep.
    • H.-G. Hirsch and H. Finster, "A new HMM adaptation approach for the case of a hands-free speech input in reverberant rooms," in Proc. Interspeech, Sep. 2006, pp. 781-783.
    • (2006) Proc. Interspeech , pp. 781-783
    • Hirsch, H.-G.1    Finster, H.2
  • 25
    • 2942539074 scopus 로고    scopus 로고
    • Techniques for handling convolutional distortion with 'missing data' automatic speech recognition
    • Jun.
    • J. B. K. J. Palomäki and G. J. Brown, "Techniques for handling convolutional distortion with 'missing data' automatic speech recognition," Speech Commun., vol.43, no.1-2, pp. 123-142, Jun. 2004.
    • (2004) Speech Commun. , vol.43 , Issue.1-2 , pp. 123-142
    • Palomäki, J.B.K.J.1    Brown, G.J.2
  • 26
    • 70350450398 scopus 로고    scopus 로고
    • Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing
    • Feb.
    • M. Delcroix, T. Nakatani, and S. Watanabe, "Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.2, pp. 324-334, Feb. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.2 , pp. 324-334
    • Delcroix, M.1    Nakatani, T.2    Watanabe, S.3
  • 30
    • 70349707036 scopus 로고    scopus 로고
    • A simplified decoding method for a robust distant-talking ASR concept based on feature-domain dereverberation
    • A. Sehr and W. Kellermann, "A simplified decoding method for a robust distant-talking ASR concept based on feature-domain dereverberation, " in Proc. Int. Workshop Acoust. Echo Noise Control (IWAENC), 2008.
    • (2008) Proc. Int. Workshop Acoust. Echo Noise Control (IWAENC)
    • Sehr, A.1    Kellermann, W.2
  • 33
    • 84863742097 scopus 로고    scopus 로고
    • Maximum likelihood estimation of a reverberation model for robust distant-talking speech recognition
    • A. Sehr, Y. Zheng, E. Nöth, and W. Kellermann, "Maximum likelihood estimation of a reverberation model for robust distant-talking speech recognition," in Proc. Eur. Signal Process. Conf. (EUSIPCO), 2007, pp. 1299-1303.
    • (2007) Proc. Eur. Signal Process. Conf. (EUSIPCO) , pp. 1299-1303
    • Sehr, A.1    Zheng, Y.2    Nöth, E.3    Kellermann, W.4
  • 35
    • 84863766806 scopus 로고    scopus 로고
    • Blind estimation of a feature-domain reverberation model in non-diffuse environments with variance adjustment
    • J. Y. C. Wen, A. Sehr, P. A. Naylor, and W. Kellermann, "Blind estimation of a feature-domain reverberation model in non-diffuse environments with variance adjustment," in Proc. Eur. Signal Process. Conf. (EUSIPCO), 2009, pp. 175-179.
    • (2009) Proc. Eur. Signal Process. Conf. (EUSIPCO) , pp. 175-179
    • Wen, J.Y.C.1    Sehr, A.2    Naylor, P.A.3    Kellermann, W.4
  • 36
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acoustics
    • Apr.
    • J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol.65, no.4, pp. 943-950, Apr. 1979.
    • (1979) J. Acoust. Soc. Amer. , vol.65 , Issue.4 , pp. 943-950
    • Allen, J.B.1    Berkley, D.A.2
  • 38
    • 77955672368 scopus 로고    scopus 로고
    • [Online]. Available
    • HTK [Online]. Available: http://htk.eng.cam.ac.uk/
  • 39
    • 29144523061 scopus 로고    scopus 로고
    • On the implementation of a primal-dual interior point filter line search algorithm for large-scale nonlinear programming
    • A. Wächter and L. Biegler, "On the implementation of a primal-dual interior point filter line search algorithm for large-scale nonlinear programming," Math. Program., vol.106, no.1, pp. 25-57, 2006.
    • (2006) Math. Program. , vol.106 , Issue.1 , pp. 25-57
    • Wächter, A.1    Biegler, L.2
  • 41
    • 77955671363 scopus 로고    scopus 로고
    • Sound scene database in real acoustical environments
    • "Sound scene database in real acoustical environments," Real World Computing Partnership, 2001.
    • (2001) Real World Computing Partnership


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.