메뉴 건너뛰기




Volumn , Issue , 2011, Pages 5484-5487

Frame-wise HMM adaptation using state-dependent reverberation estimates

Author keywords

acoustic modeling; distant talking ASR; frame wise HMM adaptation; reverberation modeling; Robust speech recognition

Indexed keywords

ACOUSTIC ENVIRONMENT; ACOUSTIC MODELING; CEPSTRAL FEATURES; DECODING COMPLEXITY; DISTANT-TALKING ASR; FEATURE VECTORS; FRAME-WISE HMM ADAPTATION; MODEL ADAPTATION; PARTIAL STATE; ROBUST SPEECH RECOGNITION; SPEECH RECORDING; STATE-DEPENDENT;

EID: 80051616058     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2011.5947600     Document Type: Conference Paper
Times cited : (7)

References (10)
  • 1
    • 33947694706 scopus 로고    scopus 로고
    • Model adaptation for long convolutional distortion by maximum likelihood based state filtering approach
    • May
    • C. K. Raut, T. Nishimoto, and S. Sagayama, "Model adaptation for long convolutional distortion by maximum likelihood based state filtering approach," Proc. ICASSP, pp. I-1133 - I-1136, May 2006.
    • (2006) Proc. ICASSP
    • Raut, C.K.1    Nishimoto, T.2    Sagayama, S.3
  • 2
    • 44949247595 scopus 로고    scopus 로고
    • A new HMM adaptation approach for the case of a hands-free speech input in reverberant rooms
    • September
    • H.-G. Hirsch and H. Finster, "A new HMM adaptation approach for the case of a hands-free speech input in reverberant rooms," Proc. INTERSPEECH, pp. 781-783, September 2006.
    • (2006) Proc. INTERSPEECH , pp. 781-783
    • Hirsch, H.-G.1    Finster, H.2
  • 3
    • 80051624029 scopus 로고    scopus 로고
    • Adapting HMMs of distant-talking ASR systems using feature-domain reverberation models
    • A. Sehr, M. Gardill, and W. Kellermann, "Adapting HMMs of distant-talking ASR systems using feature-domain reverberation models," Proc. EUSIPCO, 2009.
    • Proc. EUSIPCO, 2009
    • Sehr, A.1    Gardill, M.2    Kellermann, W.3
  • 4
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. Legetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech & Language, vol. 9, no. 2, pp. 171-185, 1995.
    • (1995) Computer Speech & Language , vol.9 , Issue.2 , pp. 171-185
    • Legetter, C.1    Woodland, P.2
  • 5
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate gaussian mixture observation of Markov chains
    • J. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate gaussian mixture observation of Markov chains," IEEE Trans. Speech and Audio Process., vol. 2, no. 2, 1994.
    • (1994) IEEE Trans. Speech and Audio Process. , vol.2 , Issue.2
    • Gauvain, J.1    Lee, C.-H.2
  • 6
    • 33645784228 scopus 로고    scopus 로고
    • Acoustic model adaptation using first-order linear prediction for reverberant speech
    • March
    • T. Takiguchi, M. Nishimura, and Y. Ariki, "Acoustic model adaptation using first-order linear prediction for reverberant speech," IEICE Trans. on Inform. & Systems, vol. E89-D, no. 3, pp. 908-914, March 2006.
    • (2006) IEICE Trans. on Inform. & Systems , vol.E89-D , Issue.3 , pp. 908-914
    • Takiguchi, T.1    Nishimura, M.2    Ariki, Y.3
  • 7
    • 0032673963 scopus 로고    scopus 로고
    • Probabilistic-trajectory segmental HMMs
    • W. Holmes and M. Russell, "Probabilistic-trajectory segmental HMMs," Computer Speech & Language, vol. 13, pp. 3-37, 1999.
    • (1999) Computer Speech & Language , vol.13 , pp. 3-37
    • Holmes, W.1    Russell, M.2
  • 8
    • 77955683144 scopus 로고    scopus 로고
    • Reverberation model-based decoding in the logmelspec domain for robust distant-talking speech recognition
    • A. Sehr, R. Maas, and W. Kellermann, "Reverberation model-based decoding in the logmelspec domain for robust distant-talking speech recognition," IEEE Trans. on Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1676-1691, 2010.
    • (2010) IEEE Trans. on Audio, Speech, Lang. Process. , vol.18 , Issue.7 , pp. 1676-1691
    • Sehr, A.1    Maas, R.2    Kellermann, W.3
  • 9
    • 80051610960 scopus 로고    scopus 로고
    • "HTK webpage," http://htk.eng.cam.ac.uk/.
    • HTK Webpage
  • 10
    • 0021226391 scopus 로고
    • A database for speaker-independent digit recognition
    • R. Leonard, "A database for speaker-independent digit recognition," Proc. ICASSP, pp. 42.11.1-42.11.4, 1984.
    • (1984) Proc. ICASSP
    • Leonard, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.