메뉴 건너뛰기




Volumn , Issue , 2009, Pages 3833-3836

Ensemble speaker and speaking environment modeling approach with advanced online estimation process

Author keywords

Ensemble speaker and speaking environment modeling; N best transcription; Noise robustness

Indexed keywords

ACOUSTIC MODEL; ADAPTATION TECHNIQUES; ENSEMBLE SPEAKER AND SPEAKING ENVIRONMENT MODELING; ENVIRONMENT MODELING; INFORMATION TECHNIQUES; MULTIPLE SET; N-BEST TRANSCRIPTION; NOISE ROBUSTNESS; ON-LINE ESTIMATION; SPEAKER VARIABILITY; UNKNOWN ENVIRONMENTS; UNSUPERVISED ADAPTATION; WORD ERROR RATE;

EID: 70349194598     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2009.4960463     Document Type: Conference Paper
Times cited : (11)

References (15)
  • 1
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • April
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Processing, vol. 2, pp.291-99, April 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 291-299
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 2
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol.12, no. 2, pp.75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 3
    • 0009625231 scopus 로고    scopus 로고
    • A comparison of novel techniques for rapid speaker adaptation
    • T. J. Hazen, "A comparison of novel techniques for rapid speaker adaptation," Speech Comm., pp.15-33, 2000.
    • (2000) Speech Comm , pp. 15-33
    • Hazen, T.J.1
  • 4
    • 0034227757 scopus 로고    scopus 로고
    • Cluster adaptive training of hidden Markov models
    • M. J. F. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Proc., pp. 417-428, 2000.
    • (2000) IEEE Trans. Speech Audio Proc , pp. 417-428
    • Gales, M.J.F.1
  • 6
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," Trans. Speech Audio Proc., pp.190-202, 1996.
    • (1996) Trans. Speech Audio Proc , pp. 190-202
    • Sankar, A.1    Lee, C.-H.2
  • 7
    • 44849130443 scopus 로고    scopus 로고
    • Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition
    • Y. Tsao and C.-H. Lee, "Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition," in ASRU, 2007.
    • (2007) ASRU
    • Tsao, Y.1    Lee, C.-H.2
  • 8
    • 84867201606 scopus 로고    scopus 로고
    • Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process
    • Y. Tsao and C.-H. Lee, "Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process," in Interspeech 2008.
    • Interspeech 2008
    • Tsao, Y.1    Lee, C.-H.2
  • 9
    • 0032627251 scopus 로고    scopus 로고
    • N-best based supervised and unsupervised adaptation for native and nonnative speakers in cars
    • P. Nguyen, P. Gelin, J.-C. Junqua, and J.-T. Chien, "N-best based supervised and unsupervised adaptation for native and nonnative speakers in cars," ICASSP'97., pp. 257-265, 1997.
    • (1997) ICASSP'97 , pp. 257-265
    • Nguyen, P.1    Gelin, P.2    Junqua, J.-C.3    Chien, J.-T.4
  • 10
    • 67651157825 scopus 로고    scopus 로고
    • Soft margin estimation for automatic speech recognition,
    • Ph.D. Dissertation, School of ECE, Georgia Institute of Technology
    • J. Li, "Soft margin estimation for automatic speech recognition," Ph.D. Dissertation, School of ECE, Georgia Institute of Technology, 2008.
    • (2008)
    • Li, J.1
  • 11
    • 70349200182 scopus 로고    scopus 로고
    • B. Mak, T.-C. Lai, and R. Hsiao Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers, in ICASSP 99, 1, pp. 173-176, 1999.
    • B. Mak, T.-C. Lai, and R. Hsiao "Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers," in ICASSP 99, vol. 1, pp. 173-176, 1999.
  • 13
    • 0003200767 scopus 로고    scopus 로고
    • The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • D. Pearce and H.-G. Hirsch, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ISCA ITRW ASR'2000.
    • Proc. ISCA ITRW ASR'2000
    • Pearce, D.1    Hirsch, H.-G.2
  • 14
    • 85009181040 scopus 로고    scopus 로고
    • Several HKU approaches for robust speech recognition and their evaluation on Aurora connected digit recognition tasks
    • J. Wu and Q. Huo, "Several HKU approaches for robust speech recognition and their evaluation on Aurora connected digit recognition tasks," in Eurospeech 2003.
    • Eurospeech 2003
    • Wu, J.1    Huo, Q.2
  • 15
    • 0031139839 scopus 로고    scopus 로고
    • Minimum classification error rate methods for speech recognition
    • B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Proc., pp. 257-265, 1997.
    • (1997) IEEE Trans. Speech Audio Proc , pp. 257-265
    • Juang, B.-H.1    Chou, W.2    Lee, C.-H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.