SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2009, Pages 3833-3836

Ensemble speaker and speaking environment modeling approach with advanced online estimation process

(3) Tsao, Yu a Li, Jinyu a Lee, Chin Hui a

a GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Ensemble speaker and speaking environment modeling; N best transcription; Noise robustness

Indexed keywords

ACOUSTIC MODEL; ADAPTATION TECHNIQUES; ENSEMBLE SPEAKER AND SPEAKING ENVIRONMENT MODELING; ENVIRONMENT MODELING; INFORMATION TECHNIQUES; MULTIPLE SET; N-BEST TRANSCRIPTION; NOISE ROBUSTNESS; ON-LINE ESTIMATION; SPEAKER VARIABILITY; UNKNOWN ENVIRONMENTS; UNSUPERVISED ADAPTATION; WORD ERROR RATE;

ACOUSTIC NOISE; SIGNAL PROCESSING; TRANSCRIPTION;

ACOUSTICS;

EID: 70349194598 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2009.4960463 Document Type: Conference Paper

Times cited : (11)

References (15)

1
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- April
- J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Processing, vol. 2, pp.291-99, April 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 291-299
- Gauvain, J.-L.¹ Lee, C.-H.²

2
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol.12, no. 2, pp.75-98, 1998.
- (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

3
- 0009625231
- A comparison of novel techniques for rapid speaker adaptation
- T. J. Hazen, "A comparison of novel techniques for rapid speaker adaptation," Speech Comm., pp.15-33, 2000.
- (2000) Speech Comm , pp. 15-33
- Hazen, T.J.¹

4
- 0034227757
- Cluster adaptive training of hidden Markov models
- M. J. F. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Proc., pp. 417-428, 2000.
- (2000) IEEE Trans. Speech Audio Proc , pp. 417-428
- Gales, M.J.F.¹

5
- 0034320005
- Rapid speaker adaptation in Eigenvoice space
- Nov
- R. Kuhn, J.-C. Junqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in Eigenvoice space," IEEE Trans. Speech Audio Processing, vol. 8, pp.695-707, Nov. 2000.
- (2000) IEEE Trans. Speech Audio Processing , vol.8 , pp. 695-707
- Kuhn, R.¹ Junqua, J.-C.² Nguyen, P.³ Niedzielski, N.⁴

6
- 0030149866
- A maximum-likelihood approach to stochastic matching for robust speech recognition
- A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," Trans. Speech Audio Proc., pp.190-202, 1996.
- (1996) Trans. Speech Audio Proc , pp. 190-202
- Sankar, A.¹ Lee, C.-H.²

7
- 44849130443
- Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition
- Y. Tsao and C.-H. Lee, "Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition," in ASRU, 2007.
- (2007) ASRU
- Tsao, Y.¹ Lee, C.-H.²

8
- 84867201606
- Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process
- Y. Tsao and C.-H. Lee, "Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process," in Interspeech 2008.
- Interspeech 2008
- Tsao, Y.¹ Lee, C.-H.²

9
- 0032627251
- N-best based supervised and unsupervised adaptation for native and nonnative speakers in cars
- P. Nguyen, P. Gelin, J.-C. Junqua, and J.-T. Chien, "N-best based supervised and unsupervised adaptation for native and nonnative speakers in cars," ICASSP'97., pp. 257-265, 1997.
- (1997) ICASSP'97 , pp. 257-265
- Nguyen, P.¹ Gelin, P.² Junqua, J.-C.³ Chien, J.-T.⁴

10
- 67651157825
- Soft margin estimation for automatic speech recognition,
- Ph.D. Dissertation, School of ECE, Georgia Institute of Technology
- J. Li, "Soft margin estimation for automatic speech recognition," Ph.D. Dissertation, School of ECE, Georgia Institute of Technology, 2008.
- (2008)
- Li, J.¹

11
- 70349200182
- B. Mak, T.-C. Lai, and R. Hsiao Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers, in ICASSP 99, 1, pp. 173-176, 1999.
- B. Mak, T.-C. Lai, and R. Hsiao "Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers," in ICASSP 99, vol. 1, pp. 173-176, 1999.

12
- 0032131292
- Atomic Decomposition by Basis Pursuit
- S. Chen, D. Donoho, and M. A. Saunders, "Atomic Decomposition by Basis Pursuit", in SIAM J. on Scientific Computing, vol. 20, No. 1, pp. 33-61, 1998.
- (1998) SIAM J. on Scientific Computing , vol.20 , Issue.1 , pp. 33-61
- Chen, S.¹ Donoho, D.² Saunders, M.A.³

13
- 0003200767
- The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- D. Pearce and H.-G. Hirsch, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ISCA ITRW ASR'2000.
- Proc. ISCA ITRW ASR'2000
- Pearce, D.¹ Hirsch, H.-G.²

14
- 85009181040
- Several HKU approaches for robust speech recognition and their evaluation on Aurora connected digit recognition tasks
- J. Wu and Q. Huo, "Several HKU approaches for robust speech recognition and their evaluation on Aurora connected digit recognition tasks," in Eurospeech 2003.
- Eurospeech 2003
- Wu, J.¹ Huo, Q.²

15
- 0031139839
- Minimum classification error rate methods for speech recognition
- B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Proc., pp. 257-265, 1997.
- (1997) IEEE Trans. Speech Audio Proc , pp. 257-265
- Juang, B.-H.¹ Chou, W.² Lee, C.-H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.