SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2014, Pages 845-849

Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks

(4) Huang, Yan a Slaney, Malcolm a Seltzer, Michael L a Gong, Yifan a

a MICROSOFT (United States)

Author keywords

CD DNN HMM; Channel compensation; Deep learning; Multi task learning; Noise robustness

Indexed keywords

SPEECH COMMUNICATION;

CD-DNN-HMM; CHANNEL COMPENSATION; DEEP LEARNING; MULTITASK LEARNING; NOISE ROBUSTNESS;

SPEECH RECOGNITION;

EID: 84910069710 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (22)

References (24)

1
- 84055222005
- Context-dependent pre-trained deep neural networks for large vocabulary speech recognition
- Jan
- Dahl, G.E., Yu, D., Deng, L., and Acero, A., "Context-Dependent Pre-trained Deep Neural Networks for Large Vocabulary Speech Recognition, " IEEE Transactions on Audio, Speech, and Language Processing (TASLP) - Special Issue on Deep Learning for Speech and Language Processing, Volume: 1, No. 1, Page(s): 33-42, Jan 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing (TASLP) - Special Issue on Deep Learning for Speech and Language Processing , vol.1 , Issue.1 , pp. 33-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

2
- 84910037211
- Conversational speech transcription using context-dependent deep neural networks
- Seide, F., Li, G., and Yu, D., "Conversational Speech Transcription Using Context-Dependent Deep Neural Networks, " in the Proceedings of Interspeech 2012.
- (2012) The Proceedings of Interspeech
- Seide, F.¹ Li, G.² Yu, D.³

3
- 84878379108
- Scalable minimum Bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
- Kingsbury, B., Sainath, N. T., and Soltau, H., "Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization, " in the Proceedings of Interspeech 2012.
- (2012) The Proceedings of Interspeech
- Kingsbury, B.¹ Sainath, N.T.² Soltau, H.³

4
- 84878539964
- Application of pretrained deep neural networks to large vocabulary speech recognition
- Jaitly, N., Nguyen, P., Senior, A., and Vanhoucke, V., "Application of Pretrained Deep Neural Networks to Large Vocabulary Speech Recognition, " in the Proceedings of Interspeech 2012.
- (2012) The Proceedings of Interspeech
- Jaitly, N.¹ Nguyen, P.² Senior, A.³ Vanhoucke, V.⁴

5
- 84890491198
- Recent advances in deep learning for speech research at microsoft
- Deng, L., Li, J., Huang, J., Yao, K., Yu, D., Seide, F., Seltzer, M., Zweig, G., He, X., Williams, J., Gong, Y., and Acero, A., "Recent Advances in Deep Learning for Speech Research at Microsoft, " in the Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013.
- (2013) The Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Deng, L.¹ Li, J.² Huang, J.³ Yao, K.⁴ Yu, D.⁵ Seide, F.⁶ Seltzer, M.⁷ Zweig, G.⁸ He, X.⁹ Williams, J.¹⁰ Gong, Y.¹¹ Acero, A.¹²

6
- 84910027886
- A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden Markov Models
- submitted to the
- Huang, Y., Yu, D., Liu, C., and Gong, Y., "A Comparative Analytic Study on the Gaussian Mixture and Context Dependent Deep Neural Network Hidden Markov Models, " submitted to the Interspeech 2014.
- (2014) Interspeech
- Huang, Y.¹ Yu, D.² Liu, C.³ Gong, Y.⁴

7
- 0034227757
- Cluster adaptive training of hidden Markov models
- July
- Gales, M. J. F., "Cluster adaptive training of hidden Markov models, " IEEE Trans. Speech Audio Processing, vol. 8, no. 4, pp. 417-428, July 2000.
- (2000) IEEE Trans. Speech Audio Processing , vol.8 , Issue.4 , pp. 417-428
- Gales, M.J.F.¹

8
- 51449089757
- Ph.D. Thesis, Cambridge University
- Yu, K., "Adaptive Training for Large Vocabulary Continuous Speech Recognition, " Ph.D. Thesis, Cambridge University, 2006.
- (2006) Adaptive Training for Large Vocabulary Continuous Speech Recognition
- Yu, K.¹

9
- 84962787636
- Acoustic factorisation
- Gales, M. J. F., "Acoustic Factorisation, " in Proceedings of ASRU 2001.
- (2001) Proceedings of ASRU
- Gales, M.J.F.¹

10
- 84862293102
- Speaker and noise factorization for robust speech recognition
- September
- Wang, Y., Gales, M. J. F., "Speaker and Noise Factorization for Robust Speech Recognition, " in IEEE Transactions on Audio, Speech, and Language Processing (TASLP), VOL. 20, NO. 7, September 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing (TASLP) , vol.20 , Issue.7
- Wang, Y.¹ Gales, M.J.F.²

11
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- Seide, F., Li, G., Chen, X., and Yu, D., "Feature Engineering in Context-Dependent Deep Neural Networks for Conversational Speech Transcription, " in the Proceedings of the ASRU, 2011.
- (2011) The Proceedings of the ASRU
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

12
- 79959849500
- Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems
- Li, B. and Sim, K. C., "Comparison of Discriminative Input and Output Transformations for SpeakerAdaptation in the Hybrid NN/HMM Systems, " in the Proceedings of Interspeech 2010.
- (2010) The Proceedings of Interspeech
- Li, B.¹ Sim, K.C.²

13
- 84910096010
- Speaker adaptation of context dependent deep neural networks
- Hank, L., " Speaker Adaptation of Context Dependent Deep Neural Networks, " in the Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013.
- (2013) The Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Hank, L.¹

14
- 0030784572
- Stochastic trajectory modeling and sentence searching for continuous speech recognition
- Gong, Y., "Stochastic trajectory modeling and sentence searching for continuous speech recognition, " IEEE Transactions on Speech Audio Processing, vol. 5, no. 1, pp. 3344, 1997.
- (1997) IEEE Transactions on Speech Audio Processing , vol.5 , Issue.1 , pp. 3344
- Gong, Y.¹

15
- 0030672089
- Generalized mixture of HMMs for continuous speech recognition
- Korkmazskiy, F., Juang, B. H., and Soong, F. K. Generalized mixture of HMMs for continuous speech recognition, in the Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1997.
- (1997) The Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Korkmazskiy, F.¹ Juang, B.H.² Soong, F.K.³

16
- 84890509740
- Modeling heterogeneous data sources for speech recognition using synchronous hidden markov models
- Zhao, Y. and Juang, B. H., "Modeling Heterogeneous Data Sources for Speech Recognition Using Synchronous Hidden Markov Models, " in the Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013.
- (2013) The Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Zhao, Y.¹ Juang, B.H.²

17
- 84890492030
- An investigation of deep neural networks for noise robust speech recognition
- Seltzer, M., Yu, D., Wang, Y., "An Investigation of Deep Neural Networks for Noise Robust Speech Recognition, " in the Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013.
- (2013) The Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Seltzer, M.¹ Yu, D.² Wang, Y.³

18
- 84890452886
- Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code
- Abdel-Hamid, O. and Jiang, H., "Fast Speaker Adaptation of Hybrid NN/HMM Model for Speech Recognition Based on Discriminative Learning of Speaker Code, " in the Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013.
- (2013) The Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Abdel-Hamid, O.¹ Jiang, H.²

19
- 84893691530
- Speaker adaptation of neural network acoustic models using I-Vectors
- Saon, G., Soltau, H., Nahamoo, D., and Picheny, M., "Speaker Adaptation of Neural Network Acoustic Models Using I-Vectors, " in the Proceedings of the ASRU 2013.
- (2013) The Proceedings of the ASRU
- Saon, G.¹ Soltau, H.² Nahamoo, D.³ Picheny, M.⁴

20
- 84890527497
- Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers
- Huang, J., Li, J., Yu, D., Deng, L., and Gong, Y., "Cross-Language Knowledge Transfer Using Multilingual Deep Neural Network With Shared Hidden Layers, " in the Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013.
- (2013) The Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Huang, J.¹ Li, J.² Yu, D.³ Deng, L.⁴ Gong, Y.⁵

21
- 84910096007
- Multilingual training of deep neural networks
- Arnab, G., Pawel, S. and Steve, R., "Multilingual training of deep neural networks, " in the Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013.
- (2013) The Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Arnab, G.¹ Pawel, S.² Steve, R.³

22
- 85083953021
- Feature learning in deep neural networks - Studies on speech recognition tasks
- Yu, D., Seltzer, M., Li, J., Huang, J., and Seide, F., "Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks, " in the Proceedings of 2013 International Confernece on Learning Representation, 2013.
- (2013) The Proceedings of 2013 International Confernece on Learning Representation
- Yu, D.¹ Seltzer, M.² Li, J.³ Huang, J.⁴ Seide, F.⁵

23
- 51449120120
- Boosted MMI for model and featurespace discriminative training
- Povey, D, Kingsbury, B., Ramabhadran, B., Saon, G., Soltau, H., and Visweswariah, K., "Boosted MMI for model and featurespace discriminative training, " in the Proceedings of the 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2008.
- (2008) The Proceedings of the 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Povey, D.¹ Kingsbury, B.² Ramabhadran, B.³ Saon, G.⁴ Soltau, H.⁵ Visweswariah, K.⁶

24
- 33646788786
- FMPE: Discriminatively trained features for speech recognition
- Povey, D, Kingsbury, B., Mangu, L., Saon, G., Soltau, H., and Zweig, G., "fMPE: Discriminatively Trained Features for Speech Recognition, " in the Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2005.
- (2005) The Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Povey, D.¹ Kingsbury, B.² Mangu, L.³ Saon, G.⁴ Soltau, H.⁵ Zweig, G.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.