SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2014, Pages 1895-1899

A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden markov models

Author keywords

CD DNN HMM; Channel compensation; GMM HMM; Noise robustness; Speaking rate normalization

Indexed keywords

IMAGE CODING; SIGNAL TO NOISE RATIO; SPEECH; SPEECH COMMUNICATION; SPEECH ENHANCEMENT; SPEECH RECOGNITION; TELEPHONE SETS; TRELLIS CODES;

CD-DNN-HMM; CHANNEL COMPENSATION; GMM-HMM; NOISE ROBUSTNESS; SPEAKING RATE;

HIDDEN MARKOV MODELS;

EID: 84910027886 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (35)

References (13)

1
- 80051616844
- Large vocabulary continuous speech recognition with context-dependent DBNHMMS
- Dahl, G.E., Yu, D., Deng, L., and Acero, A., "Large Vocabulary Continuous Speech Recognition With Context-Dependent DBNHMMS", in the Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2011.
- (2011) The Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

2
- 84910037211
- Conversational speech transcription using context-dependent deep neural networks
- Seide, F., Li, G., and Yu, D., "Conversational Speech Transcription Using Context-Dependent Deep Neural Networks", in the Proceedings of Interspeech 2012.
- (2012) The Proceedings of Interspeech
- Seide, F.¹ Li, G.² Yu, D.³

3
- 84878379108
- Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
- Kingsbury, B., Sainath, N. T., and Soltau, H., "Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization", in the Proceedings of Interspeech 2012.
- (2012) The Proceedings of Interspeech
- Kingsbury, B.¹ Sainath, N.T.² Soltau, H.³

4
- 84878539964
- Application of pretrained deep neural networks to large vocabulary speech recognition
- Jaitly, N., Nguyen, P., Senior, A., Vanhoucke, V., "Application of Pretrained Deep Neural Networks to Large Vocabulary Speech Recognition", in the Proceedings of Interspeech 2012.
- (2012) The Proceedings of Interspeech
- Jaitly, N.¹ Nguyen, P.² Senior, A.³ Vanhoucke, V.⁴

5
- 84890491198
- Recent advances in deep learning for speech research at microsoft
- Deng, L., Li, J., Huang, J., Yao, K., Yu, D., Seide, F., Seltzer, M., Zweig, G., He, X., Williams, J., Gong, Y., and Acero, A., "Recent Advances in Deep Learning for Speech Research at Microsoft", in the Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013.
- (2013) The Proceedings of the 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Deng, L.¹ Li, J.² Huang, J.³ Yao, K.⁴ Yu, D.⁵ Seide, F.⁶ Seltzer, M.⁷ Zweig, G.⁸ He, X.⁹ Williams, J.¹⁰ Gong, Y.¹¹ Acero, A.¹²

7
- 85083953021
- Feature learning in deep neural networks - studies on speech recognition tasks
- Yu, D., Seltzer, M., Li, J., Huang, J., Seide, F., "Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks", in the Proceedings of 2013 International Confernece on Learning Representation, 2013.
- (2013) The Proceedings of 2013 International Confernece on Learning Representation
- Yu, D.¹ Seltzer, M.² Li, J.³ Huang, J.⁴ Seide, F.⁵

8
- 84910031892
- Three classes of deep learning architectures and their applications: A tutorial survey
- Li, D., "Three Classes of Deep Learning Architectures and Their Applications: A Tutorial Survey", APSIPA Transactions on Signal and Information Processing, 2013.
- (2013) APSIPA Transactions on Signal and Information Processing
- Li, D.¹

9
- 51449120120
- Boosted MMI for model and featurespace discriminative training
- Povey, D, Kingsbury, B., Ramabhadran, B., Saon, G., Soltau H., and Visweswariah, K., "Boosted MMI for model and featurespace discriminative training", in the Proceedings of the 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2008.
- (2008) The Proceedings of the 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Soltau, H.¹ Povey, D.² Kingsbury, B.³ Ramabhadran, B.⁴ Saon, G.⁵ Visweswariah, K.⁶

10
- 33646788786
- FMPE: Discriminatively trained features for speech recognition
- Povey, D, Kingsbury, B., Mangu, L., Saon, G., Soltau, H., and Zweig, G., "fMPE: Discriminatively Trained Features for Speech Recognition", in the Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2005.
- (2005) The Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Povey, D.¹ Kingsbury, B.² Mangu, L.³ Saon, G.⁴ Soltau, H.⁵ Zweig, G.⁶

11
- 78049388975
- Speaking rate adaptation using continuous frame rate normalization
- Chu, S. and Povey, D. "Speaking Rate Adaptation Using Continuous Frame Rate Normalization", in the Proceedings of the 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010.
- (2010) The Proceedings of the 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Chu, S.¹ Povey, D.²

12
- 84906262717
- Speaking rate normalization with lattice-based context-dependent phoneme duration modeling for personalized speech recognizers on mobile devices
- Yeh, C., Lee, H., and Leem, L, "Speaking Rate Normalization with Lattice-based Context-dependent Phoneme Duration Modeling for Personalized Speech Recognizers on Mobile Devices", in the Proceedings of Interspeech 2013.
- (2013) The Proceedings of Interspeech
- Yeh, C.¹ Lee, H.² Leem, L.³

13
- 0033690878
- On the use of variable frame rate analysis in the speech recognition
- Zhu, Q. and Alwan, A., "On the use of Variable Frame Rate Analysis in the Speech Recognition", in the Proceedings of the 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2000.
- (2000) The Proceedings of the 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Zhu, Q.¹ Alwan, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.