SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2013, Pages 7304-7308

Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers

(5) Huang, Jui Ting a Li, Jinyu a Yu, Dong b Deng, Li b Gong, Yifan a

a MICROSOFT (United States)

b MICROSOFT RESEARCH (United States)

Author keywords

CD DNN HMM; deep neural network; multilingual speech recognition; multitask learning; transfer learning

Indexed keywords

CD-DNN-HMM; DEEP NEURAL NETWORKS; MULTILINGUAL SPEECH RECOGNITION; MULTITASK LEARNING; TRANSFER LEARNING;

KNOWLEDGE MANAGEMENT; LEARNING ALGORITHMS; LINEAR TRANSFORMATIONS; MATHEMATICAL TRANSFORMATIONS; SIGNAL PROCESSING;

NEURAL NETWORKS;

EID: 84890527497 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2013.6639081 Document Type: Conference Paper

Times cited : (655)

References (25)

1
- 84055222005
- Context-dependent pre-trained deep neural networks for large vocabulary speech recognition
- G. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. Speech and Audio Proc., vol. 20, no. 1, pp. 30-42, 2012
- (2012) IEEE Trans. Speech and Audio Proc. , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.¹ Yu, D.² Deng, L.³ Acero, A.⁴

2
- 84055163920
- Roles of pretraining and finetuning in context-dependent DNN-HMMs for real-world speech recognition
- Dec
- D. Yu, L. Deng, and G. Dahl, "Roles of pretraining and finetuning in context-dependent DNN-HMMs for real-world speech recognition," in Proc. NIPS Workshop on Deep Learning and Unsupervised Feature Learning, Dec. 2010.
- (2010) Proc. NIPS Workshop on Deep Learning and Unsupervised Feature Learning
- Yu, D.¹ Deng, L.² Dahl, G.³

3
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech, pp. 437-440, 2011.
- (2011) Proc. Interspeech , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

4
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- F. Seide, G. Li, X. Chen, D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU, pp. 24-29, 2011.
- (2011) Proc. ASRU , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

5
- 84055211743
- Acoustic modeling using deep belief networks
- A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks," IEEE Trans. on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 14-22, 2012.
- (2012) IEEE Trans. on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

6
- 84878539964
- Application of pretrained deep neural networks to large vocabulary speech recognition
- N. Jaitly, P. Nguyen, and V. Vanhoucke, "application of pretrained deep neural networks to large vocabulary speech recognition," in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Jaitly, N.¹ Nguyen, P.² Vanhoucke, V.³

7
- 84858972572
- Making deep belief networks effective for large vocabulary continuous speech recognition
- T. N. Sainath, B. Kingsbury, B. Ramabhadran, P. Fousek, P. Novak, and A.-r. Mohamed, "Making deep belief networks effective for large vocabulary continuous speech recognition," in Proc. ASRU, pp. 30-35, 2011.
- (2011) Proc. ASRU , pp. 30-35
- Sainath, T.N.¹ Kingsbury, B.² Ramabhadran, B.³ Fousek, P.⁴ Novak, P.⁵ Mohamed, A.-R.⁶

8
- 84878379108
- Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
- B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization," in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Kingsbury, B.¹ Sainath, T.N.² Soltau, H.³

9
- 84890543852
- Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription
- H. Su, G. Li, D. Yu, F. Seide, "Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription", in Proc. ICASSP 2013.
- (2013) Proc. ICASSP
- Su, H.¹ Li, G.² Yu, D.³ Seide, F.⁴

10
- 84890492030
- An investigation of deep neural networks for noise robust speech recognition
- M. Seltzer, D. Yu, Y. Wang, "An investigation of deep neural networks for noise robust speech recognition", in Proc. ICASSP 2013.
- (2013) Proc. ICASSP
- Seltzer, M.¹ Yu, D.² Wang, Y.³

11
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition
- G. Hinton, L. Deng, D. Yu, G. Dahl, A.-R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kings-bury, "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Processing Magazine, 2012.
- (2012) IEEE Signal Processing Magazine
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.-R.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰ Kings-Bury, B.¹¹

12
- 0031189914
- Multitask learning
- Kluwer Academic Publishers
- R. Caruana, "Multitask Learning," Machine Learning, Vol. 28, pp. 41-75, Kluwer Academic Publishers, 1997
- (1997) Machine Learning , vol.28 , pp. 41-75
- Caruana, R.¹

13
- 84867135575
- Building high-level features using large scale unsupervised learning
- Q. Le, M. Ranzato, R. Monga, M. Devin, K. Chen, G. Corrado, J. Dean, A. Ng, "Building high-level features using large scale unsupervised learning," International Conference in Machine Learning, 2012
- (2012) International Conference in Machine Learning
- Le, Q.¹ Ranzato, M.² Monga, R.³ Devin, M.⁴ Chen, K.⁵ Corrado, G.⁶ Dean, J.⁷ Ng, A.⁸

14
- 84874278045
- Unsupervised crosslingual knowledge transfer in DNN-based LVCSR
- P. Swietojanski, A. Ghoshal, S. Renals, "Unsupervised crosslingual knowledge transfer in DNN-based LVCSR," in Proc. SLT 2012.
- (2012) Proc. SLT
- Swietojanski, P.¹ Ghoshal, A.² Renals, S.³

15
- 0035426931
- Language independent and language adaptive acoustic modeling for speech recognition
- T. Schultz and A. Waibel, "Language independent and language adaptive acoustic modeling for speech recognition," in Speech Communication, August 2001, Volume 35, Issue 1-2, pp. 31-51
- (2001) Speech Communication, August , vol.35 , Issue.1-2 , pp. 31-51
- Schultz, T.¹ Waibel, A.²

16
- 70349220094
- A study on multilingual acoustic modeling for large vocabulary ASR
- H. Lin, L. Deng, D. Yu, Y. Gong, A. Acero, and C-H Lee, "A study on multilingual acoustic modeling for large vocabulary ASR," in Proc. ICASSP, pp. 4333-4336, 2009
- (2009) Proc. ICASSP , pp. 4333-4336
- Lin, H.¹ Deng, L.² Yu, D.³ Gong, Y.⁴ Acero, A.⁵ Lee, C.-H.⁶

17
- 34250014992
- Language-dependent state clustering for multilingual acoustic modeling
- T. Niesler, "Language-dependent state clustering for multilingual acoustic modeling," Speech Communication, vol. 49, 2007
- (2007) Speech Communication , vol.49
- Niesler, T.¹

18
- 70349197671
- Crosslingual speech recognition under runtime resource constraints
- D. Yu, L. Deng, P. Liu, J. Wu, Y. Gong, A. Acero, "crosslingual speech recognition under runtime resource constraints," in Proc. ICASSP, pp. 4193-4196, 2009
- (2009) Proc. ICASSP , pp. 4193-4196
- Yu, D.¹ Deng, L.² Liu, P.³ Wu, J.⁴ Gong, Y.⁵ Acero, A.⁶

19
- 78049394188
- Multilingual acoustic modeling for speech recognition based on subspace gaussian mixture models
- Dallas
- L. Burget et al, "Multilingual Acoustic Modeling for Speech Recognition Based on Subspace Gaussian Mixture Models," in Proc. ICASSP, Dallas, 2010
- (2010) Proc. ICASSP
- Burget, L.¹

20
- 33947619591
- Cross-domain and cross-lingual portability of acoustic features estimated by multilayer perceptrons
- A. Stolcke, F. Grzl, M-Y Hwang, X. Lei, N. Morgan, D. Vergyri, "Cross-domain and cross-lingual portability of acoustic features estimated by multilayer perceptrons," in Proc. ICASSP, 2006
- (2006) Proc. ICASSP
- Stolcke, A.¹ Grzl, F.² Hwang, M.-Y.³ Lei, X.⁴ Morgan, N.⁵ Vergyri, D.⁶

21
- 79959819891
- Cross-lingual and multi-stream posterior features for low resource lvcsr systems
- S. Thomas, S. Ganapathy and H. Hermansky, "Cross-lingual and Multi-stream Posterior Features for Low Resource LVCSR Systems," in Proc. Interspeech, 2010
- (2010) Proc. Interspeech
- Thomas, S.¹ Ganapathy, S.² Hermansky, H.³

22
- 84858976609
- Cross-lingual portability of chinese and english neural network features for french and german lvcsr
- USA
- C. Plahl, R. Schlueter and H. Ney, "Cross-lingual portability of Chinese and English neural network features for French and German LVCSR," in Proc. ASRU, USA, 2011
- (2011) Proc. ASRU
- Plahl, C.¹ Schlueter, R.² Ney, H.³

23
- 84878559540
- An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance
- N. Vu, W. Breiter, F. Metze, T. Schultz, "An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance," in Proc. Interspeech, 2012
- (2012) Proc. Interspeech
- Vu, N.¹ Breiter, W.² Metze, F.³ Schultz, T.⁴

24
- 84867606552
- Multilingual MLP features for low-resource LVCSR systems
- S. Thomas, S. Ganapathy and H. Hermansky, "Multilingual MLP features for low-resource LVCSR systems," in Proc. ICASSP, 2012
- (2012) Proc. ICASSP
- Thomas, S.¹ Ganapathy, S.² Hermansky, H.³

25
- 56449095373
- A unified architecture for natural language processing: Deep neural networks with multitask learning
- R. Collobert and J. Weston, "A unified architecture for natural language processing: deep neural networks with multitask learning," in International Conference in Machine Learning, 2008.
- (2008) International Conference in Machine Learning
- Collobert, R.¹ Weston, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.