메뉴 건너뛰기




Volumn , Issue , 2013, Pages 7304-7308

Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers

Author keywords

CD DNN HMM; deep neural network; multilingual speech recognition; multitask learning; transfer learning

Indexed keywords

CD-DNN-HMM; DEEP NEURAL NETWORKS; MULTILINGUAL SPEECH RECOGNITION; MULTITASK LEARNING; TRANSFER LEARNING;

EID: 84890527497     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6639081     Document Type: Conference Paper
Times cited : (655)

References (25)
  • 1
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large vocabulary speech recognition
    • G. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. Speech and Audio Proc., vol. 20, no. 1, pp. 30-42, 2012
    • (2012) IEEE Trans. Speech and Audio Proc. , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 3
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech, pp. 437-440, 2011.
    • (2011) Proc. Interspeech , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 4
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F. Seide, G. Li, X. Chen, D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU, pp. 24-29, 2011.
    • (2011) Proc. ASRU , pp. 24-29
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 6
    • 84878539964 scopus 로고    scopus 로고
    • Application of pretrained deep neural networks to large vocabulary speech recognition
    • N. Jaitly, P. Nguyen, and V. Vanhoucke, "application of pretrained deep neural networks to large vocabulary speech recognition," in Proc. Interspeech, 2012.
    • (2012) Proc. Interspeech
    • Jaitly, N.1    Nguyen, P.2    Vanhoucke, V.3
  • 7
  • 8
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
    • B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization," in Proc. Interspeech, 2012.
    • (2012) Proc. Interspeech
    • Kingsbury, B.1    Sainath, T.N.2    Soltau, H.3
  • 9
    • 84890543852 scopus 로고    scopus 로고
    • Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription
    • H. Su, G. Li, D. Yu, F. Seide, "Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription", in Proc. ICASSP 2013.
    • (2013) Proc. ICASSP
    • Su, H.1    Li, G.2    Yu, D.3    Seide, F.4
  • 10
    • 84890492030 scopus 로고    scopus 로고
    • An investigation of deep neural networks for noise robust speech recognition
    • M. Seltzer, D. Yu, Y. Wang, "An investigation of deep neural networks for noise robust speech recognition", in Proc. ICASSP 2013.
    • (2013) Proc. ICASSP
    • Seltzer, M.1    Yu, D.2    Wang, Y.3
  • 12
    • 0031189914 scopus 로고    scopus 로고
    • Multitask learning
    • Kluwer Academic Publishers
    • R. Caruana, "Multitask Learning," Machine Learning, Vol. 28, pp. 41-75, Kluwer Academic Publishers, 1997
    • (1997) Machine Learning , vol.28 , pp. 41-75
    • Caruana, R.1
  • 14
    • 84874278045 scopus 로고    scopus 로고
    • Unsupervised crosslingual knowledge transfer in DNN-based LVCSR
    • P. Swietojanski, A. Ghoshal, S. Renals, "Unsupervised crosslingual knowledge transfer in DNN-based LVCSR," in Proc. SLT 2012.
    • (2012) Proc. SLT
    • Swietojanski, P.1    Ghoshal, A.2    Renals, S.3
  • 15
    • 0035426931 scopus 로고    scopus 로고
    • Language independent and language adaptive acoustic modeling for speech recognition
    • T. Schultz and A. Waibel, "Language independent and language adaptive acoustic modeling for speech recognition," in Speech Communication, August 2001, Volume 35, Issue 1-2, pp. 31-51
    • (2001) Speech Communication, August , vol.35 , Issue.1-2 , pp. 31-51
    • Schultz, T.1    Waibel, A.2
  • 16
    • 70349220094 scopus 로고    scopus 로고
    • A study on multilingual acoustic modeling for large vocabulary ASR
    • H. Lin, L. Deng, D. Yu, Y. Gong, A. Acero, and C-H Lee, "A study on multilingual acoustic modeling for large vocabulary ASR," in Proc. ICASSP, pp. 4333-4336, 2009
    • (2009) Proc. ICASSP , pp. 4333-4336
    • Lin, H.1    Deng, L.2    Yu, D.3    Gong, Y.4    Acero, A.5    Lee, C.-H.6
  • 17
    • 34250014992 scopus 로고    scopus 로고
    • Language-dependent state clustering for multilingual acoustic modeling
    • T. Niesler, "Language-dependent state clustering for multilingual acoustic modeling," Speech Communication, vol. 49, 2007
    • (2007) Speech Communication , vol.49
    • Niesler, T.1
  • 18
    • 70349197671 scopus 로고    scopus 로고
    • Crosslingual speech recognition under runtime resource constraints
    • D. Yu, L. Deng, P. Liu, J. Wu, Y. Gong, A. Acero, "crosslingual speech recognition under runtime resource constraints," in Proc. ICASSP, pp. 4193-4196, 2009
    • (2009) Proc. ICASSP , pp. 4193-4196
    • Yu, D.1    Deng, L.2    Liu, P.3    Wu, J.4    Gong, Y.5    Acero, A.6
  • 19
    • 78049394188 scopus 로고    scopus 로고
    • Multilingual acoustic modeling for speech recognition based on subspace gaussian mixture models
    • Dallas
    • L. Burget et al, "Multilingual Acoustic Modeling for Speech Recognition Based on Subspace Gaussian Mixture Models," in Proc. ICASSP, Dallas, 2010
    • (2010) Proc. ICASSP
    • Burget, L.1
  • 20
    • 33947619591 scopus 로고    scopus 로고
    • Cross-domain and cross-lingual portability of acoustic features estimated by multilayer perceptrons
    • A. Stolcke, F. Grzl, M-Y Hwang, X. Lei, N. Morgan, D. Vergyri, "Cross-domain and cross-lingual portability of acoustic features estimated by multilayer perceptrons," in Proc. ICASSP, 2006
    • (2006) Proc. ICASSP
    • Stolcke, A.1    Grzl, F.2    Hwang, M.-Y.3    Lei, X.4    Morgan, N.5    Vergyri, D.6
  • 21
    • 79959819891 scopus 로고    scopus 로고
    • Cross-lingual and multi-stream posterior features for low resource lvcsr systems
    • S. Thomas, S. Ganapathy and H. Hermansky, "Cross-lingual and Multi-stream Posterior Features for Low Resource LVCSR Systems," in Proc. Interspeech, 2010
    • (2010) Proc. Interspeech
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 22
    • 84858976609 scopus 로고    scopus 로고
    • Cross-lingual portability of chinese and english neural network features for french and german lvcsr
    • USA
    • C. Plahl, R. Schlueter and H. Ney, "Cross-lingual portability of Chinese and English neural network features for French and German LVCSR," in Proc. ASRU, USA, 2011
    • (2011) Proc. ASRU
    • Plahl, C.1    Schlueter, R.2    Ney, H.3
  • 23
    • 84878559540 scopus 로고    scopus 로고
    • An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance
    • N. Vu, W. Breiter, F. Metze, T. Schultz, "An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance," in Proc. Interspeech, 2012
    • (2012) Proc. Interspeech
    • Vu, N.1    Breiter, W.2    Metze, F.3    Schultz, T.4
  • 24
    • 84867606552 scopus 로고    scopus 로고
    • Multilingual MLP features for low-resource LVCSR systems
    • S. Thomas, S. Ganapathy and H. Hermansky, "Multilingual MLP features for low-resource LVCSR systems," in Proc. ICASSP, 2012
    • (2012) Proc. ICASSP
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 25
    • 56449095373 scopus 로고    scopus 로고
    • A unified architecture for natural language processing: Deep neural networks with multitask learning
    • R. Collobert and J. Weston, "A unified architecture for natural language processing: deep neural networks with multitask learning," in International Conference in Machine Learning, 2008.
    • (2008) International Conference in Machine Learning
    • Collobert, R.1    Weston, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.