메뉴 건너뛰기




Volumn , Issue , 2013, Pages 7893-7897

KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition

Author keywords

CD DNN HMM; deep neural network; Kullback Leibler divergence regularization; speaker adaptation

Indexed keywords

ADAPTATION TECHNIQUES; CD-DNN-HMM; DEEP NEURAL NETWORKS; KULLBACK LEIBLER DIVERGENCE; LARGE VOCABULARY SPEECH RECOGNITION; SPEAKER ADAPTATION; SPEECH TRANSCRIPTIONS; UNSUPERVISED ADAPTATION;

EID: 84890542079     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6639201     Document Type: Conference Paper
Times cited : (464)

References (30)
  • 1
    • 84055222005 scopus 로고    scopus 로고
    • Contextdependent pre-trained deep neural networks for large vocabulary speech recognition
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) IEEE Trans. on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 3
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech'11, pp. 437-440, 2011.
    • (2011) Proc. Interspeech'11 , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 4
    • 44049108531 scopus 로고    scopus 로고
    • Automated directory assistance system-From theory to practice
    • D. Yu, Y.-C. Ju, Y.-Y. Wang, G. Zweig, and A. Acero, "Automated Directory Assistance System-from Theory to Practice", in Proc. Interspeech'07, pp. 2709-2712, 2007.
    • (2007) Proc. Interspeech'07 , pp. 2709-2712
    • Yu, D.1    Ju, Y.-C.2    Wang, Y.-Y.3    Zweig, G.4    Acero, A.5
  • 6
    • 84878539964 scopus 로고    scopus 로고
    • Application of pretrained deep neural networks to large vocabulary speech recognition
    • N. Jaitly, P. Nguyen, and V. Vanhoucke, "application of pretrained deep neural networks to large vocabulary speech recognition", in Proc. Interspeech'12, 2012.
    • (2012) Proc. Interspeech'12
    • Jaitly, N.1    Nguyen, P.2    Vanhoucke, V.3
  • 8
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
    • B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization," in Proc. Interspeech'12, 2012.
    • (2012) Proc. Interspeech'12
    • Kingsbury, B.1    Sainath, T.N.2    Soltau, H.3
  • 9
    • 84890543852 scopus 로고    scopus 로고
    • Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription
    • Hang Su, Gang Li, Dong Yu, Frank Seide, "Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription", in Proc. ICASSP 2013.
    • (2013) Proc. ICASSP
    • Su, H.1    Li, G.2    Yu, D.3    Seide, F.4
  • 10
    • 84890492030 scopus 로고    scopus 로고
    • An investigation of deep neural networks for noise robust speech recognition
    • Michael Seltzer, Dong Yu, Yongqiang Wang, "An investigation of deep neural networks for noise robust speech recognition", in Proc. ICASSP 2013.
    • (2013) Proc. ICASSP
    • Seltzer, M.1    Yu, D.2    Wang, Y.3
  • 12
    • 84937880519 scopus 로고
    • Connectionist speaker normalization and adaptation
    • V. Abrash, H. Franco, A. Sankar, and M. Cohen, "Connectionist speaker normalization and adaptation," in Proc. EUROSPEECH'95, pp. 2183-2186, 1995.
    • (1995) Proc. EUROSPEECH'95 , pp. 2183-2186
    • Abrash, V.1    Franco, H.2    Sankar, A.3    Cohen, M.4
  • 14
    • 0343476363 scopus 로고    scopus 로고
    • Hybrid HMM-NN modeling of stationary-transitional units for continuous speech recognition
    • D. Albesano, R. Gemello, and F. Mana, "Hybrid HMM-NN modeling of stationary-transitional units for continuous speech recognition", in Proc. NIPS'97, pp. 1112-1115, 1997.
    • (1997) Proc. NIPS'97 , pp. 1112-1115
    • Albesano, D.1    Gemello, R.2    Mana, F.3
  • 15
    • 79959849500 scopus 로고    scopus 로고
    • Comparison of discriminative input and output transformations for speaker adaptation in the Hybrid NN/HMM systems
    • B. Li and K. C. Sim, "Comparison of discriminative input and output transformations for speaker adaptation in the Hybrid NN/HMM systems", in Proc. Interspeech'10, pp. 526-529, 2010.
    • (2010) Proc. Interspeech'10 , pp. 526-529
    • Li, B.1    Sim, K.C.2
  • 16
    • 34548012893 scopus 로고    scopus 로고
    • Linear hidden transformations for adaptation of hybrid ANN/HMM models
    • R. Gemello, F. Mana, S. Scanzio, P. Laface, and R. De Mori, "Linear hidden transformations for adaptation of hybrid ANN/HMM models", Speech Communication 49, no. 10, pp. 827-83, 2007.
    • (2007) Speech Communication , vol.49 , Issue.10 , pp. 827-883
    • Gemello, R.1    Mana, F.2    Scanzio, S.3    Laface, P.4    De Mori, R.5
  • 17
    • 84865740155 scopus 로고    scopus 로고
    • Improving lvcsr system combination using neural network language model cross adaptation
    • X. Liu, M. J. F. Gales, and P. C. Woodland. "Improving LVCSR system combination using neural network language model cross adaptation," in Proc. Interspeech'11, Pp. 2857-2860, 2011.
    • (2011) Proc. Interspeech'11 , pp. 2857-2860
    • Liu, X.1    Gales, M.J.F.2    Woodland, P.C.3
  • 18
    • 84878535870 scopus 로고    scopus 로고
    • A initial attempt on task-specific adaptation for deep neural networkbased large vocabulary continuous speech recognition
    • Y. Xiao, Z. Zhang, S. Cai, J. Pan, and Y. Yan, "A initial attempt on task-specific adaptation for deep neural networkbased large vocabulary continuous speech recognition", in Proc. Interspeech'12, 2012.
    • (2012) Proc. Interspeech'12
    • Xiao, Y.1    Zhang, Z.2    Cai, S.3    Pan, J.4    Yan, Y.5
  • 20
    • 84874226579 scopus 로고    scopus 로고
    • Adaptation of context-dependent deep neural networks for automatic speech recognition
    • K. Yao, D. Yu, F. Seide, H. Su, L.i Deng, and Y. Gong, "Adaptation of context-dependent deep neural networks for automatic speech recognition", in Proc. SLT'12, 2012.
    • (2012) Proc. SLT'12
    • Yao, K.1    Yu, D.2    Seide, F.3    Su, H.4    Deng, L.I.5    Gong, Y.6
  • 21
    • 33646794050 scopus 로고    scopus 로고
    • Two-stage speaker adaptation of hybrid tied-posterior acoustic models
    • J. Stadermann and G. Rigoll, "Two-stage speaker adaptation of hybrid tied-posterior acoustic models," in Proc. ICASSP'05, vol. I, pp. 997-1000, 2005.
    • (2005) Proc. ICASSP'05 , vol.1 , pp. 997-1000
    • Stadermann, J.1    Rigoll, G.2
  • 23
    • 33947635130 scopus 로고    scopus 로고
    • Regularized adaptation of discriminative classifiers
    • X. Li and J. Bilmes, "Regularized adaptation of discriminative classifiers," in Proc. ICASSP'06, 2006.
    • (2006) Proc. ICASSP'06
    • Li, X.1    Bilmes, J.2
  • 24
    • 0033677005 scopus 로고    scopus 로고
    • Fast speaker adaptation of artificial neural networks for automatic speech recognition
    • S. Dupont and L. Cheboub, "Fast speaker adaptation of artificial neural networks for automatic speech recognition", in Proc. ICASSP'00, vol.3, pp. 1795-1798, 2000.
    • (2000) Proc. ICASSP'00 , vol.3 , pp. 1795-1798
    • Dupont, S.1    Cheboub, L.2
  • 25
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU'11, 2011.
    • (2011) Proc. ASRU'11
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 29
    • 68549140008 scopus 로고    scopus 로고
    • A novel framework and training algorithm for variable-parameter hidden markov models
    • D. Yu, L. Deng, Y. Gong, and A. Acero, "A novel framework and training algorithm for variable-parameter hidden markov models", IEEE Trans. on Audio, Speech, and Language Processing, vol 17, no. 7, pp. 1348-1360, 2009.
    • (2009) IEEE Trans. on Audio, Speech, and Language Processing , vol.17 , Issue.7 , pp. 1348-1360
    • Yu, D.1    Deng, L.2    Gong, Y.3    Acero, A.4
  • 30
    • 79959853780 scopus 로고    scopus 로고
    • On speaker adaptive training of artificial neural networks
    • J. Trmal, J. Zelinka, and L. Müller, "On speaker adaptive training of artificial neural networks", in Proc. Interspeech'10, pp. 554-557, 2010.
    • (2010) Proc. Interspeech'10 , pp. 554-557
    • Trmal, J.1    Zelinka, J.2    Müller, L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.