메뉴 건너뛰기




Volumn , Issue , 2013, Pages 344-349

DNN acoustic modeling with modular multi-lingual feature extraction networks

Author keywords

Deep Neural Networks; Large Vocabulary Speech Recognition; Low Resource Acoustic Modeling; Multi Lingual Acoustic Modeling

Indexed keywords

ACOUSTIC FEATURES; ACOUSTIC MODEL; ACOUSTIC MODEL TRAININGS; CONVERSATIONAL TELEPHONE SPEECH; DEEP NEURAL NETWORKS; FEATURE EXTRACTOR; HIGH-LEVEL FEATURES; MULTIPLE LANGUAGES;

EID: 84893642465     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2013.6707754     Document Type: Conference Paper
Times cited : (10)

References (28)
  • 2
    • 84951490428 scopus 로고
    • Review of neural networks for speech recognition
    • R.P. Lippmann, "Review of neural networks for speech recognition, " Neural computation, vol. 1, no. 1, pp. 1-38, 1989.
    • (1989) Neural Computation , vol.1 , Issue.1 , pp. 1-38
    • Lippmann, R.P.1
  • 5
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • G.E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) Audio, Speech, and Language Processing, IEEE Transactions on , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 6
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Proc. Interspeech, 2011, pp. 437-440.
    • (2011) Proc. Interspeech , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 10
    • 27144439262 scopus 로고    scopus 로고
    • Dataderived nonlinear mapping for feature extraction in hmm
    • Citeseer
    • Hynek Hermansky, Sangita Sharma, and Pratibha Jain, "Dataderived nonlinear mapping for feature extraction in hmm, " in Proc. ASRU. Citeseer, 1999, vol. 99.
    • (1999) Proc. ASRU , vol.99
    • Hermansky, H.1    Sharma, S.2    Jain, P.3
  • 11
    • 70450217311 scopus 로고    scopus 로고
    • Hierarchical processing of the modulation spectrum for gale mandarin lvcsr system
    • F. Valente, M. Magimai-Doss, C. Plahl, and S.V. Ravuri, "Hierarchical processing of the modulation spectrum for GALE Mandarin LVCSR system., " in Proc. Interspeech, 2009, pp. 2963-2966.
    • (2009) Proc. Interspeech , pp. 2963-2966
    • Valente, F.1    Magimai-Doss, M.2    Plahl, C.3    Ravuri, S.V.4
  • 17
    • 79551480483 scopus 로고    scopus 로고
    • Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion
    • P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.A. Manzagol, "Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion, " The Journal of Machine Learning Research, vol. 11, pp. 3371- 3408, 2010.
    • (2010) The Journal of Machine Learning Research , vol.11 , pp. 3371-3408
    • Vincent, P.1    Larochelle, H.2    Lajoie, I.3    Bengio, Y.4    Manzagol, P.A.5
  • 18
    • 84878559540 scopus 로고    scopus 로고
    • An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on asr performance
    • N.T. Vu, W. Breiter, F. Metze, and T. Schultz, "An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance, " in Proc. Interspeech, 2012.
    • (2012) Proc. Interspeech
    • Vu, N.T.1    Breiter, W.2    Metze, F.3    Schultz, T.4
  • 28
    • 84874282188 scopus 로고    scopus 로고
    • Improving wideband speech recognition using mixed-bandwidth training data in cd-dnn-hmm
    • 2012 IEEE. IEEE
    • Jinyu Li, Dong Yu, Jui-Ting Huang, and Yifan Gong, "Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM, " in Spoken Language Technology Workshop (SLT), 2012 IEEE. IEEE, 2012, pp. 131-136.
    • (2012) Spoken Language Technology Workshop (SLT) , pp. 131-136
    • Li, J.1    Yu, D.2    Huang, J.-T.3    Gong, Y.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.