메뉴 건너뛰기




Volumn , Issue , 2014, Pages 845-849

Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks

Author keywords

CD DNN HMM; Channel compensation; Deep learning; Multi task learning; Noise robustness

Indexed keywords

SPEECH COMMUNICATION;

EID: 84910069710     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (22)

References (24)
  • 2
    • 84910037211 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • Seide, F., Li, G., and Yu, D., "Conversational Speech Transcription Using Context-Dependent Deep Neural Networks, " in the Proceedings of Interspeech 2012.
    • (2012) The Proceedings of Interspeech
    • Seide, F.1    Li, G.2    Yu, D.3
  • 3
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum Bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
    • Kingsbury, B., Sainath, N. T., and Soltau, H., "Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization, " in the Proceedings of Interspeech 2012.
    • (2012) The Proceedings of Interspeech
    • Kingsbury, B.1    Sainath, N.T.2    Soltau, H.3
  • 4
  • 6
    • 84910027886 scopus 로고    scopus 로고
    • A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden Markov Models
    • submitted to the
    • Huang, Y., Yu, D., Liu, C., and Gong, Y., "A Comparative Analytic Study on the Gaussian Mixture and Context Dependent Deep Neural Network Hidden Markov Models, " submitted to the Interspeech 2014.
    • (2014) Interspeech
    • Huang, Y.1    Yu, D.2    Liu, C.3    Gong, Y.4
  • 7
    • 0034227757 scopus 로고    scopus 로고
    • Cluster adaptive training of hidden Markov models
    • July
    • Gales, M. J. F., "Cluster adaptive training of hidden Markov models, " IEEE Trans. Speech Audio Processing, vol. 8, no. 4, pp. 417-428, July 2000.
    • (2000) IEEE Trans. Speech Audio Processing , vol.8 , Issue.4 , pp. 417-428
    • Gales, M.J.F.1
  • 11
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • Seide, F., Li, G., Chen, X., and Yu, D., "Feature Engineering in Context-Dependent Deep Neural Networks for Conversational Speech Transcription, " in the Proceedings of the ASRU, 2011.
    • (2011) The Proceedings of the ASRU
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 12
    • 79959849500 scopus 로고    scopus 로고
    • Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems
    • Li, B. and Sim, K. C., "Comparison of Discriminative Input and Output Transformations for SpeakerAdaptation in the Hybrid NN/HMM Systems, " in the Proceedings of Interspeech 2010.
    • (2010) The Proceedings of Interspeech
    • Li, B.1    Sim, K.C.2
  • 14
    • 0030784572 scopus 로고    scopus 로고
    • Stochastic trajectory modeling and sentence searching for continuous speech recognition
    • Gong, Y., "Stochastic trajectory modeling and sentence searching for continuous speech recognition, " IEEE Transactions on Speech Audio Processing, vol. 5, no. 1, pp. 3344, 1997.
    • (1997) IEEE Transactions on Speech Audio Processing , vol.5 , Issue.1 , pp. 3344
    • Gong, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.