메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1895-1899

A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden markov models

Author keywords

CD DNN HMM; Channel compensation; GMM HMM; Noise robustness; Speaking rate normalization

Indexed keywords

IMAGE CODING; SIGNAL TO NOISE RATIO; SPEECH; SPEECH COMMUNICATION; SPEECH ENHANCEMENT; SPEECH RECOGNITION; TELEPHONE SETS; TRELLIS CODES;

EID: 84910027886     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (35)

References (13)
  • 2
    • 84910037211 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • Seide, F., Li, G., and Yu, D., "Conversational Speech Transcription Using Context-Dependent Deep Neural Networks", in the Proceedings of Interspeech 2012.
    • (2012) The Proceedings of Interspeech
    • Seide, F.1    Li, G.2    Yu, D.3
  • 3
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
    • Kingsbury, B., Sainath, N. T., and Soltau, H., "Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization", in the Proceedings of Interspeech 2012.
    • (2012) The Proceedings of Interspeech
    • Kingsbury, B.1    Sainath, N.T.2    Soltau, H.3
  • 8
    • 84910031892 scopus 로고    scopus 로고
    • Three classes of deep learning architectures and their applications: A tutorial survey
    • Li, D., "Three Classes of Deep Learning Architectures and Their Applications: A Tutorial Survey", APSIPA Transactions on Signal and Information Processing, 2013.
    • (2013) APSIPA Transactions on Signal and Information Processing
    • Li, D.1
  • 12
    • 84906262717 scopus 로고    scopus 로고
    • Speaking rate normalization with lattice-based context-dependent phoneme duration modeling for personalized speech recognizers on mobile devices
    • Yeh, C., Lee, H., and Leem, L, "Speaking Rate Normalization with Lattice-based Context-dependent Phoneme Duration Modeling for Personalized Speech Recognizers on Mobile Devices", in the Proceedings of Interspeech 2013.
    • (2013) The Proceedings of Interspeech
    • Yeh, C.1    Lee, H.2    Leem, L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.