메뉴 건너뛰기




Volumn , Issue , 2014, Pages 2199-2203

Speaker dependent bottleneck layer training for Speaker adaptation in automatic speech recognition

Author keywords

Automatic speech recognition; Bottleneck features; Deep neural networks; Speaker adaptation

Indexed keywords

LINEAR TRANSFORMATIONS; SPEECH COMMUNICATION; SPEECH PROCESSING;

EID: 84910028538     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (19)

References (28)
  • 4
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational Speech Transcription Using Context-Dependent Deep Neural Networks, " In Inter speech 2011.
    • (2011) Inter Speech
    • Seide, F.1    Li, G.2    Yu, D.3
  • 5
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large vocabulary speech recognition
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large vocabulary speech recognition, " IEEE Trans. on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 3042, 2012.
    • (2012) IEEE Trans. on Audio, Speech, and Language Processing , vol.20 , Issue.1
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 7
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
    • C. J. Leggetter and P. C.Woodland, "Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models, " Computer Speech & Language, vol. 9, no. 2, pp. 171 - 185, 1995.
    • (1995) Computer Speech & Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 9
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for hmm-based speech recognition
    • M. J. F. Gales, "Maximum Likelihood Linear Transformations for HMM-Based Speech Recognition, " Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 11
    • 84910030053 scopus 로고
    • Rec norm: Simultaneous normalisation and classification applied to speech recognition
    • J. S. Bridle, and S. Cox, "Rec Norm: Simultaneous Normalisation and Classification Applied to Speech Recognition, " in NIPS, page 234-240, 1990.
    • (1990) NIPS , pp. 234-240
    • Bridle, J.S.1    Cox, S.2
  • 13
    • 79959849500 scopus 로고    scopus 로고
    • Comparison of discriminative input and output transformations for speaker adaptation in the hybrid nn/hmm systems
    • B. Li, and K. C. Sim, "Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems, " in INTER SPEECH, 2010.
    • (2010) Inter Speech
    • Li, B.1    Sim, K.C.2
  • 14
    • 33947703156 scopus 로고    scopus 로고
    • Adaptation of hybrid ann/hmm models using linear hidden transformations and conservative training
    • R. Gemello, F. Mana, S. Scanzio, P. Laface, and R. D. Mori, "Adaptation of hybrid ANN/HMM models using linear hidden transformations and conservative training, " in ICASSP, 2006.
    • (2006) ICASSP
    • Gemello, R.1    Mana, F.2    Scanzio, S.3    Laface, P.4    Mori, R.D.5
  • 15
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in ASRU, 2011.
    • (2011) ASRU
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 16
    • 84890478625 scopus 로고    scopus 로고
    • Adaptation of context-dependent deep neural networks for automatic speech recognition
    • K. Yao, D. Yu F. Seide, H. Su, L. Deng and Y. Gong, "Adaptation of context-dependent deep neural networks for automatic speech recognition, " in IEEE SLT, 2012.
    • (2012) IEEE SLT
    • Yao, K.1    Seide, D.Y.F.2    Su, H.3    Deng, L.4    Gong, Y.5
  • 17
    • 84890509526 scopus 로고    scopus 로고
    • Mlp-based factor analysis for tandem speech recognition
    • M. Ferras and H. Bourlard, "MLP-based factor analysis for tandem speech recognition, " in ICASSP, 2013.
    • (2013) ICASSP
    • Ferras, M.1    Bourlard, H.2
  • 18
    • 84890452886 scopus 로고    scopus 로고
    • Fast speaker adaptation of hybrid nn/hmm model for speech recognition based on discriminative learning of speaker code
    • O. Abdel-Hamid and H. Jiang, "Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code, " in ICASSP, 2013.
    • (2013) ICASSP
    • Abdel-Hamid, O.1    Jiang, H.2
  • 19
    • 84906225505 scopus 로고    scopus 로고
    • Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition
    • O. Abdel-Hamid and H. Jiang, "Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition, " in INTERSPEECH, 2013.
    • (2013) Inter Speech
    • Abdel-Hamid, O.1    Jiang, H.2
  • 20
    • 84893691530 scopus 로고    scopus 로고
    • Speaker adaptation of neural network acoustic models using i-vectors
    • G. Soan, H. Soltau, D. Nahamoo and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors, " in ASRU 2013.
    • (2013) ASRU
    • Soan, G.1    Soltau, H.2    Nahamoo, D.3    Picheny, M.4
  • 21
    • 84905269643 scopus 로고    scopus 로고
    • Using neural network front-ends on far field multiple microphones based speech recognition
    • to apper
    • Y. Liu, P. Zhang and T. Hain, "Using neural network front-ends on far field multiple microphones based speech recognition, " to apper ICASSP 2014.
    • (2014) ICASSP
    • Liu, Y.1    Zhang, P.2    Hain, T.3
  • 26
    • 0032289099 scopus 로고    scopus 로고
    • Heteroscedastic discriminant analysis and reduced rank hmms for improved speech recognition
    • N. Kumar and A. G. Andreou, "Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition, " Speech Communication, vol. 26, no. 4, pp. 283 297, 1998.
    • (1998) Speech Communication , vol.26 , Issue.4 , pp. 283-297
    • Kumar, N.1    Andreou, A.G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.