메뉴 건너뛰기




Volumn , Issue , 2014, Pages 6339-6343

Direct adaptation of hybrid DNN/HMM model for fast speaker adaptation in LVCSR based on speaker code

Author keywords

Deep Neural Network (DNN); Fast Speaker Adaptation; Hybrid DNN HMM; Speaker Code

Indexed keywords

BACKPROPAGATION ALGORITHMS; CODES (SYMBOLS); ELECTRIC SWITCHBOARDS; SIGNAL PROCESSING;

EID: 84905284226     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854824     Document Type: Conference Paper
Times cited : (67)

References (25)
  • 2
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains
    • J. L. Gauvain and Chin-Hui Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Transactions on Speech and audio processing, vol. 2, no. 2, pp. 291-298, 1994.
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.-H.2
  • 3
    • 0031177213 scopus 로고    scopus 로고
    • Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models
    • S. M. Ahadi and P. C. Woodland, "Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models," Computer speech &language, vol. 11, no. 3, pp. 187-206, 1997.
    • (1997) Computer Speech &Language , vol.11 , Issue.3 , pp. 187-206
    • Ahadi, S.M.1    Woodland, P.C.2
  • 4
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • Christopher Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech &Language, vol. 9, no. 2, pp. 171-185, 1995.
    • (1995) Computer Speech &Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.1    Woodland, P.C.2
  • 5
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Mark J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer speech &language, vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Computer Speech &Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 8
    • 34548012893 scopus 로고    scopus 로고
    • Linear hidden transformations for adaptation of hybrid ANN/HMM models
    • Roberto Gemello, Franco Mana, Stefano Scanzio, Pietro Laface, and Renato De Mori, "Linear hidden transformations for adaptation of hybrid ANN/HMM models," Speech Communication, vol. 49, no. 10, pp. 827-835, 2007.
    • (2007) Speech Communication , vol.49 , Issue.10 , pp. 827-835
    • Gemello, R.1    Mana, F.2    Scanzio, S.3    Laface, P.4    De Mori, R.5
  • 10
    • 84878606732 scopus 로고    scopus 로고
    • Hermitian based hidden activation functions for adaptation of hybrid HMM/ANN models
    • Sabato Marco Siniscalchi, Jinyu Li, and Chin-Hui Lee, "Hermitian based hidden activation functions for adaptation of hybrid HMM/ANN models," in INTERSPEECH, 2012.
    • (2012) INTERSPEECH
    • Siniscalchi, S.M.1    Li, J.2    Lee, C.-H.3
  • 13
    • 84906225505 scopus 로고    scopus 로고
    • Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition
    • Ossama Abdel-Hamid and Hui Jiang, "Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition," in INTERSPEECH, 2013.
    • (2013) INTERSPEECH
    • Ossama, A.-H.1    Jiang, H.2
  • 14
    • 84910030053 scopus 로고
    • RecNorm: Simultaneous normalization and classification applied to speech recognition
    • Bridle J. S. and S. J. Cox, "RecNorm: simultaneous normalization and classification applied to speech recognition," Advances in Neural Information Processing Systems, vol. 3, 1991.
    • (1991) Advances in Neural Information Processing Systems , vol.3
    • Bridle, J.S.1    Cox, S.J.2
  • 19
    • 84874485803 scopus 로고    scopus 로고
    • Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMs in acoustic modeling
    • Jia Pan, Cong Liu, Zhiguo Wang, Yu Hu, and Hui Jiang, "Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMs in acoustic modeling," in 8th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2012, pp. 301-305.
    • (2012) 8th International Symposium on Chinese Spoken Language Processing (ISCSLP) , pp. 301-305
    • Pan, J.1    Liu, C.2    Wang, Z.3    Hu, Y.4    Jiang, H.5
  • 20
    • 84876477729 scopus 로고    scopus 로고
    • Investigation on dimensionality reduction of concatenated features with deep neural network for LVCSR systems
    • Yebo Bao, Hui Jiang, Cong Liu, Yu Hu, and Lirong Dai, "Investigation on dimensionality reduction of concatenated features with deep neural network for LVCSR systems," in IEEE 11th International Conference on Signal Processing (ICSP), 2012, vol. 1, pp. 562-566.
    • (2012) IEEE 11th International Conference on Signal Processing (ICSP) , vol.1 , pp. 562-566
    • Bao, Y.1    Jiang, H.2    Liu, C.3    Hu, Y.4    Dai, L.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.