메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1-5

Speaker adaptation of hybrid NN/HMM model for speech recognition based on singular value decomposition

Author keywords

Deep Neural Network (DNN); Hybrid DNN HMM; singular value decomposition (SVD); Speaker Adaptation

Indexed keywords

CONTINUOUS SPEECH RECOGNITION; DEEP NEURAL NETWORKS; ELECTRIC SWITCHBOARDS; NEURAL NETWORKS; SPEECH; VOCABULARY CONTROL;

EID: 84912109599     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ISCSLP.2014.6936583     Document Type: Conference Paper
Times cited : (19)

References (29)
  • 1
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • J. L. Gauvain and Chin-Hui Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Transactions on Speech and audio processing, vol. 2, no. 2, pp. 291-298, 1994.
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.2
  • 2
    • 0031177213 scopus 로고    scopus 로고
    • Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models
    • S. M. Ahadi and P. C. Woodland, "Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models," Computer speech & language, vol. 11, no. 3, pp. 187-206, 1997.
    • (1997) Computer Speech & Language , vol.11 , Issue.3 , pp. 187-206
    • Ahadi, S.M.1    Woodland, P.C.2
  • 3
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • Christopher Leggetter and P. C.Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech & Language, vol. 9, no. 2, pp. 171-185, 1995.
    • (1995) Computer Speech & Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.C.1    Woodland, P.2
  • 4
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Mark J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer speech & language, vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Computer Speech & Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 9
    • 34548012893 scopus 로고    scopus 로고
    • Linear hidden transformations for adaptation of hybrid ANN/HMM models
    • Roberto Gemello, Franco Mana, Stefano Scanzio, Pietro Laface, and Renato De Mori, "Linear hidden transformations for adaptation of hybrid ANN/HMM models," Speech Communication, vol. 49, no. 10, pp. 827-835, 2007.
    • (2007) Speech Communication , vol.49 , Issue.10 , pp. 827-835
    • Gemello, R.1    Mana, F.2    Scanzio, S.3    Laface, P.4    De Mori, R.5
  • 11
  • 13
    • 84874226579 scopus 로고    scopus 로고
    • Adaptation of context-dependent deep neural networks for automatic speech recognition
    • IEEE
    • Kaisheng Yao, Dong Yu, Frank Seide, Hang Su, Li Deng, and Yifan Gong, "Adaptation of context-dependent deep neural networks for automatic speech recognition," in Spoken Language Technology Workshop (SLT). IEEE, 2012, pp. 366-369.
    • (2012) Spoken Language Technology Workshop (SLT) , pp. 366-369
    • Yao, K.1    Yu, D.2    Seide, F.3    Su, H.4    Deng, L.5    Gong, Y.6
  • 17
    • 84890452886 scopus 로고    scopus 로고
    • Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code
    • IEEE
    • Ossama Abdel-Hamid and Hui Jiang, "Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code," in IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP). IEEE, 2013, pp. 7942-7946.
    • (2013) IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP) , pp. 7942-7946
    • Abdel-Hamid, O.1    Jiang, H.2
  • 18
    • 84906225505 scopus 로고    scopus 로고
    • Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition
    • Ossama Abdel-Hamid and Hui Jiang, "Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition," in INTERSPEECH, 2013.
    • (2013) INTERSPEECH
    • Abdel-Hamid, O.1    Jiang, H.2
  • 24
    • 84906227589 scopus 로고    scopus 로고
    • Restructuring of deep neural network acoustic models with singular value decomposition
    • Jian Xue, Jinyu Li, and Yifan Gong, "Restructuring of deep neural network acoustic models with singular value decomposition," in INTERSPEECH, 2013.
    • (2013) INTERSPEECH
    • Xue, J.1    Li, J.2    Gong, Y.3
  • 26
    • 84874485803 scopus 로고    scopus 로고
    • Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMs in acoustic modeling
    • Jia Pan, Cong Liu, Zhiguo Wang, Yu Hu, and Hui Jiang, "Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMs in acoustic modeling," in 8th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2012, pp. 301-305.
    • (2012) 8th International Symposium on Chinese Spoken Language Processing (ISCSLP) , pp. 301-305
    • Pan, J.1    Liu, C.2    Wang, Z.3    Hu, Y.4    Jiang, H.5
  • 27
    • 84876477729 scopus 로고    scopus 로고
    • Investigation on dimensionality reduction of concatenated features with deep neural network for LVCSR systems
    • Yebo Bao, Hui Jiang, Cong Liu, Yu Hu, and Lirong Dai, "Investigation on dimensionality reduction of concatenated features with deep neural network for LVCSR systems," in IEEE 11th International Conference on Signal Processing (ICSP), 2012, vol. 1, pp. 562-566.
    • (2012) IEEE 11th International Conference on Signal Processing (ICSP) , vol.1 , pp. 562-566
    • Bao, Y.1    Jiang, H.2    Liu, C.3    Hu, Y.4    Dai, L.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.