메뉴 건너뛰기




Volumn , Issue , 2014, Pages 6329-6333

Deep neural network trained with speaker representation for speaker normalization

Author keywords

Neural networks; speaker adaptation; speaker normalization

Indexed keywords

FEATURE EXTRACTION; HIDDEN MARKOV MODELS; NEURAL NETWORKS; SPEECH RECOGNITION;

EID: 84905265988     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854822     Document Type: Conference Paper
Times cited : (3)

References (18)
  • 1
    • 85032751458 scopus 로고    scopus 로고
    • Deep neural networks for acoustic modeling in speech recognition
    • November
    • G. Hinton, L. Deng, D. Yu, et al., "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Processing Magazine, vol. 29, pp. 82-97, November 2012.
    • (2012) IEEE Signal Processing Magazine , vol.29 , pp. 82-97
    • Hinton, G.1    Deng, L.2    Yu, D.3
  • 2
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • Daniel P. W. Ellis Hynek Hermansky and Sangita Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in ICASSP, 2000, pp. 1635-1638.
    • (2000) ICASSP , pp. 1635-1638
    • Ellis, D.P.W.1    Hermansky, H.2    Sharma, S.3
  • 3
    • 84867593213 scopus 로고    scopus 로고
    • Auto-encoder bottleneck features using deep belief networks
    • T. Sainath, B. Kingsbury, and B. Ramabhadran, "Auto-encoder bottleneck features using deep belief networks," in ICASSP, 2012, pp. 4153-4156.
    • (2012) ICASSP , pp. 4153-4156
    • Sainath, T.1    Kingsbury, B.2    Ramabhadran, B.3
  • 5
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol. 12, pp. 75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 6
    • 84890509526 scopus 로고    scopus 로고
    • MLP-based factor analysis for tandem speech recognition
    • M. Ferras and H. Bourlard, "MLP-based factor analysis for tandem speech recognition," in ICASSP, 2013.
    • (2013) ICASSP
    • Ferras, M.1    Bourlard, H.2
  • 7
    • 84890452886 scopus 로고    scopus 로고
    • Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code
    • O. Abdel-Hamid and H. Jiang, "Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code," in ICASSP, 2013.
    • (2013) ICASSP
    • Abdel-Hamid, O.1    Jiang, H.2
  • 8
    • 84906225505 scopus 로고    scopus 로고
    • Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition
    • O. Abdel-Hamid and H. Jiang, "Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition," in INTERSPEECH, 2013.
    • (2013) INTERSPEECH
    • Abdel-Hamid, O.1    Jiang, H.2
  • 9
    • 84876231242 scopus 로고    scopus 로고
    • Imagenet classification with deep convolutional neural networks
    • A. Krizhevsky, I. Sutskever, and G. Hinton, "Imagenet classification with deep convolutional neural networks," in NIPS, 2012.
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.3
  • 10
    • 84890527827 scopus 로고    scopus 로고
    • Improving deep neural network for LVCSR using recified linear units and dropout
    • G. Dahl, T. Sainath, and G. Hinton, "Improving deep neural network for LVCSR using recified linear units and dropout," in ICASSP, 2013.
    • (2013) ICASSP
    • Dahl, G.1    Sainath, T.2    Hinton, G.3
  • 13
    • 34548012893 scopus 로고    scopus 로고
    • Linear hidden transformations for adaptation of hybrid ANN/HMM models
    • R. Gemello, F. Mana, S. Scanzio, P. Laface, and R. de Mori, "Linear hidden transformations for adaptation of hybrid ANN/HMM models," Speech Communication, vol. 49, no. 10-11, pp. 827-835, 2007.
    • (2007) Speech Communication , vol.49 , Issue.10-11 , pp. 827-835
    • Gemello, R.1    Mana, F.2    Scanzio, S.3    Laface, P.4    De Mori, R.5
  • 14
    • 0030362995 scopus 로고    scopus 로고
    • A compact model for speaker-adaptive training
    • vol. 2
    • T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training," in ICSLP, 1996, vol. 2, pp. 1137-1140 vol. 2.
    • (1996) ICSLP , vol.2 , pp. 1137-1140
    • Anastasakos, T.1    McDonough, J.2    Schwartz, R.3    Makhoul, J.4
  • 16
    • 78149337911 scopus 로고    scopus 로고
    • Tech. Rep., University of Toronto, Department of Computer Science
    • Volodymyr Mnih, "Cudamat: a CUDA-based matrix class for python," Tech. Rep., University of Toronto, Department of Computer Science, 2009.
    • (2009) Cudamat: A CUDA-based Matrix Class for Python
    • Mnih, V.1
  • 17
    • 84865742011 scopus 로고    scopus 로고
    • A study on speaker normalized MLP features in LVCSR
    • Z. Tuske, C. Plahl, and R. Schluter, "A study on speaker normalized MLP features in LVCSR," in Interspeech, Auguest 2011, pp. 1089-1092.
    • (2011) Interspeech, Auguest , pp. 1089-1092
    • Tuske, Z.1    Plahl, C.2    Schluter, R.3
  • 18
    • 84890519798 scopus 로고    scopus 로고
    • Tandem system adaptation using multiple linear feature transforms
    • Y. Wang and M. J. F. Gales, "Tandem system adaptation using multiple linear feature transforms," in ICASSP, 2013.
    • (2013) ICASSP
    • Wang, Y.1    Gales, M.J.F.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.