메뉴 건너뛰기




Volumn 2016-May, Issue , 2016, Pages 5010-5014

SAT-LHUC: Speaker adaptive training for learning hidden unit contributions

Author keywords

Deep Neural Networks; LHUC; SAT

Indexed keywords


EID: 84973299594     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2016.7472631     Document Type: Conference Paper
Times cited : (18)

References (44)
  • 3
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • CJ Leggetter and PC Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models, " Computer Speech & Language, vol. 9, pp. 171-185, 1995.
    • (1995) Computer Speech & Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 4
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMMbased speech recognition
    • April
    • MJF Gales, "Maximum likelihood linear transformations for HMMbased speech recognition, " Computer Speech and Language, vol. 12, pp. 75-98, April 1998.
    • (1998) Computer Speech and Language , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 7
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F Seide, X Chen, and D Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc IEEE ASRU, 2011.
    • (2011) Proc IEEE ASRU
    • Seide, F.1    Chen, X.2    Yu, D.3
  • 8
  • 9
    • 84893691530 scopus 로고    scopus 로고
    • Speaker adaptation of neural network acoustic models using i-vectors
    • G Saon, H Soltau, D Nahamoo, and M Picheny, "Speaker adaptation of neural network acoustic models using i-vectors., " in Proc IEEE ASRU, 2013, pp. 55-59.
    • (2013) Proc IEEE ASRU , pp. 55-59
    • Saon, G.1    Soltau, H.2    Nahamoo, D.3    Picheny, M.4
  • 11
    • 84937880519 scopus 로고
    • Connectionist speaker normalization and adaptation
    • V Abrash, H Franco, A Sankar, and M Cohen, "Connectionist speaker normalization and adaptation, " in Proc Eurospeech, 1995, pp. 2183-2186.
    • (1995) Proc Eurospeech , pp. 2183-2186
    • Abrash, V.1    Franco, H.2    Sankar, A.3    Cohen, M.4
  • 12
    • 84874226579 scopus 로고    scopus 로고
    • Adaptation of context-dependent deep neural networks for automatic speech recognition
    • K Yao, D Yu, F Seide, H Su, L Deng, and Y Gong, "Adaptation of context-dependent deep neural networks for automatic speech recognition., " in Proc IEEE SLT, 2012.
    • (2012) Proc IEEE SLT
    • Yao, K.1    Yu, D.2    Seide, F.3    Su, H.4    Deng, L.5    Gong, Y.6
  • 13
    • 84890542079 scopus 로고    scopus 로고
    • KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition
    • D Yu, K Yao, H Su, G Li, and F Seide, "KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition., " in Proc IEEE ICASSP, 2013, pp. 7893-7897.
    • (2013) Proc IEEE ICASSP , pp. 7893-7897
    • Yu, D.1    Yao, K.2    Su, H.3    Li, G.4    Seide, F.5
  • 14
    • 84890521103 scopus 로고    scopus 로고
    • Speaker adaptation of context dependent deep neural networks
    • IEEE
    • H Liao, "Speaker adaptation of context dependent deep neural networks., " in In Proc. ICASSP. 2013, pp. 7947-7951, IEEE.
    • (2013) Proc. ICASSP. , pp. 7947-7951
    • Liao, H.1
  • 15
    • 84906225505 scopus 로고    scopus 로고
    • Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition
    • ISCA
    • O Abdel-Hamid and H Jiang, "Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition., " in Proc. Interspeech. pp. 1248-1252, ISCA.
    • Proc. Interspeech. , pp. 1248-1252
    • Abdel-Hamid, O.1    Jiang, H.2
  • 16
    • 84983119674 scopus 로고    scopus 로고
    • Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models
    • P Swietojanski and S Renals, "Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models, " in Proc. IEEE SLT, 2014.
    • (2014) Proc. IEEE SLT
    • Swietojanski, P.1    Renals, S.2
  • 17
    • 84946032695 scopus 로고    scopus 로고
    • Differentiable pooling for unsupervised speaker adaptation
    • P Swietojanski and S Renals, "Differentiable pooling for unsupervised speaker adaptation, " in Proc. IEEE ICASSP, 2015.
    • (2015) Proc. IEEE ICASSP
    • Swietojanski, P.1    Renals, S.2
  • 20
    • 84910031119 scopus 로고    scopus 로고
    • Towards speaker adaptive training of deep neural network acoustic models
    • Y Miao, H Zhang, and F Metze, "Towards speaker adaptive training of deep neural network acoustic models, " in Proc. Interspeech, 2014.
    • (2014) Proc. Interspeech
    • Miao, Y.1    Zhang, H.2    Metze, F.3
  • 21
    • 84890537527 scopus 로고    scopus 로고
    • Multi-level adaptive networks in tandem and hybrid ASR systems
    • P Bell, P Swietojanski, and S Renals, "Multi-level adaptive networks in tandem and hybrid ASR systems, " in Proc IEEE ICASSP, 2013.
    • (2013) Proc IEEE ICASSP
    • Bell, P.1    Swietojanski, P.2    Renals, S.3
  • 22
    • 84946036535 scopus 로고    scopus 로고
    • An investigation into speaker informed DNN front-end for LVCSR
    • Y Liu, P Karanasou, and T Hain, "An investigation into speaker informed DNN front-end for LVCSR, " in Proc IEEE ICASSP, 2015.
    • (2015) Proc IEEE ICASSP
    • Liu, Y.1    Karanasou, P.2    Hain, T.3
  • 23
    • 84910030053 scopus 로고
    • Recnorm: Simultaneous normalisation and classification applied to speech recognition
    • JS Bridle and S Cox, "Recnorm: Simultaneous normalisation and classification applied to speech recognition, " in Advances in Neural Information Processing Systems 3, 1990, pp. 234-240.
    • (1990) Advances in Neural Information Processing Systems 3 , pp. 234-240
    • Bridle, J.S.1    Cox, S.2
  • 24
    • 84890521637 scopus 로고    scopus 로고
    • On speaker adaptive training of artificial neural networks
    • J Trmal, J Zelinka, and L Müller, "On speaker adaptive training of artificial neural networks, " in Proc. Interspeech, 2010.
    • (2010) Proc. Interspeech
    • Trmal, J.1    Zelinka, J.2    Müller, L.3
  • 25
    • 84890452886 scopus 로고    scopus 로고
    • Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code
    • O Abdel-Hamid and H Jiang, "Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code, " in Proc IEEE ICASSP, 2013, pp. 4277-4280.
    • (2013) Proc IEEE ICASSP , pp. 4277-4280
    • Abdel-Hamid, O.1    Jiang, H.2
  • 26
    • 84946054484 scopus 로고    scopus 로고
    • Multi-basis adaptive neural network for rapid adaptation in speech recognition
    • IEEE
    • C Wu and M Gales, "Multi-basis adaptive neural network for rapid adaptation in speech recognition, " in Proc. ICASSP. 2015, IEEE.
    • (2015) Proc. ICASSP.
    • Wu, C.1    Gales, M.2
  • 27
  • 28
    • 84946036209 scopus 로고    scopus 로고
    • Context adaptive deep neural networks for fast acoustic model adaptation
    • IEEE
    • M Delcroix, K Kinoshita, T Hori, and T Nakatani, "Context adaptive deep neural networks for fast acoustic model adaptation, " in Proc. ICASSP. 2015, IEEE.
    • (2015) Proc. ICASSP.
    • Delcroix, M.1    Kinoshita, K.2    Hori, T.3    Nakatani, T.4
  • 29
    • 85001124710 scopus 로고    scopus 로고
    • Wit3: Web inventory of transcribed and translated talks
    • M Cettolo, C Girardi, and M Federico, "Wit3: Web inventory of transcribed and translated talks, " in Proc EAMT, 2012, pp. 261-268.
    • (2012) Proc EAMT , pp. 261-268
    • Cettolo, M.1    Girardi, C.2    Federico, M.3
  • 30
    • 85016587886 scopus 로고
    • SWITCHBOARD: Telephone speech corpus for research and development
    • John J Godfrey, Edward C Holliman, and Jane McDaniel, "SWITCHBOARD: Telephone speech corpus for research and development, " in Proc. ICASSP. IEEE, 1992, pp. 517-520.
    • (1992) Proc. ICASSP. IEEE , pp. 517-520
    • Godfrey, J.J.1    Holliman, E.C.2    McDaniel, J.3
  • 31
    • 35948981862 scopus 로고    scopus 로고
    • Unleashing the killer corpus: Experiences in creating the multi-everything AMI meeting corpus
    • J Carletta, "Unleashing the killer corpus: Experiences in creating the multi-everything AMI meeting corpus., " Language Resources and Evaluation, vol. 41, no. 2, pp. 181-190, 2007.
    • (2007) Language Resources and Evaluation , vol.41 , Issue.2 , pp. 181-190
    • Carletta, J.1
  • 34
    • 84890492591 scopus 로고    scopus 로고
    • Revisiting hybrid and GMM-HMM system combination techniques
    • P Swietojanski, A Ghoshal, and S Renals, "Revisiting hybrid and GMM-HMM system combination techniques, " in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Swietojanski, P.1    Ghoshal, A.2    Renals, S.3
  • 36
    • 84906274730 scopus 로고    scopus 로고
    • Sequencediscriminative training of deep neural networks
    • Lyon, France, August
    • K Vesely, A Ghoshal, L Burget, and D Povey, "Sequencediscriminative training of deep neural networks, " in Proc. Interspeech, Lyon, France, August 2013.
    • (2013) Proc. Interspeech
    • Vesely, K.1    Ghoshal, A.2    Burget, L.3    Povey, D.4
  • 38
    • 84893704659 scopus 로고    scopus 로고
    • Hybrid acoustic models for distant and multichannel large vocabulary speech recognition
    • December
    • P Swietojanski, A Ghoshal, and S Renals, "Hybrid acoustic models for distant and multichannel large vocabulary speech recognition, " in Proc. IEEE ASRU, December 2013.
    • (2013) Proc. IEEE ASRU
    • Swietojanski, P.1    Ghoshal, A.2    Renals, S.3
  • 39
    • 84959174678 scopus 로고    scopus 로고
    • Parameterised sigmoid and relu hidden activation functions for DNN acoustic modelling
    • C Zhang and PC Woodland, "Parameterised Sigmoid and ReLU Hidden Activation Functions for DNN Acoustic Modelling, " in Proc. Interspeech, 2015.
    • (2015) Proc. Interspeech
    • Zhang, C.1    Woodland, P.C.2
  • 40
    • 84959177524 scopus 로고    scopus 로고
    • Human vs machine spoofing detection on wideband and narrowband data
    • September
    • M Wester, Z Wu, and J Yamagishi, "Human vs machine spoofing detection on wideband and narrowband data, " in Proc. of Interspeech, September 2015.
    • (2015) Proc. of Interspeech
    • Wester, M.1    Wu, Z.2    Yamagishi, J.3
  • 41
    • 84910084579 scopus 로고    scopus 로고
    • 2000 NIST evaluation of conversational speech recognition over the telephone: English and Mandarin performance results
    • Citeseer
    • J Fiscus, W M Fisher, A F Martin, M A Przybocki, and D S Pallett, "2000 NIST evaluation of conversational speech recognition over the telephone: English and Mandarin performance results, " in Proc. Speech Transcription Workshop. Citeseer, 2000.
    • (2000) Proc. Speech Transcription Workshop
    • Fiscus, J.1    Fisher, W.M.2    Martin, A.F.3    Przybocki, M.A.4    Pallett, D.S.5
  • 44
    • 70349213445 scopus 로고    scopus 로고
    • Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
    • B Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, " in Proc. IEEE ICASSP, 2009, pp. 3761-3764.
    • (2009) Proc. IEEE ICASSP , pp. 3761-3764
    • Kingsbury, B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.