SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 4315-4319

Multi-basis adaptive neural network for rapid adaptation in speech recognition

(2) Wu, Chunyang a Gales, Mark J F a

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

Adaptation; deep neural network; speech recognition

Indexed keywords

ACOUSTIC PROPERTIES; AUDIO SIGNAL PROCESSING; DEEP NEURAL NETWORKS; SPEECH; SPEECH COMMUNICATION;

ADAPTATION; ADAPTIVE NEURAL NETWORKS; AUTOMATIC SPEECH RECOGNITION SYSTEM; MULTI-LAYER PERCEPTION; NEURAL NETWORK CONFIGURATIONS; RESTRICTED CONNECTIVITY; SPEAKER INDEPENDENTS; UNSUPERVISED ADAPTATION;

SPEECH RECOGNITION;

EID: 84946054484 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178785 Document Type: Conference Paper

Times cited : (73)

References (28)

1
- 84055222005
- Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
- George E Dahl, Dong Yu, Li Deng, and Alex Acero, Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 30-42, 2012
- (2012) Audio, Speech, and Language Processing, IEEE Transactions on , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

2
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- Geoffrey Hinton et al., Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, 2012
- (2012) Signal Processing Magazine, IEEE , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹

3
- 84878397276
- Pipelined back-propagation for context-dependent deep neural networks
- Xie Chen, Adam Eversole, Gang Li, Dong Yu, and Frank Seide, Pipelined back-propagation for context-dependent deep neural networks., in INTERSPEECH, 2012
- (2012) INTERSPEECH
- Chen, X.¹ Eversole, A.² Li, G.³ Yu, D.⁴ Seide, F.⁵

4
- 84910027886
- A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden markov models
- Yan Huang, Dong Yu, Chaojun Liu, and Yifan Gong, A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden markov models, Interspeech, 2014
- (2014) Interspeech
- Huang, Y.¹ Yu, D.² Liu, C.³ Gong, Y.⁴

5
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains
- Jean-Luc Gauvain and Chin-Hui Lee, Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains, Speech and audio processing, ieee transactions on, vol. 2, no. 2, pp. 291-298, 1994
- (1994) Speech and Audio Processing, Ieee Transactions on , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

6
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
- Christopher J Leggetter and Philip C Woodland, Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models, Computer Speech &Language, vol. 9, no. 2, pp. 171-185, 1995
- (1995) Computer Speech &Language , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

7
- 0032050110
- Maximum likelihood linear transformations for hmm-based speech recognition
- Mark J.F.G., Maximum likelihood linear transformations for hmm-based speech recognition, Computer speech &language, vol. 12, no. 2, pp. 75-98, 1998
- (1998) Computer Speech &Language , vol.12 , Issue.2 , pp. 75-98
- Mark, J.F.G.¹

8
- 33646794050
- Two-stage speaker adaptation of hybrid tied-posterior acoustic models
- Jan Stadermann and Gerhard Rigoll, Two-stage speaker adaptation of hybrid tied-posterior acoustic models., in ICASSP (1), 2005, pp. 977-980
- (2005) ICASSP , Issue.1 , pp. 977-980
- Stadermann, J.¹ Rigoll, G.²

9
- 33947635130
- Regularized adaptation of discriminative classifiers
- Xiao Li and Jeff Bilmes, Regularized adaptation of discriminative classifiers, in Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on. IEEE, 2006, vol. 1, pp. I-I
- Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference On. IEEE, 2006 , vol.1 , pp. 1-1
- Li, X.¹ Bilmes, J.²

10
- 84890542079
- Kl-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition
- Dong Yu, Kaisheng Yao, Hang Su, Gang Li, and Frank Seide, Kl-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition, in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 7893-7897
- (2013) Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE , pp. 7893-7897
- Yu, D.¹ Yao, K.² Su, H.³ Li, G.⁴ Seide, F.⁵

11
- 84905259138
- Improving dnn speaker independence with i-vector inputs
- Andrew Senior and Ignacio Lopez-Moreno, Improving dnn speaker independence with i-vector inputs, in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on. IEEE, 2014, pp. 225-229
- (2014) Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference On. IEEE , pp. 225-229
- Senior, A.¹ Ignacio, L.-M.²

12
- 84893691530
- Speaker adaptation of neural network acoustic models using i-vectors
- George Saon, Hagen Soltau, David Nahamoo, and Michael Picheny, Speaker adaptation of neural network acoustic models using i-vectors, in Automatic Speech Recognition and Understanding (ASRU), 2013 IEEEWorkshop on. IEEE, 2013, pp. 55-59
- (2013) Automatic Speech Recognition and Understanding (ASRU), 2013 IEEEWorkshop On. IEEE , pp. 55-59
- Saon, G.¹ Soltau, H.² Nahamoo, D.³ Picheny, M.⁴

13
- 84905262902
- Factorized adaptation for deep neural network
- Jinyu Li, Jui-Ting Huang, and Yifan Gong, Factorized adaptation for deep neural network, in Proc. ICASSP, 2014
- (2014) Proc. ICASSP
- Li, J.¹ Huang, J.-T.² Gong, Y.³

14
- 84890452886
- Fast speaker adaptation of hybrid nn/hmm model for speech recognition based on discriminative learning of speaker code
- Ossama Abdel-Hamid and Hui Jiang, Fast speaker adaptation of hybrid nn/hmm model for speech recognition based on discriminative learning of speaker code, in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 7942-7946
- (2013) Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE , pp. 7942-7946
- Ossama, A.-H.¹ Jiang, H.²

15
- 84905284226
- Direct adaptation of hybrid dnn/hmm model for fast speaker adaptation in lvcsr based on speaker code
- Shaofei Xue, Ossama Abdel-Hamid, Hui Jiang, and Lirong Dai, Direct adaptation of hybrid dnn/hmm model for fast speaker adaptation in lvcsr based on speaker code, in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on. IEEE, 2014, pp. 6339-6343
- (2014) Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference On. IEEE , pp. 6339-6343
- Xue, S.¹ Ossama, A.-H.² Jiang, H.³ Dai, L.⁴

16
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- Frank Seide, Gang Li, Xie Chen, and Dong Yu, Feature engineering in context-dependent deep neural networks for conversational speech transcription, in Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on. IEEE, 2011, pp. 24-29
- (2011) Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop On. IEEE , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

17
- 0003167795
- Joao Neto, Luís Almeida, Mike Hochberg, Ciro Martins, Luís Nunes, Steve Renals, and Tony Robinson, Speaker-adaptation for hybrid hmm-ann continuous speech recognition system, 1995
- (1995) Speaker-adaptation for Hybrid Hmm-ann Continuous Speech Recognition System
- Neto, J.¹ Almeida, L.² Hochberg, M.³ Martins, C.⁴ Nunes, L.⁵ Renals, S.⁶ Robinson, T.⁷

18
- 34548012893
- Linear hidden transformations for adaptation of hybrid ann/hmm models
- Roberto Gemello, Franco Mana, Stefano Scanzio, Pietro Laface, and Renato De Mori, Linear hidden transformations for adaptation of hybrid ann/hmm models, Speech Communication, vol. 49, no. 10, pp. 827-835, 2007
- (2007) Speech Communication , vol.49 , Issue.10 , pp. 827-835
- Gemello, R.¹ Mana, F.² Scanzio, S.³ Laface, P.⁴ De Mori, R.⁵

19
- 84874226579
- Adaptation of context-dependent deep neural networks for automatic speech recognition
- Kaisheng Yao, Dong Yu, Frank Seide, Hang Su, Li Deng, and Yifan Gong, Adaptation of context-dependent deep neural networks for automatic speech recognition., in SLT, 2012, pp. 366-369
- (2012) SLT , pp. 366-369
- Yao, K.¹ Yu, D.² Seide, F.³ Su, H.⁴ Deng, L.⁵ Gong, Y.⁶

20
- 79959849500
- Comparison of discriminative input and output transformations for speaker adaptation in the hybrid nn/hmm systems
- Bo Li and Khe Chai Sim, Comparison of discriminative input and output transformations for speaker adaptation in the hybrid nn/hmm systems., in INTERSPEECH, 2010, pp. 526-529
- (2010) INTERSPEECH , pp. 526-529
- Li, B.¹ Chai Sim, K.²

21
- 84878606732
- Hermitian based hidden activation functions for adaptation of hybrid hmm/ann models
- Sabato Marco Siniscalchi, Jinyu Li, and Chin-Hui Lee, Hermitian based hidden activation functions for adaptation of hybrid hmm/ann models., in INTERSPEECH, 2012
- (2012) INTERSPEECH
- Marco Siniscalchi, S.¹ Li, J.² Lee, C.-H.³

22
- 84905216195
- Speaker adaptive training using deep neural networks
- Tsubasa Ochiai, Shigeki Matsuda, Xugang Lu, Chiori Hori, and Shigeru Katagiri, Speaker adaptive training using deep neural networks, in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on. IEEE, 2014, pp. 6349-6353
- (2014) Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference On. IEEE , pp. 6349-6353
- Ochiai, T.¹ Matsuda, S.² Lu, X.³ Hori, C.⁴ Katagiri, S.⁵

23
- 84910031119
- Towards speaker adaptive training of deep neural network acoustic models
- Yajie Miao, Hao Zhang, and Florian Metze, Towards speaker adaptive training of deep neural network acoustic models, in Proc. Interspeech, 2014
- (2014) Proc. Interspeech
- Miao, Y.¹ Zhang, H.² Metze, F.³

24
- 0034227757
- Cluster adaptive training of hidden markov models
- Mark JF Gales, Cluster adaptive training of hidden markov models, Speech and Audio Processing, IEEE Transactions on, vol. 8, no. 4, pp. 417-428, 2000
- (2000) Speech and Audio Processing, IEEE Transactions on , vol.8 , Issue.4 , pp. 417-428
- Mark, J.F.G.¹

25
- 84871609195
- Eigenvoices for speaker adaptation
- Roland Kuhn, Patrick Nguyen, Jean-Claude Junqua, Lloyd Goldwasser, Nancy Niedzielski, Steven Fincke, Ken Field, and Matteo Contolini, Eigenvoices for speaker adaptation., in ICSLP, 1998, vol. 98, pp. 1774-1777
- (1998) ICSLP , vol.98 , pp. 1774-1777
- Kuhn, R.¹ Nguyen, P.² Junqua, J.-C.³ Goldwasser, L.⁴ Niedzielski, N.⁵ Fincke, S.⁶ Field, K.⁷ Contolini, M.⁸

26
- 84899024949
- Adaptive multi-column deep neural networks with application to robust image denoising
- Forest Agostinelli, Michael R Anderson, and Honglak Lee, Adaptive multi-column deep neural networks with application to robust image denoising, in Advances in Neural Information Processing Systems, 2013, pp. 1493-1501
- (2013) Advances in Neural Information Processing Systems , pp. 1493-1501
- Agostinelli, F.¹ Anderson, M.R.² Lee, H.³

27
- 0003871508
- Ph.D. thesis, Johns Hopkins University
- Nagendra Kumar and Andreas G Andreou, Investigation of silicon auditory models and generalization of linear discriminant analysis for improved speech recognition, Ph.D. thesis, Johns Hopkins University, 1997
- (1997) Investigation of Silicon Auditory Models and Generalization of Linear Discriminant Analysis for Improved Speech Recognition
- Kumar, N.¹ Andreou, A.G.²

28
- 0036296863
- Minimum phone error and i-smoothing for improved discriminative training
- Daniel Povey and Philip C Woodland, Minimum phone error and i-smoothing for improved discriminative training, in Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on. IEEE, 2002, vol. 1, pp. I-105
- (2002) Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference On. IEEE , vol.1 , pp. 1-105
- Povey, D.¹ Woodland, P.C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.