SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 4305-4309

Differentiable pooling for unsupervised speaker adaptation

(2) Swietojanski, Pawel a Renals, Steve a

a UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

Deep Neural Networks; Differentiable pooling; LHUC; Speaker Adaptation; TED

Indexed keywords

DEEP NEURAL NETWORKS; SPEECH COMMUNICATION; SPEECH PROCESSING; SPEECH RECOGNITION;

DIFFERENTIABLE POOLING; LHUC; MODEL BASED NEURAL NETWORKS; SPEAKER ADAPTATION; SPEAKER ADAPTIVE TRAININGS; SPEAKER DEPENDENTS; SPEAKER INDEPENDENTS; UNSUPERVISED SPEAKER ADAPTATION;

AUDIO SIGNAL PROCESSING;

EID: 84946032695 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178783 Document Type: Conference Paper

Times cited : (34)

References (41)

1
- 0003573244
- Kluwer Academic Publishers
- H Bourlard and N Morgan, Connectionist Speech Recognition: A Hybrid Approach, Kluwer Academic Publishers, 1994
- (1994) Connectionist Speech Recognition: A Hybrid Approach
- Bourlard, H.¹ Morgan, N.²

2
- 0028194709
- Connectionist probability estimators in HMM speech recognition
- S Renals, N Morgan, H Bourlard, M Cohen, and H Franco, "Connectionist probability estimators in HMM speech recognition" IEEE Trans Speech and Audio Processing, vol. 2, pp. 161-174, 1994
- (1994) IEEE Trans Speech and Audio Processing , vol.2 , pp. 161-174
- Renals, S.¹ Morgan, N.² Bourlard, H.³ Cohen, M.⁴ Franco, H.⁵

3
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- Nov
- G Hinton, L Deng, D Yu, GE Dahl, A Mohamed, N Jaitly, A Senior, V Vanhoucke, P Nguyen, TN Sainath, and B Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups" Signal Processing Magazine,IEEE, vol. 29, no. 6, pp. 82-97, Nov 2012
- (2012) Signal Processing Magazine, IEEE , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.E.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.N.¹⁰ Kingsbury, B.¹¹

4
- 84937854847
- Speaker adaptation for hybrid HMM-ANN continuous speech recognition system
- J Neto, L Almeida, M Hochberg, C Martins, L Nunes, S Renals, and T Robinson, "Speaker adaptation for hybrid HMM-ANN continuous speech recognition system" in Proc Eurospeech, 1995, pp. 2171-2174
- (1995) Proc Eurospeech , pp. 2171-2174
- Neto, J.¹ Almeida, L.² Hochberg, M.³ Martins, C.⁴ Nunes, L.⁵ Renals, S.⁶ Robinson, T.⁷

5
- 84937880519
- Connectionist speaker normalization and adaptation
- V Abrash, H Franco, A Sankar, and M Cohen, "Connectionist speaker normalization and adaptation" in Proc Eurospeech, 1995, pp. 21832186
- (1995) Proc Eurospeech , pp. 2183-2186
- Abrash, V.¹ Franco, H.² Sankar, A.³ Cohen, M.⁴

6
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- F Seide, X Chen, and D Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription" in Proc IEEE ASRU, 2011
- (2011) Proc IEEE ASRU
- Seide, F.¹ Chen, X.² Yu, D.³

7
- 84890492591
- Revisiting hybrid and GMM-HMM system combination techniques
- P Swietojanski, A Ghoshal, and S Renals, "Revisiting hybrid and GMM-HMM system combination techniques" in Proc IEEEICASSP, 2013
- (2013) Proc IEEEICASSP
- Swietojanski, P.¹ Ghoshal, A.² Renals, S.³

8
- 84890542079
- KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition
- D Yu, K Yao, H Su, G Li, and F Seide, "KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition." in Proc IEEE ICASSP, 2013, pp. 7893-7897
- (2013) Proc IEEE ICASSP , pp. 7893-7897
- Yu, D.¹ Yao, K.² Su, H.³ Li, G.⁴ Seide, F.⁵

9
- 84890521103
- Speaker adaptation of context dependent deep neural networks
- IEEE
- H Liao, "Speaker adaptation of context dependent deep neural networks." in In Proc. ICASSP. 2013, pp. 7947-7951, IEEE
- (2013) Proc. ICASSP , pp. 7947-7951
- Liao, H.¹

10
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- April
- MJF Gales, "Maximum likelihood linear transformations for HMM-based speech recognition" Computer Speech and Language, vol. 12, pp. 75-98, April 1998
- (1998) Computer Speech and Language , vol.12 , pp. 75-98
- Gales, M.J.F.¹

11
- 84890537527
- Multi-level adaptive networks in tandem and hybrid ASR systems
- P Bell, P Swietojanski, and S Renals, "Multi-level adaptive networks in tandem and hybrid ASR systems" in Proc IEEE ICASSP, 2013
- (2013) Proc IEEE ICASSP
- Bell, P.¹ Swietojanski, P.² Renals, S.³

12
- 79951609039
- Front end factor analysis for speaker verification
- N Dehak, PJ Kenny, R Dehak, P Dumouchel, and P Ouellet, "Front end factor analysis for speaker verification" IEEE Trans Audio, Speech and Language Processing, vol. 19, pp. 788-798, 2010
- (2010) IEEE Trans Audio, Speech and Language Processing , vol.19 , pp. 788-798
- Dehak, N.¹ Kenny, P.J.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

13
- 84893691530
- Speaker adaptation of neural network acoustic models using i-vectors
- G Saon, H Soltau, D Nahamoo, and M Picheny, "Speaker adaptation of neural network acoustic models using i-vectors." in Proc IEEE ASRU, 2013, pp. 55-59
- (2013) Proc IEEE ASRU , pp. 55-59
- Saon, G.¹ Soltau, H.² Nahamoo, D.³ Picheny, M.⁴

14
- 84874226579
- Adaptation of context-dependent deep neural networks for automatic speech recognition
- K Yao, D Yu, F Seide, H Su, L Deng, and Y Gong, "Adaptation of context-dependent deep neural networks for automatic speech recognition." in Proc IEEE SLT, 2012
- (2012) Proc IEEE SLT
- Yao, K.¹ Yu, D.² Seide, F.³ Su, H.⁴ Deng, L.⁵ Gong, Y.⁶

15
- 84881054791
- Hermitian polynomial for speaker adaptation of connectionist speech recognition systems
- SM Siniscalchi, J Li, and CH Lee, "Hermitian polynomial for speaker adaptation of connectionist speech recognition systems" IEEE Trans Audio, Speech,and Language Processing, vol. 21, pp. 2152-2161, 2013
- (2013) IEEE Trans Audio, Speech, and Language Processing , vol.21 , pp. 2152-2161
- Siniscalchi, S.M.¹ Li, J.² Lee, C.³

16
- 84910030053
- Recnorm: Simultaneous normalisation and classification applied to speech recognition
- JS Bridle and S Cox, "Recnorm: Simultaneous normalisation and classification applied to speech recognition" in Advances in Neural Information Processing Systems 3, 1990, pp. 234-240
- (1990) Advances in Neural Information Processing Systems , vol.3 , pp. 234-240
- Bridle, J.S.¹ Cox, S.²

17
- 84890452886
- Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code
- O Abdel-Hamid and H Jiang, "Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code" in Proc IEEE ICASSP, 2013, pp. 4277-4280
- (2013) Proc IEEE ICASSP , pp. 4277-4280
- Abdel-Hamid, O.¹ Jiang, H.²

18
- 84905229915
- Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network
- J Xue, J Li, D Yu, M Seltzer, and Y Gong, "Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network" in Proc IEEE ICASSP, 2014
- (2014) Proc IEEE ICASSP
- Xue, J.¹ Li, J.² Yu, D.³ Seltzer, M.⁴ Gong, Y.⁵

19
- 84906225505
- Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition
- ISCA
- O Abdel-Hamid and H Jiang, "Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition." in Proc. Interspeech. pp. 1248-1252, ISCA
- Proc. Interspeech , pp. 1248-1252
- Abdel-Hamid, O.¹ Jiang, H.²

20
- 84983119674
- Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models
- P Swietojanski and S Renals, "Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models" in Proc. IEEE SLT, 2014
- (2014) Proc. IEEE SLT
- Swietojanski, P.¹ Renals, S.²

21
- 0020331278
- Neocognitron: A new algoriothm for pattern recognition tolerant of deformations
- K Fukushima and S Miyake, "Neocognitron: A new algoriothm for pattern recognition tolerant of deformations" Pattern Recognition, vol. 15, pp. 455-69, 1982
- (1982) Pattern Recognition , vol.15 , pp. 455-469
- Fukushima, K.¹ Miyake, S.²

22
- 0000359337
- Backpropagation applied to handwritten zip code recognition
- Y LeCun, B Boser, JS Denker, D Henderson, RE Howard, W Hub-bard, and LD Jackel, "Backpropagation applied to handwritten zip code recognition," Neural Computation, vol. 1, pp. 541-551, 1989
- (1989) Neural Computation , vol.1 , pp. 541-551
- LeCun, Y.¹ Boser, B.² Denker, J.S.³ Henderson, D.⁴ Howard, R.E.⁵ Hub-Bard, W.⁶ Jackel, L.D.⁷

23
- 0032203257
- Gradient-based learning applied to document recognition
- Y LeCun, L Bottou, Y Bengio, and P Haffner, "Gradient-based learning applied to document recognition" Proceedings of the IEEE, vol. 86, pp. 2278-2324, 1998
- (1998) Proceedings of the IEEE , vol.86 , pp. 2278-2324
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

24
- 0033316361
- Hierarchical models of object recognition in cortex
- M Riesenhuber and T Poggio, "Hierarchical models of object recognition in cortex" Nature Neuroscience, vol. 2, pp. 1019-1025, 1999
- (1999) Nature Neuroscience , vol.2 , pp. 1019-1025
- Riesenhuber, M.¹ Poggio, T.²

25
- 51249118803
- Unsuper-vised learning of invariant feature hierarchies with applications to object recognition
- MA Ranzato, FJ Huang, Y-L Boureau, and Y LeCun, "Unsuper-vised learning of invariant feature hierarchies with applications to object recognition" in IEEE CVPR, 2007
- (2007) IEEE CVPR
- Ranzato, M.¹ Huang, F.J.² Boureau, Y.-L.³ LeCun, Y.⁴

26
- 77956502203
- A theoretical analysis of feature pooling in visual recognition
- Y-L Boureau, J Ponce, and Y LeCun, "A theoretical analysis of feature pooling in visual recognition" in Proc ICML, 2010
- (2010) Proc ICML
- Boureau, Y.-L.¹ Ponce, J.² LeCun, Y.³

27
- 84892421248
- arXiv:1302.4389
- IJ Goodfellow, D Warde-Farley, M Mirza, A Courville, and Y Bengio, "Maxout networks" arXiv:1302.4389, 2013
- (2013) Maxout Networks
- Goodfellow, I.J.¹ Warde-Farley, D.² Mirza, M.³ Courville, A.⁴ Bengio, Y.⁵

28
- 84893701756
- Deep maxout networks for low-resource speech recognition
- Y. Miao, F. Metze, and S. Rawat, "Deep maxout networks for low-resource speech recognition" in Proc. IEEE ASRU, 2013
- (2013) Proc. IEEE ASRU
- Miao, Y.¹ Metze, F.² Rawat, S.³

29
- 84893651518
- Deep maxout neural networks for speech recognition
- Dec
- M. Cai, Y. Shi, and J. Liu, "Deep maxout neural networks for speech recognition" in Proc. ASRU, Dec 2013, pp. 291-296
- (2013) Proc. ASRU , pp. 291-296
- Cai, M.¹ Shi, Y.² Liu, J.³

30
- 84905270524
- Investigation of maxout networks for speech recognition
- P Swietojanski, J Li, and J-T Huang, "Investigation of maxout networks for speech recognition" in Proc IEEE ICASSP, 2014
- (2014) Proc IEEE ICASSP
- Swietojanski, P.¹ Li, J.² Huang, J.-T.³

31
- 84904512262
- Neural networks for distant speech recognition
- S Renals and P Swietojanski, "Neural networks for distant speech recognition" in Proc HSCMA, 2014
- (2014) Proc HSCMA
- Renals, S.¹ Swietojanski, P.²

32
- 84910069623
- Convolutional deep maxout networks for phone recognition
- L Toth, "Convolutional deep maxout networks for phone recognition" in Proc Interspeech, 2014
- (2014) Proc Interspeech
- Toth, L.¹

33
- 84946063012
- Differentiable pooling for hierarchical feature learning
- abs/1207.0151
- M D Zeiler and R Fergus, "Differentiable pooling for hierarchical feature learning" CoRR, vol. abs/1207.0151, 2012
- (2012) CoRR
- Zeiler, M.D.¹ Fergus, R.²

34
- 84874575248
- Convolutional neural networks applied to house numbers digit classification
- abs/1204.3968
- P Sermanet, S Chintala, and Y LeCun, "Convolutional neural networks applied to house numbers digit classification" CoRR, vol. abs/1204.3968, 2012
- (2012) CoRR
- Sermanet, P.¹ Chintala, S.² LeCun, Y.³

35
- 84893654379
- Improvements to deep convolutional neural networks for LVCSR
- T N Sainath, B Kingsbury, A Mohamed, G E Dahl, G Saon, H Soltau, T Beran, A Y Aravkin, and B Ramabhadran, "Improvements to deep convolutional neural networks for LVCSR," in In Proc. IEEE ASRU, 2013, pp. 315-320
- (2013) Proc. IEEE ASRU , pp. 315-320
- Sainath, T.N.¹ Kingsbury, B.² Mohamed, A.³ Dahl, G.E.⁴ Saon, G.⁵ Soltau, H.⁶ Beran, T.⁷ Aravkin, A.Y.⁸ Ramabhadran, B.⁹

36
- 84905239342
- Improving deep neural network acoustic models using generalized maxout networks
- X Zhang, J Trmal, D Povey, and S Khudanpur, "Improving deep neural network acoustic models using generalized maxout networks" in ICASSP,2014
- (2014) ICASSP
- Zhang, X.¹ Trmal, J.² Povey, D.³ Khudanpur, S.⁴

37
- 84946095296
- Learned-norm pooling for deep neural networks
- abs/1311.1780
- Gulcehre K Cho, R Pascanu, and Y Bengio, "Learned-norm pooling for deep neural networks" CoRR, vol. abs/1311.1780, 2013
- (2013) CoRR
- Gulcehre Cho, K.¹ Pascanu, R.² Bengio, Y.³

38
- 0035024581
- Networks with trainable amplitude of activation functions
- E Trentin, "Networks with trainable amplitude of activation functions" Neural Networs, vol. 14, pp. 471-W3, 2001
- (2001) Neural Networs , vol.14 , pp. 471-W3
- Trentin, E.¹

39
- 85045373614
- Overview of the IWSLT 2012 evaluation campaign
- M Federico, M Cettolo, L Bentivogli, M Paul, and S StUker, "Overview of the IWSLT 2012 evaluation campaign" in Proc IWSLT, 2012
- (2012) Proc IWSLT
- Federico, M.¹ Cettolo, M.² Bentivogli, L.³ Paul, M.⁴ StUker, S.⁵

40
- 84858953642
- The Kaldi speech recognition toolkit
- December
- D Povey, A Ghoshal, G Boulianne, L Burget, O Glembek, N Goel, M Hannemann, P MotliCek, Y Qian, P Schwarz, J Silovsky, G Stem-mer, and K Vesely, "The Kaldi speech recognition toolkit" in Proc. IEEE ASRU, December 2011
- (2011) Proc. IEEE ASRU
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ MotliCek, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovsky, J.¹¹ Stem-Mer, G.¹² Vesely, K.¹³

41
- 84893401626
- arXivpreprintarXiv:1308.4214
- IJ Goodfellow, D Warde-Farley, P Lamblin, V Dumoulin, M Mirza, R Pascanu, J Bergstra, F Bastien, and Y Bengio, "Pylearn2: a machine learning research library" arXivpreprintarXiv:1308.4214, 2013
- (2013) Pylearn2: A Machine Learning Research Library
- Goodfellow, I.J.¹ Warde-Farley, D.² Lamblin, P.³ Dumoulin, V.⁴ Mirza, M.⁵ Pascanu, R.⁶ Bergstra, J.⁷ Bastien, F.⁸ Bengio, Y.⁹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.