SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn 2015-January, Issue , 2015, Pages 3605-3609

Structured output layer with auxiliary targets for context-dependent acoustic modelling

(3) Swietojanski, Pawel a Bell, Peter a Renals, Steve a

a UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

Adaptation; Deep neural networks; Multitask learning; Structured output layer

Indexed keywords

SPEECH RECOGNITION; TREES (MATHEMATICS);

ADAPTATION; CONSTRAINED CONDITIONS; CONTEXT INDEPENDENT; DEEP NEURAL NETWORKS; LARGE VOCABULARY SPEECH RECOGNITION; MULTITASK LEARNING; OUTPUT LAYER; UNSUPERVISED SPEAKER ADAPTATION;

SPEECH COMMUNICATION;

EID: 84959095902 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (27)

References (42)

1
- 0028530231
- State clustering in hidden Markov model-based continuous speech recognition
- S. J. Young and P. C. Woodland, "State clustering in hidden Markov model-based continuous speech recognition, " Computer Speech & Language, vol. 8, no. 4, pp. 369-383, 1994.
- (1994) Computer Speech & Language , vol.8 , Issue.4 , pp. 369-383
- Young, S.J.¹ Woodland, P.C.²

2
- 0003573244
- Kluwer Academic Publishers
- H. Bourlard and N. Morgan, Connectionist Speech Recognition: A Hybrid Approach. Kluwer Academic Publishers, 1994.
- (1994) Connectionist Speech Recognition: A Hybrid Approach
- Bourlard, H.¹ Morgan, N.²

3
- 0028194709
- Connectionist probability estimators in HMM speech recognition
- S. Renals, N. Morgan, H. Bourlard, M. Cohen, and H. Franco, "Connectionist probability estimators in HMM speech recognition, " IEEE Transactions on Speech and Audio Processing, vol. 2, no. 1, pp. 161-174, 1994.
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.1 , pp. 161-174
- Renals, S.¹ Morgan, N.² Bourlard, H.³ Cohen, M.⁴ Franco, H.⁵

4
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- G. Hinton, L. Deng, D. Yu, G. Dahl, A.-R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, 2012.
- (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.-R.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰ Kingsbury, B.¹¹

5
- 84055222005
- Context-dependent pretrained deep neural networks for large-vocabulary speech recognition
- G. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pretrained deep neural networks for large-vocabulary speech recognition, " IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
- (2012) IEEE Transactions on Audio, Speech and Language Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.¹ Yu, D.² Deng, L.³ Acero, A.⁴

6
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Proc. Interspeech, 2011.
- (2011) Proc. Interspeech
- Seide, F.¹ Li, G.² Yu, D.³

7
- 84872565703
- Factoring networks by a statistical method
- N. Morgan and H. Bourlard, "Factoring networks by a statistical method, " Neural Computation, vol. 4, no. 6, pp. 835-838, 1992.
- (1992) Neural Computation , vol.4 , Issue.6 , pp. 835-838
- Morgan, N.¹ Bourlard, H.²

8
- 85009078709
- CDNN: A context dependent neural network for continuous speech recognition
- H. Bourlard, N. Morgan, C. Wooters, and S. Renals, "CDNN: A context dependent neural network for continuous speech recognition, " in Proc. ICASSP, vol. 2, 1992, pp. 349-352.
- (1992) Proc. ICASSP , vol.2 , pp. 349-352
- Bourlard, H.¹ Morgan, N.² Wooters, C.³ Renals, S.⁴

9
- 0028464214
- Context-dependent connectionist probability estimation in a hybrid HMM-neural net speech recognition system
- H. Franco, M. Cohen, N. Morgan, D. Rumelhart, and V. Abrash, "Context-dependent connectionist probability estimation in a hybrid HMM-neural net speech recognition system, " Computer Speech and Language, vol. 8, pp. 211-222, 1994.
- (1994) Computer Speech and Language , vol.8 , pp. 211-222
- Franco, H.¹ Cohen, M.² Morgan, N.³ Rumelhart, D.⁴ Abrash, V.⁵

10
- 84916199887
- Regression-based context-dependent modeling of deep neural networks for speech recognition
- Nov
- G. Wang and K. C. Sim, "Regression-based context-dependent modeling of deep neural networks for speech recognition, " Audio, Speech, and Language Processing, IEEE/ACM Transactions on, vol. 22, no. 11, pp. 1660-1669, Nov 2014.
- (2014) Audio, Speech, and Language Processing, IEEE/ACM Transactions on , vol.22 , Issue.11 , pp. 1660-1669
- Wang, G.¹ Sim, K.C.²

11
- 84905269216
- Context dependent state tying for speech recognition using deep neural network acoustic models
- M. Bacchiani and D. Rybach, "Context dependent state tying for speech recognition using deep neural network acoustic models, " in Proc. ICASSP, 2014, pp. 230-234.
- (2014) Proc. ICASSP , pp. 230-234
- Bacchiani, M.¹ Rybach, D.²

12
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc. ASRU, 2011.
- (2011) Proc. ASRU
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

13
- 84946069752
- unpublished work
- C. Zhang and P. Woodland, "Context independent discriminative pre-training, " unpublished work.
- Context Independent Discriminative Pre-training
- Zhang, C.¹ Woodland, P.²

14
- 84874278045
- Unsupervised crosslingual knowledge transfer in DNN-based LVCSR
- December
- P. Swietojanski, A. Ghoshal, and S. Renals, "Unsupervised crosslingual knowledge transfer in DNN-based LVCSR, " in Proc. IEEE SLT, December 2012, pp. 246-251.
- (2012) Proc. IEEE SLT , pp. 246-251
- Swietojanski, P.¹ Ghoshal, A.² Renals, S.³

15
- 84906273501
- Improving low-resource cd-dnn-hmm using dropout and multilingual dnn training
- Y. Miao and F. Metze, "Improving low-resource cd-dnn-hmm using dropout and multilingual dnn training. " in Proc. Interspeech. ISCA, 2013, pp. 2237-2241.
- (2013) Proc. Interspeech. ISCA , pp. 2237-2241
- Miao, Y.¹ Metze, F.²

16
- 0031189914
- Multitask learning
- R. Caruana, "Multitask learning, " Machine learning, vol. 28, pp. 41-75, 1997.
- (1997) Machine Learning , vol.28 , pp. 41-75
- Caruana, R.¹

17
- 84910044198
- Multitask learning in connectionist robust ASR using recurrent neural networks
- S. Parveen and P. Green, "Multitask learning in connectionist robust ASR using recurrent neural networks, " in Proc. Interspeech, 2003.
- (2003) Proc. Interspeech
- Parveen, S.¹ Green, P.²

18
- 84890527497
- Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers
- J.-T. Huang, J. Li, D. Yu, L. Deng, and Y. Gong, "Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers, " in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Huang, J.-T.¹ Li, J.² Yu, D.³ Deng, L.⁴ Gong, Y.⁵

19
- 84890539009
- Multilingual acoustic models using distributed deep neural networks
- G. Heigold, V. Vanhoucke, A. Senior, P. Nguyen, M. Ranzato, M. Devin, and J. Dean, "Multilingual acoustic models using distributed deep neural networks, " in In Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Heigold, G.¹ Vanhoucke, V.² Senior, A.³ Nguyen, P.⁴ Ranzato, M.⁵ Devin, M.⁶ Dean, J.⁷

20
- 84890461500
- Multilingual training of deep neural networks
- A. Ghoshal, P. Swietojanski, and S. Renals, "Multilingual training of deep neural networks, " in In Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Ghoshal, A.¹ Swietojanski, P.² Renals, S.³

21
- 84864073449
- Greedy layer-wise training of deep networks
- Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle, "Greedy layer-wise training of deep networks, " in Advances in Neural Information Processing Systems 19, 2007, pp. 153-160.
- (2007) Advances in Neural Information Processing Systems , vol.19 , pp. 153-160
- Bengio, Y.¹ Lamblin, P.² Popovici, D.³ Larochelle, H.⁴

22
- 84946094751
- Regularization of context-dependent deep neural networks with context-independent multi-task training
- P. Bell and S. Renals, "Regularization of context-dependent deep neural networks with context-independent multi-task training, " in Proc. ICASSP, 2015.
- (2015) Proc. ICASSP
- Bell, P.¹ Renals, S.²

23
- 71149116544
- Curriculum learning
- Y. Bengio, J. Louradour, R. Collobert, and J. Weston, "Curriculum learning, " in Proc. ICML, 2009.
- (2009) Proc. ICML
- Bengio, Y.¹ Louradour, J.² Collobert, R.³ Weston, J.⁴

24
- 84905283791
- Joint acoustic modelling of triphones and trigraphemes by multi-task learning deep neural networks for low-resource speech recognition
- D. Chen, B. Mak, C.-C. Leung, and S. Sivadas, "Joint acoustic modelling of triphones and trigraphemes by multi-task learning deep neural networks for low-resource speech recognition, " in Proc. ICASSP, 2014.
- (2014) Proc. ICASSP
- Chen, D.¹ Mak, B.² Leung, C.-C.³ Sivadas, S.⁴

25
- 84890545600
- Multi-task learning in deep neural networks for improved phoneme recognition
- M. Seltzer and J. Droppo, "Multi-task learning in deep neural networks for improved phoneme recognition, " in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Seltzer, M.¹ Droppo, J.²

26
- 0004290881
- MIT Press
- M. Minsky and S. Papert, Perceptrons. MIT Press, 1969.
- (1969) Perceptrons
- Minsky, M.¹ Papert, S.²

27
- 0000646059
- Learning internal representations by error-propagation
- D. E. Rumelhart, G. E. Hinton, and R. J. Williams, "Learning internal representations by error-propagation, " in Parallel Distributed Processing. MIT Press, 1986, vol. 1, pp. 318-362.
- (1986) Parallel Distributed Processing. MIT Press , vol.1 , pp. 318-362
- Rumelhart, D.E.¹ Hinton, G.E.² Williams, R.J.³

28
- 84908187187
- CoRR abs/1212. 5921
- M. á. Carreira-Perpinãn and W. Wang, "Distributed optimization of deeply nested systems, " CoRR, vol. abs/1212. 5921, 2012. [Online]. Available: http: //arxiv. org/abs/1212. 5921
- (2012) Distributed Optimization of Deeply Nested Systems
- Carreira-Perpinãn, M.Á.¹ Wang, W.²

29
- 85001124710
- Wit3: Web inventory of transcribed and translated talks
- M. Cettolo, C. Girardi, and M. Federico, "Wit3: Web inventory of transcribed and translated talks, " in Proc EAMT, 2012, pp. 261-268.
- (2012) Proc EAMT , pp. 261-268
- Cettolo, M.¹ Girardi, C.² Federico, M.³

30
- 85016587886
- SWITCHBOARD: Telephone speech corpus for research and development
- J. J. Godfrey, E. C. Holliman, and J. McDaniel, "SWITCHBOARD: Telephone speech corpus for research and development, " in Proc. ICASSP. IEEE, 1992, pp. 517-520.
- (1992) Proc. ICASSP. IEEE , pp. 517-520
- Godfrey, J.J.¹ Holliman, E.C.² McDaniel, J.³

31
- 84890543632
- The UEDIN systems for the IWSLT 2012 evaluation
- E. Hasler, P. Bell, A. Ghoshal, B. Haddow, P. Koehn, F. McInnes, S. Renals, and P. Swietojanski, "The UEDIN systems for the IWSLT 2012 evaluation, " in Proc. IWSLT, 2012.
- (2012) Proc. IWSLT
- Hasler, E.¹ Bell, P.² Ghoshal, A.³ Haddow, B.⁴ Koehn, P.⁵ McInnes, F.⁶ Renals, S.⁷ Swietojanski, P.⁸

32
- 84890492591
- Revisiting hybrid and GMM-HMM system combination techniques
- P. Swietojanski, A. Ghoshal, and S. Renals, "Revisiting hybrid and GMM-HMM system combination techniques, " in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Swietojanski, P.¹ Ghoshal, A.² Renals, S.³

33
- 84976431564
- The UEDIN system for the IWSLT 2014 evaluation
- P. Bell, P. Swietojanski, J. Driesen, M. Sinclair, F. McInnes, and S. Renals, "The UEDIN system for the IWSLT 2014 evaluation, " in Proc. IWSLT, 2014.
- (2014) Proc. IWSLT
- Bell, P.¹ Swietojanski, P.² Driesen, J.³ Sinclair, M.⁴ McInnes, F.⁵ Renals, S.⁶

34
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Apr.
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech, " The Journal of the Acoustical Society of America, vol. 87, no. 4, pp. 1738-1752, Apr. 1990.
- (1990) The Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

35
- 84890454527
- Low-rank matrix factorization for deep neural network training with high-dimensional output targets
- T. N. Sainath, B. Kingsbury, V. Sindhwani, E. Arisoy, and B. Ramabhadran, "Low-rank matrix factorization for deep neural network training with high-dimensional output targets. " in Proc. ICASSP, 2013, pp. 6655-6659.
- (2013) Proc. ICASSP , pp. 6655-6659
- Sainath, T.N.¹ Kingsbury, B.² Sindhwani, V.³ Arisoy, E.⁴ Ramabhadran, B.⁵

36
- 34547548235
- Probabilistic and bottleneck features for LVCSR of meetings
- F. GrÉzl, M. Karafiát, S. Kontar, and J. Cernokcý, "Probabilistic and bottleneck features for LVCSR of meetings, " in Proc. ICASSP, 2007.
- (2007) Proc. ICASSP
- GrÉzl, F.¹ Karafiát, M.² Kontar, S.³ Cernokcý, J.⁴

37
- 84906274730
- Sequencediscriminative training of deep neural networks
- Lyon, France, August
- K. Vesely, A. Ghoshal, L. Burget, and D. Povey, "Sequencediscriminative training of deep neural networks, " in Proc. Interspeech, Lyon, France, August 2013.
- (2013) Proc. Interspeech
- Vesely, K.¹ Ghoshal, A.² Burget, L.³ Povey, D.⁴

38
- 84858953642
- The Kaldi speech recognition toolkit
- December
- D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlícek, Y. Qian, P. Schwarz, J. Silovský, G. Stemmer, and K. Veselý, "The Kaldi speech recognition toolkit, " in Proc. IEEE ASRU, December 2011.
- (2011) Proc. IEEE ASRU
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Motlícek, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovský, J.¹¹ Stemmer, G.¹² Veselý, K.¹³

39
- 84938690750
- Speaker adaptation of deep neural networks using a hierarchy of output layers
- R. Price, K. Iso, and K. Shinoda, "Speaker adaptation of deep neural networks using a hierarchy of output layers, " in Proc. IEEE SLT, 2014.
- (2014) Proc. IEEE SLT
- Price, R.¹ Iso, K.² Shinoda, K.³

40
- 84983119674
- Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models
- P. Swietojanski and S. Renals, "Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models, " in Proc. IEEE SLT, 2014.
- (2014) Proc. IEEE SLT
- Swietojanski, P.¹ Renals, S.²

41
- 84906225505
- Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition
- O. Abdel-Hamid and H. Jiang, "Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition. " in Proc. Interspeech. ISCA, pp. 1248-1252.
- Proc. Interspeech. ISCA , pp. 1248-1252
- Abdel-Hamid, O.¹ Jiang, H.²

42
- 84893401626
- arXiv: 1308. 4214
- I. Goodfellow, D. Warde-Farley, P. Lamblin, V. Dumoulin, M. Mirza, R. Pascanu, J. Bergstra, F. Bastien, and Y. Bengio, "Pylearn2: A machine learning research library, " arXiv: 1308. 4214, 2013.
- (2013) Pylearn2: A Machine Learning Research Library
- Goodfellow, I.¹ Warde-Farley, D.² Lamblin, P.³ Dumoulin, V.⁴ Mirza, M.⁵ Pascanu, R.⁶ Bergstra, J.⁷ Bastien, F.⁸ Bengio, Y.⁹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.