SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn 2015-January, Issue , 2015, Pages 3600-3604

Training deep bidirectional LSTM acoustic model for LVCSR by a context-sensitive-chunk BPTT approach

(3) Chen, Kai a,b Yan, Zhi Jie b Huo, Qiang b

a UNIVERSITY OF SCIENCE AND TECHNOLOGY OF CHINA (China)

b MICROSOFT RESEARCH ASIA (China)

Author keywords

BPTT; Context sensitive chunk; DBLSTM; DNN; Long short term memory; LVCSR

Indexed keywords

BRAIN; CONTINUOUS SPEECH RECOGNITION; SPEECH COMMUNICATION;

BPTT; CONTEXT SENSITIVE; DBLSTM; LONG SHORT TERM MEMORY; LVCSR;

SPEECH RECOGNITION;

EID: 84959076031 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (19)

References (30)

1
- 0028392167
- An application of recurrent nets to phone proba-bility estimation
- A. J. Robinson, "An application of recurrent nets to phone proba-bility estimation, " IEEE Transactions on Neural Networks, vol. 5, no. 2, pp. 298-305, 1994.
- (1994) IEEE Transactions on Neural Networks , vol.5 , Issue.2 , pp. 298-305
- Robinson, A.J.¹

2
- 0001592322
- The use of recurrentneural networks in continuous speech recognition
- C.-H. Lee, F. Soong, and K. Paliwal (Eds.)
- T. Robinson, M. Hochberg, and S. Renals, "The use of recurrentneural networks in continuous speech recognition, " in C.-H. Lee, F. Soong, and K. Paliwal (Eds.), Automatic Speech and SpeakerRecognition, pp. 233-258, 1996.
- (1996) Automatic Speech and SpeakerRecognition , pp. 233-258
- Robinson, T.¹ Hochberg, M.² Renals, S.³

3
- 0031268931
- Bidirectional recurrent neural net-works
- M. Schuster and K. K. Paliwal, "Bidirectional recurrent neural net-works, " IEEE Transactions on Signal Processing, vol. 9, no. 11, pp. 2673-2681, 1997.
- (1997) IEEE Transactions on Signal Processing , vol.9 , Issue.11 , pp. 2673-2681
- Schuster, M.¹ Paliwal, K.K.²

4
- 85032751458
- Deep neural networks for acoustic modeling in speech recogni-tion: The shared views of four research groups
- G. Hinton, L. Deng, D. Yu, G. E. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recogni-tion: The shared views of four research groups, " IEEE Signal Pro-cessing Magazine, vol. 29, no. 16, pp. 82-97, 2012.
- (2012) IEEE Signal Pro-cessing Magazine , vol.29 , Issue.16 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.E.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.N.¹⁰ Kingsbury, B.¹¹

5
- 84923929378
- Springer
- D. Yu and L. Deng, Automatic Speech Recognition: A Deep Learn-ing Approach, Springer, 2015.
- (2015) Automatic Speech Recognition: A Deep Learn-ing Approach
- Yu, D.¹ Deng, L.²

6
- 0031573117
- Long short-term memory
- S. Hochreiter, and J. Schmidhuber, "Long short-term memory, "Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

7
- 0034293152
- Learning to forget: Continual prediction with LSTM
- F. A. Gers, J. Schmidhuber, and F. Cummins, "Learning to forget: continual prediction with LSTM, " Neural Computation, vol. 12, no. 10, pp. 2451-2471, 2000.
- (2000) Neural Computation , vol.12 , Issue.10 , pp. 2451-2471
- Gers, F.A.¹ Schmidhuber, J.² Cummins, F.³

8
- 0041965934
- Learning pre-cise timing with LSTM recurrent networks
- F. A. Gers, N. N. Schraudolph, and J. Schmidhuber, "Learning pre-cise timing with LSTM recurrent networks, " Journal of MachineLearning Research, vol. 3, pp. 115-143, 2003.
- (2003) Journal of MachineLearning Research , vol.3 , pp. 115-143
- Gers, F.A.¹ Schraudolph, N.N.² Schmidhuber, J.³

9
- 27744588611
- Framewise phoneme classifica-tion with bidirectional LSTM and other neural network architec-tures
- A. Graves, and J. Schmidhuber, "Framewise phoneme classifica-tion with bidirectional LSTM and other neural network architec-tures, " Neural Networks vol. 18, no. 5, pp. 602-610, 2005.
- (2005) Neural Networks , vol.18 , Issue.5 , pp. 602-610
- Graves, A.¹ Schmidhuber, J.²

10
- 33646258991
- BidirectionalLSTM networks for improved phoneme classification and recog-nition
- Springer LNCS 3697
- A. Graves, S. Fernánd ez and J. Schmidhuber, "BidirectionalLSTM networks for improved phoneme classification and recog-nition, " Proc. ICANN-2005, Springer LNCS 3697, pp. 799-804.
- Proc. ICANN-2005 , pp. 799-804
- Graves, A.¹ Fernánd Ez, S.² Schmidhuber, J.³

11
- 64849110608
- A novel connectionist system for unconstrainedhand writing recognition
- A. Graves, M. Liwicki, S. Fernánd ez, R. Bertolami, H. Bunke, and J. Schmidhuber, "A novel connectionist system for unconstrainedhand writing recognition, " IEEE Transactions on PAMI, vol. 31, no. 5, pp. 855-868, 2009.
- (2009) IEEE Transactions on PAMI , vol.31 , Issue.5 , pp. 855-868
- Graves, A.¹ Liwicki, M.² Fernánd Ez, S.³ Bertolami, R.⁴ Bunke, H.⁵ Schmidhuber, J.⁶

12
- 34250704813
- Con-nectionist temporal classification: Labelling unsegmented sequencedata with recurrent neural networks
- A. Graves, S. Fernánd ez, F. Gomez, and J. Schmidhuber "Con-nectionist temporal classification: labelling unsegmented sequencedata with recurrent neural networks, " Proc. ICML-2006, pp. 369-376.
- Proc. ICML-2006 , pp. 369-376
- Graves, A.¹ Fernánd Ez, S.² Gomez, F.³ Schmidhuber, J.⁴

13
- 84890543083
- Speech recogni-tion with deep recurrent neural networks
- A. Graves, A. R. Mohamed, and G. Hinton, "Speech recogni-tion with deep recurrent neural networks, " Proc. ICASSP-2013, pp. 6645-6649.
- Proc. ICASSP-2013 , pp. 6645-6649
- Graves, A.¹ Mohamed, A.R.² Hinton, G.³

14
- 84936143793
- Towards end-to-end speech recognitionwith recurrent neural networks
- A. Graves, and N. Jaitly, "Towards end-to-end speech recognitionwith recurrent neural networks, " Proc. ICML-2014, pp. 1764-1772.
- (2014) Proc. ICML , pp. 1764-1772
- Graves, A.¹ Jaitly, N.²

15
- 0003573244
- Kluwer, Norwell, MA
- H. Bourlard and N. Morgan, Connectionist Speech Recognition: A Hybrid Approach, Kluwer, Norwell, MA, 1993.
- (1993) Connectionist Speech Recognition: A Hybrid Approach
- Bourlard, H.¹ Morgan, N.²

16
- 84893701254
- Hybrid speech recog-nition with deep bidirectional LSTM
- A. Graves, N. Jaitly, and A. R. Mohamed, "Hybrid speech recog-nition with deep bidirectional LSTM, " Proc. ASRU-2013, pp. 273-278.
- Proc. ASRU-2013 , pp. 273-278
- Graves, A.¹ Jaitly, N.² Mohamed, A.R.³

17
- 84910046405
- Long short-term memory re-current neural network architectures for large scale acoustic mod-eling
- H. Sak, A. Senior, and F. Beaufays, "Long short-term memory re-current neural network architectures for large scale acoustic mod-eling, " Proc. INTERSPEECH-2014, pp. 338-342.
- Proc. INTERSPEECH-2014 , pp. 338-342
- Sak, H.¹ Senior, A.² Beaufays, F.³

18
- 84910072094
- Sequence discriminative distributed train-ing of long short-term memory recurrent neural networks
- H. Sak, O. Vinyals, G. Heigold, A. Senior, E. McDermott, R. Monga, and M. Mao, "Sequence discriminative distributed train-ing of long short-term memory recurrent neural networks, " Proc. INTERSPEECH-2014, pp. 1209-1213.
- Proc. INTERSPEECH-2014 , pp. 1209-1213
- Sak, H.¹ Vinyals, O.² Heigold, G.³ Senior, A.⁴ McDermott, E.⁵ Monga, R.⁶ Mao, M.⁷

19
- 0001609567
- An efficient gradient-based algo-rithm for on-line training of recurrent network trajectories
- R. J. Williams, and J. Peng, "An efficient gradient-based algo-rithm for on-line training of recurrent network trajectories, " NeuralComputation, vol. 2, no. 4, pp. 490-501, 1990.
- (1990) NeuralComputation , vol.2 , Issue.4 , pp. 490-501
- Williams, R.J.¹ Peng, J.²

20
- 84976226316
- F. Eyben, J. Bergmann, and F. Weninger, CURRENNT: CUDA-enabled Machine Learning Library For Recurrent Neural Net-works, http: //sourceforge. net/projects/currennt/.
- CURRENNT: CUDA-enabled Machine Learning Library for Recurrent Neural Net-works
- Eyben, F.¹ Bergmann, J.² Weninger, F.³

21
- 84962501970
- A context-sensitive-chunkBPTT approach to training deep LSTM/BLSTM recurrent neuralnetworks for offline hand writing recognition
- K. Chen, Z.-J. Yan, and Q. Huo, "A context-sensitive-chunkBPTT approach to training deep LSTM/BLSTM recurrent neuralnetworks for offline hand writing recognition, " Proc. ICDAR-2015.
- Proc. ICDAR-2015
- Chen, K.¹ Yan, Z.-J.² Huo, Q.³

22
- 84942251167
- Fast and robust trainingof recurrent neural networks for offline hand writing recognition
- P. Doetsch, M. Kozielski, and H. Ney, "Fast and robust trainingof recurrent neural networks for offline hand writing recognition, "Proc. ICFHR-2014, pp. 279-284.
- Proc. ICFHR-2014 , pp. 279-284
- Doetsch, P.¹ Kozielski, M.² Ney, H.³

23
- 84910072497
- Un-folded recurrent neural network for speech recognition
- G. Saon, H. Soltau, A. Emami, and M. Picheny, "Un-folded recurrent neural network for speech recognition, " Proc. INTERSPEECH-2014, pp. 343-347.
- Proc. INTERSPEECH-2014 , pp. 343-347
- Saon, G.¹ Soltau, H.² Emami, A.³ Picheny, M.⁴

24
- 84887388950
- An em-pirical study of learning rates in deep neural networks for speechrecognition
- A. Senior, G. Heigold, M. A. Ranzato, and K. Yang, "An em-pirical study of learning rates in deep neural networks for speechrecognition, " Proc. ICASSP-2013, pp. 6724-6728.
- Proc. ICASSP-2013 , pp. 6724-6728
- Senior, A.¹ Heigold, G.² Ranzato, M.A.³ Yang, K.⁴

25
- 84905233897
- Mean-normalized stochastic gradient for large-scale deep learning
- S. Wiesler, A. Richard, R. Schluter, and H. Ney, "Mean-normalized stochastic gradient for large-scale deep learning, " Proc. ICASSP-2014, pp. 180-184.
- Proc. ICASSP-2014 , pp. 180-184
- Wiesler, S.¹ Richard, A.² Schluter, R.³ Ney, H.⁴

26
- 85016587886
- Switch-board: Telephone speech corpus for research and development
- J. J. Godfrey, Edward. C. Holliman, and J. McDaniel, "Switch-board: telephone speech corpus for research and development, "Proc. ICASSP-1992, pp. I-517-520.
- Proc. ICASSP-1992 , pp. I517-520
- Godfrey, J.J.¹ Holliman, E.C.² McDaniel, J.³

27
- 84867720412
- G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, "Improving neural networks by preventing co-adaptation of feature detectors, " http: //arxiv. org/abs/1207. 0580, 2012.
- (2012) Mproving Neural Networks by Preventing Co-adaptation of Feature Detectors
- Hinton, G.E.¹ Srivastava, N.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

28
- 84865801985
- Conversational speech tran-scription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech tran-scription using context-dependent deep neural networks, " Proc. INTERSPEECH-2011, pp. 437-440.
- Proc. INTERSPEECH-2011 , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

29
- 84906225757
- A scalable approach to usingDNN-derived features in GMM-HMM based acoustic modeling forLVCSR
- Z.-J. Yan, Q. Huo, and J. Xu, "A scalable approach to usingDNN-derived features in GMM-HMM based acoustic modeling forLVCSR, " Proc. INTERSPEECH-2013, pp. 104-108.
- Proc. INTERSPEECH-2013 , pp. 104-108
- Yan, Z.-J.¹ Huo, Q.² Xu, J.³

30
- 0030121298
- A study on the use of bi-directional contex-tual dependence in Markov rand om field-based acoustic modellingfor speech recognition
- Q. Huo and C. Chan, "A study on the use of bi-directional contex-tual dependence in Markov rand om field-based acoustic modellingfor speech recognition, " Computer Speech and Language, vol. 10, pp. 95-105, 1996.
- (1996) Computer Speech and Language , vol.10 , pp. 95-105
- Huo, Q.¹ Chan, C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.