SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 3281-3285

RASR/NN: The RWTH neural network toolkit for speech recognition

(5) Wiesler, Simon a Richard, Alexander a Golik, Pavel a Schluter, Ralf a Ney, Hermann a,b

a RWTH AACHEN UNIVERSITY (Germany)

b UFR 919 Laboratoire d'Informatique Pour la Mécanique et les Sciences de l'Ingénieur (France)

Author keywords

acoustic modeling; GPU; neural networks; open source; RASR; speech recognition

Indexed keywords

ALGORITHMS; ELECTRIC NETWORK TOPOLOGY; NEURAL NETWORKS; OPTIMIZATION; SIGNAL PROCESSING;

ACOUSTIC MODEL; ACTIVATION FUNCTIONS; GPU; OPEN SOURCES; OPTIMIZATION ALGORITHMS; RASR; SPEECH RECOGNITION SYSTEMS; STATE-OF-THE-ART PERFORMANCE;

SPEECH RECOGNITION;

EID: 84905222840 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6854207 Document Type: Conference Paper

Times cited : (71)

References (27)

1
- 84055211743
- Acoustic modeling using deep belief networks
- A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks," IEEE Trans. on Audio, Speech, and Language Processing, no. 99, pp. 14-22, 2010.
- (2010) IEEE Trans. on Audio, Speech, and Language Processing , Issue.99 , pp. 14-22
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

2
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- G. Hinton, L. Deng, D. Yu, G. E. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath et al., "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, 2012.
- (2012) Signal Processing Magazine, IEEE , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.E.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.N.¹⁰

3
- 70450185565
- The RWTH Aachen university open source speech recognition system
- Brighton, UK, Sep.
- D. Rybach, C. Gollan, G. Heigold, B. Hoffmeister, J. Lööf, R. Schlüter, and H. Ney, "The RWTH Aachen university open source speech recognition system," in Proc. Interspeech, Brighton, UK, Sep. 2009, pp. 2111-2114.
- (2009) Proc. Interspeech , pp. 2111-2114
- Rybach, D.¹ Gollan, C.² Heigold, G.³ Hoffmeister, B.⁴ Lööf, J.⁵ Schlüter, R.⁶ Ney, H.⁷

4
- 33645209480
- Sphinx-4: A flexible open source framework for speech recognition
- Inc., Mountain View, CA, USA, Tech. Rep.
- W. Walker, P. Lamere, P. Kwok, B. Raj, R. Singh, E. Gouvea, P. Wolf, and J. Woelfel, "Sphinx-4: A flexible open source framework for speech recognition," Sun Microsystems, Inc., Mountain View, CA, USA, Tech. Rep., 2004.
- (2004) Sun Microsystems
- Walker, W.¹ Lamere, P.² Kwok, P.³ Raj, B.⁴ Singh, R.⁵ Gouvea, E.⁶ Wolf, P.⁷ Woelfel, J.⁸

5
- 84905265035
- The HTK book version 3.4. Cambridge University Engineering Department
- S. Young, G. Evermann, M. Gales, T. Hain, D. Kershaw, X. Liu, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, The HTK book version 3.4. Cambridge University Engineering Department, 2006.
- (2006)
- Young, S.¹ Evermann, G.² Gales, M.³ Hain, T.⁴ Kershaw, D.⁵ Liu, X.⁶ Moore, G.⁷ Odell, J.⁸ Ollason, D.⁹ Povey, D.¹⁰ Valtchev, V.¹¹ Woodland, P.¹²

6
- 85009062693
- Julius-an open source real-time large vocabulary recognition engine
- Aalborg, Denmark, Sep.
- A. Lee, T. Kawahara, and K. Shikano, "Julius-an open source real-time large vocabulary recognition engine," in Proc. Interspeech, Aalborg, Denmark, Sep. 2001, pp. 1691-1694.
- (2001) Proc. Interspeech , pp. 1691-1694
- Lee, A.¹ Kawahara, T.² Shikano, K.³

7
- 84858953642
- The Kaldi speech recognition toolkit
- Hawaii, USA, Dec.
- D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlcek, Y. Qian, P. Schwarz, J. Silovský, G. Stemmer, and K. Veselý, "The Kaldi speech recognition toolkit," in Proc. IEEE Automatic Speech Recognition and UnderstandingWorkshop (ASRU), Hawaii, USA, Dec. 2011.
- (2011) Proc. IEEE Automatic Speech Recognition and UnderstandingWorkshop (ASRU)
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Motlcek, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovský, J.¹¹ Stemmer, G.¹² Veselý, K.¹³

8
- 85054983194
- Berkeley
- D. Johnson. (2004) QuickNet, speech group at ICSI, Berkeley. [Online]. Available: http://www.icsi.berkeley.edu/Speech/qn.html
- (2004) QuickNet, Speech Group at ICSI
- Johnson, D.¹

9
- 79959816017
- Parallel training of neural networks for speech recognition
- Makuhari, Japan, Sep.
- K. Veselý, L. Burget, and F. Grézl, "Parallel training of neural networks for speech recognition," in Proc. Interspeech, Makuhari, Japan, Sep. 2010, pp. 2934-2937.
- (2010) Proc. Interspeech , pp. 2934-2937
- Veselý, K.¹ Burget, L.² Grézl, F.³

10
- 84858971297
- Convolutive bottleneck network features for LVCSR
- Hawaii, USA, Dec.
- K. Veselý, M. Karafíat, and F. Grézl, "Convolutive bottleneck network features for LVCSR," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Hawaii, USA, Dec. 2011, pp. 42-47.
- (2011) Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) , pp. 42-47
- Veselý, K.¹ Karafíat, M.² Grézl, F.³

11
- 77956509090
- Rectified linear units improve restricted Boltzmann machines
- Haifa, Israel, Jun.
- V. Nair and G. E. Hinton, "Rectified linear units improve restricted Boltzmann machines," in Proc. of the 27th Int. Conf. on Machine Learning, Haifa, Israel, Jun. 2010, pp. 807-814.
- (2010) Proc. of the 27th Int. Conf. on Machine Learning , pp. 807-814
- Nair, V.¹ Hinton, G.E.²

12
- 0001336749
- Accelerated learning in layered neural networks
- Dec.
- S. A. Solla, E. Levin, and M. Fleisher, "Accelerated learning in layered neural networks," Complex Systems, vol. 2, no. 6, pp. 625-639, Dec. 1988.
- (1988) Complex Systems , vol.2 , Issue.6 , pp. 625-639
- Solla, S.A.¹ Levin, E.² Fleisher, M.³

13
- 77956541496
- Deep learning via Hessian-free optimization
- J. Martens, "Deep learning via Hessian-free optimization," in Proc. of the 27th Int. Conf. on Machine Learning, vol. 951, 2010, p. 2010.
- (2010) Proc. of the 27th Int. Conf. on Machine Learning , vol.951 , pp. 2010
- Martens, J.¹

14
- 84883190472
- Large scale distributed deep networks
- J. Dean, G. Corrado, R. Monga, K. Chen, M. Devin, Q. Le, M. Mao, M. Ranzato, A. Senior, P. Tucker, K. Yang, and A. Ng, "Large scale distributed deep networks," in Advances in Neural Information Processing Systems 25, 2012, pp. 1232-1240.
- (2012) Advances in Neural Information Processing Systems , vol.25 , pp. 1232-1240
- Dean, J.¹ Corrado, G.² Monga, R.³ Chen, K.⁴ Devin, M.⁵ Le, Q.⁶ Mao, M.⁷ Ranzato, M.⁸ Senior, A.⁹ Tucker, P.¹⁰ Yang, K.¹¹ Ng, A.¹²

15
- 84905233897
- Meannormalized stochastic gradient for large-scale deep learning
- Florence, Italy, May
- S. Wiesler, A. Richard, R. Schlüter, and H. Ney, "Meannormalized stochastic gradient for large-scale deep learning," in (submitted to) Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Florence, Italy, May 2014.
- (2014) Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing
- Wiesler, S.¹ Richard, A.² Schlüter, R.³ Ney, H.⁴

16
- 84943274699
- A direct adaptive method for faster backpropagation learning: The RPROP algorithm
- M. Riedmiller and H. Braun, "A direct adaptive method for faster backpropagation learning: The RPROP algorithm," in Proc. of the Int. Conf. on Neural Networks, 1993, pp. 586-591.
- (1993) Proc. of the Int. Conf. on Neural Networks , pp. 586-591
- Riedmiller, M.¹ Braun, H.²

17
- 84905270596
- Cross-entropy vs squared error training: A theoretical and experimental comparison
- Lyon, France, Aug.
- P. Golik, P. Doetsch, and H. Ney, "Cross-entropy vs. squared error training: A theoretical and experimental comparison," in Proc. Interspeech, Lyon, France, Aug. 2013, pp. 1756-1760.
- (2013) Proc. Interspeech , pp. 1756-1760
- Golik, P.¹ Doetsch, P.² Ney, H.³

18
- 84887388950
- An empirical study of learning rates in deep neural networks for speech recognition
- A. Senior, G. Heigold, M. Ranzato, and K. Yang, "An empirical study of learning rates in deep neural networks for speech recognition," in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, vol. 1, 2013, pp. 6724-6728.
- (2013) Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing , vol.1 , pp. 6724-6728
- Senior, A.¹ Heigold, G.² Ranzato, M.³ Yang, K.⁴

19
- 0000635720
- Progress in dynamic programming search for LVCSR
- Aug.
- H. Ney and S. Ortmanns, "Progress in dynamic programming search for LVCSR," Proc. of the IEEE, vol. 88, no. 8, pp. 1224-1240, Aug. 2000.
- (2000) Proc. of the IEEE , vol.88 , Issue.8 , pp. 1224-1240
- Ney, H.¹ Ortmanns, S.²

20
- 0003573244
- Norwell, MA, USA: Kluwer Academic Publishers
- H. A. Bourlard and N. Morgan, Connectionist speech recognition: A hybrid approach. Norwell, MA, USA: Kluwer Academic Publishers, 1993.
- (1993) Connectionist Speech Recognition: A Hybrid Approach
- Bourlard, H.A.¹ Morgan, N.²

21
- 0033709098
- Tandem connectionist feature extraction for conventional HMM systems
- Istanbul, Turkey, Jun.
- H. Hermansky, D. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, vol. 3, Istanbul, Turkey, Jun. 2000, pp. 1635-1638.
- (2000) Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing , vol.3 , pp. 1635-1638
- Hermansky, H.¹ Ellis, D.² Sharma, S.³

22
- 33745213373
- Multi-resolution RASTA filtering for TANDEM-based ASR
- Lisbon, Portugal, Sep.
- H. Hermansky and P. Fousek, "Multi-resolution RASTA filtering for TANDEM-based ASR," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005, pp. 361-364.
- (2005) Proc. Interspeech , pp. 361-364
- Hermansky, H.¹ Fousek, P.²

23
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- Hawaii, USA, Dec.
- F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Hawaii, USA, Dec. 2011, pp. 24-29.
- (2011) Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

24
- 80051609102
- The RWTH 2010 QUAERO ASR evaluation system for English, French, and German
- Prague, Czech, May
- M. Sundermeyer, M. Nusbaum-Thom, S. Wiesler, C. Plahl, A. E. Mousa, S. Hahn, D. Nolden, R. Schlüter, and H. Ney, "The RWTH 2010 QUAERO ASR evaluation system for English, French, and German," in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Prague, Czech, May 2011, pp. 2212-2215.
- (2011) Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing , pp. 2212-2215
- Sundermeyer, M.¹ Nusbaum-Thom, M.² Wiesler, S.³ Plahl, C.⁴ Mousa, A.E.⁵ Hahn, S.⁶ Nolden, D.⁷ Schlüter, R.⁸ Ney, H.⁹

25
- 84893701254
- Hybrid speech recognition with deep bidirectional LSTM
- Olomouc, Czech Republic, Dec.
- A. Graves, N. Jaitly, and A.-r. Mohamed, "Hybrid speech recognition with deep bidirectional LSTM," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Olomouc, Czech Republic, Dec. 2013, pp. 273-278.
- (2013) Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) , pp. 273-278
- Graves, A.¹ Jaitly, N.² Mohamed, A.-R.³

26
- 84867605836
- Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition
- Kyoto, Japan, Mar.
- O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition," in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Kyoto, Japan, Mar. 2012, pp. 4277-4280.
- (2012) Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing , pp. 4277-4280
- Abdel-Hamid, O.¹ Mohamed, A.² Jiang, H.³ Penn, G.⁴

27
- 70349213445
- Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
- Taipei, Taiwan, Apr.
- B. Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling," in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Taipei, Taiwan, Apr. 2009, pp. 3761-3764.
- (2009) Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing , pp. 3761-3764
- Kingsbury, B.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.