-
1
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks," IEEE Trans. on Audio, Speech, and Language Processing, no. 99, pp. 14-22, 2010.
-
(2010)
IEEE Trans. on Audio, Speech, and Language Processing
, Issue.99
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.2
Hinton, G.3
-
2
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. E. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath et al., "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
Signal Processing Magazine, IEEE
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
-
3
-
-
70450185565
-
The RWTH Aachen university open source speech recognition system
-
Brighton, UK, Sep.
-
D. Rybach, C. Gollan, G. Heigold, B. Hoffmeister, J. Lööf, R. Schlüter, and H. Ney, "The RWTH Aachen university open source speech recognition system," in Proc. Interspeech, Brighton, UK, Sep. 2009, pp. 2111-2114.
-
(2009)
Proc. Interspeech
, pp. 2111-2114
-
-
Rybach, D.1
Gollan, C.2
Heigold, G.3
Hoffmeister, B.4
Lööf, J.5
Schlüter, R.6
Ney, H.7
-
4
-
-
33645209480
-
Sphinx-4: A flexible open source framework for speech recognition
-
Inc., Mountain View, CA, USA, Tech. Rep.
-
W. Walker, P. Lamere, P. Kwok, B. Raj, R. Singh, E. Gouvea, P. Wolf, and J. Woelfel, "Sphinx-4: A flexible open source framework for speech recognition," Sun Microsystems, Inc., Mountain View, CA, USA, Tech. Rep., 2004.
-
(2004)
Sun Microsystems
-
-
Walker, W.1
Lamere, P.2
Kwok, P.3
Raj, B.4
Singh, R.5
Gouvea, E.6
Wolf, P.7
Woelfel, J.8
-
5
-
-
84905265035
-
-
The HTK book version 3.4. Cambridge University Engineering Department
-
S. Young, G. Evermann, M. Gales, T. Hain, D. Kershaw, X. Liu, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, The HTK book version 3.4. Cambridge University Engineering Department, 2006.
-
(2006)
-
-
Young, S.1
Evermann, G.2
Gales, M.3
Hain, T.4
Kershaw, D.5
Liu, X.6
Moore, G.7
Odell, J.8
Ollason, D.9
Povey, D.10
Valtchev, V.11
Woodland, P.12
-
6
-
-
85009062693
-
Julius-an open source real-time large vocabulary recognition engine
-
Aalborg, Denmark, Sep.
-
A. Lee, T. Kawahara, and K. Shikano, "Julius-an open source real-time large vocabulary recognition engine," in Proc. Interspeech, Aalborg, Denmark, Sep. 2001, pp. 1691-1694.
-
(2001)
Proc. Interspeech
, pp. 1691-1694
-
-
Lee, A.1
Kawahara, T.2
Shikano, K.3
-
7
-
-
84858953642
-
The Kaldi speech recognition toolkit
-
Hawaii, USA, Dec.
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlcek, Y. Qian, P. Schwarz, J. Silovský, G. Stemmer, and K. Veselý, "The Kaldi speech recognition toolkit," in Proc. IEEE Automatic Speech Recognition and UnderstandingWorkshop (ASRU), Hawaii, USA, Dec. 2011.
-
(2011)
Proc. IEEE Automatic Speech Recognition and UnderstandingWorkshop (ASRU)
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlcek, P.8
Qian, Y.9
Schwarz, P.10
Silovský, J.11
Stemmer, G.12
Veselý, K.13
-
9
-
-
79959816017
-
Parallel training of neural networks for speech recognition
-
Makuhari, Japan, Sep.
-
K. Veselý, L. Burget, and F. Grézl, "Parallel training of neural networks for speech recognition," in Proc. Interspeech, Makuhari, Japan, Sep. 2010, pp. 2934-2937.
-
(2010)
Proc. Interspeech
, pp. 2934-2937
-
-
Veselý, K.1
Burget, L.2
Grézl, F.3
-
10
-
-
84858971297
-
Convolutive bottleneck network features for LVCSR
-
Hawaii, USA, Dec.
-
K. Veselý, M. Karafíat, and F. Grézl, "Convolutive bottleneck network features for LVCSR," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Hawaii, USA, Dec. 2011, pp. 42-47.
-
(2011)
Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
, pp. 42-47
-
-
Veselý, K.1
Karafíat, M.2
Grézl, F.3
-
11
-
-
77956509090
-
Rectified linear units improve restricted Boltzmann machines
-
Haifa, Israel, Jun.
-
V. Nair and G. E. Hinton, "Rectified linear units improve restricted Boltzmann machines," in Proc. of the 27th Int. Conf. on Machine Learning, Haifa, Israel, Jun. 2010, pp. 807-814.
-
(2010)
Proc. of the 27th Int. Conf. on Machine Learning
, pp. 807-814
-
-
Nair, V.1
Hinton, G.E.2
-
12
-
-
0001336749
-
Accelerated learning in layered neural networks
-
Dec.
-
S. A. Solla, E. Levin, and M. Fleisher, "Accelerated learning in layered neural networks," Complex Systems, vol. 2, no. 6, pp. 625-639, Dec. 1988.
-
(1988)
Complex Systems
, vol.2
, Issue.6
, pp. 625-639
-
-
Solla, S.A.1
Levin, E.2
Fleisher, M.3
-
14
-
-
84883190472
-
Large scale distributed deep networks
-
J. Dean, G. Corrado, R. Monga, K. Chen, M. Devin, Q. Le, M. Mao, M. Ranzato, A. Senior, P. Tucker, K. Yang, and A. Ng, "Large scale distributed deep networks," in Advances in Neural Information Processing Systems 25, 2012, pp. 1232-1240.
-
(2012)
Advances in Neural Information Processing Systems
, vol.25
, pp. 1232-1240
-
-
Dean, J.1
Corrado, G.2
Monga, R.3
Chen, K.4
Devin, M.5
Le, Q.6
Mao, M.7
Ranzato, M.8
Senior, A.9
Tucker, P.10
Yang, K.11
Ng, A.12
-
15
-
-
84905233897
-
Meannormalized stochastic gradient for large-scale deep learning
-
Florence, Italy, May
-
S. Wiesler, A. Richard, R. Schlüter, and H. Ney, "Meannormalized stochastic gradient for large-scale deep learning," in (submitted to) Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Florence, Italy, May 2014.
-
(2014)
Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing
-
-
Wiesler, S.1
Richard, A.2
Schlüter, R.3
Ney, H.4
-
16
-
-
84943274699
-
A direct adaptive method for faster backpropagation learning: The RPROP algorithm
-
M. Riedmiller and H. Braun, "A direct adaptive method for faster backpropagation learning: The RPROP algorithm," in Proc. of the Int. Conf. on Neural Networks, 1993, pp. 586-591.
-
(1993)
Proc. of the Int. Conf. on Neural Networks
, pp. 586-591
-
-
Riedmiller, M.1
Braun, H.2
-
17
-
-
84905270596
-
Cross-entropy vs squared error training: A theoretical and experimental comparison
-
Lyon, France, Aug.
-
P. Golik, P. Doetsch, and H. Ney, "Cross-entropy vs. squared error training: A theoretical and experimental comparison," in Proc. Interspeech, Lyon, France, Aug. 2013, pp. 1756-1760.
-
(2013)
Proc. Interspeech
, pp. 1756-1760
-
-
Golik, P.1
Doetsch, P.2
Ney, H.3
-
18
-
-
84887388950
-
An empirical study of learning rates in deep neural networks for speech recognition
-
A. Senior, G. Heigold, M. Ranzato, and K. Yang, "An empirical study of learning rates in deep neural networks for speech recognition," in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, vol. 1, 2013, pp. 6724-6728.
-
(2013)
Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, vol.1
, pp. 6724-6728
-
-
Senior, A.1
Heigold, G.2
Ranzato, M.3
Yang, K.4
-
19
-
-
0000635720
-
Progress in dynamic programming search for LVCSR
-
Aug.
-
H. Ney and S. Ortmanns, "Progress in dynamic programming search for LVCSR," Proc. of the IEEE, vol. 88, no. 8, pp. 1224-1240, Aug. 2000.
-
(2000)
Proc. of the IEEE
, vol.88
, Issue.8
, pp. 1224-1240
-
-
Ney, H.1
Ortmanns, S.2
-
21
-
-
0033709098
-
Tandem connectionist feature extraction for conventional HMM systems
-
Istanbul, Turkey, Jun.
-
H. Hermansky, D. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, vol. 3, Istanbul, Turkey, Jun. 2000, pp. 1635-1638.
-
(2000)
Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, vol.3
, pp. 1635-1638
-
-
Hermansky, H.1
Ellis, D.2
Sharma, S.3
-
22
-
-
33745213373
-
Multi-resolution RASTA filtering for TANDEM-based ASR
-
Lisbon, Portugal, Sep.
-
H. Hermansky and P. Fousek, "Multi-resolution RASTA filtering for TANDEM-based ASR," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005, pp. 361-364.
-
(2005)
Proc. Interspeech
, pp. 361-364
-
-
Hermansky, H.1
Fousek, P.2
-
23
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
Hawaii, USA, Dec.
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Hawaii, USA, Dec. 2011, pp. 24-29.
-
(2011)
Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
, pp. 24-29
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
24
-
-
80051609102
-
The RWTH 2010 QUAERO ASR evaluation system for English, French, and German
-
Prague, Czech, May
-
M. Sundermeyer, M. Nusbaum-Thom, S. Wiesler, C. Plahl, A. E. Mousa, S. Hahn, D. Nolden, R. Schlüter, and H. Ney, "The RWTH 2010 QUAERO ASR evaluation system for English, French, and German," in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Prague, Czech, May 2011, pp. 2212-2215.
-
(2011)
Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, pp. 2212-2215
-
-
Sundermeyer, M.1
Nusbaum-Thom, M.2
Wiesler, S.3
Plahl, C.4
Mousa, A.E.5
Hahn, S.6
Nolden, D.7
Schlüter, R.8
Ney, H.9
-
25
-
-
84893701254
-
Hybrid speech recognition with deep bidirectional LSTM
-
Olomouc, Czech Republic, Dec.
-
A. Graves, N. Jaitly, and A.-r. Mohamed, "Hybrid speech recognition with deep bidirectional LSTM," in Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Olomouc, Czech Republic, Dec. 2013, pp. 273-278.
-
(2013)
Proc. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
, pp. 273-278
-
-
Graves, A.1
Jaitly, N.2
Mohamed, A.-R.3
-
26
-
-
84867605836
-
Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition
-
Kyoto, Japan, Mar.
-
O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition," in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Kyoto, Japan, Mar. 2012, pp. 4277-4280.
-
(2012)
Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, pp. 4277-4280
-
-
Abdel-Hamid, O.1
Mohamed, A.2
Jiang, H.3
Penn, G.4
-
27
-
-
70349213445
-
Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
-
Taipei, Taiwan, Apr.
-
B. Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling," in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Taipei, Taiwan, Apr. 2009, pp. 3761-3764.
-
(2009)
Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, pp. 3761-3764
-
-
Kingsbury, B.1
|