-
1
-
-
0032203257
-
Gradientbased learning applied to document recognition
-
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradientbased learning applied to document recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
Lecun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
2
-
-
84945900998
-
Best practices for convolutional neural networks applied to visual document analysis
-
P. Y. Simard, D. Steinkraus, and J. C. Platt, "Best practices for convolutional neural networks applied to visual document analysis," in International Conference on Document Analysis and Recognition (ICDAR), 2003, pp. 958-963.
-
(2003)
International Conference on Document Analysis and Recognition (ICDAR)
, pp. 958-963
-
-
Simard, P.Y.1
Steinkraus, D.2
Platt, J.C.3
-
4
-
-
84876231242
-
ImageNet classification with deep convolutional neural networks
-
A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," in Neural Information Processing Systems (NIPS), 2012, pp. 1106-1114.
-
(2012)
Neural Information Processing Systems (NIPS)
, pp. 1106-1114
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
5
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Interspeech, 2011, pp. 437-440.
-
(2011)
Interspeech
, pp. 437-440
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
6
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Automatic Speech Recognition and Understanding Workshop (ASRU), 2011, pp. 24-29.
-
(2011)
Automatic Speech Recognition and Understanding Workshop (ASRU)
, pp. 24-29
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
7
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
November
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition," in IEEE Signal Processing Maganize, November 2012, pp. 82-97.
-
(2012)
IEEE Signal Processing Maganize
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
Kingsbury, B.11
-
8
-
-
84878379108
-
Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
-
B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization," in Interspeech, 2012.
-
(2012)
Interspeech
-
-
Kingsbury, B.1
Sainath, T.N.2
Soltau, H.3
-
9
-
-
84905249354
-
-
http://www.iarpa.gov/Programs/ia/Babel/babel.html.
-
-
-
-
10
-
-
84890507010
-
Developing speech recognition systems for corpus indexing under the IARPA Babel program
-
J. Cui, X. Cui, B. Ramabhadran, J. Kim, B. Kingsbury, J. Mamou, L. Mangu, M. Picheny, T. N. Sainath, and A. Sethy, "Developing speech recognition systems for corpus indexing under the IARPA Babel program," in International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2013, pp. 6753-6757.
-
(2013)
International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 6753-6757
-
-
Cui, J.1
Cui, X.2
Ramabhadran, B.3
Kim, J.4
Kingsbury, B.5
Mamou, J.6
Mangu, L.7
Picheny, M.8
Sainath, T.N.9
Sethy, A.10
-
11
-
-
84890537373
-
A highperformance Cantonese keyword search system
-
B. Kingsbury, J. Cui, X. Cui, M. J. F. Gales, K. Knill, J. Mamou, L. Mangu, D. Nolden, M. Picheny, B. Ramabhadran, R. Schluter, A. Sethy, and P. C. Woodland, "A highperformance Cantonese keyword search system," in International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2013, pp. 8277-8281.
-
(2013)
International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 8277-8281
-
-
Kingsbury, B.1
Cui, J.2
Cui, X.3
Gales, M.J.F.4
Knill, K.5
Mamou, J.6
Mangu, L.7
Nolden, D.8
Picheny, M.9
Ramabhadran, B.10
Schluter, R.11
Sethy, A.12
Woodland, P.C.13
-
12
-
-
0031647824
-
A frequency warping approach to speaker normalization
-
L. Lee and R. Rose, "A frequency warping approach to speaker normalization," IEEE Transactions on Speech and Audio Processing, vol. 6, no. 1, pp. 49-60, 1998.
-
(1998)
IEEE Transactions on Speech and Audio Processing
, vol.6
, Issue.1
, pp. 49-60
-
-
Lee, L.1
Rose, R.2
-
13
-
-
0032638856
-
Semi-tied covariance matrices for hidden Markov models
-
M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Transactions on Speech and Audio Processing, vol. 7, no. 3, pp. 272-281, 1999.
-
(1999)
IEEE Transactions on Speech and Audio Processing
, vol.7
, Issue.3
, pp. 272-281
-
-
Gales, M.J.F.1
-
14
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol. 12, pp. 75-98, 1998.
-
(1998)
Computer Speech and Language
, vol.12
, pp. 75-98
-
-
Gales, M.J.F.1
-
15
-
-
79951796005
-
The IBM Attila speech recognition toolkit
-
H. Soltau, G. Saon, and B. Kingsbury, "The IBM Attila speech recognition toolkit," in Spoken Language Technology Workshop (SLT), 2010, pp. 97-101.
-
(2010)
Spoken Language Technology Workshop (SLT)
, pp. 97-101
-
-
Soltau, H.1
Saon, G.2
Kingsbury, B.3
-
16
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hiddenMarkov models
-
C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hiddenMarkov models," Computer Speech and Language, vol. 9, pp. 171-185, 1995.
-
(1995)
Computer Speech and Language
, vol.9
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
17
-
-
84865265602
-
Hidden Markov acoustic modeling with bootstrap and restructuring for lowresourced languages
-
X. Cui, J. Xue, X. Chen, P. A. Olsen, P. L. Dognin, U. V. Chaudhari, J. R. Hershey, and B. Zhou, "Hidden Markov acoustic modeling with bootstrap and restructuring for lowresourced languages," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 8, pp. 2252-2264, 2012.
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.8
, pp. 2252-2264
-
-
Cui, X.1
Xue, J.2
Chen, X.3
Olsen, P.A.4
Dognin, P.L.5
Chaudhari, U.V.6
Hershey, J.R.7
Zhou, B.8
|