-
1
-
-
85010720695
-
Deep speech 2: End-to-end speech recognition in english and Mandarin
-
abs/1512.02595
-
Dario Amodei, Rishita Anubhai, Eric Battenberg, Carl Case, Jared Casper, Bryan C. Catanzaro, Jingdong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse Engel, Linxi Fan, Christopher Fougner, Tony Han, Awni Y. Hannun, Billy Jun, Patrick LeGresley, Libby Lin, Sharan Narang, Andrew Y. Ng, Sherjil Ozair, Ryan Prenger, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Yi Wang, Zhiqian Wang, Chong Wang, Bo Xiao, Dani Yogatama, Jun Zhan, and Zhenyao Zhu. Deep speech 2: End-to-end speech recognition in english and mandarin. CoRR, abs/1512.02595, 2015. URL http://arxiv.org/abs/1512.02595.
-
(2015)
CoRR
-
-
Amodei, D.1
Anubhai, R.2
Battenberg, E.3
Case, C.4
Casper, J.5
Catanzaro, B.C.6
Chen, J.7
Chrzanowski, M.8
Coates, A.9
Diamos, G.10
Elsen, E.11
Engel, J.12
Fan, L.13
Fougner, C.14
Han, T.15
Hannun, A.Y.16
Jun, B.17
LeGresley, P.18
Lin, L.19
Narang, S.20
Ng, A.Y.21
Ozair, S.22
Prenger, R.23
Raiman, J.24
Satheesh, S.25
Seetapun, D.26
Sengupta, S.27
Wang, Y.28
Wang, Z.29
Wang, C.30
Xiao, B.31
Yogatama, D.32
Zhan, J.33
Zhu, Z.34
more..
-
3
-
-
0001163081
-
Number of stable points for spin-glasses and neural networks of higher orders
-
Pierre Baldi and Santosh S Venkatesh. Number of stable points for spin-glasses and neural networks of higher orders. Physical Review Letters, 58(9):913, 1987.
-
(1987)
Physical Review Letters
, vol.58
, Issue.9
, pp. 913
-
-
Baldi, P.1
Venkatesh, S.S.2
-
4
-
-
84955254040
-
Nanoconnectomic upper bound on the variability of synaptic plasticity
-
Thomas M Bartol, Cailey Bromer, Justin Kinney, Michael A Chirillo, Jennifer N Bourne, Kristen M Harris, and Terrence J Sejnowski. Nanoconnectomic upper bound on the variability of synaptic plasticity. eLife, 4: e10778, 2016.
-
(2016)
eLife
, vol.4
-
-
Bartol, T.M.1
Bromer, C.2
Kinney, J.3
Chirillo, M.A.4
Bourne, J.N.5
Harris, K.M.6
Sejnowski, T.J.7
-
5
-
-
84899857287
-
Short-term memory capacity in networks via the restricted isometry property
-
Adam S Charles, Han Lun Yap, and Christopher J Rozell. Short-term memory capacity in networks via the restricted isometry property. Neural computation, 26(6):1198-1235, 2014.
-
(2014)
Neural Computation
, vol.26
, Issue.6
, pp. 1198-1235
-
-
Charles, A.S.1
Yap, H.L.2
Rozell, C.J.3
-
6
-
-
84961291190
-
-
arXiv preprint
-
Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078, 2014.
-
(2014)
Learning Phrase Representations Using Rnn Encoder-Decoder for Statistical Machine Translation
-
-
Cho, K.1
Van Merriënboer, B.2
Gulcehre, C.3
Bahdanau, D.4
Bougares, F.5
Schwenk, H.6
Bengio, Y.7
-
8
-
-
84918441630
-
Geometrical and statistical properties of systems of linear inequalities with applications in pattern recognition
-
Thomas M Cover. Geometrical and statistical properties of systems of linear inequalities with applications in pattern recognition. IEEE transactions on electronic computers, (3):326-334, 1965.
-
(1965)
IEEE Transactions on Electronic Computers
, Issue.3
, pp. 326-334
-
-
Cover, T.M.1
-
9
-
-
84942597150
-
Parallelizing exploration-exploitation tradeoffs in Gaussian process bandit optimization
-
Thomas Desautels, Andreas Krause, and Joel W Burdick. Parallelizing exploration-exploitation tradeoffs in gaussian process bandit optimization. The Journal of Machine Learning Research, 15(1):3873-3923, 2014.
-
(2014)
The Journal of Machine Learning Research
, vol.15
, Issue.1
, pp. 3873-3923
-
-
Desautels, T.1
Krause, A.2
Burdick, J.W.3
-
11
-
-
85071025850
-
Intelligible language modeling with input switched affine networks
-
Jakob Foerster, Justin Gilmer, Jan Chorowski, Jascha Sohl-Dickstein, and David Sussillo. Intelligible language modeling with input switched affine networks. ICLR 2017 submission, 2016.
-
(2016)
ICLR 2017 Submission
-
-
Foerster, J.1
Gilmer, J.2
Chorowski, J.3
Sohl-Dickstein, J.4
Sussillo, D.5
-
12
-
-
57749113625
-
Memory traces in dynamical systems
-
Surya Ganguli, Dongsung Huh, and Haim Sompolinsky. Memory traces in dynamical systems. Proceedings of the National Academy of Sciences, 105(48):18970-18975, 2008.
-
(2008)
Proceedings of the National Academy of Sciences
, vol.105
, Issue.48
, pp. 18970-18975
-
-
Ganguli, S.1
Huh, D.2
Sompolinsky, H.3
-
13
-
-
36149029786
-
The space of interactions in neural network models
-
Elizabeth Gardner. The space of interactions in neural network models. Journal of physics A: Mathematical and general, 21(1):257, 1988.
-
(1988)
Journal of Physics A: Mathematical and General
, vol.21
, Issue.1
, pp. 257
-
-
Gardner, E.1
-
14
-
-
0033344091
-
Learning to forget: Continual prediction with lstm
-
Felix A. Gers, Jurgen Schmidhuber, and Fred Cummins. Learning to forget: Continual prediction with lstm. Artificial Neural Networks, ICANN 99. Ninth International Conference on (Conf. Publ. No. 470), 2:850-855, 1999.
-
(1999)
Artificial Neural Networks, ICANN 99. Ninth International Conference on (Conf. Publ. No. 470)
, vol.2
, pp. 850-855
-
-
Gers, F.A.1
Schmidhuber, J.2
Cummins, F.3
-
15
-
-
64849110608
-
A novel connectionist system for unconstrained handwriting recognition. Pattern analysis and machine Intelligence
-
Alex Graves, Marcus Liwicki, Santiago Fernández, Roman Bertolami, Horst Bunke, and Jürgen Schmidhuber. A novel connectionist system for unconstrained handwriting recognition. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 31(5):855-868, 2009.
-
(2009)
IEEE Transactions on
, vol.31
, Issue.5
, pp. 855-868
-
-
Graves, A.1
Liwicki, M.2
Fernández, S.3
Bertolami, R.4
Bunke, H.5
Schmidhuber, J.6
-
16
-
-
84943739264
-
-
arXiv preprint
-
Klaus Greff, Rupesh Kumar Srivastava, Jan Koutník, Bas R Steunebrink, and Jürgen Schmidhuber. Lstm: A search space odyssey. arXiv preprint arXiv:1503.04069, 2015.
-
(2015)
Lstm: A Search Space Odyssey
-
-
Greff, K.1
Srivastava, R.K.2
Koutník, J.3
Steunebrink, B.R.4
Schmidhuber, J.5
-
22
-
-
1842421269
-
Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication
-
Herbert Jaeger and Harald Haas. Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication. science, 304(5667):78-80, 2004.
-
(2004)
Science
, vol.304
, Issue.5667
, pp. 78-80
-
-
Jaeger, H.1
Haas, H.2
-
24
-
-
84994193137
-
Exploring the limits of language modeling
-
abs
-
Rafal Józefowicz, Oriol Vinyals, Mike Schuster, Noam Shazeer, and Yonghui Wu. Exploring the limits of language modeling. CoRR, abs/1602.02410, 2016. URL http://arxiv.org/abs/1602.02410.
-
(2016)
CoRR
-
-
Józefowicz, R.1
Vinyals, O.2
Schuster, M.3
Shazeer, N.4
Wu, Y.5
-
26
-
-
85083951076
-
ADaM: A method for stochastic optimization
-
abs
-
Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization. CoRR, abs/1412.6980, 2014. URL http://arxiv.org/abs/1412.6980.
-
(2014)
CoRR
-
-
Kingma, D.P.1
Ba, J.2
-
27
-
-
0002702215
-
Vapnik-chervonenkis dimension of recurrent neural networks
-
Pascal Koiran and Eduardo D Sontag. Vapnik-chervonenkis dimension of recurrent neural networks. Discrete Applied Mathematics, 86(1):63-79, 1998.
-
(1998)
Discrete Applied Mathematics
, vol.86
, Issue.1
, pp. 63-79
-
-
Koiran, P.1
Sontag, E.D.2
-
29
-
-
0036834701
-
Real-time computing without stable states: A new framework for neural computation based on perturbations
-
Wolfgang Maass, Thomas Natschläger, and Henry Markram. Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural computation, 14(11):2531-2560, 2002.
-
(2002)
Neural Computation
, vol.14
, Issue.11
, pp. 2531-2560
-
-
Maass, W.1
Natschläger, T.2
Markram, H.3
-
31
-
-
84887390404
-
Context-dependent computation by recurrent dynamics in prefrontal cortex
-
Valerio Mante, David Sussillo, Krishna V Shenoy, and William T Newsome. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature, 503(7474):78-84, 2013.
-
(2013)
Nature
, vol.503
, Issue.7474
, pp. 78-84
-
-
Mante, V.1
Sussillo, D.2
Shenoy, K.V.3
Newsome, W.T.4
-
33
-
-
84965163874
-
Deep knowledge tracing
-
Chris Piech, Jonathan Bassen, Jonathan Huang, Surya Ganguli, Mehran Sahami, Leonidas J Guibas, and Jascha Sohl-Dickstein. Deep knowledge tracing. In Advances in Neural Information Processing Systems, pp. 505-513, 2015.
-
(2015)
Advances in Neural Information Processing Systems
, pp. 505-513
-
-
Piech, C.1
Bassen, J.2
Huang, J.3
Ganguli, S.4
Sahami, M.5
Guibas, L.J.6
Sohl-Dickstein, J.7
-
34
-
-
85088226307
-
Outrageously large neural networks: The sparsely-gated mixture-of-experts layer
-
abs
-
Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc V. Le, Geoffrey E. Hinton, and Jeff Dean. Outrageously large neural networks: The sparsely-gated mixture-of-experts layer. CoRR, abs/1701.06538, 2017. URL http://arxiv.org/abs/1701.06538.
-
(2017)
CoRR
-
-
Shazeer, N.1
Mirhoseini, A.2
Maziarz, K.3
Davis, A.4
Le, Q.V.5
Hinton, G.E.6
Dean, J.7
-
37
-
-
84877827546
-
Opening the black box: Low-dimensional dynamics in high-dimensional recurrent neural networks
-
David Sussillo and Omri Barak. Opening the black box: low-dimensional dynamics in high-dimensional recurrent neural networks. Neural computation, 25(3):626-649, 2013.
-
(2013)
Neural Computation
, vol.25
, Issue.3
, pp. 626-649
-
-
Sussillo, D.1
Barak, O.2
-
39
-
-
84893343292
-
Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude
-
Tijmen Tieleman and Geoffrey. Hinton. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4, 2012.
-
(2012)
COURSERA: Neural Networks for Machine Learning
, vol.4
-
-
Tieleman, T.1
Hinton, G.2
-
40
-
-
2342592517
-
Short-term memory in orthogonal neural networks
-
Olivia L White, Daniel D Lee, and Haim Sompolinsky. Short-term memory in orthogonal neural networks. Physical review letters, 92(14):148102, 2004.
-
(2004)
Physical Review Letters
, vol.92
, Issue.14
, pp. 148102
-
-
White, O.L.1
Lee, D.D.2
Sompolinsky, H.3
-
41
-
-
84973904224
-
Deep fried convnets
-
Zichao Yang, Marcin Moczulski, Misha Denil, Nando de Freitas, Alex Smola, Le Song, and Ziyu Wang. Deep fried convnets. In Proceedings of the IEEE International Conference on Computer Vision, pp. 1476-1483, 2015.
-
(2015)
Proceedings of the IEEE International Conference on Computer Vision
, pp. 1476-1483
-
-
Yang, Z.1
Moczulski, M.2
Denil, M.3
De Freitas, N.4
Smola, A.5
Song, L.6
Wang, Z.7
-
42
-
-
84975705947
-
Minimal gated unit for recurrent neural networks
-
Guo-Bing Zhou, Jianxin Wu, Chen-Lin Zhang, and Zhi-Hua Zhou. Minimal gated unit for recurrent neural networks. International Journal of Automation and Computing, 13(3):226-234, 2016. ISSN 1751-8520. doi: 10.1007/s11633-016-1006-2. URL http://dx.doi.org/10.1007/s11633-016-1006-2.
-
(2016)
International Journal of Automation and Computing
, vol.13
, Issue.3
, pp. 226-234
-
-
Zhou, G.-B.1
Wu, J.2
Zhang, C.-L.3
Zhou, Z.-H.4
|