-
1
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
Nov.
-
Geoffrey Hinton, Li Deng, George E. Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara Sainath, and Brian Kingsbury. Deep neural networks for acoustic modeling in speech recognition. IEEE Signal Processing Magazine, 29(6):82-97, Nov. 2012.
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Dahl, G.E.3
Mohamed, A.-R.4
Jaitly, N.5
Senior, A.6
Vanhoucke, V.7
Nguyen, P.8
Sainath, T.9
Kingsbury, B.10
-
3
-
-
84876231242
-
ImageNet classification with deep convolutional neural networks
-
A. Krizhevsky, I. Sutskever, and G. Hinton. ImageNet classification with deep convolutional neural networks. In NIPS'2012. 2012.
-
(2012)
NIPS'2012
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.3
-
4
-
-
84964983441
-
-
Technical report, arXiv:1409.4842
-
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. Going deeper with convolutions. Technical report, arXiv:1409.4842, 2014.
-
(2014)
Going Deeper with Convolutions
-
-
Szegedy, C.1
Liu, W.2
Jia, Y.3
Sermanet, P.4
Reed, S.5
Anguelov, D.6
Erhan, D.7
Vanhoucke, V.8
Rabinovich, A.9
-
5
-
-
84906921986
-
Fast and robust neural network joint models for statistical machine translation
-
Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, and John Makhoul. Fast and robust neural network joint models for statistical machine translation. In Proc. ACL'2014, 2014.
-
(2014)
Proc. ACL'2014
-
-
Devlin, J.1
Zbib, R.2
Huang, Z.3
Lamar, T.4
Schwartz, R.5
Makhoul, J.6
-
6
-
-
84928547704
-
Sequence to sequence learning with neural networks
-
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. Sequence to sequence learning with neural networks. In NIPS'2014, 2014.
-
(2014)
NIPS'2014
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.3
-
7
-
-
85083953689
-
Neural machine translation by jointly learning to align and translate
-
arXiv:1409.0473
-
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. In ICLR'2015, arXiv:1409.0473, 2015.
-
(2015)
ICLR'2015
-
-
Bahdanau, D.1
Cho, K.2
Bengio, Y.3
-
8
-
-
71149105669
-
Large-scale deep unsupervised learning using graphics processors
-
Rajat Raina, Anand Madhavan, and Andrew Y. Ng. Large-scale deep unsupervised learning using graphics processors. In ICML'2009, 2009.
-
(2009)
ICML'2009
-
-
Raina, R.1
Madhavan, A.2
Ng, A.Y.3
-
9
-
-
0142166851
-
A neural probabilistic language model
-
Yoshua Bengio, Réjean Ducharme, Pascal Vincent, and Christian Jauvin. A neural probabilistic language model. Journal of Machine Learning Research, 3:1137-1155, 2003.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 1137-1155
-
-
Bengio, Y.1
Ducharme, R.2
Vincent, P.3
Jauvin, C.4
-
10
-
-
84877760312
-
Large scale distributed deep networks
-
J. Dean, G. S. Corrado, R. Monga, K. Chen, M. Devin, Q. V. Le, M. Z. Mao, M. A. Ranzato, A. Senior, P. Tucker, K. Yang, and A. Y. Ng. Large scale distributed deep networks. In NIPS'2012, 2012.
-
(2012)
NIPS'2012
-
-
Dean, J.1
Corrado, G.S.2
Monga, R.3
Chen, K.4
Devin, M.5
Le, Q.V.6
Mao, M.Z.7
Ranzato, M.A.8
Senior, A.9
Tucker, P.10
Yang, K.11
Ng, A.Y.12
-
11
-
-
70449805398
-
A highly scalable restricted Boltzmann machine FPGA implementation
-
IEEE
-
Sang Kyun Kim, Lawrence C McAfee, Peter Leonard McMahon, and Kunle Olukotun. A highly scalable restricted Boltzmann machine FPGA implementation. In Field Programmable Logic and Applications, 2009. FPL 2009. International Conference on, pages 367-372. IEEE, 2009.
-
(2009)
Field Programmable Logic and Applications, 2009. FPL 2009. International Conference on
, pp. 367-372
-
-
Kim, S.K.1
McAfee, L.C.2
McMahon, P.L.3
Olukotun, K.4
-
12
-
-
84897780584
-
Diannao: A small-footprint high-throughput accelerator for ubiquitous machine-learning
-
ACM
-
Tianshi Chen, Zidong Du, Ninghui Sun, Jia Wang, Chengyong Wu, Yunji Chen, and Olivier Temam. Diannao: A small-footprint high-throughput accelerator for ubiquitous machine-learning. In Proceedings of the 19th international conference on Architectural support for programming languages and operating systems, pages 269-284. ACM, 2014.
-
(2014)
Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 269-284
-
-
Chen, T.1
Du, Z.2
Sun, N.3
Wang, J.4
Wu, C.5
Chen, Y.6
Temam, O.7
-
13
-
-
84937706638
-
Dadiannao: A machine-learning supercomputer
-
IEEE
-
Yunji Chen, Tao Luo, Shaoli Liu, Shijin Zhang, Liqiang He, Jia Wang, Ling Li, Tianshi Chen, Zhiwei Xu, Ninghui Sun, et al. Dadiannao: A machine-learning supercomputer. In Microarchitecture (MICRO), 2014 47th Annual IEEE/ACM International Symposium on, pages 609-622. IEEE, 2014.
-
(2014)
Microarchitecture (MICRO), 2014 47th Annual IEEE/ACM International Symposium on
, pp. 609-622
-
-
Chen, Y.1
Luo, T.2
Liu, S.3
Zhang, S.4
He, L.5
Wang, J.6
Li, L.7
Chen, T.8
Xu, Z.9
Sun, N.10
-
17
-
-
84941258385
-
-
bioRxiv
-
Thomas M Bartol, Cailey Bromer, Justin P Kinney, Michael A Chirillo, Jennifer N Bourne, Kristen M Harris, and Terrence J Sejnowski. Hippocampal spine head sizes are highly precise. bioRxiv, 2015.
-
(2015)
Hippocampal Spine Head Sizes Are Highly Precise
-
-
Bartol, T.M.1
Bromer, C.2
Kinney, J.P.3
Chirillo, M.A.4
Bourne, J.N.5
Harris, K.M.6
Sejnowski, T.J.7
-
18
-
-
85162557101
-
Practical variational inference for neural networks
-
J. Shawe-Taylor, R. S. Zemel, P. L. Bartlett, F. Pereira, and K. Q. Weinberger, editors, Curran Associates, Inc.
-
Alex Graves. Practical variational inference for neural networks. In J. Shawe-Taylor, R. S. Zemel, P. L. Bartlett, F. Pereira, and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems 24, pages 2348-2356. Curran Associates, Inc., 2011.
-
(2011)
Advances in Neural Information Processing Systems
, vol.24
, pp. 2348-2356
-
-
Graves, A.1
-
20
-
-
84904163933
-
Dropout: A simple way to prevent neural networks from overfitting
-
Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15:1929-1958, 2014.
-
(2014)
Journal of Machine Learning Research
, vol.15
, pp. 1929-1958
-
-
Srivastava, N.1
Hinton, G.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
21
-
-
84897550107
-
Regularization of neural networks using dropconnect
-
Li Wan, Matthew Zeiler, Sixin Zhang, Yann Le Cun, and Rob Fergus. Regularization of neural networks using dropconnect. In ICML'2013, 2013.
-
(2013)
ICML'2013
-
-
Wan, L.1
Zeiler, M.2
Zhang, S.3
Le Cun, Y.4
Fergus, R.5
-
22
-
-
34548811405
-
Hardware complexity of modular multiplication and exponentiation
-
Oct.
-
J. P. David, K. Kalach, and N. Tittley. Hardware complexity of modular multiplication and exponentiation. Computers, IEEE Transactions on, 56(10):1308-1319, Oct. 2007.
-
(2007)
Computers, IEEE Transactions on
, vol.56
, Issue.10
, pp. 1308-1319
-
-
David, J.P.1
Kalach, K.2
Tittley, N.3
-
25
-
-
79951563340
-
Understanding the difficulty of training deep feedforward neural networks
-
Xavier Glorot and Yoshua Bengio. Understanding the difficulty of training deep feedforward neural networks. In AISTATS'2010, 2010.
-
(2010)
AISTATS'2010
-
-
Glorot, X.1
Bengio, Y.2
-
28
-
-
34548480020
-
A method for unconstrained convex minimization problem with the rate of convergence o (1/k2)
-
Yu Nesterov. A method for unconstrained convex minimization problem with the rate of convergence o (1/k2). Doklady AN SSSR (translated as Soviet. Math. Docl.), 269:543-547, 1983.
-
(1983)
Doklady AN SSSR (translated as Soviet. Math. Docl.)
, vol.269
, pp. 543-547
-
-
Nesterov, Y.1
-
29
-
-
84892421248
-
-
Technical Report Arxiv, Université de Montréal, February
-
Ian J. Goodfellow, David Warde-Farley, Mehdi Mirza, Aaron Courville, and Yoshua Bengio. Maxout networks. Technical Report Arxiv report 1302.4389, Université de Montréal, February 2013.
-
(2013)
Maxout Networks
-
-
Goodfellow, I.J.1
Warde-Farley, D.2
Mirza, M.3
Courville, A.4
Bengio, Y.5
-
32
-
-
84943645147
-
-
arXiv preprint arXiv:1409.5185
-
Chen-Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, and Zhuowen Tu. Deeply-supervised nets. arXiv preprint arXiv:1409.5185, 2014.
-
(2014)
Deeply-supervised Nets
-
-
Lee, C.-Y.1
Xie, S.2
Gallagher, P.3
Zhang, Z.4
Tu, Z.5
-
33
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
November
-
Yann Le Cun, Leon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324, November 1998.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
Le Cun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
34
-
-
77956509090
-
Rectified linear units improve restricted Boltzmann machines
-
V. Nair and G. E. Hinton. Rectified linear units improve restricted Boltzmann machines. In ICML'2010, 2010.
-
(2010)
ICML'2010
-
-
Nair, V.1
Hinton, G.E.2
-
36
-
-
85083953063
-
Very deep convolutional networks for large-scale image recognition
-
Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
-
(2015)
ICLR
-
-
Simonyan, K.1
Zisserman, A.2
-
37
-
-
84937908919
-
Expectation backpropagation: Parameter-free training of multilayer neural networks with continuous or discrete weights
-
Daniel Soudry, Itay Hubara, and Ron Meir. Expectation backpropagation: Parameter-free training of multilayer neural networks with continuous or discrete weights. In NIPS'2014, 2014.
-
(2014)
NIPS'2014
-
-
Soudry, D.1
Hubara, I.2
Meir, R.3
-
39
-
-
84920265200
-
Fixed-point feedforward deep neural network design using weights+ 1, 0, and-1
-
IEEE
-
Kyuyeon Hwang and Wonyong Sung. Fixed-point feedforward deep neural network design using weights+ 1, 0, and-1. In Signal Processing Systems (SiPS), 2014 IEEE Workshop on, pages 1-6. IEEE, 2014.
-
(2014)
Signal Processing Systems (SiPS), 2014 IEEE Workshop on
, pp. 1-6
-
-
Hwang, K.1
Sung, W.2
-
40
-
-
84905216479
-
X1000 real-time phoneme recognition vlsi using feed-forward deep neural networks
-
IEEE
-
Jonghong Kim, Kyuyeon Hwang, and Wonyong Sung. X1000 real-time phoneme recognition vlsi using feed-forward deep neural networks. In Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pages 7510-7514. IEEE, 2014.
-
(2014)
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
, pp. 7510-7514
-
-
Kim, J.1
Hwang, K.2
Sung, W.3
-
41
-
-
0345978970
-
Expectation propagation for approximate Bayesian inference
-
Thomas P Minka. Expectation propagation for approximate bayesian inference. In UAI'2001, 2001.
-
(2001)
UAI'2001
-
-
Minka, T.P.1
-
42
-
-
84857819132
-
Theano: A CPU and GPU math expression compiler
-
June, Oral Presentation
-
James Bergstra, Olivier Breuleux, Frédéric Bastien, Pascal Lamblin, Razvan Pascanu, Guillaume Desjardins, Joseph Turian, David Warde-Farley, and Yoshua Bengio. Theano: a CPU and GPU math expression compiler. In Proceedings of the Python for Scientific Computing Conference (SciPy), June 2010. Oral Presentation.
-
(2010)
Proceedings of the Python for Scientific Computing Conference (SciPy)
-
-
Bergstra, J.1
Breuleux, O.2
Bastien, F.3
Lamblin, P.4
Pascanu, R.5
Desjardins, G.6
Turian, J.7
Warde-Farley, D.8
Bengio, Y.9
-
43
-
-
84897544737
-
Theano: New features and speed improvements
-
Frédéric Bastien, Pascal Lamblin, Razvan Pascanu, James Bergstra, Ian J. Goodfellow, Arnaud Bergeron, Nicolas Bouchard, and Yoshua Bengio. Theano: new features and speed improvements. Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop, 2012.
-
(2012)
Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop
-
-
Bastien, F.1
Lamblin, P.2
Pascanu, R.3
Bergstra, J.4
Goodfellow, I.J.5
Bergeron, A.6
Bouchard, N.7
Bengio, Y.8
-
44
-
-
84893401626
-
-
arXiv preprint arXiv:1308.4214
-
Ian J. Goodfellow, David Warde-Farley, Pascal Lamblin, Vincent Dumoulin, Mehdi Mirza, Razvan Pascanu, James Bergstra, Frédéric Bastien, and Yoshua Bengio. Pylearn2: a machine learning research library. arXiv preprint arXiv:1308.4214, 2013.
-
(2013)
Pylearn2: A Machine Learning Research Library
-
-
Goodfellow, I.J.1
Warde-Farley, D.2
Lamblin, P.3
Dumoulin, V.4
Mirza, M.5
Pascanu, R.6
Bergstra, J.7
Bastien, F.8
Bengio, Y.9
|