-
1
-
-
84857710417
-
Optimization with sparsity-inducing penalties
-
Francis Bach, Rodolphe Jenatton, Mien Mairal, and Guillaume Obozinski. 2012. Optimization with sparsity-inducing penalties. Foundations and Trends in Machine Learning, 4(1):1-106.
-
(2012)
Foundations and Trends in Machine Learning
, vol.4
, Issue.1
, pp. 1-106
-
-
Bach, F.1
Jenatton, R.2
Mairal, M.3
Obozinski, G.4
-
3
-
-
0142166851
-
A neural probabilistic language model
-
Yoshua Bengio, Rejean Ducharme, Pascal Vincent, and Christian Janvin. 2003. A neural probabilistic language model. J. Machine Learning Research, 3:1137-1155.
-
(2003)
J. Machine Learning Research
, vol.3
, pp. 1137-1155
-
-
Bengio, Y.1
Ducharme, R.2
Vincent, P.3
Janvin, C.4
-
4
-
-
84906921986
-
Fast and robust neural network joint models for statistical machine translation
-
Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, and John Makhoul. 2014. Fast and robust neural network joint models for statistical machine translation. In Proc. ACL, pages 1370-1380.
-
(2014)
Proc. ACL
, pp. 1370-1380
-
-
Devlin, J.1
Zbib, R.2
Huang, Z.3
Lamar, T.4
Schwartz, R.5
Makhoul, J.6
-
5
-
-
75249102673
-
Efficient online and batch learning using forward backward splitting
-
John Duchi and Yoram Singer. 2009. Efficient online and batch learning using forward backward splitting. J. Machine Learning Research, 10:2899-2934.
-
(2009)
J. Machine Learning Research
, vol.10
, pp. 2899-2934
-
-
Duchi, J.1
Singer, Y.2
-
7
-
-
85110867932
-
Moses: Open source toolkit for statistical machine translation
-
Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondfej Bojar, Alexandra Constantin, and Evan Herbst. 2007. Moses: Open source toolkit for statistical machine translation. In Proc. ACL, Interactive Poster and Demonstration Sessions, pages 177-180.
-
(2007)
Proc. ACL, Interactive Poster and Demonstration Sessions
, pp. 177-180
-
-
Koehn, P.1
Hoang, H.2
Birch, A.3
Callison-Burch, C.4
Federico, M.5
Bertoldi, N.6
Cowan, B.7
Shen, W.8
Moran, C.9
Zens, R.10
Dyer, C.11
Bojar, O.12
Constantin, A.13
Herbst, E.14
-
8
-
-
0000494466
-
Optimal brain damage
-
Yann LeCun, John S. Denker, Sara A. Sofia, Richard E. Howard, and Lawrence D. Jackel. 1989. Optimal brain damage. In Proc. NIPS, volume 2, pages 598-605.
-
(1989)
Proc. NIPS
, vol.2
, pp. 598-605
-
-
Le Cun, Y.1
Denker, J.S.2
Sofia, S.A.3
Howard, R.E.4
Jackel, L.D.5
-
9
-
-
84901784231
-
RNNLM-recurrent neural network language modeling toolkit
-
Tomas Mikolov, Stefan Kombrink, Anoop Deoras, Lukar Burget, and Jan Cernocky. 2011. RNNLM-recurrent neural network language modeling toolkit. In Proc. ASRU, pages 196-201.
-
(2011)
Proc. ASRU
, pp. 196-201
-
-
Mikolov, T.1
Kombrink, S.2
Deoras, A.3
Burget, L.4
Cernocky, J.5
-
10
-
-
84867118996
-
A fast and simple algorithm for training neural probabilistic language models
-
Andriy Mnih and Yee Whye Teh. 2012. A fast and simple algorithm for training neural probabilistic language models. In Proc. ICML, pages 1751-1758.
-
(2012)
Proc. ICML
, pp. 1751-1758
-
-
Mnih, A.1
Teh, Y.W.2
-
11
-
-
77956509090
-
Rectified linear units improve restricted boltzmann machines
-
Vinod Nair and Geoffrey E Hinton. 2010. Rectified linear units improve Restricted Boltzmann Machines. In Proc. ICML, pages 807-814.
-
(2010)
Proc. ICML
, pp. 807-814
-
-
Nair, V.1
Hinton, G.E.2
-
12
-
-
0001765492
-
Simplifying neural networks by soft weight-sharing
-
Steven J. Nowland and Geoffrey E. Hinton. 1992. Simplifying neural networks by soft weight-sharing. Neural Computation, 4:473-493.
-
(1992)
Neural Computation
, vol.4
, pp. 473-493
-
-
Nowland, S.J.1
Hinton, G.E.2
-
16
-
-
84904163933
-
Dropout: A simple way to prevent neural networks from overfitting
-
Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: A simple way to prevent neural networks from overfitting. J. Machine Learning Research, 15(1):1929-1958.
-
(2014)
J. Machine Learning Research
, vol.15
, Issue.1
, pp. 1929-1958
-
-
Srivastava, N.1
Hinton, G.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
17
-
-
84924036578
-
From feedforward to recurrent LSTM neural networks for language modeling
-
Martin Sundermeyer, Hermann Ney, and Ralf Schliiter. 2015. From feedforward to recurrent LSTM neural networks for language modeling. Trans. Audio, Speech, and Language, 23(3):517-529.
-
(2015)
Trans. Audio, Speech, and Language
, vol.23
, Issue.3
, pp. 517-529
-
-
Sundermeyer, M.1
Ney, H.2
Schliiter, R.3
-
18
-
-
84926298172
-
Decoding with large-scale neural language models improves translation
-
Ashish Vaswani, Yinggong Zhao, Victoria Fossum, and David Chiang. 2013. Decoding with large-scale neural language models improves translation. In Proc. EMNLP, pages 1387-1392.
-
(2013)
Proc. EMNLP
, pp. 1387-1392
-
-
Vaswani, A.1
Zhao, Y.2
Fossum, V.3
Chiang, D.4
|