-
2
-
-
85083953689
-
Neural machine translation by jointly learning to align and translate
-
D. Bahdanau, K. Cho, and Y. Bengio. 2015. Neural machine translation by jointly learning to align and translate. In ICLR.
-
(2015)
ICLR
-
-
Bahdanau, D.1
Cho, K.2
Bengio, Y.3
-
3
-
-
0142166851
-
A neural probabilistic language model
-
Y. Bengio, R. Ducharme, P. Vincent, and C. Janvin. 2003. A neural probabilistic language model. JMLR, 3:1137–1155.
-
(2003)
JMLR
, vol.3
, pp. 1137-1155
-
-
Bengio, Y.1
Ducharme, R.2
Vincent, P.3
Janvin, C.4
-
4
-
-
69349090197
-
Learning deep architectures for ai
-
January
-
Yoshua Bengio. 2009. Learning deep architectures for ai. Foundations and Trends in Machine Learning, 2(1):1–127, January.
-
(2009)
Foundations and Trends in Machine Learning
, vol.2
, Issue.1
, pp. 1-127
-
-
Bengio, Y.1
-
6
-
-
84906921986
-
Fast and robust neural network joint models for statistical machine translation
-
J. Devlin, R. Zbib, Z. Huang, T. Lamar, R. Schwartz, and J. Makhoul. 2014. Fast and robust neural network joint models for statistical machine translation. In ACL.
-
(2014)
ACL
-
-
Devlin, J.1
Zbib, R.2
Huang, Z.3
Lamar, T.4
Schwartz, R.5
Makhoul, J.6
-
7
-
-
84921631157
-
An empirical comparison of features and tuning for phrase-based machine translation
-
S. Green, D. Cer, and C. D. Manning. 2014. An empirical comparison of features and tuning for phrase-based machine translation. In WMT.
-
(2014)
WMT
-
-
Green, S.1
Cer, D.2
Manning, C.D.3
-
8
-
-
84857892556
-
Noise-contrastive estimation of unnormalized statistical models, with applications to natural image statistics
-
Michael Gutmann and Aapo Hyvärinen. 2012. Noise-contrastive estimation of unnormalized statistical models, with applications to natural image statistics. JMLR, 13:307–361.
-
(2012)
JMLR
, vol.13
, pp. 307-361
-
-
Gutmann, M.1
Hyvärinen, A.2
-
9
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury. 2012. Deep neural networks for acoustic modeling in speech recognition. IEEE Signal Processing Magazine.
-
(2012)
IEEE Signal Processing Magazine
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
10
-
-
84943744936
-
On using very large target vocabulary for neural machine translation
-
Sébastien Jean, Kyunghyun Cho, Roland Memisevic, and Yoshua Bengio. 2015. On using very large target vocabulary for neural machine translation. In ACL.
-
(2015)
ACL
-
-
Jean, S.1
Cho, K.2
Memisevic, R.3
Bengio, Y.4
-
11
-
-
84926283798
-
Recurrent continuous translation models
-
N. Kalchbrenner and P. Blunsom. 2013. Recurrent continuous translation models. In EMNLP.
-
(2013)
EMNLP
-
-
Kalchbrenner, N.1
Blunsom, P.2
-
12
-
-
84876231242
-
ImageNet classification with deep convolutional neural networks
-
A. Krizhevsky, I. Sutskever, and G. E. Hinton. 2012. ImageNet classification with deep convolutional neural networks. In NIPS.
-
(2012)
NIPS
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
16
-
-
67650453038
-
Three new graphical models for statistical language modelling
-
Andriy Mnih and Geoffrey Hinton. 2007. Three new graphical models for statistical language modelling. In ICML.
-
(2007)
ICML
-
-
Mnih, A.1
Hinton, G.2
-
17
-
-
84858779990
-
A scalable hierarchical distributed language model
-
Andriy Mnih and Geoffrey Hinton. 2009. A scalable hierarchical distributed language model. In NIPS.
-
(2009)
NIPS
-
-
Mnih, A.1
Hinton, G.2
-
18
-
-
84867118996
-
A fast and simple algorithm for training neural probabilistic language models
-
Andriy Mnih and Yee Whye Teh. 2012. A fast and simple algorithm for training neural probabilistic language models. In ICML.
-
(2012)
ICML
-
-
Mnih, A.1
Teh, Y.W.2
-
19
-
-
34547997987
-
Hierarchical probabilistic neural network language model
-
Frederic Morin. 2005. Hierarchical probabilistic neural network language model. In AISTATS.
-
(2005)
AISTATS
-
-
Morin, F.1
-
20
-
-
77956509090
-
Rectified linear units improve restricted boltzmann machines
-
Vinod Nair and Geoffrey E. Hinton. 2010. Rectified linear units improve restricted boltzmann machines. In ICML.
-
(2010)
ICML
-
-
Nair, V.1
Hinton, G.E.2
-
22
-
-
85045980083
-
Large, pruned or continuous space language models on a GPU for statistical machine translation
-
H. Schwenk, A. Rousseau, and M. Attik. 2012. Large, pruned or continuous space language models on a gpu for statistical machine translation. In NAACL WLM workshop.
-
(2012)
NAACL WLM Workshop
-
-
Schwenk, H.1
Rousseau, A.2
Attik, M.3
-
23
-
-
85044798389
-
Continuous space language models for statistical machine translation
-
H. Schwenk. 2010. Continuous space language models for statistical machine translation. The Prague Bulletin of Mathematical Linguistics, (93):137–146.
-
(2010)
The Prague Bulletin of Mathematical Linguistics
, Issue.93
, pp. 137-146
-
-
Schwenk, H.1
-
24
-
-
84926203469
-
Continuous space translation models with neural networks
-
Le Hai Son, Alexandre Allauzen, and François Yvon. 2012. Continuous space translation models with neural networks. In NAACL-HLT.
-
(2012)
NAACL-HLT
-
-
Son, L.H.1
Allauzen, A.2
Yvon, F.3
-
25
-
-
84928547704
-
Sequence to sequence learning with neural networks
-
I. Sutskever, O. Vinyals, and Q. V. Le. 2014. Sequence to sequence learning with neural networks. In NIPS.
-
(2014)
NIPS
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.3
-
26
-
-
84926298172
-
Decoding with large-scale neural language models improves translation
-
Ashish Vaswani, Yinggong Zhao, Victoria Fossum, and David Chiang. 2013. Decoding with large-scale neural language models improves translation. In EMNLP.
-
(2013)
EMNLP
-
-
Vaswani, A.1
Zhao, Y.2
Fossum, V.3
Chiang, D.4
|