-
4
-
-
85083953689
-
Neural machine translation by jointly learning to align and translate
-
D. Bahdanau, K. Cho, and Y. Bengio. Neural machine translation by jointly learning to align and translate. ICLR, 2015.
-
(2015)
ICLR
-
-
Bahdanau, D.1
Cho, K.2
Bengio, Y.3
-
6
-
-
84961291190
-
-
K. Cho, B. Van Merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv:1406.1078, 2014.
-
(2014)
Learning Phrase Representations Using Rnn Encoder-Decoder for Statistical Machine Translation
-
-
Cho, K.1
Van Merriënboer, B.2
Gulcehre, C.3
Bahdanau, D.4
Bougares, F.5
Schwenk, H.6
Bengio, Y.7
-
10
-
-
84965139942
-
Teaching machines to read and comprehend
-
K. M. Hermann, T. Kocisky, E. Grefenstette, L. Espeholt, W. Kay, M. Suleyman, and P. Blunsom. Teaching machines to read and comprehend. In NIPS, 2015.
-
(2015)
NIPS
-
-
Hermann, K.M.1
Kocisky, T.2
Grefenstette, E.3
Espeholt, L.4
Kay, W.5
Suleyman, M.6
Blunsom, P.7
-
15
-
-
85083951713
-
Regularizing rnns by stabilizing activations
-
D Krueger and R. Memisevic. Regularizing rnns by stabilizing activations. ICLR, 2016.
-
(2016)
ICLR
-
-
Krueger, D.1
Memisevic, R.2
-
16
-
-
85018911798
-
-
David Krueger, Tegan Maharaj, János Kramár, Mohammad Pezeshki, Nicolas Ballas, Nan Rosemary Ke, Anirudh Goyal, Yoshua Bengio, Hugo Larochelle, and Aaron Courville. Zoneout: Regularizing rnns by randomly preserving hidden activations. arXiv:1606.01305, 2016.
-
(2016)
Zoneout: Regularizing Rnns by Randomly Preserving Hidden Activations
-
-
Krueger, D.1
Maharaj, T.2
Kramár, J.3
Pezeshki, M.4
Ballas, N.5
Ke, N.R.6
Goyal, A.7
Bengio, Y.8
Larochelle, H.9
Courville, A.10
-
17
-
-
84973326024
-
Batch normalized recurrent neural networks
-
C. Laurent, G. Pereyra, P. Brakel, Y. Zhang, and Y. Bengio. Batch normalized recurrent neural networks. ICASSP, 2016.
-
(2016)
ICASSP
-
-
Laurent, C.1
Pereyra, G.2
Brakel, P.3
Zhang, Y.4
Bengio, Y.5
-
22
-
-
80053451847
-
Learning recurrent neural networks with hessian-free optimization
-
J. Martens and I. Sutskever. Learning recurrent neural networks with hessian-free optimization. In ICML, 2011.
-
(2011)
ICML
-
-
Martens, J.1
Sutskever, I.2
-
23
-
-
84897527816
-
-
preprint
-
T. Mikolov, I. Sutskever, A. Deoras, H. Le, S. Kombrink, and J. Cernocky. Subword language modeling with neural networks. preprint, 2012.
-
(2012)
Subword Language Modeling with Neural Networks
-
-
Mikolov, T.1
Sutskever, I.2
Deoras, A.3
Le, H.4
Kombrink, S.5
Cernocky, J.6
-
27
-
-
0037527188
-
Improving predictive inference under covariate shift by weighting the log-likelihood function
-
H. Shimodaira. Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of statistical planning and inference, 2000.
-
(2000)
Journal of Statistical Planning and Inference
-
-
Shimodaira, H.1
-
30
-
-
84962564531
-
-
CoRR, abs/1506.00619
-
Bart van Merriënboer, Dzmitry Bahdanau, Vincent Dumoulin, Dmitriy Serdyuk, David Warde-Farley, Jan Chorowski, and Yoshua Bengio. Blocks and fuel: Frameworks for deep learning. CoRR, abs/1506.00619, 2015. URL http://arxiv.org/abs/1506.00619.
-
(2015)
Blocks and Fuel: Frameworks for Deep Learning
-
-
Van Merriënboer, B.1
Bahdanau, D.2
Dumoulin, V.3
Serdyuk, D.4
Warde-Farley, D.5
Chorowski, J.6
Bengio, Y.7
-
31
-
-
84939821074
-
-
K. Xu, J. Ba, R. Kiros, A. Courville, R. Salakhutdinov, R. Zemel, and Y. Bengio. Show, attend and tell: Neural image caption generation with visual attention. arXiv:1502.03044, 2015.
-
(2015)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
-
-
Xu, K.1
Ba, J.2
Kiros, R.3
Courville, A.4
Salakhutdinov, R.5
Zemel, R.6
Bengio, Y.7
-
32
-
-
84973884896
-
Describing videos by exploiting temporal structure
-
L. Yao, A. Torabi, K. Cho, N. Ballas, C. Pal, H. Larochelle, and A. Courville. Describing videos by exploiting temporal structure. In ICCV, 2015.
-
(2015)
ICCV
-
-
Yao, L.1
Torabi, A.2
Cho, K.3
Ballas, N.4
Pal, C.5
Larochelle, H.6
Courville, A.7
-
33
-
-
85014939893
-
-
S. Zhang, Y. Wu, T. Che, Z. Lin, R. Memisevic, R. Salakhutdinov, and Y. Bengio. Architectural complexity measures of recurrent neural networks. arXiv:1602.08210, 2016.
-
(2016)
Architectural Complexity Measures of Recurrent Neural Networks
-
-
Zhang, S.1
Wu, Y.2
Che, T.3
Lin, Z.4
Memisevic, R.5
Salakhutdinov, R.6
Bengio, Y.7
|