-
1
-
-
85083953689
-
Neural machine translation by jointly learning to align and translate
-
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. In ICLR, 2015.
-
(2015)
ICLR
-
-
Bahdanau, D.1
Cho, K.2
Bengio, Y.3
-
2
-
-
84999018249
-
Strongly-typed recurrent neural networks
-
David Balduzzi and Muhammad Ghifary. Strongly-typed recurrent neural networks. In ICML, 2016.
-
(2016)
ICML
-
-
Balduzzi, D.1
Ghifary, M.2
-
3
-
-
85059684387
-
MetaMind neural machine translation system for WMT 2016
-
Berlin, Germany. Association for Computational Linguistics
-
James Bradbury and Richard Socher. MetaMind neural machine translation system for WMT 2016. In Proceedings of the First Conference on Machine Translation, Berlin, Germany. Association for Computational Linguistics, 2016.
-
(2016)
Proceedings of the First Conference on Machine Translation
-
-
Bradbury, J.1
Socher, R.2
-
4
-
-
85019171807
-
A theoretically grounded application of dropout in recurrent neural networks
-
Yarin Gal and Zoubin Ghahramani. A theoretically grounded application of dropout in recurrent neural networks. In NIPS, 2016.
-
(2016)
NIPS
-
-
Gal, Y.1
Ghahramani, Z.2
-
5
-
-
0031573117
-
Long short-term memory
-
Nov
-
Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory. Neural Computation, 9(8): 1735-1780, Nov 1997. ISSN 0899-7667.
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
8
-
-
85021676739
-
-
arXiv preprint
-
Nal Kalchbrenner, Lasse Espeholt, Karen Simonyan, Aaron van den Oord, Alex Graves, and Koray Kavukcuoglu. Neural machine translation in linear time. arXiv preprint arXiv:1610.10099, 2016.
-
(2016)
Neural Machine Translation in Linear Time
-
-
Kalchbrenner, N.1
Espeholt, L.2
Simonyan, K.3
Van Den Oord, A.4
Graves, A.5
Kavukcuoglu, K.6
-
11
-
-
84876231242
-
ImageNet classification with deep convolutional neural networks
-
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. ImageNet classification with deep convolutional neural networks. In NIPS, 2012.
-
(2012)
NIPS
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
12
-
-
85018911798
-
-
arXiv preprint
-
David Krueger, Tegan Maharaj, János Kramár, Mohammad Pezeshki, Nicolas Ballas, Nan Rosemary Ke, Anirudh Goyal, Yoshua Bengio, Hugo Larochelle, Aaron Courville, et al. Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations. arXiv preprint arXiv:1606.01305, 2016.
-
(2016)
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations
-
-
Krueger, D.1
Maharaj, T.2
Kramár, J.3
Pezeshki, M.4
Ballas, N.5
Ke, N.R.6
Goyal, A.7
Bengio, Y.8
Larochelle, H.9
Courville, A.10
-
13
-
-
84998698731
-
Ask me Anything: Dynamic memory networks for natural language processing
-
Ankit Kumar, Ozan Irsoy, Peter Ondruska, Mohit Iyyer, James Bradbury, Ishaan Gulrajani, Victor Zhong, Romain Paulus, and Richard Socher. Ask me anything: Dynamic memory networks for natural language processing. In ICML, 2016.
-
(2016)
ICML
-
-
Kumar, A.1
Irsoy, O.2
Ondruska, P.3
Iyyer, M.4
Bradbury, J.5
Gulrajani, I.6
Zhong, V.7
Paulus, R.8
Socher, R.9
-
15
-
-
85070972450
-
A way out of the odyssey: Analyzing and combining recent insights for LSTMs
-
Shayne Longpre, Sabeek Pradhan, Caiming Xiong, and Richard Socher. A way out of the odyssey: Analyzing and combining recent insights for LSTMs. Submitted to ICLR, 2016.
-
(2016)
ICLR
-
-
Longpre, S.1
Pradhan, S.2
Xiong, C.3
Socher, R.4
-
16
-
-
84959874994
-
Effective approaches to attention-based neural machine translation
-
M. T. Luong, H. Pham, and C. D. Manning. Effective approaches to attention-based neural machine translation. In EMNLP, 2015.
-
(2015)
EMNLP
-
-
Luong, M.T.1
Pham, H.2
Manning, C.D.3
-
20
-
-
79959829092
-
Recurrent neural network based language model
-
Tomas Mikolov, Martin Karafiát, Lukás Burget, Jan Cernocký, and Sanjeev Khudanpur. Recurrent neural network based language model. In INTERSPEECH, 2010.
-
(2010)
INTERSPEECH
-
-
Mikolov, T.1
Karafiát, M.2
Burget, L.3
Cernocký, J.4
Khudanpur, S.5
-
22
-
-
84961289992
-
Glove: Global vectors for word representation
-
Jeffrey Pennington, Richard Socher, and Christopher D Manning. GloVe: Global vectors for word representation. In EMNLP, 2014.
-
(2014)
EMNLP
-
-
Pennington, J.1
Socher, R.2
Manning, C.D.3
-
23
-
-
85083951479
-
Sequence level training with recurrent neural networks
-
Marc'Aurelio Ranzato, Sumit Chopra, Michael Auli, and Wojciech Zaremba. Sequence level training with recurrent neural networks. In ICLR, 2016.
-
(2016)
ICLR
-
-
Ranzato, M.1
Chopra, S.2
Auli, M.3
Zaremba, W.4
-
25
-
-
84893343292
-
Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude
-
Tijmen Tieleman and Geoffrey Hinton. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4(2), 2012.
-
(2012)
COURSERA: Neural Networks for Machine Learning
, vol.4
, Issue.2
-
-
Tieleman, T.1
Hinton, G.2
-
28
-
-
85018927054
-
-
arXiv preprint
-
Aaron van den Oord, Nal Kalchbrenner, Oriol Vinyals, Lasse Espeholt, Alex Graves, and Koray Kavukcuoglu. Conditional image generation with PixelCNN decoders. arXiv preprint arXiv:1606.05328, 2016b.
-
(2016)
Conditional Image Generation with PixelCNN Decoders
-
-
Van Den Oord, A.1
Kalchbrenner, N.2
Vinyals, O.3
Espeholt, L.4
Graves, A.5
Kavukcuoglu, K.6
-
29
-
-
84875872773
-
Baselines and bigrams: Simple, good sentiment and topic classification
-
Sida Wang and Christopher D Manning. Baselines and bigrams: Simple, good sentiment and topic classification. In ACL, 2012.
-
(2012)
ACL
-
-
Wang, S.1
Manning, C.D.2
-
30
-
-
84943794421
-
Predicting polarities of tweets by composing word embeddings with long short-term memory
-
Xin Wang, Yuanchao Liu, Chengjie Sun, Baoxun Wang, and Xiaolong Wang. Predicting polarities of tweets by composing word embeddings with long short-term memory. In ACL, 2015.
-
(2015)
ACL
-
-
Wang, X.1
Liu, Y.2
Sun, C.3
Wang, B.4
Wang, X.5
-
32
-
-
85018271332
-
-
arXiv preprint
-
Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, et al. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144, 2016.
-
(2016)
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
-
-
Wu, Y.1
Schuster, M.2
Chen, Z.3
Le, Q.V.4
Norouzi, M.5
Macherey, W.6
Krikun, M.7
Cao, Y.8
Gao, Q.9
Macherey, K.10
-
34
-
-
84999008900
-
Dynamic memory networks for visual and textual question answering
-
Caiming Xiong, Stephen Merity, and Richard Socher. Dynamic memory networks for visual and textual question answering. In ICML, 2016.
-
(2016)
ICML
-
-
Xiong, C.1
Merity, S.2
Socher, R.3
-
36
-
-
84965162393
-
Character-level convolutional networks for text classification
-
Xiang Zhang, Junbo Zhao, and Yann LeCun. Character-level convolutional networks for text classification. In NIPS, 2015.
-
(2015)
NIPS
-
-
Zhang, X.1
Zhao, J.2
LeCun, Y.3
|