-
1
-
-
85011954517
-
Globally normalized transition-based neural networks
-
Daniel Andor, Chris Alberti, David Weiss, Aliaksei Severyn, Alessandro Presta, Kuzman Ganchev, Slav Petrov, and Michael Collins. Globally Normalized Transition-Based Neural Networks. In Proceedings of ACL, 2016.
-
(2016)
Proceedings of ACL
-
-
Andor, D.1
Alberti, C.2
Weiss, D.3
Severyn, A.4
Presta, A.5
Ganchev, K.6
Petrov, S.7
Collins, M.8
-
2
-
-
85083953689
-
Neural machine translation by jointly learning to align and translate
-
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural Machine Translation by Jointly Learning to Align and Translate. In Proceedings of ICLR, 2015.
-
(2015)
Proceedings of ICLR
-
-
Bahdanau, D.1
Cho, K.2
Bengio, Y.3
-
6
-
-
85021645849
-
A fast unified model for parsing and sentence understanding
-
Samuel R. Bowman, Jon Gauthier, Abhinav Rastogi, Raghav Gupta, Christopher D. Manning, and Christopher Potts. A Fast Unified Model for Parsing and Sentence Understanding. In Proceedings of ACL, 2016.
-
(2016)
Proceedings of ACL
-
-
Bowman, S.R.1
Gauthier, J.2
Rastogi, A.3
Gupta, R.4
Manning, C.D.5
Potts, C.6
-
7
-
-
84994328213
-
-
William Chan, Navdeep Jaitly, Quoc Le, and Oriol Vinyals. Listen, Attend and Spell. arXiv:1508.01211, 2015.
-
(2015)
Listen, Attend and Spell
-
-
Chan, W.1
Jaitly, N.2
Le, Q.3
Vinyals, O.4
-
9
-
-
85031930724
-
-
Qian Chen, Xiaodan Zhu, Zhenhua Ling, Si Wei, and Hui Jiang. Enhancing and Combining Sequential and Tree LSTM for Natural Language Inference. arXiv:1609.06038, 2016.
-
(2016)
Enhancing and Combining Sequential and Tree LSTM for Natural Language Inference
-
-
Chen, Q.1
Zhu, X.2
Ling, Z.3
Wei, S.4
Jiang, H.5
-
10
-
-
84946763507
-
Describing Multimedia Content using Attention-based Encoder-Decoder Networks
-
Kyunghyun Cho, Aaron Courville, and Yoshua Bengio. Describing Multimedia Content using Attention-based Encoder-Decoder Networks. In IEEE Transactions on Multimedia, 2015.
-
(2015)
IEEE Transactions on Multimedia
-
-
Cho, K.1
Courville, A.2
Bengio, Y.3
-
11
-
-
84965139600
-
Attention-based models for speech recognition
-
Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, Kyunghyun Cho, and Yoshua Bengio. Attention-Based Models for Speech Recognition. In Proceedings of NIPS, 2015.
-
(2015)
Proceedings of NIPS
-
-
Chorowski, J.1
Bahdanau, D.2
Serdyuk, D.3
Cho, K.4
Bengio, Y.5
-
12
-
-
80053558787
-
Natural language processing (almost) from scratch
-
Ronan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. Natural Language Processing (almost) from Scratch. Journal of Machine Learning Research, 12:2493-2537, 2011.
-
(2011)
Journal of Machine Learning Research
, vol.12
, pp. 2493-2537
-
-
Collobert, R.1
Weston, J.2
Bottou, L.3
Karlen, M.4
Kavukcuoglu, K.5
Kuksa, P.6
-
14
-
-
80052913933
-
Parameter learning with truncated message-passing
-
Justin Domke. Parameter Learning with Truncated Message-Passing. In Proceedings of CVPR, 2011.
-
(2011)
Proceedings of CVPR
-
-
Domke, J.1
-
15
-
-
84869036002
-
Generic methods for optimization-based modeling
-
Justin Domke. Generic methods for optimization-based modeling. In AISTATS, pp. 318-326, 2012.
-
(2012)
AISTATS
, pp. 318-326
-
-
Domke, J.1
-
16
-
-
80052250414
-
Adaptive subgradient methods for online learning and stochastic optimization
-
John Duchi, Elad Hazan, and Yoram Singer. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization. Journal of Machine Learning Research, 12:2021-2159, 2011.
-
(2011)
Journal of Machine Learning Research
, vol.12
, pp. 2021-2159
-
-
Duchi, J.1
Hazan, E.2
Singer, Y.3
-
18
-
-
4043070633
-
Three new probabilistic models for dependency parsing: An exploration
-
Jason M. Eisner. Three New Probabilistic Models for Dependency Parsing: An Exploration. In Proceedings of ACL, 1996.
-
(1996)
Proceedings of ACL
-
-
Eisner, J.M.1
-
20
-
-
85070943017
-
Approximation-aware dependency parsing by belief propagation
-
Matthew R. Gormley, Mark Dredze, and Jason Eisner. Approximation-Aware Dependency Parsing by Belief Propagation. In Proceedings of TACL, 2015.
-
(2015)
Proceedings of TACL
-
-
Gormley, M.R.1
Dredze, M.2
Eisner, J.3
-
22
-
-
84993949467
-
Hybrid computing using a neural network with dynamic external memory
-
October
-
Alex Graves, Greg Wayne, Malcolm Reynolds, Tim Harley, Ivo Danihelka, Agnieszka Grabska-Barwinska, Sergio Gomez Colmenarejo, Edward Grefenstette, Tiago Ramalho, John Agapiou, Adria Puigdomenech Badia, Karl Moritz Hermann, Yori Zwols, Georg Ostrovski, Adam Cain, Helen King, Christopher Summerfield, Phil Blunsom, Koray Kavukcuoglu, and Demis Hassabis. Hybrid Computing Using a Neural Network with Dynamic External Memory. Nature, October 2016.
-
(2016)
Nature
-
-
Graves, A.1
Wayne, G.2
Reynolds, M.3
Harley, T.4
Danihelka, I.5
Grabska-Barwinska, A.6
Colmenarejo, S.G.7
Grefenstette, E.8
Ramalho, T.9
Agapiou, J.10
Badia, A.P.11
Hermann, K.M.12
Zwols, Y.13
Ostrovski, G.14
Cain, A.15
King, H.16
Summerfield, C.17
Blunsom, P.18
Kavukcuoglu, K.19
Hassabis, D.20
more..
-
24
-
-
84965139942
-
Teaching machines to read and comprehend
-
Karl Moritz Hermann, Tomas Kocisky, Edward Grefenstette, Lasse Espeholt, Will Kay, Mustafa Suleyman, and Phil Blunsom. Teaching Machines to Read and Comprehend. In Proceedings of NIPS, 2015.
-
(2015)
Proceedings of NIPS
-
-
Hermann, K.M.1
Kocisky, T.2
Grefenstette, E.3
Espeholt, L.4
Kay, W.5
Suleyman, M.6
Blunsom, P.7
-
26
-
-
85083951076
-
ADaM: A method for stochastic optimization
-
Diederik Kingma and Jimmy Ba. Adam: A Method for Stochastic Optimization. In Proceedings of ICLR, 2015.
-
(2015)
Proceedings of ICLR
-
-
Kingma, D.1
Ba, J.2
-
27
-
-
85019553882
-
Simple and Accurate Dependency Parsing using Bidirectional LSTM Feature Representations
-
Eliyahu Kipperwasser and Yoav Goldberg. Simple and Accurate Dependency Parsing using Bidirectional LSTM Feature Representations. In TACL, 2016.
-
(2016)
TACL
-
-
Kipperwasser, E.1
Goldberg, Y.2
-
29
-
-
0142192295
-
Conditional random fields: Probabilistic models for segmenting and labeling sequence data
-
John Lafferty, Andrew McCallum, and Fernando Pereira. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In Proceedings of ICML, 2001.
-
(2001)
Proceedings of ICML
-
-
Lafferty, J.1
McCallum, A.2
Pereira, F.3
-
30
-
-
84994130883
-
Neural architectures for named entity recognition
-
Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. Neural Architectures for Named Entity Recognition. In Proceedings of NAACL, 2016.
-
(2016)
Proceedings of NAACL
-
-
Lample, G.1
Ballesteros, M.2
Subramanian, S.3
Kawakami, K.4
Dyer, C.5
-
32
-
-
80053275857
-
First- And second-order expectation semirings with applications to minimum-risk training on translation forests
-
Zhifei Li and Jason Eisner. First- and Second-Order Expectation Semirings with Applications to Minimum-Risk Training on Translation Forests. In Proceedings of EMNLP 2009, 2009.
-
(2009)
Proceedings of EMNLP 2009
-
-
Li, Z.1
Eisner, J.2
-
33
-
-
84994242299
-
Segmental recurrent neural networks for end-to-end speech recognition
-
Liang Lu, Lingpeng Kong, Chris Dyer, Noah A. Smith, and Steve Renals. Segmental Recurrent Neural Networks for End-to-End Speech Recognition. In Proceedings of INTERSPEECH, 2016.
-
(2016)
Proceedings of INTERSPEECH
-
-
Lu, L.1
Kong, L.2
Dyer, C.3
Smith, N.A.4
Renals, S.5
-
34
-
-
84959874994
-
Effective Approaches to Attention-based Neural Machine Translation
-
Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. Effective Approaches to Attention-based Neural Machine Translation. In Proceedings of EMNLP, 2015.
-
(2015)
Proceedings of EMNLP
-
-
Luong, M.-T.1
Pham, H.2
Manning, C.D.3
-
35
-
-
84989338543
-
Gradient-based Hyperparameter Optimization through Reversible Learning
-
Dougal Maclaurin, David Duvenaud, and Ryan P. Adams. Gradient-based Hyperparameter Optimization through Reversible Learning. In Proceedings of ICML, 2015.
-
(2015)
Proceedings of ICML
-
-
Maclaurin, D.1
Duvenaud, D.2
Adams, R.P.3
-
36
-
-
85011819504
-
Natural language inference by tree-based convolution and heuristic matching
-
Lili Mou, Rui Men, Ge Li, Yan Xu, Lu Zhang, Rui Yan, and Zhi Jin. Natural language inference by tree-based convolution and heuristic matching. In Proceedings of ACL, 2016.
-
(2016)
Proceedings of ACL
-
-
Mou, L.1
Men, R.2
Li, G.3
Xu, Y.4
Zhang, L.5
Yan, R.6
Jin, Z.7
-
38
-
-
85031942138
-
ASPEC: Asian scientific paper excerpt corpus
-
Nicoletta Calzo-lari (Conference Chair), Khalid Choukri, Thierry Declerck, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis (eds, Portoro, Slovenia, may European Language Resources Association (ELRA
-
Toshiaki Nakazawa, Manabu Yaguchi, Kiyotaka Uchimoto, Masao Utiyama, Eiichiro Sumita, Sadao Kurohashi, and Hitoshi Isahara. Aspec: Asian scientific paper excerpt corpus. In Nicoletta Calzo-lari (Conference Chair), Khalid Choukri, Thierry Declerck, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis (eds.), Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2016), pp. 2204-2208, Portoro, Slovenia, may 2016. European Language Resources Association (ELRA). ISBN 978-2-9517408-9-1.
-
(2016)
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2016)
, pp. 2204-2208
-
-
Nakazawa, T.1
Yaguchi, M.2
Uchimoto, K.3
Utiyama, M.4
Sumita, E.5
Kurohashi, S.6
Isahara, H.7
-
39
-
-
84859011635
-
Pointwise prediction for robust, adaptable Japanese morphological analysis
-
Graham Neubig, Yosuke Nakata, and Shinsuke Mori. Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis. In Proceedings of ACL, 2011.
-
(2011)
Proceedings of ACL
-
-
Neubig, G.1
Nakata, Y.2
Mori, S.3
-
43
-
-
85083950860
-
Reasoning about entailment with neural attention
-
Tim Rocktäschel, Edward Grefenstette, Karl Moritz Hermann, Tomas Kocisky, and Phil Blunsom. Reasoning about Entailment with Neural Attention. In Proceedings of ICLR, 2016.
-
(2016)
Proceedings of ICLR
-
-
Rocktäschel, T.1
Grefenstette, E.2
Hermann, K.M.3
Kocisky, T.4
Blunsom, P.5
-
44
-
-
84965157716
-
Gradient estimation using stochastic computation graphs
-
John Schulman, Nicolas Heess, Theophane Weber, and Pieter Abbeel. Gradient estimation using stochastic computation graphs. In Advances in Neural Information Processing Systems, pp. 3528-3536, 2015.
-
(2015)
Advances in Neural Information Processing Systems
, pp. 3528-3536
-
-
Schulman, J.1
Heess, N.2
Weber, T.3
Abbeel, P.4
-
46
-
-
84926143199
-
Minimum-Risk Training of Approximate CRF-based NLP Systems
-
Veselin Stoyanov and Jason Eisner. Minimum-Risk Training of Approximate CRF-based NLP Systems. In Proceedings of NAACL, 2012.
-
(2012)
Proceedings of NAACL
-
-
Stoyanov, V.1
Eisner, J.2
-
47
-
-
84883148756
-
Empirical risk minimization of graphical model parameters given approximate inference, decoding, and model structure
-
Veselin Stoyanov, Alexander Ropson, and Jason Eisner. Empirical Risk Minimization of Graphical Model Parameters Given Approximate Inference, Decoding, and Model Structure. In Proceedings of AISTATS, 2011.
-
(2011)
Proceedings of AISTATS
-
-
Stoyanov, V.1
Ropson, A.2
Eisner, J.3
-
51
-
-
84994156970
-
Learning natural language inference with LSTM
-
Shuohang Wang and Jing Jiang. Learning Natural Language Inference with LSTM. In Proceedings of NAACL, 2016.
-
(2016)
Proceedings of NAACL
-
-
Wang, S.1
Jiang, J.2
-
53
-
-
84930622674
-
-
arXiv preprint
-
Jason Weston, Antoine Bordes, Sumit Chopra, Alexander M Rush, Bart van Merriënboer, Armand Joulin, and Tomas Mikolov. Towards Ai-complete Question Answering: A Set of Prerequisite Toy Tasks. arXiv preprint arXiv:1502.05698, 2015.
-
(2015)
Towards Ai-Complete Question Answering: A Set of Prerequisite Toy Tasks
-
-
Weston, J.1
Bordes, A.2
Chopra, S.3
Rush, A.M.4
Van Merriënboer, B.5
Joulin, A.6
Mikolov, T.7
-
54
-
-
84970002232
-
Show, attend and tell: Neural image caption generation with visual attention
-
Kelvin Xu, Jimma Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard Zemel, and Yoshua Bengio. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. In Proceedings of ICML, 2015.
-
(2015)
Proceedings of ICML
-
-
Xu, K.1
Ba, J.2
Kiros, R.3
Cho, K.4
Courville, A.5
Salakhutdinov, R.6
Zemel, R.7
Bengio, Y.8
-
56
-
-
85088231409
-
The neural noisy channel
-
Lei Yu, Phil Blunsom, Chris Dyer, Edward Grefenstette, and Tomas Kocisky. The Neural Noisy Channel. In Proceedings of ICLR, 2017.
-
(2017)
Proceedings of ICLR
-
-
Yu, L.1
Blunsom, P.2
Dyer, C.3
Grefenstette, E.4
Kocisky, T.5
-
57
-
-
85050983881
-
Textual entailment with structured attentions and composition
-
Kai Zhao, Liang Huang, and Minbo Ma. Textual Entailment with Structured Attentions and Composition. In Proceedings of COLING, 2016.
-
(2016)
Proceedings of COLING
-
-
Zhao, K.1
Huang, L.2
Ma, M.3
|