-
1
-
-
27844439373
-
A framework for learning predictive structures from multiple tasks and unlabeled data
-
Rie Kubota Ando and Tong Zhang. 2005. A framework for learning predictive structures from multiple tasks and unlabeled data. The Journal of Machine Learning Research, 6:1817-1853.
-
(2005)
The Journal of Machine Learning Research
, vol.6
, pp. 1817-1853
-
-
Ando, R.K.1
Zhang, T.2
-
2
-
-
0028392483
-
Learning long-term dependencies with gradient descent is difficult
-
Yoshua Bengio, Patrice Simard, and Paolo Frasconi. 1994. Learning long-term dependencies with gradient descent is difficult. Neural Networks, IEEE Transactions on, 5(2):157-166.
-
(1994)
Neural Networks, IEEE Transactions on
, vol.5
, Issue.2
, pp. 157-166
-
-
Bengio, Y.1
Simard, P.2
Frasconi, P.3
-
3
-
-
84857819132
-
Theano: A cpu and GPU math expression compiler
-
Austin, TX
-
James Bergstra, Olivier Breuleux, Frédéric Bastien, Pascal Lamblin, Razvan Pascanu, Guillaume Desjardins, Joseph Turian, David Warde-Farley, and Yoshua Bengio. 2010. Theano: a cpu and gpu math expression compiler. In Proceedings of the Python for scientific computing conference (SciPy), volume 4, page 3. Austin, TX.
-
(2010)
Proceedings of the Python for Scientific Computing Conference (SciPy)
, vol.4
, pp. 3
-
-
Bergstra, J.1
Breuleux, O.2
Bastien, F.3
Lamblin, P.4
Pascanu, R.5
Desjardins, G.6
Turian, J.7
Warde-Farley, D.8
Bengio, Y.9
-
4
-
-
84951272941
-
A fast and accurate dependency parser using neural networks
-
Doha, Qatar, October
-
Danqi Chen and Christopher Manning. 2014. A fast and accurate dependency parser using neural networks. In Proceedings of EMNLP-2014, pages 740-750, Doha, Qatar, October.
-
(2014)
Proceedings of EMNLP-2014
, pp. 740-750
-
-
Chen, D.1
Manning, C.2
-
5
-
-
0345007542
-
Named entity recognition: A maximum entropy approach using global information
-
Hai Leong Chieu and Hwee Tou Ng. 2002. Named entity recognition: a maximum entropy approach using global information. In Proceedings of CoNLL-2003, pages 1-7.
-
(2002)
Proceedings of CoNLL-2003
, pp. 1-7
-
-
Chieu, H.L.1
Ng, H.T.2
-
7
-
-
85097641926
-
On the properties of neural machine translation: Encoder-decoder approaches
-
Kyunghyun Cho, Bart van Merriënboer, Dzmitry Bahdanau, and Yoshua Bengio. 2014. On the properties of neural machine translation: Encoder-decoder approaches. Syntax, Semantics and Structure in Statistical Translation, page 103.
-
(2014)
Syntax, Semantics and Structure in Statistical Translation
, pp. 103
-
-
Cho, K.1
Van Merriënboer, B.2
Bahdanau, D.3
Bengio, Y.4
-
8
-
-
80053558787
-
Natural language processing (almost) from scratch
-
Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. 2011. Natural language processing (almost) from scratch. The Journal of Machine Learning Research, 12:2493-2537.
-
(2011)
The Journal of Machine Learning Research
, vol.12
, pp. 2493-2537
-
-
Collobert, R.1
Weston, J.2
Bottou, L.3
Karlen, M.4
Kavukcuoglu, K.5
Kuksa, P.6
-
9
-
-
84925629091
-
Enhancing of chemical compound and drug name recognition using representative tag scheme and fine-grained tokenization
-
Hong-Jie Dai, Po-Ting Lai, Yung-Chun Chang, and Richard Tzong-Han Tsai. 2015. Enhancing of chemical compound and drug name recognition using representative tag scheme and fine-grained tokenization. Journal of cheminformatics, 7(S1):1-10.
-
(2015)
Journal of Cheminformatics
, vol.7
, Issue.S1
, pp. 1-10
-
-
Dai, H.-J.1
Lai, P.-T.2
Chang, Y.-C.3
Tsai, R.T.-H.4
-
12
-
-
84943742882
-
Transitionbased dependency parsing with stack long shortterm memory
-
Beijing, China, July
-
Chris Dyer, Miguel Ballesteros, Wang Ling, Austin Matthews, and Noah A. Smith. 2015. Transitionbased dependency parsing with stack long shortterm memory. In Proceedings of ACL-2015 (Volume 1: Long Papers), pages 334-343, Beijing, China, July.
-
(2015)
Proceedings of ACL-2015 (Volume 1: Long Papers)
, pp. 334-343
-
-
Dyer, C.1
Ballesteros, M.2
Ling, W.3
Matthews, A.4
Smith, N.A.5
-
14
-
-
0034293152
-
Learning to forget: Continual prediction with lstm
-
Felix A Gers, Jürgen Schmidhuber, and Fred Cummins. 2000. Learning to forget: Continual prediction with lstm. Neural computation, 12(10):2451-2471.
-
(2000)
Neural Computation
, vol.12
, Issue.10
, pp. 2451-2471
-
-
Gers, F.A.1
Schmidhuber, J.2
Cummins, F.3
-
16
-
-
84898932856
-
Overfitting in neural nets: Backpropagation, conjugate gradient, and early stopping
-
MIT Press
-
Rich Caruana Steve Lawrence Lee Giles. 2001. Overfitting in neural nets: Backpropagation, conjugate gradient, and early stopping. In Advances in Neural Information Processing Systems 13: Proceedings of the 2000 Conference, volume 13, page 402. MIT Press.
-
(2001)
Advances in Neural Information Processing Systems 13: Proceedings of the 2000 Conference
, vol.13
, pp. 402
-
-
Caruana, R.1
Lawrence, S.2
Giles, L.3
-
17
-
-
85035364491
-
Svmtool: A general pos tagger generator based on support vector machines
-
Jesús Giménez and Lluís Màrquez. 2004. Svmtool: A general pos tagger generator based on support vector machines. In In Proceedings of LREC-2004.
-
(2004)
Proceedings of LREC-2004
-
-
Giménez, J.1
Màrquez, L.2
-
19
-
-
0029727454
-
Learning task-dependent distributed representations by backpropagation through structure
-
IEEE
-
Christoph Goller and Andreas Kuchler. 1996. Learning task-dependent distributed representations by backpropagation through structure. In Neural Networks, 1996., IEEE International Conference on, volume 1, pages 347-352. IEEE.
-
(1996)
Neural Networks, 1996., IEEE International Conference on
, vol.1
, pp. 347-352
-
-
Goller, C.1
Kuchler, A.2
-
20
-
-
84890543083
-
Speech recognition with deep recurrent neural networks
-
IEEE
-
Alan Graves, Abdel-rahman Mohamed, and Geoffrey Hinton. 2013. Speech recognition with deep recurrent neural networks. In Proceedings of ICASSP-2013, pages 6645-6649. IEEE.
-
(2013)
Proceedings of ICASSP-2013
, pp. 6645-6649
-
-
Graves, A.1
Mohamed, A.2
Hinton, G.3
-
23
-
-
85012027098
-
Harnessing deep neural networks with logic rules
-
Berlin, Germany, August
-
Zhiting Hu, Xuezhe Ma, Zhengzhong Liu, Eduard H. Hovy, and Eric P. Xing. 2016. Harnessing deep neural networks with logic rules. In Proceedings of ACL-2016, Berlin, Germany, August.
-
(2016)
Proceedings of ACL-2016
-
-
Hu, Z.1
Ma, X.2
Liu, Z.3
Hovy, E.H.4
Xing, E.P.5
-
27
-
-
84859952372
-
Efficient thirdorder dependency parsers
-
Uppsala, Sweden, July
-
Terry Koo and Michael Collins. 2010. Efficient thirdorder dependency parsers. In Proceedings of ACL-2010, pages 1-11, Uppsala, Sweden, July.
-
(2010)
Proceedings of ACL-2010
, pp. 1-11
-
-
Koo, T.1
Collins, M.2
-
29
-
-
0142192295
-
Conditional random fields: Probabilistic models for segmenting and labeling sequence data
-
John Lafferty, Andrew McCallum, and Fernando CN Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of ICML-2001, volume 951, pages 282-289.
-
(2001)
Proceedings of ICML-2001
, vol.951
, pp. 282-289
-
-
Lafferty, J.1
McCallum, A.2
Pereira, F.C.N.3
-
30
-
-
84994130883
-
Neural architectures for named entity recognition
-
San Diego, California, USA, June
-
Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. 2016. Neural architectures for named entity recognition. In Proceedings of NAACL-2016, San Diego, California, USA, June.
-
(2016)
Proceedings of NAACL-2016
-
-
Lample, G.1
Ballesteros, M.2
Subramanian, S.3
Kawakami, K.4
Dyer, C.5
-
31
-
-
0000359337
-
Backpropagation applied to handwritten zip code recognition
-
Yann LeCun, Bernhard Boser, John S Denker, Donnie Henderson, Richard E Howard, Wayne Hubbard, and Lawrence D Jackel. 1989. Backpropagation applied to handwritten zip code recognition. Neural computation, 1(4):541-551.
-
(1989)
Neural Computation
, vol.1
, Issue.4
, pp. 541-551
-
-
LeCun, Y.1
Boser, B.2
Denker, J.S.3
Henderson, D.4
Howard, R.E.5
Hubbard, W.6
Jackel, L.D.7
-
32
-
-
85185398851
-
Phrase clustering for discriminative learning
-
Dekang Lin and Xiaoyun Wu. 2009. Phrase clustering for discriminative learning. In Proceedings of ACL-2009, pages 1030-1038.
-
(2009)
Proceedings of ACL-2009
, pp. 1030-1038
-
-
Lin, D.1
Wu, X.2
-
33
-
-
84959892473
-
Finding function in form: Compositional character models for open vocabulary word representation
-
Lisbon, Portugal, September
-
Wang Ling, Chris Dyer, Alan W Black, Isabel Trancoso, Ramon Fermandez, Silvio Amir, Luis Marujo, and Tiago Luis. 2015. Finding function in form: Compositional character models for open vocabulary word representation. In Proceedings of EMNLP-2015, pages 1520-1530, Lisbon, Portugal, September.
-
(2015)
Proceedings of EMNLP-2015
, pp. 1520-1530
-
-
Ling, W.1
Dyer, C.2
Black, A.W.3
Trancoso, I.4
Fermandez, R.5
Amir, S.6
Marujo, L.7
Luis, T.8
-
34
-
-
84959884304
-
Joint entity recognition and disambiguation
-
Lisbon, Portugal, September
-
Gang Luo, Xiaojiang Huang, Chin-Yew Lin, and Zaiqing Nie. 2015. Joint entity recognition and disambiguation. In Proceedings of EMNLP-2015, pages 879-888, Lisbon, Portugal, September.
-
(2015)
Proceedings of EMNLP-2015
, pp. 879-888
-
-
Luo, G.1
Huang, X.2
Lin, C.3
Nie, Z.4
-
35
-
-
84959859456
-
Efficient innerto-outer greedy algorithm for higher-order labeled dependency parsing
-
Lisbon, Portugal, September
-
Xuezhe Ma and Eduard Hovy. 2015. Efficient innerto-outer greedy algorithm for higher-order labeled dependency parsing. In Proceedings of the EMNLP-2015, pages 1322-1328, Lisbon, Portugal, September.
-
(2015)
Proceedings of the EMNLP-2015
, pp. 1322-1328
-
-
Ma, X.1
Hovy, E.2
-
36
-
-
84906922946
-
Unsupervised dependency parsing with transferring distribution via parallel guidance and entropy regularization
-
Baltimore, Maryland, June
-
Xuezhe Ma and Fei Xia. 2014. Unsupervised dependency parsing with transferring distribution via parallel guidance and entropy regularization. In Proceedings of ACL-2014, pages 1337-1348, Baltimore, Maryland, June.
-
(2014)
Proceedings of ACL-2014
, pp. 1337-1348
-
-
Ma, X.1
Xia, F.2
-
37
-
-
84906924339
-
Fourth-order dependency parsing
-
Mumbai, India, December
-
Xuezhe Ma and Hai Zhao. 2012a. Fourth-order dependency parsing. In Proceedings of COLING 2012: Posters, pages 785-796, Mumbai, India, December.
-
(2012)
Proceedings of COLING 2012: Posters
, pp. 785-796
-
-
Ma, X.1
Zhao, H.2
-
39
-
-
84994173353
-
Unsupervised ranking model for entity coreference resolution
-
San Diego, California, USA, June
-
Xuezhe Ma, Zhengzhong Liu, and Eduard Hovy. 2016. Unsupervised ranking model for entity coreference resolution. In Proceedings of NAACL-2016, San Diego, California, USA, June.
-
(2016)
Proceedings of NAACL-2016
-
-
Ma, X.1
Liu, Z.2
Hovy, E.3
-
40
-
-
79952276191
-
Part-of-speech tagging from 97% to 100%: Is it time for some linguistics?
-
Springer
-
Christopher D Manning. 2011. Part-of-speech tagging from 97% to 100%: is it time for some linguistics? In Computational Linguistics and Intelligent Text Processing, pages 171-189. Springer.
-
(2011)
Computational Linguistics and Intelligent Text Processing
, pp. 171-189
-
-
Manning, C.D.1
-
41
-
-
34249852033
-
Building a large annotated corpus of English: The Penn Treebank
-
Mitchell Marcus, Beatrice Santorini, and Mary Ann Marcinkiewicz. 1993. Building a large annotated corpus of English: the Penn Treebank. Computational Linguistics, 19(2):313-330.
-
(1993)
Computational Linguistics
, vol.19
, Issue.2
, pp. 313-330
-
-
Marcus, M.1
Santorini, B.2
Marcinkiewicz, M.A.3
-
42
-
-
84859925125
-
Online large-margin training of dependency parsers
-
Ann Arbor, Michigan, USA, June 25-30
-
Ryan McDonald, Koby Crammer, and Fernando Pereira. 2005. Online large-margin training of dependency parsers. In Proceedings of ACL-2005, pages 91-98, Ann Arbor, Michigan, USA, June 25-30.
-
(2005)
Proceedings of ACL-2005
, pp. 91-98
-
-
McDonald, R.1
Crammer, K.2
Pereira, F.3
-
43
-
-
84898956512
-
Distributed representations of words and phrases and their compositionality
-
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems, pages 3111-3119.
-
(2013)
Advances in Neural Information Processing Systems
, pp. 3111-3119
-
-
Mikolov, T.1
Sutskever, I.2
Chen, K.3
Corrado, G.S.4
Dean, J.5
-
44
-
-
84860004149
-
Supervised noun phrase coreference research: The first fifteen years
-
Uppsala, Sweden, July. Association for Computational Linguistics
-
Vincent Ng. 2010. Supervised noun phrase coreference research: The first fifteen years. In Proceedings of ACL-2010, pages 1396-1411, Uppsala, Sweden, July. Association for Computational Linguistics.
-
(2010)
Proceedings of ACL-2010
, pp. 1396-1411
-
-
Ng, V.1
-
45
-
-
85119092123
-
Deterministic dependency parsing of English text
-
Geneva, Switzerland, August 23-27
-
Joakim Nivre and Mario Scholz. 2004. Deterministic dependency parsing of English text. In Proceedings of COLING-2004, pages 64-70, Geneva, Switzerland, August 23-27.
-
(2004)
Proceedings of COLING-2004
, pp. 64-70
-
-
Nivre, J.1
Scholz, M.2
-
47
-
-
85064069174
-
Lexicon infused phrase embeddings for named entity resolution
-
Ann Arbor, Michigan, June
-
Alexandre Passos, Vineet Kumar, and Andrew McCallum. 2014. Lexicon infused phrase embeddings for named entity resolution. In Proceedings of CoNLL-2014, pages 78-86, Ann Arbor, Michigan, June.
-
(2014)
Proceedings of CoNLL-2014
, pp. 78-86
-
-
Passos, A.1
Kumar, V.2
McCallum, A.3
-
48
-
-
84959875172
-
Named entity recognition for Chinese social media with jointly trained embeddings
-
Lisbon, Portugal, September
-
Nanyun Peng and Mark Dredze. 2015. Named entity recognition for Chinese social media with jointly trained embeddings. In Proceedings of EMNLP-2015, pages 548-554, Lisbon, Portugal, September.
-
(2015)
Proceedings of EMNLP-2015
, pp. 548-554
-
-
Peng, N.1
Dredze, M.2
-
49
-
-
85016571065
-
Improving named entity recognition for Chinese social media with word segmentation representation learning
-
Berlin, Germany, August
-
Nanyun Peng and Mark Dredze. 2016. Improving named entity recognition for Chinese social media with word segmentation representation learning. In Proceedings of ACL-2016, Berlin, Germany, August.
-
(2016)
Proceedings of ACL-2016
-
-
Peng, N.1
Dredze, M.2
-
50
-
-
84961289992
-
Glove: Global vectors for word representation
-
Doha, Qatar, October
-
Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. Glove: Global vectors for word representation. In Proceedings of EMNLP-2014, pages 1532-1543, Doha, Qatar, October.
-
(2014)
Proceedings of EMNLP-2014
, pp. 1532-1543
-
-
Pennington, J.1
Socher, R.2
Manning, C.3
-
51
-
-
84862300668
-
Design challenges and misconceptions in named entity recognition
-
Lev Ratinov and Dan Roth. 2009. Design challenges and misconceptions in named entity recognition. In Proceedings of CoNLL-2009, pages 147-155.
-
(2009)
Proceedings of CoNLL-2009
, pp. 147-155
-
-
Ratinov, L.1
Roth, D.2
-
52
-
-
85011857132
-
Learning character-level representations for part-of-speech tagging
-
Cicero D Santos and Bianca Zadrozny. 2014. Learning character-level representations for part-of-speech tagging. In Proceedings of ICML-2014, pages 1818-1826.
-
(2014)
Proceedings of ICML-2014
, pp. 1818-1826
-
-
Santos, C.D.1
Zadrozny, B.2
-
53
-
-
84860520429
-
Guided learning for bidirectional sequence classification
-
Libin Shen, Giorgio Satta, and Aravind Joshi. 2007. Guided learning for bidirectional sequence classification. In Proceedings of ACL-2007, volume 7, pages 760-767.
-
(2007)
Proceedings of ACL-2007
, vol.7
, pp. 760-767
-
-
Shen, L.1
Satta, G.2
Joshi, A.3
-
54
-
-
84859010772
-
Semi-supervised condensed nearest neighbor for part-of-speech tagging
-
Portland, Oregon, USA, June
-
Anders Søgaard. 2011. Semi-supervised condensed nearest neighbor for part-of-speech tagging. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 48-52, Portland, Oregon, USA, June.
-
(2011)
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
, pp. 48-52
-
-
Søgaard, A.1
-
55
-
-
84904163933
-
Dropout: A simple way to prevent neural networks from overfitting
-
Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research, 15(1):1929-1958.
-
(2014)
The Journal of Machine Learning Research
, vol.15
, Issue.1
, pp. 1929-1958
-
-
Srivastava, N.1
Hinton, G.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
57
-
-
85099019865
-
Introduction to the conll-2003 shared task: Language-independent named entity recognition
-
Stroudsburg, PA, USA
-
Erik F. Tjong Kim Sang and Fien De Meulder. 2003. Introduction to the conll-2003 shared task: Language-independent named entity recognition. In Proceedings of CoNLL-2003 - Volume 4, pages 142-147, Stroudsburg, PA, USA.
-
(2003)
Proceedings of CoNLL-2003
, vol.4
, pp. 142-147
-
-
Tjong, E.F.1
Sang, K.2
De Meulder, F.3
-
59
-
-
84983470508
-
Feature-rich partof-speech tagging with a cyclic dependency network
-
Kristina Toutanova, Dan Klein, Christopher D Manning, and Yoram Singer. 2003. Feature-rich partof-speech tagging with a cyclic dependency network. In Proceedings of NAACL-HLT-2003, Volume 1, pages 173-180.
-
(2003)
Proceedings of NAACL-HLT-2003
, vol.1
, pp. 173-180
-
-
Toutanova, K.1
Klein, D.2
Manning, C.D.3
Singer, Y.4
|