-
1
-
-
27844439373
-
A framework for learning predictive structures from multiple tasks and unlabeled data
-
Rie Kubota Ando and Tong Zhang. 2005. A framework for learning predictive structures from multiple tasks and unlabeled data. Journal of Machine Learning Research, 6:1817-1853.
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 1817-1853
-
-
Kubota Ando, R.1
Zhang, T.2
-
2
-
-
84921977518
-
Neural machine translation by jointly learning to align and translate
-
abs/1409.0473
-
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. CoRR, abs/1409.0473.
-
(2014)
CoRR
-
-
Bahdanau, D.1
Cho, K.2
Bengio, Y.3
-
3
-
-
84897544737
-
Theano: New features and speed improvements
-
abs/1211.5590
-
Frédéric Bastien, Pascal Lamblin, Razvan Pascanu, James Bergstra, Ian J. Goodfellow, Arnaud Bergeron, Nicolas Bouchard, David Warde-Farley, and Yoshua Bengio. 2012. Theano: new features and speed improvements. CoRR, abs/1211.5590.
-
(2012)
CoRR
-
-
Bastien, F.1
Lamblin, P.2
Pascanu, R.3
Bergstra, J.4
Goodfellow, I.J.5
Bergeron, A.6
Bouchard, N.7
Warde-Farley, D.8
Bengio, Y.9
-
4
-
-
33847215211
-
Stochastic gradient learning in neural networks
-
Nimes, France. EC2
-
Léon Bottou. 1991. Stochastic gradient learning in neural networks. In Proceedings of Neuro-Nîmes 91, Nimes, France. EC2.
-
(1991)
Proceedings of Neuro-Nîmes
, vol.91
-
-
Bottou, L.1
-
5
-
-
84943772258
-
On the properties of neural machine translation: Encoderdecoder approaches
-
abs/1409.1259
-
Kyung Hyun Cho, Bart van Merrienboer, Dzmitry Bahdanau, and Yoshua Bengio. 2014. On the properties of neural machine translation: Encoderdecoder approaches. CoRR, abs/1409.1259.
-
(2014)
CoRR
-
-
Hyun Cho, K.1
Van Merrienboer, B.2
Bahdanau, D.3
Bengio, Y.4
-
6
-
-
84860539387
-
Machine translation by triangulation: Making effective use of multi-parallel corpora
-
Trevor Cohn and Mirella Lapata. 2007. Machine translation by triangulation: Making effective use of multi-parallel corpora. In Proc. ACL, pages 728-735.
-
(2007)
Proc. ACL
, pp. 728-735
-
-
Cohn, T.1
Lapata, M.2
-
7
-
-
80053558787
-
Natural language processing (almost) from scratch
-
Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel P. Kuksa. 2011. Natural language processing (almost) from scratch. Journal of Machine Learning Research, 12:2493-2537.
-
(2011)
Journal of Machine Learning Research
, vol.12
, pp. 2493-2537
-
-
Collobert, R.1
Weston, J.2
Bottou, L.3
Karlen, M.4
Kavukcuoglu, K.5
Kuksa, P.P.6
-
8
-
-
84906931795
-
Multi-domain adaptation for SMT using multi-Task learning
-
Lei Cui, Xilun Chen, Dongdong Zhang, Shujie Liu, Mu Li, and Ming Zhou. 2013. Multi-domain adaptation for SMT using multi-Task learning. In Proc. EMNLP, pages 1055-1065.
-
(2013)
Proc. EMNLP
, pp. 1055-1065
-
-
Cui, L.1
Chen, X.2
Zhang, D.3
Liu, S.4
Li, M.5
Zhou, M.6
-
9
-
-
84906921986
-
Fast and robust neural network joint models for statistical machine translation
-
Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard M. Schwartz, and John Makhoul. 2014. Fast and robust neural network joint models for statistical machine translation. In Proc. ACL, pages 1370-1380.
-
(2014)
Proc. ACL
, pp. 1370-1380
-
-
Devlin, J.1
Zbib, R.2
Huang, Z.3
Lamar, T.4
Schwartz, R.M.5
Makhoul, J.6
-
10
-
-
84906932220
-
Learning continuous phrase representations for translation modeling
-
Jianfeng Gao, Xiaodong He, Wen-Tau Yih, and Li Deng. 2014. Learning continuous phrase representations for translation modeling. In Proc. ACL, pages 699-709.
-
(2014)
Proc. ACL
, pp. 699-709
-
-
Gao, J.1
He, X.2
Yih, W.3
Deng, L.4
-
11
-
-
84875853611
-
Incremental joint approach to word segmentation, POS tagging, and dependency parsing in chinese
-
Jun Hatori, Takuya Matsuzaki, Yusuke Miyao, and Junichi Tsujii. 2012. Incremental joint approach to word segmentation, POS tagging, and dependency parsing in chinese. In Proc. ACL, pages 1045-1053.
-
(2012)
Proc. ACL
, pp. 1045-1053
-
-
Hatori, J.1
Matsuzaki, T.2
Miyao, Y.3
Tsujii, J.4
-
12
-
-
84926283798
-
Recurrent continuous translation models
-
Nal Kalchbrenner and Phil Blunsom. 2013. Recurrent continuous translation models. In Proc. EMNLP, pages 1700-1709.
-
(2013)
Proc. EMNLP
, pp. 1700-1709
-
-
Kalchbrenner, N.1
Blunsom, P.2
-
13
-
-
35048882514
-
Pharaoh: A beam search decoder for phrase-based statistical machine translation models
-
AMTA 2004, Washington, DC, USA, September 28-October 2, 2004, Proceedings
-
Philipp Koehn. 2004. Pharaoh: A beam search decoder for phrase-based statistical machine translation models. In Machine Translation: From Real Users to Research, 6th Conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, USA, September 28-October 2, 2004, Proceedings, pages 115-124.
-
(2004)
Machine Translation: From Real Users to Research, 6th Conference of the Association for Machine Translation in the Americas
, pp. 115-124
-
-
Koehn, P.1
-
14
-
-
84897952656
-
Joint optimization for chinese POS tagging and dependency parsing
-
Zhenghua Li, Min Zhang, Wanxiang Che, Ting Liu, and Wenliang Chen. 2014. Joint optimization for chinese POS tagging and dependency parsing. IEEE/ACM Transactions on Audio, Speech & Language Processing, 22(1):274-286.
-
(2014)
IEEE/ACM Transactions on Audio, Speech & Language Processing
, vol.22
, Issue.1
, pp. 274-286
-
-
Li, Z.1
Zhang, M.2
Che, W.3
Liu, T.4
Chen, W.5
-
15
-
-
84906923393
-
A recursive recurrent neural network for statistical machine translation
-
Shujie Liu, Nan Yang, Mu Li, and Ming Zhou. 2014. A recursive recurrent neural network for statistical machine translation. In Proc. ACL, pages 1491-1500.
-
(2014)
Proc. ACL
, pp. 1491-1500
-
-
Liu, S.1
Yang, N.2
Li, M.3
Zhou, M.4
-
16
-
-
85133336275
-
Bleu: A method for automatic evaluation of machine translation
-
Stroudsburg, PA, USA. Association for Computational Linguistics
-
Kishore Papineni, Salim Roukos, ToddWard, andWei-Jing Zhu. 2002. Bleu: A method for automatic evaluation of machine translation. In Proc. ACL, ACL 2002, pages 311-318, Stroudsburg, PA, USA. Association for Computational Linguistics.
-
(2002)
Proc. ACL, ACL 2002
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.4
-
17
-
-
84907312418
-
A multi-domain translation model framework for statistical machine translation
-
Rico Sennrich, Holger Schwenk, and Walid Aransa. 2013. A multi-domain translation model framework for statistical machine translation. In Proc. ACL, pages 832-840.
-
(2013)
Proc. ACL
, pp. 832-840
-
-
Sennrich, R.1
Schwenk, H.2
Aransa, W.3
-
18
-
-
84961291354
-
Translation modeling with bidirectional recurrent neural networks
-
Martin Sundermeyer, Tamer Alkhouli, Joern Wuebker, and Hermann Ney. 2014. Translation modeling with bidirectional recurrent neural networks. In Proc. EMNLP, pages 14-25.
-
(2014)
Proc. EMNLP
, pp. 14-25
-
-
Sundermeyer, M.1
Alkhouli, T.2
Wuebker, J.3
Ney, H.4
-
19
-
-
84928547704
-
-
Annual Conference on Neural Information Processing Systems 2014, December 8-13 2014, Montreal, Quebec, Canada
-
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8-13 2014, Montreal, Quebec, Canada, pages 3104-3112.
-
(2014)
Sequence to Sequence Learning With Neural Networks Advances In Neural Information Processing Systems
, vol.27
, pp. 3104-3112
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.3
-
20
-
-
54249155932
-
Pivot language approach for phrase-based statistical machine translation
-
Hua Wu and Haifeng Wang. 2007. Pivot language approach for phrase-based statistical machine translation. In Proc. ACL, pages 165-181.
-
(2007)
Proc. ACL
, pp. 165-181
-
-
Wu, H.1
Wang, H.2
-
21
-
-
84905272120
-
ADADELTA: An adaptive learning rate method
-
Matthew D. Zeiler. 2012. ADADELTA: an adaptive learning rate method. CoRR, abs/1212.5701.
-
(2012)
CoRR, abs/1212.5701
-
-
Zeiler, M.D.1
|