-
1
-
-
84926321124
-
Joint language and translation modeling with recurrent neural networks
-
Michael Auli, Michel Galley, Chris Quirk, and Geoffrey Zweig. 2013. Joint language and translation modeling with recurrent neural networks. In Proc. of EMNLP, pages 1044-1054.
-
(2013)
Proc. of EMNLP
, pp. 1044-1054
-
-
Auli, M.1
Galley, M.2
Quirk, C.3
Zweig, G.4
-
4
-
-
84961291190
-
Learning phrase representations using RNN encoder-decoder for statistical machine translation
-
Kyunghyun Cho, Bart van Merrienboer, Çaglar Gülçehre, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. Proc. of EMNLP.
-
(2014)
Proc. of EMNLP
-
-
Cho, K.1
Van Merrienboer, B.2
Aglar Gülçehre, C.3
Bougares, F.4
Schwenk, H.5
Bengio, Y.6
-
5
-
-
56449095373
-
A unified architecture for natural language processing: Deep neural networks with multitask learning
-
ACM
-
Ronan Collobert and Jason Weston. 2008. A unified architecture for natural language processing: Deep neural networks with multitask learning. In Proc. of ICML, pages 160-167. ACM.
-
(2008)
Proc. of ICML
, pp. 160-167
-
-
Collobert, R.1
Weston, J.2
-
6
-
-
84906921986
-
Fast and robust neural network joint models for statistical machine translation
-
Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, and John Makhoul. 2014. Fast and robust neural network joint models for statistical machine translation. In Proc. of ACL.
-
(2014)
Proc. of ACL
-
-
Devlin, J.1
Zbib, R.2
Huang, Z.3
Lamar, T.4
Schwartz, R.5
Makhoul, J.6
-
7
-
-
80052250414
-
Adaptive subgradient methods for online learning and stochastic optimization
-
John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive subgradient methods for online learning and stochastic optimization. Journ. Mach. Learn. Res., 12:2121-2159.
-
(2011)
Journ. Mach. Learn. Res
, vol.12
, pp. 2121-2159
-
-
Duchi, J.1
Hazan, E.2
Singer, Y.3
-
8
-
-
84906932220
-
Learning continuous phrase representations for translation modeling
-
Jianfeng Gao, Xiaodong He, Wen tau Yih, and Li Deng. 2014a. Learning continuous phrase representations for translation modeling. In Proc. of ACL, pages 699-709.
-
(2014)
Proc. of ACL
, pp. 699-709
-
-
Gao, J.1
He, X.2
Tau Yih, W.3
Deng, L.4
-
9
-
-
85106623891
-
Modeling interestingness with deep neural networks
-
Jianfeng Gao, Patrick Pantel, Michael Gamon, Xiaodong He, and Li Deng. 2014b. Modeling interestingness with deep neural networks. In Proc. of EMNLP, pages 2-13.
-
(2014)
Proc. of EMNLP
, pp. 2-13
-
-
Gao, J.1
Pantel, P.2
Gamon, M.3
He, X.4
Deng, L.5
-
11
-
-
77956510865
-
Noisecontrastive estimation: A new estimation principle for unnormalized statistical models
-
Michael Gutmann and Aapo Hyvärinen. 2010. Noisecontrastive estimation: A new estimation principle for unnormalized statistical models. In Proc. of AISTATS, pages 297-304.
-
(2010)
Proc. of AISTATS
, pp. 297-304
-
-
Gutmann, M.1
Hyvärinen, A.2
-
12
-
-
84889566627
-
Learning deep structured semantic models for web search using clickthrough data
-
Po-Sen Huang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Acero, and Larry Heck. 2013. Learning deep structured semantic models for web search using clickthrough data. In Proc. of CIKM, pages 2333-2338.
-
(2013)
Proc. of CIKM
, pp. 2333-2338
-
-
Huang, P.1
He, X.2
Gao, J.3
Deng, L.4
Acero, A.5
Heck, L.6
-
13
-
-
84926283798
-
Recurrent continuous translation models
-
Nal Kalchbrenner and Phil Blunsom. 2013. Recurrent continuous translation models. Proc. of EMNLP, pages 1700-1709.
-
(2013)
Proc. of EMNLP
, pp. 1700-1709
-
-
Kalchbrenner, N.1
Blunsom, P.2
-
14
-
-
0028996876
-
Improved backing-off for M-gram language modeling
-
May
-
Reinhard Kneser and Hermann Ney. 1995. Improved backing-off for M-gram language modeling. In Proc. of ICASSP, pages 181-184, May.
-
(1995)
Proc. of ICASSP
, pp. 181-184
-
-
Kneser, R.1
Ney, H.2
-
15
-
-
85110867932
-
Moses: Open source toolkit for statistical machine translation
-
Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, and Evan Herbst. 2007. Moses: Open Source Toolkit for Statistical Machine Translation. In Proc. of ACL Demo and Poster Sessions, pages 177-180.
-
(2007)
Proc. of ACL Demo and Poster Sessions
, pp. 177-180
-
-
Koehn, P.1
Hoang, H.2
Birch, A.3
Callison-Burch, C.4
Federico, M.5
Bertoldi, N.6
Cowan, B.7
Shen, W.8
Moran, C.9
Zens, R.10
Dyer, C.11
Bojar, O.12
Constantin, A.13
Herbst, E.14
-
17
-
-
79959829092
-
Recurrent neural network based language model
-
Tomas Mikolov, Martin Karafiát, Lukas Burget, Jan Cernocký, and Sanjeev Khudanpur. 2010. Recurrent neural network based language model. In Proc. of INTERSPEECH, pages 1045-1048.
-
(2010)
Proc. of INTERSPEECH
, pp. 1045-1048
-
-
Mikolov, T.1
Karafiát, M.2
Burget, L.3
Cernocký, J.4
Khudanpur, S.5
-
18
-
-
22944469345
-
The alignment template approach to machine translation
-
Franz Josef Och and Hermann Ney. 2004. The alignment template approach to machine translation. Comput. Linguist., 30(4):417-449.
-
(2004)
Comput. Linguist
, vol.30
, Issue.4
, pp. 417-449
-
-
Josef Och, F.1
Ney, H.2
-
19
-
-
22944447077
-
Minimum error rate training in statistical machine translation
-
Franz Josef Och. 2003. Minimum error rate training in statistical machine translation. In Proc. of ACL, pages 160-167.
-
(2003)
Proc. of ACL
, pp. 160-167
-
-
Josef Och, F.1
-
20
-
-
85133336275
-
BLEU: A method for automatic evaluation of machine translation
-
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proc. of ACL, pages 311-318.
-
(2002)
Proc. of ACL
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.4
-
21
-
-
84892982833
-
On the difficulty of training recurrent neural networks
-
Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio. 2013. On the difficulty of training recurrent neural networks. Proc. of ICML, pages 1310-1318.
-
(2013)
Proc. of ICML
, pp. 1310-1318
-
-
Pascanu, R.1
Mikolov, T.2
Bengio, Y.3
-
22
-
-
80053292690
-
Data-driven response generation in social media
-
Alan Ritter, Colin Cherry, and William B. Dolan. 2011. Data-driven response generation in social media. In Proc. of EMNLP, pages 583-593.
-
(2011)
Proc. of EMNLP
, pp. 583-593
-
-
Ritter, A.1
Cherry, C.2
Dolan, W.B.3
-
24
-
-
0141853652
-
Learning representations by backpropagating errors
-
James A. Anderson and Edward Rosenfeld, editors MIT Press, Cambridge, MA, USA
-
David E. Rumelhart, Geoffrey E. Hinton, and Ronald J. Williams. 1988. Learning representations by backpropagating errors. In James A. Anderson and Edward Rosenfeld, editors, Neurocomputing: Foundations of Research, pages 696-699. MIT Press, Cambridge, MA, USA.
-
(1988)
Neurocomputing: Foundations of Research
, pp. 696-699
-
-
Rumelhart, D.E.1
Hinton, G.E.2
Williams, R.J.3
-
25
-
-
84928315948
-
A latent semantic model with convolutional-pooling structure for information retrieval
-
Yelong Shen, Xiaodong He, Jianfeng Gao, Li Deng, and Grégoire Mesnil. 2014. A latent semantic model with convolutional-pooling structure for information retrieval. In Proc. of CIKM, pages 101-110.
-
(2014)
Proc. of CIKM
, pp. 101-110
-
-
Shen, Y.1
He, X.2
Gao, J.3
Deng, L.4
Mesnil, G.5
-
28
-
-
84994101051
-
A trainable generator for recommendations in multimodal dialog
-
Marilyn A. Walker, Rashmi Prasad, and Amanda Stent. 2003. A trainable generator for recommendations in multimodal dialog. In Proc. of EUROSPEECH.
-
(2003)
Proc. of EUROSPEECH
-
-
Walker, M.A.1
Prasad, R.2
Stent, A.3
-
29
-
-
70349231178
-
The hidden information state model: A practical framework for pomdp-based spoken dialogue management
-
Steve Young, Milica Ga?sić, Simon Keizer, François Mairesse, Jost Schatzmann, Blaise Thomson, and Kai Yu. 2010. The hidden information state model: A practical framework for pomdp-based spoken dialogue management. Comput. Speech Lang., 24(2):150-174.
-
(2010)
Comput. Speech Lang
, vol.24
, Issue.2
, pp. 150-174
-
-
Young, S.1
Gasić, M.2
Keizer, S.3
Mairesse, F.4
Schatzmann, J.5
Thomson, B.6
Yu, K.7
-
30
-
-
85026948453
-
Talking to machines (statistically speaking)
-
Steve Young. 2002. Talking to machines (statistically speaking). In Proc. of INTERSPEECH.
-
(2002)
Proc. of INTERSPEECH
-
-
Young, S.1
|