-
1
-
-
84906923072
-
Decoder integration and expected bleu training for recurrent neural network language models
-
Baltimore, Maryland, June Association for Computational Linguistics
-
Michael Auli and Jianfeng Gao. Decoder integration and expected bleu training for recurrent neural network language models. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL '14), pages 136-142, Baltimore, Maryland, June 2014. Association for Computational Linguistics.
-
(2014)
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL '14)
, pp. 136-142
-
-
Auli, M.1
Gao, J.2
-
2
-
-
84926321124
-
Joint language and translation modeling with recurrent neural networks
-
Seattle, Washington, USA, October 2013. Association for Computational Linguistics
-
Michael Auli, Michel Galley, Chris Quirk, and Geoffrey Zweig. Joint language and translation modeling with recurrent neural networks. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1044-1054, Seattle, Washington, USA, October 2013. Association for Computational Linguistics.
-
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
, pp. 1044-1054
-
-
Auli, M.1
Galley, M.2
Quirk, C.3
Zweig, G.4
-
3
-
-
84941201812
-
Oxlm: A neural language modelling framework for machine translation
-
October
-
Paul Baltescu, Phil Blunsom, and Hieu Hoang. Oxlm: A neural language modelling framework for machine translation. The Prague Bulletin of Mathematical Linguistics, 102(1):81-92, October 2014.
-
(2014)
The Prague Bulletin of Mathematical Linguistics
, vol.102
, Issue.1
, pp. 81-92
-
-
Baltescu, P.1
Blunsom, P.2
Hoang, H.3
-
4
-
-
42549142788
-
Adaptive importance sampling to accelerate training of a neural probabilistic language model
-
Yoshua Bengio and Jean-Sbastien Senecal. Adaptive importance sampling to accelerate training of a neural probabilistic language model. IEEE Transactions on Neural Networks, 19(4):713-722, 2008.
-
(2008)
IEEE Transactions on Neural Networks
, vol.19
, Issue.4
, pp. 713-722
-
-
Bengio, Y.1
Senecal, J.2
-
5
-
-
0142166851
-
A neural probabilistic language model
-
Yoshua Bengio, Réjean Ducharme, Pascal Vincent, and Christian Janvin. A neural probabilistic language model. Journal of Machine Learning Research, 3:1137-1155, 2003.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 1137-1155
-
-
Bengio, Y.1
Ducharme, R.2
Vincent, P.3
Janvin, C.4
-
7
-
-
85022919385
-
Class-based n-gram models of natural language
-
Peter F. Brown, Peter V. deSouza, Robert L. Mercer, Vincent J. Della Pietra, and Jenifer C. Lai. Class-based n-gram models of natural language. Computational Linguistics, 18(4):467-479, 1992.
-
(1992)
Computational Linguistics
, vol.18
, Issue.4
, pp. 467-479
-
-
Brown, P.F.1
DeSouza, P.V.2
Mercer, R.L.3
Pietra Della, V.J.4
Lai, J.C.5
-
8
-
-
84943757899
-
One billion word benchmark for measuring progress in statistical language modeling
-
Ciprian Chelba, Tomas Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, and Phillipp Koehn. One billion word benchmark for measuring progress in statistical language modeling. CoRR, 2013.
-
(2013)
CoRR
-
-
Chelba, C.1
Mikolov, T.2
Schuster, M.3
Ge, Q.4
Brants, T.5
Koehn, P.6
-
9
-
-
0033329799
-
An empirical study of smoothing techniques for language modeling
-
Stanley F. Chen and Joshua Goodman. An empirical study of smoothing techniques for language modeling. Computer Speech & Language, 13(4): 359-393, 1999.
-
(1999)
Computer Speech & Language
, vol.13
, Issue.4
, pp. 359-393
-
-
Chen, S.F.1
Goodman, J.2
-
10
-
-
84960134362
-
On the properties of neural machine translation: Encoder-decoder approaches
-
KyungHyun Cho, Bart van Merrienboer, Dzmitry Bahdanau, and Yoshua Bengio. On the properties of neural machine translation: Encoder-decoder approaches. CoRR, 2014.
-
(2014)
CoRR
-
-
Cho, K.1
Van Merrienboer, B.2
Bahdanau, D.3
Bengio, Y.4
-
11
-
-
84906921986
-
Fast and robust neural network joint models for statistical machine translation
-
Baltimore, MD, USA, June
-
Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard M. Schwartz, and John Makhoul. Fast and robust neural network joint models for statistical machine translation. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14), Baltimore, MD, USA, June 2014.
-
(2014)
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14)
-
-
Devlin, J.1
Zbib, R.2
Huang, Z.3
Lamar, T.4
Schwartz, R.M.5
Makhoul, J.6
-
12
-
-
84989154066
-
Classes for fast maximum entropy training
-
Joshua Goodman. Classes for fast maximum entropy training. CoRR, 2001.
-
(2001)
CoRR
-
-
Goodman, J.1
-
13
-
-
84982842007
-
Kenlm: Faster and smaller language model queries
-
Edinburgh, Scotland, July. Association for Computational Linguistics
-
Kenneth Heafield. Kenlm: Faster and smaller language model queries. In Proceedings of the Sixth Workshop on Statistical Machine Translation (WMT '11), pages 187-197, Edinburgh, Scotland, July 2011. Association for Computational Linguistics.
-
(2011)
Proceedings of the Sixth Workshop on Statistical Machine Translation (WMT '11)
, pp. 187-197
-
-
Heafield, K.1
-
14
-
-
84938015047
-
A method for the construction of minimum-redundancy codes
-
September
-
David A. Huffman. A method for the construction of minimum-redundancy codes. Proceedings of the Institute of Radio Engineers, 40(9):1098-1101, September 1952.
-
(1952)
Proceedings of the Institute of Radio Engineers
, vol.40
, Issue.9
, pp. 1098-1101
-
-
Huffman, D.A.1
-
15
-
-
85110867932
-
Moses: Open source toolkit for statistical machine translation
-
Prague, Czech Republic, June. Association for Computational Linguistics
-
Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, and Evan Herbst. Moses: Open source toolkit for statistical machine translation. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL '07), pages 177-180, Prague, Czech Republic, June 2007. Association for Computational Linguistics.
-
(2007)
Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL '07)
, pp. 177-180
-
-
Koehn, P.1
Hoang, H.2
Birch, A.3
Callison-Burch, C.4
Federico, M.5
Bertoldi, N.6
Cowan, B.7
Shen, W.8
Moran, C.9
Zens, R.10
Dyer, C.11
Bojar, O.12
Constantin, A.13
Herbst, E.14
-
17
-
-
84858966958
-
Strategies for training large scale neural network language models
-
IEEE Signal Processing Society
-
Tomas Mikolov, Anoop Deoras, Daniel Povey, Lukas Burget, and Jan Cernocky. Strategies for training large scale neural network language models. In Proceedings of the 2011 Automatic Speech Recognition and Understanding Workshop, pages 196-201. IEEE Signal Processing Society, 2011a.
-
(2011)
Proceedings of the 2011 Automatic Speech Recognition and Understanding Workshop
, pp. 196-201
-
-
Mikolov, T.1
Deoras, A.2
Povey, D.3
Burget, L.4
Cernocky, J.5
-
18
-
-
80051643236
-
Jan ernock, and Sanjeev Khudanpur. Extensions of recurrent neural network language model
-
IEEE Signal Processing Society
-
Tom Mikolov, Stefan Kombrink, Luk Burget, Jan ernock, and Sanjeev Khudanpur. Extensions of recurrent neural network language model. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, pages 5528-5531. IEEE Signal Processing Society, 2011b.
-
(2011)
Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
, pp. 5528-5531
-
-
Mikolov, T.1
Kombrink, S.2
Burget, L.3
-
23
-
-
33847610331
-
Continuous space language models
-
Holger Schwenk. Continuous space language models. Computer Speech & Language, 21(3):492-518, 2007.
-
(2007)
Computer Speech & Language
, vol.21
, Issue.3
, pp. 492-518
-
-
Schwenk, H.1
-
24
-
-
85044798389
-
Continuous-space language models for statistical machine translation
-
Holger Schwenk. Continuous-space language models for statistical machine translation. Prague Bulletin of Mathematical Linguistics, 93:137-146, 2010.
-
(2010)
Prague Bulletin of Mathematical Linguistics
, vol.93
, pp. 137-146
-
-
Schwenk, H.1
-
25
-
-
84959277840
-
Sequence to sequence learning with neural networks
-
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. Sequence to sequence learning with neural networks. CoRR, 2014.
-
(2014)
CoRR
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.3
-
26
-
-
84926298172
-
Decoding with large-scale neural language models improves translation
-
Seattle, Washington, USA, October. Association for Computational Linguistics
-
Ashish Vaswani, Yinggong Zhao, Victoria Fossum, and David Chiang. Decoding with large-scale neural language models improves translation. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1387-1392, Seattle, Washington, USA, October 2013. Association for Computational Linguistics.
-
(2013)
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
, pp. 1387-1392
-
-
Vaswani, A.1
Zhao, Y.2
Fossum, V.3
Chiang, D.4
-
27
-
-
84921414780
-
An investigation on statistical machine translation with neural language models
-
Wuhan, China, October
-
Yinggong Zhao, Shujian Huang, Huadong Chen, and Jiajun Chen. An investigation on statistical machine translation with neural language models. In Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data-13th China National Conference, CCL 2014, and Second International Symposium, NLP-NABD, pages 175-186,Wuhan, China, October 2014.
-
(2014)
Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data-13th China National Conference, CCL 2014, and Second International Symposium, NLP-NABD
, pp. 175-186
-
-
Zhao, Y.1
Huang, S.2
Chen, H.3
Chen, J.4
|