-
1
-
-
85075436378
-
-
Association for Computational Linguistics
-
Ebru Arisoy, Tara N. Sainath, Brian Kingsbury, and Bhuvana Ramabhadran. 2012. Deep neural network language models. In Proceedings of the NAACLHLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, pages 20-28. Association for Computational Linguistics.
-
(2012)
Deep Neural Network Language Models
, pp. 20-28
-
-
Arisoy, E.1
Sainath, T.N.2
Kingsbury, B.3
Ramabhadran, B.4
-
2
-
-
84926321124
-
Joint language and translation modeling with recurrent neural networks
-
Seattle, USA, October
-
Michael Auli, Michel Galley, Chris Quirk, and Geoffrey Zweig. 2013. Joint Language and Translation Modeling with Recurrent Neural Networks. In Conference on Empirical Methods in Natural Language Processing, pages 1044-1054, Seattle, USA, October.
-
(2013)
Conference on Empirical Methods in Natural Language Processing
, pp. 1044-1054
-
-
Auli, M.1
Galley, M.2
Quirk, C.3
Zweig, G.4
-
3
-
-
0028392483
-
Learning long-term dependencies with gra- dient descent is difficult
-
Yoshua Bengio, Patrice Simard, and Paolo Frasconi. 1994. Learning long-term dependencies with gra- dient descent is difficult. Neural Networks, IEEE Transactions on, 5(2):157-166.
-
(1994)
Neural Networks, IEEE Transactions on
, vol.5
, Issue.2
, pp. 157-166
-
-
Bengio, Y.1
Simard, P.2
Frasconi, P.3
-
5
-
-
84961291873
-
Text-to-text machine translation using the RECONTRA connectionist model
-
Alicante, Spain
-
Maria Asunción Castaño and Francisco Casacuberta. 1999. Text-to-text machine translation using the RECONTRA connectionist model. In Lecture Notes in Computer Science (IWANN 99), volume 1607, pages 683-692, Alicante, Spain.
-
(1999)
Lecture Notes in Computer Science (IWANN 99)
, vol.1607
, pp. 683-692
-
-
Castaño, M.A.1
Casacuberta, F.2
-
6
-
-
0005689727
-
Machine translation using neural networks and finite-state models
-
Santa Fe, USA
-
Maria Asunción Castaño, Francisco Casacuberta, and Enrique Vidal. 1997. Machine translation using neural networks and finite-state models. In 7th International Conference on Theoretical and Methodological Issues in Machine Translation. TMI'97, pages 160-167, Santa Fe, USA.
-
(1997)
7th International Conference on Theoretical and Methodological Issues in Machine Translation. TMI'97
, pp. 160-167
-
-
Castaño, M.A.1
Casacuberta, F.2
Vidal, E.3
-
7
-
-
0003396042
-
An empirical study of smoothing techniques for language modeling
-
Harvard University, Cambridge, MA, August
-
Stanley F. Chen and Joshua Goodman. 1998. An Empirical Study of Smoothing Techniques for Language Modeling. Technical Report TR-10-98, Computer Science Group, Harvard University, Cambridge, MA, August.
-
(1998)
Technical Report TR-10-98, Computer Science Group
-
-
Chen, S.F.1
Goodman, J.2
-
8
-
-
84906921986
-
Fast and robust neural network joint models for statistical machine translation
-
page to appear, Baltimore, MD, USA, June
-
Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, and John Makhoul. 2014. Fast and Robust Neural Network Joint Models for Statistical Machine Translation. In 52nd Annual Meeting of the Association for Computational Linguistics, page to appear, Baltimore, MD, USA, June.
-
(2014)
52nd Annual Meeting of the Association for Computational Linguistics
-
-
Devlin, J.1
Zbib, R.2
Huang, Z.3
Lamar, T.4
Schwartz, R.5
Makhoul, J.6
-
9
-
-
80053360939
-
A simple and effective hierarchical phrase reordering model
-
Stroudsburg, PA, USA. Association for Computational Linguistics
-
Michel Galley and Christopher D. Manning. 2008. A simple and effective hierarchical phrase reordering model. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '08, pages 848-856, Stroudsburg, PA, USA. Association for Computational Linguistics.
-
(2008)
Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '08
, pp. 848-856
-
-
Galley, M.1
Manning, C.D.2
-
10
-
-
0034293152
-
Learning to forget: Continual prediction with LSTM
-
Felix A. Gers, Jürgen Schmidhuber, and Fred Cummins. 2000. Learning to forget: Continual prediction with LSTM. Neural computation, 12(10):2451-2471.
-
(2000)
Neural Computation
, vol.12
, Issue.10
, pp. 2451-2471
-
-
Gers, F.A.1
Schmidhuber, J.2
Cummins, F.3
-
12
-
-
0034856455
-
Classes for fast maximum entropy training
-
IEEE
-
Joshua Goodman. 2001. Classes for fast maximum entropy training. In Acoustics, Speech, and Signal Processing, 2001. Proceedings.(ICASSP'01). 2001 IEEE International Conference on, volume 1, pages 561-564. IEEE.
-
(2001)
Acoustics, Speech, and Signal Processing, 2001. Proceedings.(ICASSP'01). 2001 IEEE International Conference on
, vol.1
, pp. 561-564
-
-
Goodman, J.1
-
13
-
-
27744588611
-
Framewise phoneme classification with bidirectional LSTM and other neural network architectures
-
Alex Graves and Jürgen Schmidhuber. 2005. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks, 18(5):602-610.
-
(2005)
Neural Networks
, vol.18
, Issue.5
, pp. 602-610
-
-
Graves, A.1
Schmidhuber, J.2
-
16
-
-
84905705252
-
Minimum translation modeling with recurrent neural networks
-
Gothenburg, Sweden, April. Association for Computational Linguistics
-
Yuening Hu, Michael Auli, Qin Gao, and Jianfeng Gao. 2014. Minimum translation modeling with recurrent neural networks. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pages 20-29, Gothenburg, Sweden, April. Association for Computational Linguistics.
-
(2014)
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics
, pp. 20-29
-
-
Hu, Y.1
Auli, M.2
Gao, Q.3
Gao, J.4
-
17
-
-
85011919605
-
A phrase orientation model for hierarchical machine translation
-
Sofia, Bulgaria, August
-
Matthias Huck, Joern Wuebker, Felix Rietig, and Hermann Ney. 2013. A phrase orientation model for hierarchical machine translation. In ACL 2013 Eighth Workshop on Statistical Machine Translation, pages 452-463, Sofia, Bulgaria, August.
-
(2013)
ACL 2013 Eighth Workshop on Statistical Machine Translation
, pp. 452-463
-
-
Huck, M.1
Wuebker, J.2
Rietig, F.3
Ney, H.4
-
18
-
-
84926283798
-
Recurrent continuous translation models
-
Seattle, Washington, USA, October. Association for Computational Linguistics
-
Nal Kalchbrenner and Phil Blunsom. 2013. Recurrent continuous translation models. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1700-1709, Seattle, Washington, USA, October. Association for Computational Linguistics.
-
(2013)
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
, pp. 1700-1709
-
-
Kalchbrenner, N.1
Blunsom, P.2
-
19
-
-
0028996876
-
Improved backing-off for m-gram language modeling
-
May
-
Reinhard Kneser and Hermann Ney. 1995. Improved backing-off for M-gram language modeling. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processingw, volume 1, pages 181-184, May.
-
(1995)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processingw
, vol.1
, pp. 181-184
-
-
Kneser, R.1
Ney, H.2
-
20
-
-
85118138826
-
Statistical phrase-based translation
-
Edmonton, Alberta
-
Philipp Koehn, Franz Josef Och, and Daniel Marcu. 2003. Statistical Phrase-Based Translation. In Proceedings of the 2003 Meeting of the North American chapter of the Association for Computational Linguistics (NAACL-03), pages 127-133, Edmonton, Alberta.
-
(2003)
Proceedings of the 2003 Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL-03)
, pp. 127-133
-
-
Koehn, P.1
Och, F.J.2
Marcu, D.3
-
21
-
-
84926203469
-
Continuous space translation models with neural networks
-
Montreal, Canada, June
-
Hai Son Le, Alexandre Allauzen, and François Yvon. 2012. Continuous Space Translation Models with Neural Networks. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 39-48, Montreal, Canada, June.
-
(2012)
Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
, pp. 39-48
-
-
Le, H.S.1
Allauzen, A.2
Yvon, F.3
-
22
-
-
84905240726
-
Efficient lattice rescoring using recurrent neural network language models
-
IEEE
-
Xunying Liu, Yongqiang Wang, Xie Chen, Mark J. F. Gales, and Phil C.Woodland. 2014. Efficient lattice rescoring using recurrent neural network language models. In Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pages 4941-4945. IEEE.
-
(2014)
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
, pp. 4941-4945
-
-
Liu, X.1
Wang, Y.2
Chen, X.3
Gales, M.J.F.4
Woodland, P.C.5
-
23
-
-
84906237242
-
Investigation of recurrent-neuralnetwork architectures and learning methods for spoken language understanding
-
Grégoire Mesnil, Xiaodong He, Li Deng, and Yoshua Bengio. 2013. Investigation of recurrent-neuralnetwork architectures and learning methods for spoken language understanding. In Interspeech, pages 3771-3775.
-
(2013)
Terspeech
, pp. 3771-3775
-
-
Mesnil, G.1
He, X.2
Deng, L.3
Bengio, Y.4
-
24
-
-
80051643236
-
Extensions of recurrent neural network language model
-
IEEE
-
Tomas Mikolov, Stefan Kombrink, Lukas Burget, JH Cernocky, and Sanjeev Khudanpur. 2011. Extensions of recurrent neural network language model. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pages 5528-5531. IEEE.
-
(2011)
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
, pp. 5528-5531
-
-
Mikolov, T.1
Kombrink, S.2
Burget, L.3
Cernocky, J.H.4
Khudanpur, S.5
-
25
-
-
84859981825
-
Intelligent selection of language model training data
-
Short Papers, Uppsala, Sweden, July
-
Robert C. Moore andWilliam Lewis. 2010. Intelligent Selection of Language Model Training Data. In ACL (Short Papers), pages 220-224, Uppsala, Sweden, July.
-
(2010)
ACL
, pp. 220-224
-
-
Lewis, R.C.M.A.1
-
28
-
-
85133336275
-
Bleu: A method for automatic evaluation of machine translation
-
Philadelphia, Pennsylvania, USA, July
-
Kishore Papineni, Salim Roukos, ToddWard, andWei- Jing Zhu. 2002. Bleu: A Method for Automatic Evaluation of Machine Translation. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pages 311-318, Philadelphia, Pennsylvania, USA, July.
-
(2002)
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Todd, W.3
Zhu, A.-J.4
-
29
-
-
0000646059
-
Learning internal representations by error propagation
-
J. L. McClelland, D. E. Rumelhart, and The PDP Research Group:, The MIT Press
-
David E. Rumelhart, Geoffrey E. Hinton, and Ronald J. Williams. 1986. Learning Internal Representations by Error Propagation. In: J. L. McClelland, D. E. Rumelhart, and The PDP Research Group: "Parallel Distributed Processing, Volume 1: Foundations". The MIT Press.
-
(1986)
Parallel Distributed Processing, Volume 1: Foundations
-
-
Rumelhart, D.E.1
Hinton, G.E.2
Williams, R.J.3
-
31
-
-
85075873581
-
Continuous space language models for statistical machine translation
-
Sydney, Australia, July
-
Holger Schwenk, Daniel Déchelotte, and Jean-Luc Gauvain. 2006. Continuous Space Language Models for Statistical Machine Translation. In Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pages 723-730, Sydney, Australia, July.
-
(2006)
Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions
, pp. 723-730
-
-
Schwenk, H.1
Déchelotte, D.2
Gauvain, J.-L.3
-
32
-
-
84905273821
-
Continuous space translation models for phrase-based statistical machine translation
-
Mumbai, India, December
-
Holger Schwenk. 2012. Continuous Space Translation Models for Phrase-Based Statistical Machine Translation. In 25th International Conference on Computational Linguistics (COLING), pages 1071-1080, Mumbai, India, December.
-
(2012)
25th International Conference on Computational Linguistics (COLING)
, pp. 1071-1080
-
-
Schwenk, H.1
-
33
-
-
84857522507
-
A study of translation edit rate with targeted human annotation
-
Cambridge, Massachusetts, USA, August
-
Matthew Snover, Bonnie Dorr, Richard Schwartz, Linnea Micciulla, and John Makhoul. 2006. A Study of Translation Edit Rate with Targeted Human Annotation. In Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, pages 223-231, Cambridge, Massachusetts, USA, August.
-
(2006)
Proceedings of the 7th Conference of the Association for Machine Translation in the Americas
, pp. 223-231
-
-
Snover, M.1
Dorr, B.2
Schwartz, R.3
Micciulla, L.4
Makhoul, J.5
-
34
-
-
84891308106
-
Srilm - An extensible language modeling toolkit
-
Denver, CO, September
-
Andreas Stolcke. 2002. SRILM - An Extensible Language Modeling Toolkit. In Proc. of the Int. Conf. on Speech and Language Processing (ICSLP), volume 2, pages 901-904, Denver, CO, September.
-
(2002)
Proc. of the Int. Conf. on Speech and Language Processing (ICSLP)
, vol.2
, pp. 901-904
-
-
Stolcke, A.1
-
35
-
-
84878402147
-
Lstm neural networks for language modeling
-
Portland, OR, USA, September
-
Martin Sundermeyer, Ralf Schlüter, and Hermann Ney. 2012. LSTM neural networks for language modeling. In Interspeech, Portland, OR, USA, September.
-
(2012)
Terspeech
-
-
Sundermeyer, M.1
Schlüter, R.2
Ney, H.3
-
36
-
-
84890480734
-
Comparison of feedforward and recurrent neural network language models
-
Vancouver, Canada, May
-
Martin Sundermeyer, Ilya Oparin, Jean-Luc Gauvain, Ben Freiberg, Ralf Schlüter, and Hermann Ney. 2013. Comparison of feedforward and recurrent neural network language models. In IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 8430-8434, Vancouver, Canada, May.
-
(2013)
IEEE International Conference on Acoustics, Speech, and Signal Processing
, pp. 8430-8434
-
-
Sundermeyer, M.1
Oparin, I.2
Gauvain, J.-L.3
Freiberg, B.4
Schlüter, R.5
Ney, H.6
-
37
-
-
84926298172
-
Decoding with largescale neural language models improves translation
-
Seattle, Washington, USA, October. Association for Computational Linguistics
-
Ashish Vaswani, Yinggong Zhao, Victoria Fossum, and David Chiang. 2013. Decoding with largescale neural language models improves translation. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1387-1392, Seattle, Washington, USA, October. Association for Computational Linguistics.
-
(2013)
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
, pp. 1387-1392
-
-
Vaswani, A.1
Zhao, Y.2
Fossum, V.3
Chiang, D.4
-
38
-
-
84857512571
-
Jane: Open source hierarchical translation, extended with reordering and lexicon models
-
Uppsala, Sweden, July
-
David Vilar, Daniel Stein, Matthias Huck, and Hermann Ney. 2010. Jane: Open source hierarchical translation, extended with reordering and lexicon models. In ACL 2010 Joint Fifth Workshop on Statistical Machine Translation and Metrics MATR, pages 262-270, Uppsala, Sweden, July.
-
(2010)
ACL 2010 Joint Fifth Workshop on Statistical Machine Translation and Metrics MATR
, pp. 262-270
-
-
Vilar, D.1
Stein, D.2
Huck, M.3
Ney, H.4
-
39
-
-
0025503558
-
Backpropagation through time: What it does and how to do it
-
Paul J. Werbos. 1990. Backpropagation through time: what it does and how to do it. Proceedings of the IEEE, 78(10):1550-1560.
-
(1990)
Proceedings of the IEEE
, vol.78
, Issue.10
, pp. 1550-1560
-
-
Werbos, P.J.1
-
40
-
-
0001765578
-
Gradient- based learning algorithms for recurrent networks and their computational complexity
-
Yves Chauvain and David E. Rumelhart, Lawrence Erlbaum Publishers
-
Ronald J. Williams and David Zipser. 1995. Gradient- Based Learning Algorithms for Recurrent Networks and Their Computational Complexity. In: Yves Chauvain and David E. Rumelhart: "Back- Propagation: Theory, Architectures and Applications". Lawrence Erlbaum Publishers.
-
(1995)
Back- Propagation: Theory, Architectures and Applications
-
-
Williams, R.J.1
Zipser, D.2
-
41
-
-
80053239127
-
Training phrase translation models with leaving-one-out
-
Uppsala, Sweden, July
-
Joern Wuebker, Arne Mauser, and Hermann Ney. 2010. Training phrase translation models with leaving-one-out. In Proceedings of the 48th Annual Meeting of the Assoc. for Computational Linguistics, pages 475-484, Uppsala, Sweden, July.
-
(2010)
Proceedings of the 48th Annual Meeting of the Assoc. for Computational Linguistics
, pp. 475-484
-
-
Wuebker, J.1
Mauser, A.2
Ney, H.3
-
42
-
-
84906932758
-
Jane 2: Open source phrase-based and hierarchical statistical machine translation
-
Mumbai, India, December
-
Joern Wuebker, Matthias Huck, Stephan Peitz, Malte Nuhn, Markus Freitag, Jan-Thorsten Peter, Saab Mansour, and Hermann Ney. 2012. Jane 2: Open source phrase-based and hierarchical statistical machine translation. In International Conference on Computational Linguistics, pages 483-491, Mumbai, India, December.
-
(2012)
Ternational Conference on Computational Linguistics
, pp. 483-491
-
-
Wuebker, J.1
Huck, M.2
Peitz, S.3
Nuhn, M.4
Freitag, M.5
Peter, J.-T.6
Mansour, S.7
Ney, H.8
-
43
-
-
84921412917
-
Improving statistical machine translation with word class models
-
Seattle, USA, October
-
Joern Wuebker, Stephan Peitz, Felix Rietig, and Hermann Ney. 2013. Improving statistical machine translation with word class models. In Conference on Empirical Methods in Natural Language Processing, pages 1377-1381, Seattle, USA, October.
-
(2013)
Conference on Empirical Methods in Natural Language Processing
, pp. 1377-1381
-
-
Wuebker, J.1
Peitz, S.2
Rietig, F.3
Ney, H.4
|