-
1
-
-
33845260073
-
Neural probabilistic language models
-
Yoshua Bengio, Holger Schwenk, Jean-Sébastien Senécal, Fréderic Morin, and Jean-Luc Gauvain. 2006. Neural probabilistic language models. Innovations in Machine Learning, pages 137-186.
-
(2006)
Innovations in Machine Learning
, pp. 137-186
-
-
Bengio, Y.1
Schwenk, H.2
Senécal, J.-S.3
Morin, F.4
Gauvain, J.5
-
2
-
-
84864073449
-
Greedy layer-wise training of deep networks
-
Yoshua Bengio, Pascal Lamblin, Dan Popovici, and HugoLarochelle. 2007. Greedy layer-wise training of deep networks. Advances in neural information processing systems, 19:153.
-
(2007)
Advances in Neural Information Processing Systems
, vol.19
, pp. 153
-
-
Bengio, Y.1
Lamblin, P.2
Popovici, D.3
Larochelle, H.4
-
4
-
-
0002652285
-
A maximum entropy approach to natural language processing
-
March
-
Adam L. Berger, Vincent J. Della Pietra, and Stephen A. Della Pietra. 1996. A maximum entropy approach to natural language processing. Comput. Linguist, 22(1):39-71, March.
-
(1996)
Comput. Linguist
, vol.22
, Issue.1
, pp. 39-71
-
-
Berger, A.L.1
Della Pietra, V.J.2
Della Pietra, S.A.3
-
6
-
-
85044611587
-
The mathematics of statistical machine translation: Parameter estimation
-
Peter F Brown, Vincent J Della Pietra, Stephen A Della Pietra, and Robert L Mercer. 1993. The mathematics of statistical machine translation: Parameter estimation. Computational linguistics, 19(2):263-311.
-
(1993)
Computational Linguistics
, vol.19
, Issue.2
, pp. 263-311
-
-
Brown, P.F.1
Della Pietra, V.J.2
Della Pietra, S.A.3
Mercer, R.L.4
-
7
-
-
34347360650
-
Hierarchical phrase-based translation
-
David Chiang. 2007. Hierarchical phrase-based translation. computational linguistics, 33(2):201-228.
-
(2007)
Computational Linguistics
, vol.33
, Issue.2
, pp. 201-228
-
-
Chiang, D.1
-
8
-
-
80053558787
-
Natural language processing (almost) from scratch
-
Ronan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. 2011. Natural language processing (almost) from scratch. The Journal of Machine Learning Research, 12:2493-2537.
-
(2011)
The Journal of Machine Learning Research
, vol.12
, pp. 2493-2537
-
-
Collobert, R.1
Weston, J.2
Bottou, L.3
Karlen, M.4
Kavukcuoglu, K.5
Kuksa, P.6
-
9
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
-
George E Dahl, Dong Yu, Li Deng, and Alex Acero. 2012. Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 20(1):30-42.
-
(2012)
Audio, Speech, and Language Processing, IEEE Transactions On
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
10
-
-
84859032829
-
Modelbased aligner combination using dual decomposition
-
John DeNero and Klaus Macherey. 2011. Modelbased aligner combination using dual decomposition. In Proc. ACL.
-
(2011)
Proc. ACL
-
-
DeNero, J.1
Macherey, K.2
-
12
-
-
84883191590
-
Better word alignments with supervised itg models
-
Association for Computational Linguistics
-
Aria Haghighi, John Blitzer, John DeNero, and Dan Klein. 2009. Better word alignments with supervised itg models. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-Volume 2, pages 923-931. Association for Computational Linguistics.
-
(2009)
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2
, vol.2
, pp. 923-931
-
-
Haghighi, A.1
Blitzer, J.2
DeNero, J.3
Klein, D.4
-
13
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
Geoffrey E Hinton, Simon Osindero, and Yee-Whye Teh. 2006. A fast learning algorithm for deep belief nets. Neural computation, 18(7):1527-1554.
-
(2006)
Neural Computation
, vol.18
, Issue.7
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.3
-
14
-
-
85162460675
-
Learning convolutional feature hierarchies for visual recognition
-
Koray Kavukcuoglu, Pierre Sermanet, Y-Lan Boureau, Karol Gregor, Michaël Mathieu, and Yann LeCun. 2010. Learning convolutional feature hierarchies for visual recognition. Advances in Neural Information Processing Systems, pages 1090-1098.
-
(2010)
Advances in Neural Information Processing Systems
, pp. 1090-1098
-
-
Kavukcuoglu, K.1
Sermanet, P.2
Boureau, Y.-L.3
Gregor, K.4
Mathieu, M.5
LeCun, Y.6
-
17
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
LeCun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
18
-
-
0000134812
-
A learning scheme for asymmetric threshold networks
-
Yann LeCun. 1985. A learning scheme for asymmetric threshold networks. Proceedings of Cognitiva, 85:599-604.
-
(1985)
Proceedings of Cognitiva
, vol.85
, pp. 599-604
-
-
LeCun, Y.1
-
19
-
-
84864036295
-
Efficient sparse coding algorithms
-
Honglak Lee, Alexis Battle, Rajat Raina, and Andrew Y Ng. 2007. Efficient sparse coding algorithms. Advances in neural information processing systems, 19:801.
-
(2007)
Advances in Neural Information Processing Systems
, vol.19
, pp. 801
-
-
Lee, H.1
Battle, A.2
Raina, R.3
Ng, A.Y.4
-
20
-
-
84859013607
-
Discriminative pruning for discriminative itg alignment
-
Shujie Liu, Chi-Ho Li, and Ming Zhou. 2010. Discriminative pruning for discriminative itg alignment. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL, volume 10, pages 316-324.
-
(2010)
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL
, vol.10
, pp. 316-324
-
-
Liu, S.1
Li, C.2
Zhou, M.3
-
25
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
Frank Seide, Gang Li, and Dong Yu. 2011. Conversational speech transcription using context-dependent deep neural networks. In Proc. Interspeech, pages 437-440.
-
(2011)
Proc. Interspeech
, pp. 437-440
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
27
-
-
80053438267
-
Parsing natural scenes and natural language with recursive neural networks
-
Richard Socher, Cliff C Lin, Andrew Y Ng, and Christopher D Manning. 2011. Parsing natural scenes and natural language with recursive neural networks. In Proceedings of the 26th International Conference on Machine Learning (ICML), volume 2, page 7.
-
(2011)
Proceedings of the 26th International Conference on Machine Learning (ICML)
, vol.2
, pp. 7
-
-
Socher, R.1
Lin, C.C.2
Ng, A.Y.3
Manning, C.D.4
-
31
-
-
84862285546
-
Word representations: A simple and general method for semi-supervised learning
-
Joseph Turian, Lev Ratinov, and Yoshua Bengio. 2010. Word representations: a simple and general method for semi-supervised learning. Urbana, 51:61801.
-
(2010)
Urbana
, vol.51
, pp. 61801
-
-
Turian, J.1
Ratinov, L.2
Bengio, Y.3
-
32
-
-
0004339720
-
Hmm-based word alignment in statistical translation
-
Association for Computational Linguistics
-
Stephan Vogel, Hermann Ney, and Christoph Tillmann. 1996. Hmm-based word alignment in statistical translation. In Proceedings of the 16th conference on Computational linguistics-Volume 2, pages 836-841. Association for Computational Linguistics.
-
(1996)
Proceedings of the 16th Conference on Computational linguistics
, vol.2
, pp. 836-841
-
-
Vogel, S.1
Ney, H.2
Tillmann, C.3
-
33
-
-
0000319590
-
Stochastic inversion transduction grammars and bilingual parsing of parallel corpora
-
Dekai Wu. 1997. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational linguistics, 23(3):377-403.
-
(1997)
Computational Linguistics
, vol.23
, Issue.3
, pp. 377-403
-
-
Wu, D.1
|