-
1
-
-
84872577736
-
Practical recommendations for gradient-based training of deep architectures
-
Springer
-
Yoshua Bengio. 2012. Practical recommendations for gradient-based training of deep architectures. In Neural Networks: Tricks of the Trade, pages 437-478. Springer.
-
(2012)
Neural Networks: Tricks of the Trade
, pp. 437-478
-
-
Bengio, Y.1
-
5
-
-
0031189914
-
Multitask learning
-
Rich Caruana. 1997. Multitask learning. Machine learning, 28(1):41-75.
-
(1997)
Machine Learning
, vol.28
, Issue.1
, pp. 41-75
-
-
Caruana, R.1
-
6
-
-
84965158187
-
A recurrent latent variable model for sequential data
-
Montréal
-
Junyoung Chung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron Courville, and Yoshua Bengio. 2015. A recurrent latent variable model for sequential data. In Neural Information Processing Systems (NIPS), Montréal.
-
(2015)
Neural Information Processing Systems (NIPS)
-
-
Chung, J.1
Kastner, K.2
Dinh, L.3
Goel, K.4
Courville, A.5
Bengio, Y.6
-
7
-
-
80053558787
-
Natural language processing (almost) from scratch
-
R. Collobert, J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu, and P. Kuksa. 2011. Natural language processing (almost) from scratch. Journal of Machine Learning Research, 12:2493-2537.
-
(2011)
Journal of Machine Learning Research
, vol.12
, pp. 2493-2537
-
-
Collobert, R.1
Weston, J.2
Bottou, L.3
Karlen, M.4
Kavukcuoglu, K.5
Kuksa, P.6
-
8
-
-
0000979403
-
Sequential monte carlo methods to train neural network models
-
João FG de Freitas, Mahesan Niranjan, Andrew H. Gee, and Arnaud Doucet. Sequential monte carlo methods to train neural network models. Neural computation, 12(4):955-993.
-
Neural Computation
, vol.12
, Issue.4
, pp. 955-993
-
-
De Freitas, J.F.G.1
Niranjan, M.2
Gee, A.H.3
Doucet, A.4
-
9
-
-
80052250414
-
Adaptive subgradient methods for online learning and stochastic optimization
-
John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive subgradient methods for online learning and stochastic optimization. The Journal of Machine Learning Research, 12:2121-2159.
-
(2011)
The Journal of Machine Learning Research
, vol.12
, pp. 2121-2159
-
-
Duchi, J.1
Hazan, E.2
Singer, Y.3
-
10
-
-
84943742882
-
Transition-based dependency parsing with stack long short-term memory
-
Beijing, China
-
Chris Dyer, Miguel Ballesteros, Wang Ling, Austin Matthews, and Noah A. Smith. 2015. Transition-based dependency parsing with stack long short-term memory. In Proceedings of the Association for Computational Linguistics (ACL), pages 334-343, Beijing, China.
-
(2015)
Proceedings of the Association for Computational Linguistics (ACL)
, pp. 334-343
-
-
Dyer, C.1
Ballesteros, M.2
Ling, W.3
Matthews, A.4
Smith, N.A.5
-
12
-
-
85016587886
-
Switchboard: Telephone speech corpus for research and development
-
IEEE
-
John J Godfrey, Edward C Holliman, and Jane McDaniel. 1992. Switchboard: Telephone speech corpus for research and development. In ICASSP, volume 1, pages 517-520. IEEE.
-
(1992)
ICASSP
, vol.1
, pp. 517-520
-
-
Godfrey, J.J.1
Holliman, E.C.2
McDaniel, J.3
-
17
-
-
84994157341
-
Document context language models
-
Poster Paper, volume
-
Yangfeng Ji, Trevor Cohn, Lingpeng Kong, Chris Dyer, and Jacob Eisenstein. 2015. Document context language models. In International Conference on Learning Representations, Poster Paper, volume abs/1511.03962.
-
(2015)
International Conference on Learning Representations
-
-
Ji, Y.1
Cohn, T.2
Kong, L.3
Dyer, C.4
Eisenstein, J.5
-
24
-
-
84959935599
-
Hierarchical recurrent neural network for document modeling
-
Lisbon, September
-
Rui Lin, Shujie Liu, Muyun Yang, Mu Li, Ming Zhou, and Sheng Li. 2015. Hierarchical recurrent neural network for document modeling. In Proceedings of Empirical Methods for Natural Language Processing (EMNLP), pages 899-907, Lisbon, September.
-
(2015)
Proceedings of Empirical Methods for Natural Language Processing (EMNLP)
, pp. 899-907
-
-
Lin, R.1
Liu, S.2
Yang, M.3
Li, M.4
Zhou, M.5
Li, S.6
-
25
-
-
84953734134
-
Computational linguistics and deep learning
-
Christopher D. Manning. 2016. Computational linguistics and deep learning. Computational Linguistics, 41(4).
-
(2016)
Computational Linguistics
, vol.41
, Issue.4
-
-
Manning, C.D.1
-
27
-
-
79959829092
-
Recurrent neural network based language model
-
Tomas Mikolov, Martin Karafiát, Lukas Burget, Jan Cernockỳ, and Sanjeev Khudanpur. 2010. Recurrent neural network based language model. In INTERSPEECH, pages 1045-1048.
-
(2010)
INTERSPEECH
, pp. 1045-1048
-
-
Mikolov, T.1
Karafiát, M.2
Burget, L.3
Cernockỳ, J.4
Khudanpur, S.5
-
29
-
-
84987948019
-
Improving implicit discourse relation recognition through feature set optimization
-
Seoul, South Korea, July. Association for Computational Linguistics
-
Joonsuk Park and Claire Cardie. Improving implicit discourse relation recognition through feature set optimization. In Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 108-112, Seoul, South Korea, July. Association for Computational Linguistics.
-
Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
, pp. 108-112
-
-
Park, J.1
Cardie, C.2
-
31
-
-
84942258824
-
Dropout improves recurrent neural networks for handwriting recognition
-
IEEE
-
Vu Pham, Théodore Bluche, Christopher Kermorvant, and Jérôme Louradour. 2014. Dropout improves recurrent neural networks for handwriting recognition. In Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on, pages 285-290. IEEE.
-
(2014)
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
, pp. 285-290
-
-
Pham, V.1
Bluche, T.2
Kermorvant, C.3
Louradour, J.4
-
32
-
-
79959960319
-
Easily identifiable discourse relations
-
Manchester, UK
-
Emily Pitler, Mridhula Raghupathy, Hena Mehta, Ani Nenkova, Alan Lee, and Aravind Joshi. 2008. Easily identifiable discourse relations. In Proceedings of the International Conference on Computational Linguistics (COLING), pages 87-90, Manchester, UK.
-
(2008)
Proceedings of the International Conference on Computational Linguistics (COLING)
, pp. 87-90
-
-
Pitler, E.1
Raghupathy, M.2
Mehta, H.3
Nenkova, A.4
Lee, A.5
Joshi, A.6
-
34
-
-
85037060758
-
The Penn Discourse Treebank 2.0
-
Rashmi Prasad, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind Joshi, and Bonnie Webber. 2008. The Penn Discourse Treebank 2.0. In Proceedings of LREC.
-
(2008)
Proceedings of LREC
-
-
Prasad, R.1
Dinesh, N.2
Lee, A.3
Miltsakaki, E.4
Robaldo, L.5
Joshi, A.6
Webber, B.7
-
37
-
-
78650192154
-
Analysis of discourse structure with syntactic dependencies and data-driven shift-reduce parsing
-
Paris, France, October. Association for Computational Linguistics
-
Kenji Sagae. 2009. Analysis of discourse structure with syntactic dependencies and data-driven shift-reduce parsing. In Proceedings of the 11th International Conference on Parsing Technologies (IWPT'09), pages 81-84, Paris, France, October. Association for Computational Linguistics.
-
(2009)
Proceedings of the 11th International Conference on Parsing Technologies (IWPT'09)
, pp. 81-84
-
-
Sagae, K.1
-
38
-
-
84926358845
-
Recursive deep models for semantic compositionality over a sentiment treebank
-
Seattle, WA
-
Richard Socher, Alex Perelygin, Jean Y Wu, Jason Chuang, Christopher D Manning, Andrew Y Ng, and Christopher Potts. 2013. Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of Empirical Methods for Natural Language Processing (EMNLP), Seattle, WA.
-
(2013)
Proceedings of Empirical Methods for Natural Language Processing (EMNLP)
-
-
Socher, R.1
Perelygin, A.2
Wu, J.Y.3
Chuang, J.4
Manning, C.D.5
Ng, A.Y.6
Potts, C.7
-
39
-
-
84960121226
-
A neural network approach to context-sensitive generation of conversational responses
-
Denver, CO, May
-
Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Meg Mitchell, Jian-Yun Nie, Jianfeng Gao, and Bill Dolan. 2015. A neural network approach to context-sensitive generation of conversational responses. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), Denver, CO, May.
-
(2015)
Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL)
-
-
Sordoni, A.1
Galley, M.2
Auli, M.3
Brockett, C.4
Ji, Y.5
Mitchell, M.6
Nie, J.-Y.7
Gao, J.8
Dolan, B.9
-
40
-
-
84904163933
-
Dropout: A simple way to prevent neural networks from overfitting
-
Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research, 15(1):1929-1958.
-
(2014)
The Journal of Machine Learning Research
, vol.15
, Issue.1
, pp. 1929-1958
-
-
Srivastava, N.1
Hinton, G.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
41
-
-
0000023031
-
Dialogue act modeling for automatic tagging and recognition of conversational speech
-
Andreas Stolcke, Klaus Ries, Noah Coccaro, Elizabeth Shriberg, Rebecca Bates, Daniel Jurafsky, Paul Taylor, Rachel Martin, Carol Van Ess-Dykema, and Marie Meteer. 2000. Dialogue act modeling for automatic tagging and recognition of conversational speech. Computational linguistics, 26(3):339-373.
-
(2000)
Computational Linguistics
, vol.26
, Issue.3
, pp. 339-373
-
-
Stolcke, A.1
Ries, K.2
Coccaro, N.3
Shriberg, E.4
Bates, R.5
Jurafsky, D.6
Taylor, P.7
Martin, R.8
Van Ess-Dykema, C.9
Meteer, M.10
-
46
-
-
33750703175
-
Partially observable markov decision processes for spoken dialog systems
-
Jason D Williams and Steve Young. 2007. Partially observable markov decision processes for spoken dialog systems. Computer Speech & Language, 21(2):393-422.
-
(2007)
Computer Speech & Language
, vol.21
, Issue.2
, pp. 393-422
-
-
Williams, J.D.1
Young, S.2
-
47
-
-
85072755120
-
The CoNLL-2015 shared task on shallow discourse parsing
-
Nianwen Xue, Hwee Tou Ng, Sameer Pradhan, Rashmi Prasad, Christopher Bryant, and Attapol T Rutherford. 2015. The CoNLL-2015 shared task on shallow discourse parsing. In Proceedings of the Conference on Natural Language Learning (CoNLL).
-
(2015)
Proceedings of the Conference on Natural Language Learning (CoNLL)
-
-
Xue, N.1
Ng, H.T.2
Pradhan, S.3
Prasad, R.4
Bryant, C.5
Rutherford, A.T.6
-
48
-
-
84959891977
-
Shallow convolutional neural network for implicit discourse relation recognition
-
Lisbon, September
-
Biao Zhang, Jinsong Su, Deyi Xiong, Yaojie Lu, Hong Duan, and Junfeng Yao. 2015. Shallow convolutional neural network for implicit discourse relation recognition. In Proceedings of Empirical Methods for Natural Language Processing (EMNLP), pages 2230-2235, Lisbon, September.
-
(2015)
Proceedings of Empirical Methods for Natural Language Processing (EMNLP)
, pp. 2230-2235
-
-
Zhang, B.1
Su, J.2
Xiong, D.3
Lu, Y.4
Duan, H.5
Yao, J.6
|