-
2
-
-
80051494649
-
Reinforcement learning for mapping instructions to actions
-
August
-
S. R. K. Branavan, H. Chen, L. Zettlemoyer, and R. Barzilay. 2009. Reinforcement learning for mapping instructions to actions. In Proc. of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th IJCNLP, pages 82-90, August.
-
(2009)
Proc. of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th IJCNLP
, pp. 82-90
-
-
Branavan, S.R.K.1
Chen, H.2
Zettlemoyer, L.3
Barzilay, R.4
-
4
-
-
56449095373
-
A unified architecture for natural language processing: Deep neural networks with multitask learning
-
ACM
-
R. Collobert and J. Weston. 2008. A unified architecture for natural language processing: Deep neural networks with multitask learning. In Proc. of the 25th International Conference on Machine learning, pages 160-167. ACM.
-
(2008)
Proc. of the 25th International Conference on Machine Learning
, pp. 160-167
-
-
Collobert, R.1
Weston, J.2
-
5
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
-
G. E. Dahl, D. Yu, L. Deng, and A. Acero. 2012. Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 20(1):30-42.
-
(2012)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
6
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. E. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath, and B. Kingsbury. 2012. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Process. Mag., 29(6):82-97.
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
Kingsbury, B.11
-
7
-
-
84889566627
-
Learning deep structured semantic models for web search using clickthrough data
-
ACM
-
P-S. Huang, X. He, J. Gao, L. Deng, A. Acero, and L. Heck. 2013. Learning deep structured semantic models for web search using clickthrough data. In Proc. of the ACM International Conference on Information & Knowledge Management, pages 2333-2338. ACM.
-
(2013)
Proc. of the ACM International Conference on Information & Knowledge Management
, pp. 2333-2338
-
-
Huang, P.-S.1
He, X.2
Gao, J.3
Deng, L.4
Acero, A.5
Heck, L.6
-
8
-
-
84965153327
-
Skipthought vectors
-
R. Kiros, Y. Zhu, R. R. Salakhutdinov, R. Zemel, R. Urtasun, A. Torralba, and S. Fidler. 2015. Skipthought vectors. In Advances in Neural Information Processing Systems, pages 3276-3284.
-
(2015)
Advances in Neural Information Processing Systems
, pp. 3276-3284
-
-
Kiros, R.1
Zhu, Y.2
Salakhutdinov, R.R.3
Zemel, R.4
Urtasun, R.5
Torralba, A.6
Fidler, S.7
-
11
-
-
84930630277
-
Deep learning
-
Y. LeCun, Y. Bengio, and G. Hinton. 2015. Deep learning. Nature, 521(7553):436-444.
-
(2015)
Nature
, vol.521
, Issue.7553
, pp. 436-444
-
-
LeCun, Y.1
Bengio, Y.2
Hinton, G.3
-
13
-
-
85083953657
-
Continuous control with deep reinforcement learning
-
T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra. 2016. Continuous control with deep reinforcement learning. In International Conference on Learning Representations.
-
(2016)
International Conference on Learning Representations
-
-
Lillicrap, T.P.1
Hunt, J.J.2
Pritzel, A.3
Heess, N.4
Erez, T.5
Tassa, Y.6
Silver, D.7
Wierstra, D.8
-
16
-
-
85011904626
-
-
NIPS Deep Learning Workshop, December
-
V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, and M. Riedmiller. 2013. Playing Atari with Deep Reinforcement Learning. NIPS Deep Learning Workshop, December.
-
(2013)
Playing Atari with Deep Reinforcement Learning
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Graves, A.4
Antonoglou, I.5
Wierstra, D.6
Riedmiller, M.7
-
17
-
-
84924051598
-
Humanlevel control through deep reinforcement learning
-
V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, et al. 2015. Humanlevel control through deep reinforcement learning. Nature, 518(7540):529-533.
-
(2015)
Nature
, vol.518
, Issue.7540
, pp. 529-533
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
Bellemare, M.G.6
Graves, A.7
Riedmiller, M.8
Fidjeland, A.K.9
Ostrovski, G.10
-
23
-
-
0029276036
-
Temporal difference learning and td-gammon
-
G. Tesauro. 1995. Temporal difference learning and td-gammon. Communications of the ACM, 38(3):58-68.
-
(1995)
Communications of the ACM
, vol.38
, Issue.3
, pp. 58-68
-
-
Tesauro, G.1
-
25
-
-
84876682878
-
Pomdp-based statistical spoken dialog systems: A review
-
S. Young, M. Gasic, B. Thomson, and J. D. Williams. 2013. Pomdp-based statistical spoken dialog systems: A review. Proceedings of the IEEE, 101(5):1160-1179.
-
(2013)
Proceedings of the IEEE
, vol.101
, Issue.5
, pp. 1160-1179
-
-
Young, S.1
Gasic, M.2
Thomson, B.3
Williams, J.D.4
|