-
2
-
-
85153940465
-
Generalization in reinforcement learning: Safely approximating the value function
-
J. A. Boyan and A. W. Moore. 1995. Generalization in reinforcement learning: Safely approximating the value function. In Advances in NIPS, pages 369-376.
-
(1995)
Advances in NIPS
, pp. 369-376
-
-
Boyan, J.A.1
Moore, A.W.2
-
4
-
-
0000595242
-
Note on learning rate schedules for stochastic optimization
-
Christian Darken and John Moody. 1990. Note on learning rate schedules for stochastic optimization. In Advances in NIPS, pages 832-838.
-
(1990)
Advances in NIPS
, pp. 832-838
-
-
Darken, C.1
Moody, J.2
-
5
-
-
84859963400
-
On the interpretation of natural language instructions
-
Barbara Di Eugenio and Michael White. 1992. On the interpretation of natural language instructions. In Proceedings of COLING, pages 1147-1151.
-
(1992)
Proceedings of COLING
, pp. 1147-1151
-
-
Di Eugenio, B.1
White, M.2
-
6
-
-
0005000120
-
Understanding natural language instructions: The case of purpose clauses
-
Barbara Di Eugenio. 1992. Understanding natural language instructions: the case of purpose clauses. In Proceedings of ACL, pages 120-127.
-
(1992)
Proceedings of ACL
, pp. 120-127
-
-
Di Eugenio, B.1
-
8
-
-
84862272102
-
Intentional context in situated natural language learning
-
Michael Fleischman and Deb Roy. 2005. Intentional context in situated natural language learning. In Proceedings of CoNLL, pages 104-111.
-
(2005)
Proceedings of CoNLL
, pp. 104-111
-
-
Fleischman, M.1
Roy, D.2
-
9
-
-
60349084848
-
Model-based function approximation in reinforcement learning
-
Nicholas K. Jong and Peter Stone. 2007. Model-based function approximation in reinforcement learning. In Proceedings of AAMAS, pages 670-677.
-
(2007)
Proceedings of AAMAS
, pp. 670-677
-
-
Jong, N.K.1
Stone, P.2
-
10
-
-
84859996318
-
-
Nate Kushman, Micah Brodsky, S.R.K. Branavan, Dina Katabi, Regina Barzilay, and Martin Rinard. 2009. Wikido. In Proceedings of HotNets-VIII.
-
(2009)
Wikido Proceedings of HotNets-VIII
-
-
Kushman, N.1
Brodsky, M.2
Branavan, S.R.K.3
Katabi, D.4
Barzilay, R.5
Rinard, M.6
-
12
-
-
70450125031
-
User simulations for context-sensitive speech recognition in spoken dialogue systems
-
Oliver Lemon and Ioannis Konstas. 2009. User simulations for context-sensitive speech recognition in spoken dialogue systems. In Proceedings of EACL, pages 505-513.
-
(2009)
Proceedings of EACL
, pp. 505-513
-
-
Lemon, O.1
Konstas, I.2
-
13
-
-
80053391310
-
Learning semantic correspondences with less supervision
-
Percy Liang, Michael I. Jordan, and Dan Klein. 2009. Learning semantic correspondences with less supervision. In Proceedings of ACL, pages 91-99.
-
(2009)
Proceedings of ACL
, pp. 91-99
-
-
Liang, P.1
Jordan, M.I.2
Klein, D.3
-
14
-
-
36348992104
-
Walk the talk: Connecting language, knowledge, and action in route instructions
-
Matt MacMahon, Brian Stankiewicz, and Benjamin Kuipers. 2006. Walk the talk: connecting language, knowledge, and action in route instructions. In Proceedings of AAAI, pages 1475-1482.
-
(2006)
Proceedings of AAAI
, pp. 1475-1482
-
-
MacMahon, M.1
Stankiewicz, B.2
Kuipers, B.3
-
16
-
-
57749107432
-
Learning to connect language and perception
-
Raymond J. Mooney. 2008. Learning to connect language and perception. In Proceedings of AAAI, pages 1598-1601.
-
(2008)
Proceedings of AAAI
, pp. 1598-1601
-
-
Mooney, R.J.1
-
20
-
-
0037841376
-
Optimizing dialogue management with reinforcement learning: Experiments with the njfun system
-
Satinder Singh, Diane Litman, Michael Kearns, and Marilyn Walker. 2002. Optimizing dialogue management with reinforcement learning: Experiments with the njfun system. Journal of Artificial Intelligence Research, 16:105-133.
-
(2002)
Journal of Artificial Intelligence Research
, vol.16
, pp. 105-133
-
-
Singh, S.1
Litman, D.2
Kearns, M.3
Walker, M.4
-
21
-
-
0002480013
-
Grounding the lexical semantics of verbs in visual perception using force dynamics and event logic
-
Jeffrey Mark Siskind. 2001. Grounding the lexical semantics of verbs in visual perception using force dynamics and event logic. Journal of Artificial Intelligence Research, 15:31-90.
-
(2001)
Journal of Artificial Intelligence Research
, vol.15
, pp. 31-90
-
-
Siskind, J.M.1
-
23
-
-
84898939480
-
Policy gradient methods for reinforcement learning with function approximation
-
Richard S. Sutton, David McAllester, Satinder Singh, and Yishay Mansour. 2000. Policy gradient methods for reinforcement learning with function approximation. In Advances in NIPS, pages 1057-1063.
-
(2000)
Advances in NIPS
, pp. 1057-1063
-
-
Sutton, R.S.1
McAllester, D.2
Singh, S.3
Mansour, Y.4
-
24
-
-
0029244438
-
Instructions, intentions and expectations
-
Bonnie Webber, Norman Badler, Barbara Di Eugenio, Libby Levison Chris Geib, and Michael Moore. 1995. Instructions, intentions and expectations. Artificial Intelligence, 73(1-2).
-
(1995)
Artificial Intelligence
, vol.73
, Issue.1-2
-
-
Webber, B.1
Badler, N.2
Di Eugenio, B.3
Levison, L.4
Geib, C.5
Moore, M.6
-
26
-
-
9444259165
-
On the integration of grounding language and learning objects
-
Chen Yu and Dana H. Ballard. 2004. On the integration of grounding language and learning objects. In Proceedings of AAAI, pages 488-493.
-
(2004)
Proceedings of AAAI
, pp. 488-493
-
-
Yu, C.1
Ballard, D.H.2
|