-
1
-
-
0000985504
-
TD-Gammon, a self-teaching backgammon program, achieves master-level play
-
G. Tesauro, "TD-Gammon, a self-teaching backgammon program, achieves master-level play," Neural Computation, vol. 6, no. 2, pp. 215-219, 1994.
-
(1994)
Neural Computation
, vol.6
, Issue.2
, pp. 215-219
-
-
Tesauro, G.1
-
3
-
-
27544506565
-
Reinforcement learning for RoboCup-soccer keepaway
-
P. Stone, R. S. Sutton, and G. Kuhlmann, "Reinforcement learning for RoboCup-soccer keepaway," Adaptive Behavior, vol. 13, no. 3, pp. 165-188, 2005.
-
(2005)
Adaptive Behavior
, vol.13
, Issue.3
, pp. 165-188
-
-
Stone, P.1
Sutton, R.S.2
Kuhlmann, G.3
-
5
-
-
0029732210
-
Creating advice-taking reinforcement learners
-
R. Maclin and J. W. Shavlik, "Creating advice-taking reinforcement learners," Machine Learning, vol. 22, pp. 251-282, 1996.
-
(1996)
Machine Learning
, vol.22
, pp. 251-282
-
-
Maclin, R.1
Shavlik, J.W.2
-
6
-
-
67650110473
-
Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance
-
A. Thomaz and C. Breazeal, "Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance," AAAI-2006.
-
AAAI-2006
-
-
Thomaz, A.1
Breazeal, C.2
-
8
-
-
33748432461
-
Cobot in LambdaMOO: An adaptive social statistics agent
-
November
-
C. L. Isbell, Jr., M. Kearns, S. Singh, C. Shelton, P. Stone, and D. Kormann, "Cobot in LambdaMOO: An adaptive social statistics agent," Autonomous Agents and Multiagent Systems, vol. 13, no. 3, November 2006.
-
(2006)
Autonomous Agents and Multiagent Systems
, vol.13
, Issue.3
-
-
Isbell Jr., C.L.1
Kearns, M.2
Singh, S.3
Shelton, C.4
Stone, P.5
Kormann, D.6
-
10
-
-
33745885802
-
Using prior knowledge to improve reinforcement learning in mobile robotics
-
UK
-
D. Moreno, C. Regueiro, R. Iglesias, and S. Barro, "Using prior knowledge to improve reinforcement learning in mobile robotics," Proc. Towards Autonomous Robotics Systems. Univ. of Essex, UK, 2004.
-
(2004)
Proc. Towards Autonomous Robotics Systems. Univ. of Essex
-
-
Moreno, D.1
Regueiro, C.2
Iglesias, R.3
Barro, S.4
-
11
-
-
32144462307
-
Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer
-
July
-
G. Kuhlmann, P. Stone, R. Mooney, and J. Shavlik, "Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer," in The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems, July 2004.
-
(2004)
The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems
-
-
Kuhlmann, G.1
Stone, P.2
Mooney, R.3
Shavlik, J.4
-
12
-
-
0033151712
-
Is imitation learning the route to humanoid robots?
-
S. Schaal, "Is imitation learning the route to humanoid robots?" Trends in Cognitive Sciences, vol. 3, no. 6, 1999.
-
(1999)
Trends in Cognitive Sciences
, vol.3
, Issue.6
-
-
Schaal, S.1
-
13
-
-
0037204457
-
Robotic clicker training
-
F. Kaplan, P. Oudeyer, E. Kubinyi, and A. Mikl osi, "Robotic clicker training," Robotics and Autonomous Systems, vol. 38, no. 3-4, 2002.
-
(2002)
Robotics and Autonomous Systems
, vol.38
, Issue.3-4
-
-
Kaplan, F.1
Oudeyer, P.2
Kubinyi, E.3
Mikl osi, A.4
-
14
-
-
77953852622
-
Integrated learning for interactive synthetic characters
-
B. Blumberg, M. Downie, Y. Ivanov, M. Berlin, M. Johnson, and B. Tomlinson, "Integrated learning for interactive synthetic characters," Proc. of the 29th annual conference on Computer graphics and interactive techniques, 2002.
-
(2002)
Proc. of the 29th annual conference on Computer graphics and interactive techniques
-
-
Blumberg, B.1
Downie, M.2
Ivanov, Y.3
Berlin, M.4
Johnson, M.5
Tomlinson, B.6
-
16
-
-
33845344721
-
Learning Tetris Using the Noisy Cross-Entropy Method
-
I. Szita and A. Lorincz, "Learning Tetris Using the Noisy Cross-Entropy Method," Neural Computation, vol. 18, no. 12, 2006.
-
(2006)
Neural Computation
, vol.18
, Issue.12
-
-
Szita, I.1
Lorincz, A.2
-
17
-
-
67650123910
-
-
J. Ramon and K. Driessens, On the numeric stability of gaussian processes regression for relational reinforcement learning, ICML-2004 Workshop on Relational Reinforcement Learning, pp. 10-14, 2004.
-
J. Ramon and K. Driessens, "On the numeric stability of gaussian processes regression for relational reinforcement learning," ICML-2004 Workshop on Relational Reinforcement Learning, pp. 10-14, 2004.
-
-
-
-
18
-
-
33845339117
-
Evolving a heuristic function for the game of Tetris
-
N. Bohm, G. Kokai, and S. Mandl, "Evolving a heuristic function for the game of Tetris," Proc. Lernen, Wissensentdeckung und Adaptivitat LWA, 2004.
-
(2004)
Proc. Lernen, Wissensentdeckung und Adaptivitat LWA
-
-
Bohm, N.1
Kokai, G.2
Mandl, S.3
|