-
1
-
-
63149159130
-
A survey of robot learning from demonstration
-
B. Argali, S. Chernova, M. Vcloso, and B. Browning. A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57(5):469-483, 2009.
-
(2009)
Robotics and Autonomous Systems
, vol.57
, Issue.5
, pp. 469-483
-
-
Argali, B.1
Chernova, S.2
Vcloso, M.3
Browning, B.4
-
2
-
-
77953852622
-
Integrated learning for interactive synthetic characters
-
B. Blumberg, M. Downie, Y. Ivanov, M. Berlin, M. Johnson, and B. Tomlinson. Integrated learning for interactive synthetic characters. SIGGRAPH, 2002.
-
(2002)
SIGGRAPH
-
-
Blumberg, B.1
Downie, M.2
Ivanov, Y.3
Berlin, M.4
Johnson, M.5
Tomlinson, B.6
-
3
-
-
0001133021
-
Generalization in reinforcement learning: Safely approximating the value function
-
J. Boyan and A. Moore. Generalization in reinforcement learning: Safely approximating the value function. NIPS, 1995.
-
(1995)
NIPS
-
-
Boyan, J.1
Moore, A.2
-
4
-
-
0028739953
-
Robot shaping: Developing situated agents through learning
-
M. Dorigo and M. Colombctti. Robot shaping: Developing situated agents through learning. Artificial Intelligence, 70(2):321-370, 1994.
-
(1994)
Artificial Intelligence
, vol.70
, Issue.2
, pp. 321-370
-
-
Dorigo, M.1
Colombctti, M.2
-
5
-
-
77955023200
-
Probabilistic policy reuse in a reinforcement learning agent
-
F. Fernandez and M. Veloso. Probabilistic policy reuse in a reinforcement learning agent. AAMAS, 2006.
-
(2006)
AAMAS
-
-
Fernandez, F.1
Veloso, M.2
-
6
-
-
33748432461
-
Cobot in LambdaMOO: An adaptive social statistics agent
-
C. Isbell, M. Kcarns, S. Singh, C. Shelton, P. Stone, and D. Kormann. Cobot in LambdaMOO: An Adaptive Social Statistics Agent. AAMAS, 2006.
-
(2006)
AAMAS
-
-
Isbell, C.1
Kcarns, M.2
Singh, S.3
Shelton, C.4
Stone, P.5
Kormann, D.6
-
7
-
-
0037204457
-
Robotic clicker training
-
F. Kaplan, P. Oudcycr, E. Kubinyi, and A. Miklósi. Robotic clicker training. Robotics and Autonomous Systems, 38(3-4), 2002.
-
(2002)
Robotics and Autonomous Systems
, vol.38
, Issue.3-4
-
-
Kaplan, F.1
Oudcycr, P.2
Kubinyi, E.3
Miklósi, A.4
-
8
-
-
70350475295
-
Design principles for creating human-shapable agents
-
March
-
W. B. Knox, I. Fasel, and P. Stone. Design principles for creating human-shapable agents. In AAAI Spring 2009 Symposium, on Agents that Learn from, Human Teachers, March 2009.
-
(2009)
AAAI Spring 2009 Symposium, on Agents that Learn From, Human Teachers
-
-
Knox, W.B.1
Fasel, I.2
Stone, P.3
-
11
-
-
84957895797
-
Reward functions for accelerated learning
-
M. Mataric. Reward functions for accelerated learning. ICML, 1994.
-
(1994)
ICML
-
-
Mataric, M.1
-
12
-
-
0141596576
-
Policy invariance under reward transformations: Theory and application to reward shaping
-
A. Ng, D. Harada, and S. Russell. Policy invariance under reward transformations: Theory and application to reward shaping. ICML, 1999.
-
(1999)
ICML
-
-
Ng, A.1
Harada, D.2
Russell, S.3
-
13
-
-
27344432348
-
Accelerating reinforcement learning through implicit imitation
-
B. Price and C. Boutilier. Accelerating reinforcement learning through implicit imitation. JAIR, 19:569-629, 2003.
-
(2003)
JAIR
, vol.19
, pp. 569-629
-
-
Price, B.1
Boutilier, C.2
-
15
-
-
70449370276
-
Rl-glue: Language-independent software for reinforcement-learning experiments
-
B. Tanner and A. White. Rl-glue: Language-independent software for reinforcement-learning experiments. JMLR, 10, 2009.
-
JMLR
, vol.10
, pp. 2009
-
-
Tanner, B.1
White, A.2
-
16
-
-
84922201091
-
Transferring instances for model-based reinforcement learning
-
M. E. Taylor, N. K. Jong, and P. Stone. Transferring instances for model-based reinforcement learning. ECML PKDD, 2008.
-
(2008)
ECML PKDD
-
-
Taylor, M.E.1
Jong, N.K.2
Stone, P.3
-
17
-
-
38349005230
-
Cross-domain transfer for reinforcement learning
-
M. E. Taylor and P. Stone. Cross-domain transfer for reinforcement learning. ICML, 2007.
-
(2007)
ICML
-
-
Taylor, M.E.1
Stone, P.2
-
18
-
-
34848816477
-
Transfer learning via inter-task mappings for temporal difference learning
-
M. E. Taylor, P. Stone, and Y. Liu. Transfer learning via inter-task mappings for temporal difference learning. JMLR, 8(1):2125-2167, 2007.
-
(2007)
JMLR
, vol.8
, Issue.1
, pp. 2125-2167
-
-
Taylor, M.E.1
Stone, P.2
Liu, Y.3
-
19
-
-
70350460438
-
Reinforcement learning with human teachers: Evidence of feedback and guidance with implications for learning performance
-
A. Thomaz and C. Brcazcal. Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance. AAAI, 2006.
-
(2006)
AAAI
-
-
Thomaz, A.1
Brcazcal, C.2
|