-
1
-
-
84899465277
-
Heuristically accelerated Q-learning: A new approach to speed up reinforcement learning
-
R. Bianchi, C. Ribeiro, and A. Costa. Heuristically Accelerated Q-Learning: a new approach to speed up Reinforcement Learning. Advances in AI - SBIA, 2004.
-
(2004)
Advances in AI - SBIA
-
-
Bianchi, R.1
Ribeiro, C.2
Costa, A.3
-
3
-
-
0028739953
-
Robot shaping: Developing situated agents through learning
-
M. Dorigo and M. Colombetti. Robot shaping: Developing situated agents through learning. Artificial Intelligence, 1994.
-
(1994)
Artificial Intelligence
-
-
Dorigo, M.1
Colombetti, M.2
-
4
-
-
77955023200
-
Probabilistic policy reuse in a reinforcement learning agent
-
F. Fernández and M. Veloso. Probabilistic policy reuse in a reinforcement learning agent. AAMAS, 2006.
-
(2006)
AAMAS
-
-
Fernández, F.1
Veloso, M.2
-
5
-
-
33748432461
-
Cobot in LambdaMOO: An adaptive social statistics agent
-
C. Isbell, M. Kearns, S. Singh, C. Shelton, P. Stone, and D. Kormann. Cobot in LambdaMOO: An Adaptive Social Statistics Agent. AAMAS, 2006.
-
(2006)
AAMAS
-
-
Isbell, C.1
Kearns, M.2
Singh, S.3
Shelton, C.4
Stone, P.5
Kormann, D.6
-
6
-
-
80053038345
-
Reinforcement learning via practice and critique advice
-
K. Judah, S. Roy, A. Fern, and T. Dietterich. Reinforcement Learning Via Practice and Critique Advice. AAAI, 2010.
-
(2010)
AAAI
-
-
Judah, K.1
Roy, S.2
Fern, A.3
Dietterich, T.4
-
7
-
-
70449629717
-
Interactively shaping agents via human reinforcement: The TAMER framework
-
W. Knox and P. Stone. Interactively shaping agents via human reinforcement: The TAMER framework. K-CAP, 2009.
-
(2009)
K-CAP
-
-
Knox, W.1
Stone, P.2
-
8
-
-
84884357468
-
Combining manual feedback with subsequent MDP reward signals for reinforcement learning
-
W. Knox and P. Stone. Combining manual feedback with subsequent MDP reward signals for reinforcement learning. AAMAS, 2010.
-
(2010)
AAMAS
-
-
Knox, W.1
Stone, P.2
-
10
-
-
84957895797
-
Reward functions for accelerated learning
-
M. Mataric. Reward functions for accelerated learning. ICML, 1994.
-
(1994)
ICML
-
-
Mataric, M.1
-
11
-
-
27344432348
-
Accelerating reinforcement learning through implicit imitation
-
B. Price and C. Boutilier. Accelerating reinforcement learning through implicit imitation. JAIR, 19:569-629, 2003.
-
(2003)
JAIR
, vol.19
, pp. 569-629
-
-
Price, B.1
Boutilier, C.2
-
12
-
-
0001898381
-
Practical reinforcement learning in continuous spaces
-
W. Smart and L. Kaelbling. Practical reinforcement learning in continuous spaces. ICML, 2000.
-
(2000)
ICML
-
-
Smart, W.1
Kaelbling, L.2
-
16
-
-
70449370276
-
RL-glue: Language-independent software for reinforcement-learning experiments
-
B. Tanner and A. White. RL-Glue: Language-independent software for reinforcement-learning experiments. JMLR, 10, 2009.
-
JMLR
, vol.10
, pp. 2009
-
-
Tanner, B.1
White, A.2
-
17
-
-
84872375255
-
Integrating reinforcement learning with human demonstrations of varying ability
-
M. Taylor, H. Suay, and S. Chernova. Integrating reinforcement learning with human demonstrations of varying ability. AAMAS, 2011.
-
(2011)
AAMAS
-
-
Taylor, M.1
Suay, H.2
Chernova, S.3
-
19
-
-
70350460438
-
Reinforcement learning with human teachers: Evidence of feedback and guidance with implications for learning performance
-
A. Thomaz and C. Breazeal. Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance. AAAI, 2006.
-
(2006)
AAAI
-
-
Thomaz, A.1
Breazeal, C.2
-
20
-
-
1942484477
-
Principled methods for advising reinforcement learning agents
-
E. Wiewiora, G. Cottrell, and C. Elkan. Principled methods for advising reinforcement learning agents. ICML, 2003.
-
(2003)
ICML
-
-
Wiewiora, E.1
Cottrell, G.2
Elkan, C.3
|