SCOPUS 정보 검색 플랫폼

11th International Conference on Autonomous Agents and Multiagent Systems 2012, AAMAS 2012: Innovative Applications Track

Volumn 1, Issue , 2012, Pages 528-535

Reinforcement learning from simultaneous human and MDP reward

(2) Knox, W Bradley a Stone, Peter a

a University of Texas at Austin (United States)

Author keywords

Human teachers; Human agent interaction; Interactive learning; Reinforcement learning; Shaping

Indexed keywords

AUTONOMOUS AGENTS; LEARNING ALGORITHMS; MARKOV PROCESSES; MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING;

COMPLEX ENVIRONMENTS; COMPUTATIONAL AGENTS; HUMAN TEACHERS; HUMAN-AGENT INTERACTION; INTERACTIVE LEARNING; MARKOV DECISION PROCESSES; SHAPING; TRADITIONAL REINFORCEMENTS;

PERSONNEL TRAINING;

EID: 84899434166 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (111)

References (20)

1
- 84899465277
- Heuristically accelerated Q-learning: A new approach to speed up reinforcement learning
- R. Bianchi, C. Ribeiro, and A. Costa. Heuristically Accelerated Q-Learning: a new approach to speed up Reinforcement Learning. Advances in AI - SBIA, 2004.
- (2004) Advances in AI - SBIA
- Bianchi, R.¹ Ribeiro, C.² Costa, A.³

2
- 84881076324
- Automatic state abstraction from demonstration
- L. Cobo, P. Zang, C. Isbell Jr, and A. Thomaz. Automatic state abstraction from demonstration. In IJCAI, 2011.
- (2011) IJCAI
- Cobo, L.¹ Zang, P.² Isbell Jr., C.³ Thomaz, A.⁴

3
- 0028739953
- Robot shaping: Developing situated agents through learning
- M. Dorigo and M. Colombetti. Robot shaping: Developing situated agents through learning. Artificial Intelligence, 1994.
- (1994) Artificial Intelligence
- Dorigo, M.¹ Colombetti, M.²

4
- 77955023200
- Probabilistic policy reuse in a reinforcement learning agent
- F. Fernández and M. Veloso. Probabilistic policy reuse in a reinforcement learning agent. AAMAS, 2006.
- (2006) AAMAS
- Fernández, F.¹ Veloso, M.²

5
- 33748432461
- Cobot in LambdaMOO: An adaptive social statistics agent
- C. Isbell, M. Kearns, S. Singh, C. Shelton, P. Stone, and D. Kormann. Cobot in LambdaMOO: An Adaptive Social Statistics Agent. AAMAS, 2006.
- (2006) AAMAS
- Isbell, C.¹ Kearns, M.² Singh, S.³ Shelton, C.⁴ Stone, P.⁵ Kormann, D.⁶

6
- 80053038345
- Reinforcement learning via practice and critique advice
- K. Judah, S. Roy, A. Fern, and T. Dietterich. Reinforcement Learning Via Practice and Critique Advice. AAAI, 2010.
- (2010) AAAI
- Judah, K.¹ Roy, S.² Fern, A.³ Dietterich, T.⁴

7
- 70449629717
- Interactively shaping agents via human reinforcement: The TAMER framework
- W. Knox and P. Stone. Interactively shaping agents via human reinforcement: The TAMER framework. K-CAP, 2009.
- (2009) K-CAP
- Knox, W.¹ Stone, P.²

8
- 84884357468
- Combining manual feedback with subsequent MDP reward signals for reinforcement learning
- W. Knox and P. Stone. Combining manual feedback with subsequent MDP reward signals for reinforcement learning. AAMAS, 2010.
- (2010) AAMAS
- Knox, W.¹ Stone, P.²

9
- 32144462307
- Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer
- July
- G. Kuhlmann, P. Stone, R. Mooney, and J. Shavlik. Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer. In The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems, July 2004.
- (2004) The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems
- Kuhlmann, G.¹ Stone, P.² Mooney, R.³ Shavlik, J.⁴

10
- 84957895797
- Reward functions for accelerated learning
- M. Mataric. Reward functions for accelerated learning. ICML, 1994.
- (1994) ICML
- Mataric, M.¹

11
- 27344432348
- Accelerating reinforcement learning through implicit imitation
- B. Price and C. Boutilier. Accelerating reinforcement learning through implicit imitation. JAIR, 19:569-629, 2003.
- (2003) JAIR , vol.19 , pp. 569-629
- Price, B.¹ Boutilier, C.²

12
- 0001898381
- Practical reinforcement learning in continuous spaces
- W. Smart and L. Kaelbling. Practical reinforcement learning in continuous spaces. ICML, 2000.
- (2000) ICML
- Smart, W.¹ Kaelbling, L.²

13
- 84857858120
- Augmented reinforcement learning for interaction with non-expert humans in agent domains
- M. Sridharan. Augmented reinforcement learning for interaction with non-expert humans in agent domains. In Proceedings of IEEE International Conference on Machine Learning Applications, 2011.
- (2011) Proceedings of IEEE International Conference on Machine Learning Applications
- Sridharan, M.¹

14
- 84910004081
- Learning options through human interaction
- K. Subramanian, C. Isbell, and A. Thomaz. Learning options through human interaction. In 2011 IJCAI Workshop on Agents Learning Interactively from Human Teachers (ALIHT), 2011.
- (2011) 2011 IJCAI Workshop on Agents Learning Interactively from Human Teachers (ALIHT)
- Subramanian, K.¹ Isbell, C.² Thomaz, A.³

15
- 0004102479
- MIT Press
- R. Sutton and A. Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

16
- 70449370276
- RL-glue: Language-independent software for reinforcement-learning experiments
- B. Tanner and A. White. RL-Glue: Language-independent software for reinforcement-learning experiments. JMLR, 10, 2009.
- JMLR , vol.10 , pp. 2009
- Tanner, B.¹ White, A.²

17
- 84872375255
- Integrating reinforcement learning with human demonstrations of varying ability
- M. Taylor, H. Suay, and S. Chernova. Integrating reinforcement learning with human demonstrations of varying ability. AAMAS, 2011.
- (2011) AAMAS
- Taylor, M.¹ Suay, H.² Chernova, S.³

18
- 78650024215
- Dynamic reward shaping: Training a robot by voice
- A. Tenorio-Gonzalez, E. Morales, and L. Villasehor-Pineda. Dynamic reward shaping: training a robot by voice. Advances in Artificial Intelligence- IBERAMIA 2010, pages 483-492, 2010.
- (2010) Advances in Artificial Intelligence-IBERAMIA 2010 , pp. 483-492
- Tenorio-Gonzalez, A.¹ Morales, E.² Villasehor-Pineda, L.³

19
- 70350460438
- Reinforcement learning with human teachers: Evidence of feedback and guidance with implications for learning performance
- A. Thomaz and C. Breazeal. Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance. AAAI, 2006.
- (2006) AAAI
- Thomaz, A.¹ Breazeal, C.²

20
- 1942484477
- Principled methods for advising reinforcement learning agents
- E. Wiewiora, G. Cottrell, and C. Elkan. Principled methods for advising reinforcement learning agents. ICML, 2003.
- (2003) ICML
- Wiewiora, E.¹ Cottrell, G.² Elkan, C.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.