SCOPUS 정보 검색 플랫폼

Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS

Volumn 1, Issue , 2010, Pages 5-12

Combining manual feedback with subsequent MDP reward signals for reinforcement learning

a University of Texas at Austin (United States)

Author keywords

Human teachers; Human agent interaction; Reinforcement learning; Shaping

Indexed keywords

FEEDBACK; INTELLIGENT AGENTS; LEARNING ALGORITHMS; MARKOV PROCESSES; MULTI AGENT SYSTEMS; PERSONNEL TRAINING; REINFORCEMENT LEARNING; TEACHING;

AUTONOMOUS LEARNING; DESIGNING AGENTS; HUMAN TEACHERS; HUMAN-AGENT INTERACTION; MARKOV DECISION PROCESSES; PROGRAMMING SKILLS; SAMPLE COMPLEXITY; SHAPING;

AUTONOMOUS AGENTS;

EID: 84884357468 PISSN: 15488403 EISSN: 15582914 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (200)

References (19)

1
- 63149159130
- A survey of robot learning from demonstration
- B. Argali, S. Chernova, M. Vcloso, and B. Browning. A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57(5):469-483, 2009.
- (2009) Robotics and Autonomous Systems , vol.57 , Issue.5 , pp. 469-483
- Argali, B.¹ Chernova, S.² Vcloso, M.³ Browning, B.⁴

2
- 77953852622
- Integrated learning for interactive synthetic characters
- B. Blumberg, M. Downie, Y. Ivanov, M. Berlin, M. Johnson, and B. Tomlinson. Integrated learning for interactive synthetic characters. SIGGRAPH, 2002.
- (2002) SIGGRAPH
- Blumberg, B.¹ Downie, M.² Ivanov, Y.³ Berlin, M.⁴ Johnson, M.⁵ Tomlinson, B.⁶

3
- 0001133021
- Generalization in reinforcement learning: Safely approximating the value function
- J. Boyan and A. Moore. Generalization in reinforcement learning: Safely approximating the value function. NIPS, 1995.
- (1995) NIPS
- Boyan, J.¹ Moore, A.²

4
- 0028739953
- Robot shaping: Developing situated agents through learning
- M. Dorigo and M. Colombctti. Robot shaping: Developing situated agents through learning. Artificial Intelligence, 70(2):321-370, 1994.
- (1994) Artificial Intelligence , vol.70 , Issue.2 , pp. 321-370
- Dorigo, M.¹ Colombctti, M.²

5
- 77955023200
- Probabilistic policy reuse in a reinforcement learning agent
- F. Fernandez and M. Veloso. Probabilistic policy reuse in a reinforcement learning agent. AAMAS, 2006.
- (2006) AAMAS
- Fernandez, F.¹ Veloso, M.²

6
- 33748432461
- Cobot in LambdaMOO: An adaptive social statistics agent
- C. Isbell, M. Kcarns, S. Singh, C. Shelton, P. Stone, and D. Kormann. Cobot in LambdaMOO: An Adaptive Social Statistics Agent. AAMAS, 2006.
- (2006) AAMAS
- Isbell, C.¹ Kcarns, M.² Singh, S.³ Shelton, C.⁴ Stone, P.⁵ Kormann, D.⁶

7
- 0037204457
- Robotic clicker training
- F. Kaplan, P. Oudcycr, E. Kubinyi, and A. Miklósi. Robotic clicker training. Robotics and Autonomous Systems, 38(3-4), 2002.
- (2002) Robotics and Autonomous Systems , vol.38 , Issue.3-4
- Kaplan, F.¹ Oudcycr, P.² Kubinyi, E.³ Miklósi, A.⁴

8
- 70350475295
- Design principles for creating human-shapable agents
- March
- W. B. Knox, I. Fasel, and P. Stone. Design principles for creating human-shapable agents. In AAAI Spring 2009 Symposium, on Agents that Learn from, Human Teachers, March 2009.
- (2009) AAAI Spring 2009 Symposium, on Agents that Learn From, Human Teachers
- Knox, W.B.¹ Fasel, I.² Stone, P.³

9
- 67650154279
- TAMER: Training an agent manually via evaluative reinforcement
- August.
- W. B. Knox and P. Stone. TAMER: Training an agent manually via evaluative reinforcement. In IEEE 7th International Conference, on Development and Learning, August. 2008.
- (2008) IEEE 7th International Conference, on Development and Learning
- Knox, W.B.¹ Stone, P.²

10
- 70449629717
- Interactively shaping agents via human reinforcement: The TAMER framework
- September
- W. B. Knox and P. Stone. Interactively shaping agents via human reinforcement: The TAMER framework. In The Fifth Internatzonal Conference on Knowledge Capture, September 2009.
- (2009) The Fifth Internatzonal Conference on Knowledge Capture
- Knox, W.B.¹ Stone, P.²

11
- 84957895797
- Reward functions for accelerated learning
- M. Mataric. Reward functions for accelerated learning. ICML, 1994.
- (1994) ICML
- Mataric, M.¹

12
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- A. Ng, D. Harada, and S. Russell. Policy invariance under reward transformations: Theory and application to reward shaping. ICML, 1999.
- (1999) ICML
- Ng, A.¹ Harada, D.² Russell, S.³

13
- 27344432348
- Accelerating reinforcement learning through implicit imitation
- B. Price and C. Boutilier. Accelerating reinforcement learning through implicit imitation. JAIR, 19:569-629, 2003.
- (2003) JAIR , vol.19 , pp. 569-629
- Price, B.¹ Boutilier, C.²

14
- 0004102479
- MIT Press
- R. Sutton and A. Barto. Reinforcement Learning: An Intioduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Intioduction
- Sutton, R.¹ Barto, A.²

15
- 70449370276
- Rl-glue: Language-independent software for reinforcement-learning experiments
- B. Tanner and A. White. Rl-glue: Language-independent software for reinforcement-learning experiments. JMLR, 10, 2009.
- JMLR , vol.10 , pp. 2009
- Tanner, B.¹ White, A.²

16
- 84922201091
- Transferring instances for model-based reinforcement learning
- M. E. Taylor, N. K. Jong, and P. Stone. Transferring instances for model-based reinforcement learning. ECML PKDD, 2008.
- (2008) ECML PKDD
- Taylor, M.E.¹ Jong, N.K.² Stone, P.³

17
- 38349005230
- Cross-domain transfer for reinforcement learning
- M. E. Taylor and P. Stone. Cross-domain transfer for reinforcement learning. ICML, 2007.
- (2007) ICML
- Taylor, M.E.¹ Stone, P.²

18
- 34848816477
- Transfer learning via inter-task mappings for temporal difference learning
- M. E. Taylor, P. Stone, and Y. Liu. Transfer learning via inter-task mappings for temporal difference learning. JMLR, 8(1):2125-2167, 2007.
- (2007) JMLR , vol.8 , Issue.1 , pp. 2125-2167
- Taylor, M.E.¹ Stone, P.² Liu, Y.³

19
- 70350460438
- Reinforcement learning with human teachers: Evidence of feedback and guidance with implications for learning performance
- A. Thomaz and C. Brcazcal. Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance. AAAI, 2006.
- (2006) AAAI
- Thomaz, A.¹ Brcazcal, C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.