SCOPUS 정보 검색 플랫폼

2008 IEEE 7th International Conference on Development and Learning, ICDL

Volumn , Issue , 2008, Pages 292-297

TAMER: Training an agent manually via evaluative reinforcement

(2) Knox, W Bradley a Stone, Peter a

a University of Texas at Austin (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AGENT MODEL; AUTONOMOUS LEARNING; COMPLEX TASK; HUMAN EXPERTISE; LEARNING AGENTS; LEARNING PROCESS; NOVEL ALGORITHM; ORDER OF MAGNITUDE; REWARD FUNCTION; SEQUENTIAL DECISION MAKING;

ALGORITHMS; AUTONOMOUS AGENTS; GAME THEORY; INTELLIGENT AGENTS; REINFORCEMENT;

EDUCATION;

EID: 67650154279 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/DEVLRN.2008.4640845 Document Type: Conference Paper

Times cited : (169)

References (18)

1
- 0000985504
- TD-Gammon, a self-teaching backgammon program, achieves master-level play
- G. Tesauro, "TD-Gammon, a self-teaching backgammon program, achieves master-level play," Neural Computation, vol. 6, no. 2, pp. 215-219, 1994.
- (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
- Tesauro, G.¹

2
- 33947387070
- Apprenticeship learning via inverse reinforcement learning
- P. Abbeel and A. Ng, "Apprenticeship learning via inverse reinforcement learning," ACM International Conference Proceeding Series, 2004.
- (2004) ACM International Conference Proceeding Series
- Abbeel, P.¹ Ng, A.²

3
- 27544506565
- Reinforcement learning for RoboCup-soccer keepaway
- P. Stone, R. S. Sutton, and G. Kuhlmann, "Reinforcement learning for RoboCup-soccer keepaway," Adaptive Behavior, vol. 13, no. 3, pp. 165-188, 2005.
- (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
- Stone, P.¹ Sutton, R.S.² Kuhlmann, G.³

4
- 9444275934
- Machine learning for fast quadrupedal locomotion
- July
- N. Kohl and P. Stone, "Machine learning for fast quadrupedal locomotion," in The Nineteenth National Conference on Artificial Intelligence, July 2004, pp. 611-616.
- (2004) The Nineteenth National Conference on Artificial Intelligence , pp. 611-616
- Kohl, N.¹ Stone, P.²

5
- 0029732210
- Creating advice-taking reinforcement learners
- R. Maclin and J. W. Shavlik, "Creating advice-taking reinforcement learners," Machine Learning, vol. 22, pp. 251-282, 1996.
- (1996) Machine Learning , vol.22 , pp. 251-282
- Maclin, R.¹ Shavlik, J.W.²

6
- 67650110473
- Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance
- A. Thomaz and C. Breazeal, "Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance," AAAI-2006.
- AAAI-2006
- Thomaz, A.¹ Breazeal, C.²

7
- 0004102479
- MIT Press
- R. Sutton and A. Barto, Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

8
- 33748432461
- Cobot in LambdaMOO: An adaptive social statistics agent
- November
- C. L. Isbell, Jr., M. Kearns, S. Singh, C. Shelton, P. Stone, and D. Kormann, "Cobot in LambdaMOO: An adaptive social statistics agent," Autonomous Agents and Multiagent Systems, vol. 13, no. 3, November 2006.
- (2006) Autonomous Agents and Multiagent Systems , vol.13 , Issue.3
- Isbell Jr., C.L.¹ Kearns, M.² Singh, S.³ Shelton, C.⁴ Stone, P.⁵ Kormann, D.⁶

9
- 33947411394
- Sinauer Associates
- M. Bouton, Learning and Behavior: A Contemporary Synthesis. Sinauer Associates, 2007.
- (2007) Learning and Behavior: A Contemporary Synthesis
- Bouton, M.¹

10
- 33745885802
- Using prior knowledge to improve reinforcement learning in mobile robotics
- UK
- D. Moreno, C. Regueiro, R. Iglesias, and S. Barro, "Using prior knowledge to improve reinforcement learning in mobile robotics," Proc. Towards Autonomous Robotics Systems. Univ. of Essex, UK, 2004.
- (2004) Proc. Towards Autonomous Robotics Systems. Univ. of Essex
- Moreno, D.¹ Regueiro, C.² Iglesias, R.³ Barro, S.⁴

11
- 32144462307
- Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer
- July
- G. Kuhlmann, P. Stone, R. Mooney, and J. Shavlik, "Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer," in The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems, July 2004.
- (2004) The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems
- Kuhlmann, G.¹ Stone, P.² Mooney, R.³ Shavlik, J.⁴

12
- 0033151712
- Is imitation learning the route to humanoid robots?
- S. Schaal, "Is imitation learning the route to humanoid robots?" Trends in Cognitive Sciences, vol. 3, no. 6, 1999.
- (1999) Trends in Cognitive Sciences , vol.3 , Issue.6
- Schaal, S.¹

13
- 0037204457
- Robotic clicker training
- F. Kaplan, P. Oudeyer, E. Kubinyi, and A. Mikl osi, "Robotic clicker training," Robotics and Autonomous Systems, vol. 38, no. 3-4, 2002.
- (2002) Robotics and Autonomous Systems , vol.38 , Issue.3-4
- Kaplan, F.¹ Oudeyer, P.² Kubinyi, E.³ Mikl osi, A.⁴

14
- 77953852622
- Integrated learning for interactive synthetic characters
- B. Blumberg, M. Downie, Y. Ivanov, M. Berlin, M. Johnson, and B. Tomlinson, "Integrated learning for interactive synthetic characters," Proc. of the 29th annual conference on Computer graphics and interactive techniques, 2002.
- (2002) Proc. of the 29th annual conference on Computer graphics and interactive techniques
- Blumberg, B.¹ Downie, M.² Ivanov, Y.³ Berlin, M.⁴ Johnson, M.⁵ Tomlinson, B.⁶

15
- 0003487482
- Athena Scientific
- D. Bertsekas and J. Tsitsiklis, Neuro-Dynamic Programming. Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

16
- 33845344721
- Learning Tetris Using the Noisy Cross-Entropy Method
- I. Szita and A. Lorincz, "Learning Tetris Using the Noisy Cross-Entropy Method," Neural Computation, vol. 18, no. 12, 2006.
- (2006) Neural Computation , vol.18 , Issue.12
- Szita, I.¹ Lorincz, A.²

17
- 67650123910
- J. Ramon and K. Driessens, On the numeric stability of gaussian processes regression for relational reinforcement learning, ICML-2004 Workshop on Relational Reinforcement Learning, pp. 10-14, 2004.
- J. Ramon and K. Driessens, "On the numeric stability of gaussian processes regression for relational reinforcement learning," ICML-2004 Workshop on Relational Reinforcement Learning, pp. 10-14, 2004.

18
- 33845339117
- Evolving a heuristic function for the game of Tetris
- N. Bohm, G. Kokai, and S. Mandl, "Evolving a heuristic function for the game of Tetris," Proc. Lernen, Wissensentdeckung und Adaptivitat LWA, 2004.
- (2004) Proc. Lernen, Wissensentdeckung und Adaptivitat LWA
- Bohm, N.¹ Kokai, G.² Mandl, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.