메뉴 건너뛰기




Volumn , Issue , 2008, Pages 292-297

TAMER: Training an agent manually via evaluative reinforcement

Author keywords

[No Author keywords available]

Indexed keywords

AGENT MODEL; AUTONOMOUS LEARNING; COMPLEX TASK; HUMAN EXPERTISE; LEARNING AGENTS; LEARNING PROCESS; NOVEL ALGORITHM; ORDER OF MAGNITUDE; REWARD FUNCTION; SEQUENTIAL DECISION MAKING;

EID: 67650154279     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/DEVLRN.2008.4640845     Document Type: Conference Paper
Times cited : (169)

References (18)
  • 1
    • 0000985504 scopus 로고
    • TD-Gammon, a self-teaching backgammon program, achieves master-level play
    • G. Tesauro, "TD-Gammon, a self-teaching backgammon program, achieves master-level play," Neural Computation, vol. 6, no. 2, pp. 215-219, 1994.
    • (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
    • Tesauro, G.1
  • 3
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning for RoboCup-soccer keepaway
    • P. Stone, R. S. Sutton, and G. Kuhlmann, "Reinforcement learning for RoboCup-soccer keepaway," Adaptive Behavior, vol. 13, no. 3, pp. 165-188, 2005.
    • (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
    • Stone, P.1    Sutton, R.S.2    Kuhlmann, G.3
  • 5
    • 0029732210 scopus 로고    scopus 로고
    • Creating advice-taking reinforcement learners
    • R. Maclin and J. W. Shavlik, "Creating advice-taking reinforcement learners," Machine Learning, vol. 22, pp. 251-282, 1996.
    • (1996) Machine Learning , vol.22 , pp. 251-282
    • Maclin, R.1    Shavlik, J.W.2
  • 6
    • 67650110473 scopus 로고    scopus 로고
    • Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance
    • A. Thomaz and C. Breazeal, "Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance," AAAI-2006.
    • AAAI-2006
    • Thomaz, A.1    Breazeal, C.2
  • 12
    • 0033151712 scopus 로고    scopus 로고
    • Is imitation learning the route to humanoid robots?
    • S. Schaal, "Is imitation learning the route to humanoid robots?" Trends in Cognitive Sciences, vol. 3, no. 6, 1999.
    • (1999) Trends in Cognitive Sciences , vol.3 , Issue.6
    • Schaal, S.1
  • 16
    • 33845344721 scopus 로고    scopus 로고
    • Learning Tetris Using the Noisy Cross-Entropy Method
    • I. Szita and A. Lorincz, "Learning Tetris Using the Noisy Cross-Entropy Method," Neural Computation, vol. 18, no. 12, 2006.
    • (2006) Neural Computation , vol.18 , Issue.12
    • Szita, I.1    Lorincz, A.2
  • 17
    • 67650123910 scopus 로고    scopus 로고
    • J. Ramon and K. Driessens, On the numeric stability of gaussian processes regression for relational reinforcement learning, ICML-2004 Workshop on Relational Reinforcement Learning, pp. 10-14, 2004.
    • J. Ramon and K. Driessens, "On the numeric stability of gaussian processes regression for relational reinforcement learning," ICML-2004 Workshop on Relational Reinforcement Learning, pp. 10-14, 2004.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.