메뉴 건너뛰기




Volumn , Issue , 2008, Pages 154-161

A study of reinforcement learning in a new multiagent domain

Author keywords

[No Author keywords available]

Indexed keywords

EDUCATION; MULTI AGENT SYSTEMS; REINFORCEMENT; REINFORCEMENT LEARNING;

EID: 62949148941     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/WIIAT.2008.114     Document Type: Conference Paper
Times cited : (5)

References (12)
  • 1
    • 37249034293 scopus 로고    scopus 로고
    • Keepaway soccer: From machine learning testbed to benchmark
    • I. Noda, A. Jacoff, A. Bredenfeld, and Y. Takahashi, eds, Berlin, pp, Springer Verlag
    • P. Stone, G. Kuhlmann, M. E. Taylor, and Y. Liu, "Keepaway soccer: From machine learning testbed to benchmark," in RoboCup-2005: Robot Soccer World Cup IX (I. Noda, A. Jacoff, A. Bredenfeld, and Y. Takahashi, eds.), vol. 4020, (Berlin), pp. 93-105, Springer Verlag, 2006.
    • (2006) RoboCup-2005: Robot Soccer World Cup IX , vol.4020 , pp. 93-105
    • Stone, P.1    Kuhlmann, G.2    Taylor, M.E.3    Liu, Y.4
  • 2
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning for RoboCup-soccer keepaway
    • P. Stone, R. S. Sutton, and G. Kuhlmann, "Reinforcement learning for RoboCup-soccer keepaway," Adaptive Behavior, vol. 13, no. 3, pp. 165-188, 2005.
    • (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
    • Stone, P.1    Sutton, R.S.2    Kuhlmann, G.3
  • 4
    • 0028733775 scopus 로고
    • Reinforcement learning in continuous time: Advantage updating
    • June
    • L.C. Baird, "Reinforcement learning in continuous time: advantage updating," in IEEE World Congress on Computational Intelligence, vol. 4, pp. 2448-2453, June 1994.
    • (1994) IEEE World Congress on Computational Intelligence , vol.4 , pp. 2448-2453
    • Baird, L.C.1
  • 8
    • 17444414191 scopus 로고    scopus 로고
    • Basis function adaptation in temporal difference reinforcement learning
    • I. Menache, S. Mannor, and N. Shimkin, "Basis function adaptation in temporal difference reinforcement learning," Annals of Operations Research, vol. 134, pp. 215-238, 2005.
    • (2005) Annals of Operations Research , vol.134 , pp. 215-238
    • Menache, I.1    Mannor, S.2    Shimkin, N.3
  • 9
    • 29344433509 scopus 로고    scopus 로고
    • Samuel meets amarel: Automating value function approximation using global state space analysis
    • S. Mahadevan, "Samuel meets amarel: Automating value function approximation using global state space analysis," in Proceedings of AAAI, pp. 1000-1005, 2005.
    • (2005) Proceedings of AAAI , pp. 1000-1005
    • Mahadevan, S.1
  • 11
    • 0024680419 scopus 로고
    • Adaptive aggregation methods for infinite horizon dynamic programming
    • D. Bertsekas and D. non, "Adaptive aggregation methods for infinite horizon dynamic programming," IEEE Transactions on Automatic Control, vol. 34, pp. 589-598, 1989.
    • (1989) IEEE Transactions on Automatic Control , vol.34 , pp. 589-598
    • Bertsekas, D.1    non, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.