메뉴 건너뛰기




Volumn 3, Issue , 2004, Pages 1334-1335

Hierarchical reinforcement learning in communication-mediated multiagent coordination

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER ARCHITECTURE; COMPUTER PROGRAMMING LANGUAGES; DECISION THEORY; HIERARCHICAL SYSTEMS; HUMAN COMPUTER INTERACTION; KNOWLEDGE ENGINEERING; LEARNING SYSTEMS; MARKOV PROCESSES;

EID: 4544220380     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (27)

References (10)
  • 1
    • 0141988716 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • A. G. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems, 13(4):41-77, 2003.
    • (2003) Discrete Event Dynamic Systems , vol.13 , Issue.4 , pp. 41-77
    • Barto, A.G.1    Mahadevan, S.2
  • 2
    • 85150714688 scopus 로고
    • Reinforcement learning methods for continuous-time Markov decision problems
    • G. Tesauro, D. Touretzky, and T. Leen, editors. The MIT Press
    • S. J. Bradtke and M. O. Duff. Reinforcement learning methods for continuous-time Markov decision problems. In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems, volume 7, pages 393-400. The MIT Press, 1995.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 393-400
    • Bradtke, S.J.1    Duff, M.O.2
  • 8
    • 4544279348 scopus 로고    scopus 로고
    • Multi-agent reinforcement learning: A critical survey
    • Stanford University
    • Y. Shoham, R. Powers, and T. Grenager. Multi-agent reinforcement learning: a critical survey. Technical report, Stanford University, 2003.
    • (2003) Technical Report
    • Shoham, Y.1    Powers, R.2    Grenager, T.3
  • 10
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • R. S. Sutton, D. Precup, and S. P. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1-2): 181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.