메뉴 건너뛰기




Volumn 2, Issue , 2014, Pages 1273-1280

Planning with macro-actions in decentralized POMDPs

Author keywords

[No Author keywords available]

Indexed keywords


EID: 84911422338     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (75)

References (34)
  • 4
    • 0141988716 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • A. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems, 13:41-77, 2003.
    • (2003) Discrete Event Dynamic Systems , vol.13 , pp. 41-77
    • Barto, A.1    Mahadevan, S.2
  • 12
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • T. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:227-303, 2000.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.1
  • 15
    • 27844487453 scopus 로고    scopus 로고
    • A survey of multi-agent organizational paradigms
    • B. Horling and V. Lesser. A survey of multi-agent organizational paradigms. The Knowledge Engineering Review, 19(4):281-316, 2004.
    • (2004) The Knowledge Engineering Review , vol.19 , Issue.4 , pp. 281-316
    • Horling, B.1    Lesser, V.2
  • 28
    • 85122663910 scopus 로고    scopus 로고
    • Navigation among movable obstacles: Real-time reasoning in complex environments
    • M. Stilman and J. Kuffner. Navigation among movable obstacles: Real-time reasoning in complex environments. International Journal on Humanoid Robotics, 2(4):479-504, 2005.
    • (2005) International Journal on Humanoid Robotics , vol.2 , Issue.4 , pp. 479-504
    • Stilman, M.1    Kuffner, J.2
  • 29
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning for robocup soccer keepaway
    • P. Stone, R. Sutton, and G. Kuhlmann. Reinforcement learning for robocup soccer keepaway. Adaptive Behavior, 13(3):165-188, 2005.
    • (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
    • Stone, P.1    Sutton, R.2    Kuhlmann, G.3
  • 30
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • R. S. Sutton, D. Precup, and S. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1):181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , Issue.1 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.