메뉴 건너뛰기




Volumn , Issue , 2011, Pages

Monte Carlo Value Iteration with macro-actions

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE;

EID: 85162333012     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (36)

References (18)
  • 3
    • 0037288370 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • 2003
    • Andrew G. Barto and Sridhar Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems, 13:2003, 2003.
    • (2003) Discrete Event Dynamic Systems , vol.13
    • Barto, A.G.1    Mahadevan, S.2
  • 4
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • T. G. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. J. Artificial Intelligence Research, 13:227-303, 2000.
    • (2000) J. Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 8
    • 79952126758 scopus 로고    scopus 로고
    • Motion planning under uncertainty for robotic tasks with long time horizons
    • H. Kurniawati, Y. Du, D. Hsu, and W. S. Lee. Motion planning under uncertainty for robotic tasks with long time horizons. Int. J. Robotics Research, 30(3):308-323, 2010.
    • (2010) Int. J. Robotics Research , vol.30 , Issue.3 , pp. 308-323
    • Kurniawati, H.1    Du, Y.2    Hsu, D.3    Lee, W.S.4
  • 9
    • 70349645087 scopus 로고    scopus 로고
    • SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces
    • H. Kurniawati, D. Hsu, and W.S. Lee. SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces. In Proc. Robotics: Science & Systems, 2008.
    • (2008) Proc. Robotics: Science & Systems
    • Kurniawati, H.1    Hsu, D.2    Lee, W.S.3
  • 10
  • 13
    • 84899444044 scopus 로고    scopus 로고
    • UAV swarm coordination using cooperative control for establishing a wireless communications backbone
    • A. Sivakumar and C.K.Y. Tan. UAV swarm coordination using cooperative control for establishing a wireless communications backbone. In Proc. Int. Conf. on Autonomous Agents & Multiagent Systems, pages 1157-1164, 2010.
    • (2010) Proc. Int. Conf. on Autonomous Agents & Multiagent Systems , pp. 1157-1164
    • Sivakumar, A.1    Tan, C.K.Y.2
  • 15
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • R.S. Sutton, D. Precup, and S. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1):181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , Issue.1 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 18
    • 0016928374 scopus 로고
    • Procedures for the solution of a finite-horizon, partially observed, semi-Markov optimization problem
    • C.C. White. Procedures for the solution of a finite-horizon, partially observed, semi-Markov optimization problem.Operations Research, 24(2):348-358, 1976.
    • (1976) Operations Research , vol.24 , Issue.2 , pp. 348-358
    • White, C.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.