메뉴 건너뛰기




Volumn 7, Issue , 2006, Pages 2259-2301

Causal graph based decomposition of factored MDPs

Author keywords

Hierarchical decomposition; Markov decision processes; State abstraction

Indexed keywords

ABSTRACTING; ALGORITHMS; DECISION MAKING; GRAPH THEORY; HIERARCHICAL SYSTEMS; MATHEMATICAL MODELS;

EID: 33750705246     PISSN: 15337928     EISSN: 15337928     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (75)

References (40)
  • 4
    • 85150714688 scopus 로고
    • Reinforcement learning methods for continuous-time Markov decision problems
    • S. Bradtke and M. Duff. Reinforcement learning methods for continuous-time Markov decision problems. Advances in Neural Information Processing Systems, 7:393-400, 1995.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 393-400
    • Bradtke, S.1    Duff, M.2
  • 8
    • 84990553353 scopus 로고
    • A model for reasoning about persistence and causation
    • T. Dean and K. Kanazawa. A model for reasoning about persistence and causation. Computational Intelligence, 5(3): 142-150, 1989.
    • (1989) Computational Intelligence , vol.5 , Issue.3 , pp. 142-150
    • Dean, T.1    Kanazawa, K.2
  • 10
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • T. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:227-303, 2000a.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.1
  • 12
    • 0007907759 scopus 로고    scopus 로고
    • Emergent hierarchical control structures: Learning reactive/hierarchical relationships in reinforcement environments
    • B. Digney. Emergent hierarchical control structures: Learning reactive/hierarchical relationships in reinforcement environments. From animals to animals, 4:363-372, 1996.
    • (1996) From Animals to Animals , vol.4 , pp. 363-372
    • Digney, B.1
  • 15
    • 2842560201 scopus 로고
    • Strips: A new approach to the application of theorem proving to problem solving
    • R. Fikes and N. Nilsson. Strips: A New Approach to the Application of Theorem Proving to Problem Solving. Artificial Intelligence, 2:189-208, 1971.
    • (1971) Artificial Intelligence , vol.2 , pp. 189-208
    • Fikes, R.1    Nilsson, N.2
  • 18
    • 0023365727 scopus 로고
    • Statecharts: A visual formalism for complex systems
    • D. Harel. Statecharts: A visual formalism for complex systems. Science of Computer Programming, 8:231-274, 1987.
    • (1987) Science of Computer Programming , vol.8 , pp. 231-274
    • Harel, D.1
  • 23
  • 38
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • R. Sutton, D. Precup, and S. Singh. Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112:181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.