메뉴 건너뛰기




Volumn 2371, Issue , 2002, Pages 196-211

Model minimization in hierarchical reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ABSTRACTING; MACHINE LEARNING; MARKOV PROCESSES; REDUNDANCY;

EID: 84956854078     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/3-540-45622-8_15     Document Type: Conference Paper
Times cited : (64)

References (19)
  • 1
    • 0028572333 scopus 로고
    • Using abstractions for decision theoretic planning with time constraints
    • AAAI
    • C. Boutilier and R. Dearden. Using abstractions for decision theoretic planning with time constraints. In Proceedings of the AAAI-94, pages 1016-1022. AAAI, 1994.
    • (1994) Proceedings of the AAAI-94 , pp. 1016-1022
    • Boutilier, C.1    Dearden, R.2
  • 4
    • 0031370386 scopus 로고    scopus 로고
    • Model minimization in markov decision processes
    • AAAI
    • Thomas Dean and Robert Givan. Model minimization in markov decision processes. In Proceedings of AAAI-97, pages 106-111. AAAI, 1997.
    • (1997) Proceedings of AAAI-97 , pp. 106-111
    • Dean, T.1    Givan, R.2
  • 6
    • 84956854241 scopus 로고    scopus 로고
    • Equivalence notions and model minimization in markov decision processes
    • Robert Givan, Thomas Dean, and Matthew Greig. Equivalence notions and model minimization in markov decision processes. Submitted to Artificial Intelligence, 2001.
    • (2001) Submitted to Artificial Intelligence
    • Givan, R.1    Dean, T.2    Greig, M.3
  • 7
    • 0034272032 scopus 로고    scopus 로고
    • Bounded-parameter markov decision processes
    • Robert Givan, Sonia Leach, and Thomas Dean. Bounded-parameter markov decision processes. Artificial Intelligence, 122:71-109, 2000.
    • (2000) Artificial Intelligence , vol.122 , pp. 71-109
    • Givan, R.1    Leach, S.2    Dean, T.3
  • 8
    • 0141763163 scopus 로고
    • Symmetry groups and translation invariant representations of markov processes
    • J. Glover. Symmetry groups and translation invariant representations of markov processes. The Annals of Probability, 19(2):562-586, 1991.
    • (1991) The Annals of Probability , vol.19 , Issue.2 , pp. 562-586
    • Glover, J.1
  • 10
    • 0000148778 scopus 로고
    • Iba. A heuristic approach to the discovery of macro-operators
    • Glenn A. Iba. A heuristic approach to the discovery of macro-operators. Machine Learning, 3:285-317, 1989.
    • (1989) Machine Learning , vol.3 , pp. 285-317
    • Glenn, A.1
  • 11
    • 0014604028 scopus 로고
    • A note on the iterative decomposition of finite automata
    • J. R. Jump. A note on the iterative decomposition of finite automata. Information and Control, 15:424-435, 1969.
    • (1969) Information and Control , vol.15 , pp. 424-435
    • Jump, J.R.1
  • 12
    • 0026222347 scopus 로고
    • Bisimulation through probabilistic testing
    • K. G. Larsen and A. Skou. Bisimulation through probabilistic testing. Information and Computation, 94(1):1-28, 1991.
    • (1991) Information and Computation , vol.94 , Issue.1 , pp. 1-28
    • Larsen, K.G.1    Skou, A.2
  • 16
    • 33745919581 scopus 로고    scopus 로고
    • Reinforcement Learning
    • MIT Press, Cambridge, MA
    • Richard S. Sutton and Andrew G. Barto. Reinforcement Learning. An Introduction. MIT Press, Cambridge, MA, 1998.
    • (1998) An Introduction
    • Sutton, R.S.1    Barto, A.G.2
  • 18
    • 0004049893 scopus 로고
    • PhD thesis, Cambridge University, Cambridge, England
    • C. J. C. H. Watkins. Learning from delayed rewards. PhD thesis, Cambridge University, Cambridge, England, 1989.
    • (1989) Learning from Delayed Rewards
    • Watkins, C.1
  • 19
    • 33645390656 scopus 로고    scopus 로고
    • Symmetry in markov decision processes and its implications for single agent and multi agent learning
    • San Francisco, CA, Morgan Kaufmann
    • M. Zinkevich and T. Balch. Symmetry in markov decision processes and its implications for single agent and multi agent learning. In Proceedings of the 18th International Conference on Machine Learning, pages 632-640, San Francisco, CA, 2001. Morgan Kaufmann.
    • (2001) Proceedings of the 18Th International Conference on Machine Learning , pp. 632-640
    • Zinkevich, M.1    Balch, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.