메뉴 건너뛰기




Volumn 2, Issue , 1999, Pages 1332-1339

Computing factored value functions for policies in structured MDPs

Author keywords

[No Author keywords available]

Indexed keywords

ADDITIVE FUNCTION; ADDITIVE VALUE FUNCTIONS; COMPACT REPRESENTATION; DYNAMIC BAYESIAN NETWORKS; LINEAR FUNCTIONS; MARKOV DECISION PROCESSES; PROCESS DESCRIPTIONS; VALUE DETERMINATION;

EID: 84880688552     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (124)

References (17)
  • 1
    • 0002546896 scopus 로고    scopus 로고
    • Graphical models for preference and utility
    • F. Bacchus and A. Grove. Graphical models for preference and utility. In Proc. UAI, 1995.
    • Proc. UAI, 1995
    • Bacchus, F.1    Grove, A.2
  • 2
    • 0012352653 scopus 로고    scopus 로고
    • Approximating value trees in structured dynamic programming
    • C. Boutilier and R. Dearden. Approximating value trees in structured dynamic programming. In Proc. ICML, pages 54-62, 1996.
    • (1996) Proc. ICML , pp. 54-62
    • Boutilier, C.1    Dearden, R.2
  • 3
    • 0001811022 scopus 로고    scopus 로고
    • Prioritized goal decomposition of Markov decision processes: Towards a synthesis of classical and decision theoretic planning
    • C. Boutilier, R.I. Brafman, and C. Geib. Prioritized goal decomposition of Markov decision processes: Towards a synthesis of classical and decision theoretic planning. In Proc. UAI, pages 24-32, 1998.
    • (1998) Proc. UAI , pp. 24-32
    • Boutilier, C.1    Brafman, R.I.2    Geib, C.3
  • 5
    • 84880686930 scopus 로고    scopus 로고
    • Tractable inference for complex stochastic processes
    • X. Boyen and D. Koller. Tractable inference for complex stochastic processes. In Proc. UAI, 1998.
    • Proc. UAI, 1998
    • Boyen, X.1    Koller, D.2
  • 6
    • 0000746330 scopus 로고    scopus 로고
    • Model reduction techniques for computing approximately optimal solutions for Markov decision processes
    • T. Dean, R. Givan, and S. Leach. Model reduction techniques for computing approximately optimal solutions for Markov decision processes. In Proc. UAI, 1997.
    • Proc. UAI, 1997
    • Dean, T.1    Givan, R.2    Leach, S.3
  • 7
    • 84880694195 scopus 로고
    • Stable function approximation in dynamic programming
    • G J. Gordon. Stable function approximation in dynamic programming. In Proc. ICML, pages 261-268, 1995.
    • (1995) Proc. ICML , pp. 261-268
    • Gordon, G.J.1
  • 12
    • 84899022377 scopus 로고    scopus 로고
    • How to dynamically merge Markov decision processes
    • S.P. Singh and D. Cohn. How to dynamically merge Markov decision processes. In NIPS 10, pages 1057-1063, 1998.
    • (1998) NIPS 10 , pp. 1057-1063
    • Singh, S.P.1    Cohn, D.2
  • 14
    • 0000672258 scopus 로고    scopus 로고
    • Improved switching among temporally abstract actions
    • To appear
    • R.S. Sutton, S. Singh, D. Precup, and B. Ravindran. Improved switching among temporally abstract actions. In NIPS 12, 1999. To appear.
    • (1999) NIPS 12
    • Sutton, R.S.1    Singh, S.2    Precup, D.3    Ravindran, B.4
  • 15
    • 0002313852 scopus 로고    scopus 로고
    • Scaling up average reward reinforcement learning by approximating the domain models and the value function
    • P. Tadepalli and D. Ok. Scaling up average reward reinforcement learning by approximating the domain models and the value function. In Proc. ICML, 1996.
    • Proc. ICML, 1996
    • Tadepalli, P.1    Ok, D.2
  • 16
    • 0029752470 scopus 로고    scopus 로고
    • Feature-based methods for large scale dynamic programming
    • J. D. Tsitsiklis and B. Van Roy. Feature-based methods for large scale dynamic programming. Machine Learning, 22(1):59-94, January 1996. (Pubitemid 126724363)
    • (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 59-94
    • Tsitsiklis, J.N.1    Van Roy, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.