메뉴 건너뛰기




Volumn , Issue , 2002, Pages 1547-1554

Value-Directed Compression of POMDPs

Author keywords

[No Author keywords available]

Indexed keywords

ACCURATE PREDICTION; CONDITION; DECISION QUALITY; LINEAR COMPRESSION; LOSSLESS; LOSSY COMPRESSIONS; MATHEMATICAL PROPERTIES; POLICY EVALUATION; SPACE COMPRESSION; STATE-SPACE;

EID: 85156266716     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (24)

References (17)
  • 1
    • 0034248853 scopus 로고    scopus 로고
    • Stochastic dynamic programming with factored representations
    • C. Boutilier, R. Dearden, and M. Goldszmidt. Stochastic dynamic programming with factored representations. Artificial Intelligence, 121:49-107, 2000.
    • (2000) Artificial Intelligence , vol.121 , pp. 49-107
    • Boutilier, C.1    Dearden, R.2    Goldszmidt, M.3
  • 2
    • 0030349220 scopus 로고    scopus 로고
    • Computing optimal policies for partially observable decision processes using compact representations
    • Portland, OR
    • C. Boutilier and D. Poole. Computing optimal policies for partially observable decision processes using compact representations. Proc. AAAI-96, pp.1168-1175, Portland, OR, 1996.
    • (1996) Proc. AAAI-96 , pp. 1168-1175
    • Boutilier, C.1    Poole, D.2
  • 3
    • 84898943953 scopus 로고    scopus 로고
    • Equivalence notions and model minimization in Markov decision processes
    • to appear
    • R. Givan, T. Dean, and M. Greig. Equivalence notions and model minimization in Markov decision processes. Artificial Intelligence, to appear, 2002.
    • (2002) Artificial Intelligence
    • Givan, R.1    Dean, T.2    Greig, M.3
  • 4
    • 84880898477 scopus 로고    scopus 로고
    • Max-norm projections for factored MDPs
    • Seattle, WA
    • C. Guestrin, D. Koller, and R. Parr. Max-norm projections for factored MDPs. Proc. IJCAI-01, pp.673-680, Seattle, WA, 2001.
    • (2001) Proc. IJCAI-01 , pp. 673-680
    • Guestrin, C.1    Koller, D.2    Parr, R.3
  • 7
    • 0002956570 scopus 로고    scopus 로고
    • SPUDD: Stochastic planning using decision diagrams
    • Stockholm
    • J. Hoey, R. St-Aubin, A. Hu, and C. Boutilier. SPUDD: Stochastic planning using decision diagrams. Proc. UAI-99, pp.279-288, Stockholm, 1999.
    • (1999) Proc. UAI-99 , pp. 279-288
    • Hoey, J.1    St-Aubin, R.2    Hu, A.3    Boutilier, C.4
  • 8
    • 0003272035 scopus 로고
    • Memoryless policies: theoretical limitations and practical results
    • D. Cliff, P. Husbands, J. Meyer, S. W. Wilson, eds., Cambridge, MIT Press
    • M. L. Littman. Memoryless policies: theoretical limitations and practical results. In D. Cliff, P. Husbands, J. Meyer, S. W. Wilson, eds., Proc. 3rd Intl. Conf. Sim. of Adaptive Behavior, Cambridge, 1994. MIT Press.
    • (1994) Proc. 3rd Intl. Conf. Sim. of Adaptive Behavior
    • Littman, M. L.1
  • 10
    • 0030168518 scopus 로고    scopus 로고
    • Hidden state and reinforcement learning with instance-based state identification
    • R. A. McCallum. Hidden state and reinforcement learning with instance-based state identification. IEEE Transations on Systems, Man, and Cybernetics, 26(3):464-473, 1996.
    • (1996) IEEE Transations on Systems, Man, and Cybernetics , vol.26 , Issue.3 , pp. 464-473
    • McCallum, R. A.1
  • 12
    • 0036927202 scopus 로고    scopus 로고
    • Greedy linear value-approximation for factored Markov decision processes
    • Edmonton
    • R. Patrascu, P. Poupart, D. Schuurmans, C. Boutilier, C. Guestrin. Greedy linear value-approximation for factored Markov decision processes. AAAI-02, pp.285-291, Edmonton, 2002.
    • (2002) AAAI-02 , pp. 285-291
    • Patrascu, R.1    Poupart, P.2    Schuurmans, D.3    Boutilier, C.4    Guestrin, C.5
  • 13
    • 60349109508 scopus 로고    scopus 로고
    • Sufficiency, separability and temporal probabilistic models
    • Seattle, WA
    • A. Pfeffer. Sufficiency, separability and temporal probabilistic models. Proc. UAI-01, pp.421-428, Seattle, WA, 2001.
    • (2001) Proc. UAI-01 , pp. 421-428
    • Pfeffer, A.1
  • 14
    • 0036923210 scopus 로고    scopus 로고
    • Piecewise linear value function approximation for factored MDPs
    • Edmonton
    • P. Poupart, C. Boutilier, R. Patrascu, and D. Schuurmans. Piecewise linear value function approximation for factored MDPs. AAAI-02, pp.292-299, Edmonton, 2002.
    • (2002) AAAI-02 , pp. 292-299
    • Poupart, P.1    Boutilier, C.2    Patrascu, R.3    Schuurmans, D.4
  • 16
    • 1542342765 scopus 로고    scopus 로고
    • Direct value-approximation for factored MDPs
    • Vancouver
    • D. Schuurmans and R. Patrascu. Direct value-approximation for factored MDPs. Proc. NIPS-01, Vancouver, 2001.
    • (2001) Proc. NIPS-01
    • Schuurmans, D.1    Patrascu, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.