SCOPUS 정보 검색 플랫폼

NIPS 2002: Proceedings of the 15th International Conference on Neural Information Processing Systems

Volumn , Issue , 2002, Pages 1547-1554

Value-Directed Compression of POMDPs

(2) Poupart, Pascal a Boutilier, Craig a

Author keywords

[No Author keywords available]

Indexed keywords

ACCURATE PREDICTION; CONDITION; DECISION QUALITY; LINEAR COMPRESSION; LOSSLESS; LOSSY COMPRESSIONS; MATHEMATICAL PROPERTIES; POLICY EVALUATION; SPACE COMPRESSION; STATE-SPACE;

EID: 85156266716 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (24)

References (17)

1
- 0034248853
- Stochastic dynamic programming with factored representations
- C. Boutilier, R. Dearden, and M. Goldszmidt. Stochastic dynamic programming with factored representations. Artificial Intelligence, 121:49-107, 2000.
- (2000) Artificial Intelligence , vol.121 , pp. 49-107
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

2
- 0030349220
- Computing optimal policies for partially observable decision processes using compact representations
- Portland, OR
- C. Boutilier and D. Poole. Computing optimal policies for partially observable decision processes using compact representations. Proc. AAAI-96, pp.1168-1175, Portland, OR, 1996.
- (1996) Proc. AAAI-96 , pp. 1168-1175
- Boutilier, C.¹ Poole, D.²

3
- 84898943953
- Equivalence notions and model minimization in Markov decision processes
- to appear
- R. Givan, T. Dean, and M. Greig. Equivalence notions and model minimization in Markov decision processes. Artificial Intelligence, to appear, 2002.
- (2002) Artificial Intelligence
- Givan, R.¹ Dean, T.² Greig, M.³

4
- 84880898477
- Max-norm projections for factored MDPs
- Seattle, WA
- C. Guestrin, D. Koller, and R. Parr. Max-norm projections for factored MDPs. Proc. IJCAI-01, pp.673-680, Seattle, WA, 2001.
- (2001) Proc. IJCAI-01 , pp. 673-680
- Guestrin, C.¹ Koller, D.² Parr, R.³

5
- 27344440966
- Solving factored POMDPs with linear value functions
- Seattle, WA
- C. Guestrin, D. Koller, and R. Parr. Solving factored POMDPs with linear value functions. IJCAI-01 Worksh. on Planning under Uncertainty and Inc. Info., Seattle, WA, 2001.
- (2001) IJCAI-01 Worksh. on Planning under Uncertainty and Inc. Info
- Guestrin, C.¹ Koller, D.² Parr, R.³

6
- 84899025633
- Unpublished manuscript
- C. Guestrin and D. Ormoneit. Information-theoretic features for reinforcement learning. Unpublished manuscript.
- Information-theoretic features for reinforcement learning
- Guestrin, C.¹ Ormoneit, D.²

7
- 0002956570
- SPUDD: Stochastic planning using decision diagrams
- Stockholm
- J. Hoey, R. St-Aubin, A. Hu, and C. Boutilier. SPUDD: Stochastic planning using decision diagrams. Proc. UAI-99, pp.279-288, Stockholm, 1999.
- (1999) Proc. UAI-99 , pp. 279-288
- Hoey, J.¹ St-Aubin, R.² Hu, A.³ Boutilier, C.⁴

8
- 0003272035
- Memoryless policies: theoretical limitations and practical results
- D. Cliff, P. Husbands, J. Meyer, S. W. Wilson, eds., Cambridge, MIT Press
- M. L. Littman. Memoryless policies: theoretical limitations and practical results. In D. Cliff, P. Husbands, J. Meyer, S. W. Wilson, eds., Proc. 3rd Intl. Conf. Sim. of Adaptive Behavior, Cambridge, 1994. MIT Press.
- (1994) Proc. 3rd Intl. Conf. Sim. of Adaptive Behavior
- Littman, M. L.¹

9
- 84898982129
- Predictive representations of state
- Vancouver
- M. L. Littman, R. S. Sutton, and S. Singh. Predictive representations of state. Proc.NIPS-02, Vancouver, 2001.
- (2001) Proc.NIPS-02
- Littman, M. L.¹ Sutton, R. S.² Singh, S.³

10
- 0030168518
- Hidden state and reinforcement learning with instance-based state identification
- R. A. McCallum. Hidden state and reinforcement learning with instance-based state identification. IEEE Transations on Systems, Man, and Cybernetics, 26(3):464-473, 1996.
- (1996) IEEE Transations on Systems, Man, and Cybernetics , vol.26 , Issue.3 , pp. 464-473
- McCallum, R. A.¹

11
- 34247264897
- Technical Report, U.C. Berkeley
- K. Murphy. A survey of POMDP solution techniques. Technical Report, U.C. Berkeley, 2000.
- (2000) A survey of POMDP solution techniques
- Murphy, K.¹

12
- 0036927202
- Greedy linear value-approximation for factored Markov decision processes
- Edmonton
- R. Patrascu, P. Poupart, D. Schuurmans, C. Boutilier, C. Guestrin. Greedy linear value-approximation for factored Markov decision processes. AAAI-02, pp.285-291, Edmonton, 2002.
- (2002) AAAI-02 , pp. 285-291
- Patrascu, R.¹ Poupart, P.² Schuurmans, D.³ Boutilier, C.⁴ Guestrin, C.⁵

13
- 60349109508
- Sufficiency, separability and temporal probabilistic models
- Seattle, WA
- A. Pfeffer. Sufficiency, separability and temporal probabilistic models. Proc. UAI-01, pp.421-428, Seattle, WA, 2001.
- (2001) Proc. UAI-01 , pp. 421-428
- Pfeffer, A.¹

14
- 0036923210
- Piecewise linear value function approximation for factored MDPs
- Edmonton
- P. Poupart, C. Boutilier, R. Patrascu, and D. Schuurmans. Piecewise linear value function approximation for factored MDPs. AAAI-02, pp.292-299, Edmonton, 2002.
- (2002) AAAI-02 , pp. 292-299
- Poupart, P.¹ Boutilier, C.² Patrascu, R.³ Schuurmans, D.⁴

15
- 0003554096
- PWS, Boston
- Y. Saad. Iterative Methods for Sparse Linear Systems. PWS, Boston, 1996.
- (1996) Iterative Methods for Sparse Linear Systems
- Saad, Y.¹

16
- 1542342765
- Direct value-approximation for factored MDPs
- Vancouver
- D. Schuurmans and R. Patrascu. Direct value-approximation for factored MDPs. Proc. NIPS-01, Vancouver, 2001.
- (2001) Proc. NIPS-01
- Schuurmans, D.¹ Patrascu, R.²

17
- 0001808038
- The information bottleneck method
- N. Tishby, F. C. Pereira, and W. Bialek. The information bottleneck method. 37th Annual Allerton Conf. on Comm., Contr. and Computing, pp.368-377, 1999.
- (1999) 37th Annual Allerton Conf. on Comm., Contr. and Computing , pp. 368-377
- Tishby, N.¹ Pereira, F. C.² Bialek, W.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.