-
2
-
-
0030349220
-
Computing optimal policies for partially observable decision processes using compact representations
-
Portland, OR
-
C. Boutilier and D. Poole. Computing optimal policies for partially observable decision processes using compact representations. Proc. AAAI-96, pp.1168-1175, Portland, OR, 1996.
-
(1996)
Proc. AAAI-96
, pp. 1168-1175
-
-
Boutilier, C.1
Poole, D.2
-
3
-
-
84898943953
-
Equivalence notions and model minimization in Markov decision processes
-
to appear
-
R. Givan, T. Dean, and M. Greig. Equivalence notions and model minimization in Markov decision processes. Artificial Intelligence, to appear, 2002.
-
(2002)
Artificial Intelligence
-
-
Givan, R.1
Dean, T.2
Greig, M.3
-
4
-
-
84880898477
-
Max-norm projections for factored MDPs
-
Seattle, WA
-
C. Guestrin, D. Koller, and R. Parr. Max-norm projections for factored MDPs. Proc. IJCAI-01, pp.673-680, Seattle, WA, 2001.
-
(2001)
Proc. IJCAI-01
, pp. 673-680
-
-
Guestrin, C.1
Koller, D.2
Parr, R.3
-
7
-
-
0002956570
-
SPUDD: Stochastic planning using decision diagrams
-
Stockholm
-
J. Hoey, R. St-Aubin, A. Hu, and C. Boutilier. SPUDD: Stochastic planning using decision diagrams. Proc. UAI-99, pp.279-288, Stockholm, 1999.
-
(1999)
Proc. UAI-99
, pp. 279-288
-
-
Hoey, J.1
St-Aubin, R.2
Hu, A.3
Boutilier, C.4
-
8
-
-
0003272035
-
Memoryless policies: theoretical limitations and practical results
-
D. Cliff, P. Husbands, J. Meyer, S. W. Wilson, eds., Cambridge, MIT Press
-
M. L. Littman. Memoryless policies: theoretical limitations and practical results. In D. Cliff, P. Husbands, J. Meyer, S. W. Wilson, eds., Proc. 3rd Intl. Conf. Sim. of Adaptive Behavior, Cambridge, 1994. MIT Press.
-
(1994)
Proc. 3rd Intl. Conf. Sim. of Adaptive Behavior
-
-
Littman, M. L.1
-
10
-
-
0030168518
-
Hidden state and reinforcement learning with instance-based state identification
-
R. A. McCallum. Hidden state and reinforcement learning with instance-based state identification. IEEE Transations on Systems, Man, and Cybernetics, 26(3):464-473, 1996.
-
(1996)
IEEE Transations on Systems, Man, and Cybernetics
, vol.26
, Issue.3
, pp. 464-473
-
-
McCallum, R. A.1
-
12
-
-
0036927202
-
Greedy linear value-approximation for factored Markov decision processes
-
Edmonton
-
R. Patrascu, P. Poupart, D. Schuurmans, C. Boutilier, C. Guestrin. Greedy linear value-approximation for factored Markov decision processes. AAAI-02, pp.285-291, Edmonton, 2002.
-
(2002)
AAAI-02
, pp. 285-291
-
-
Patrascu, R.1
Poupart, P.2
Schuurmans, D.3
Boutilier, C.4
Guestrin, C.5
-
13
-
-
60349109508
-
Sufficiency, separability and temporal probabilistic models
-
Seattle, WA
-
A. Pfeffer. Sufficiency, separability and temporal probabilistic models. Proc. UAI-01, pp.421-428, Seattle, WA, 2001.
-
(2001)
Proc. UAI-01
, pp. 421-428
-
-
Pfeffer, A.1
-
14
-
-
0036923210
-
Piecewise linear value function approximation for factored MDPs
-
Edmonton
-
P. Poupart, C. Boutilier, R. Patrascu, and D. Schuurmans. Piecewise linear value function approximation for factored MDPs. AAAI-02, pp.292-299, Edmonton, 2002.
-
(2002)
AAAI-02
, pp. 292-299
-
-
Poupart, P.1
Boutilier, C.2
Patrascu, R.3
Schuurmans, D.4
-
16
-
-
1542342765
-
Direct value-approximation for factored MDPs
-
Vancouver
-
D. Schuurmans and R. Patrascu. Direct value-approximation for factored MDPs. Proc. NIPS-01, Vancouver, 2001.
-
(2001)
Proc. NIPS-01
-
-
Schuurmans, D.1
Patrascu, R.2
-
17
-
-
0001808038
-
The information bottleneck method
-
N. Tishby, F. C. Pereira, and W. Bialek. The information bottleneck method. 37th Annual Allerton Conf. on Comm., Contr. and Computing, pp.368-377, 1999.
-
(1999)
37th Annual Allerton Conf. on Comm., Contr. and Computing
, pp. 368-377
-
-
Tishby, N.1
Pereira, F. C.2
Bialek, W.3
|