-
1
-
-
0009459044
-
Constrained Markov decision processes with total cost criteria: Occupation measures and primal LP. Methods Models
-
Altman, E.: Constrained Markov decision processes with total cost criteria: occupation measures and primal LP. Methods Models Oper. Res. 43(1), 45-72 (1996)
-
(1996)
Oper. Res
, vol.43
, Issue.1
, pp. 45-72
-
-
Altman, E.1
-
2
-
-
1942424978
-
Constrained Markov decision processes with total cost criteria: Lagrange approach and dual LP. Methods Models
-
Altman, E.: Constrained Markov decision processes with total cost criteria: Lagrange approach and dual LP. Methods Models Oper. Res. 48, 387-417 (1998)
-
(1998)
Oper. Res
, vol.48
, pp. 387-417
-
-
Altman, E.1
-
3
-
-
0000235370
-
-
Altman, E., Shwartz, A.: Adaptive control of constrained Markov chains: criteria and policies. Ann. Oper. Res., special issue on Markov Decision Processes 28, 101-134 (1991)
-
Altman, E., Shwartz, A.: Adaptive control of constrained Markov chains: criteria and policies. Ann. Oper. Res., special issue on Markov Decision Processes 28, 101-134 (1991)
-
-
-
-
9
-
-
85166207010
-
Exploiting structure in policy construction
-
Boutilier, C., Dearden, R., Goldszmidt, M.: Exploiting structure in policy construction. In: Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI-95), pp.1104-1111 (1995)
-
(1995)
Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI-95)
, pp. 1104-1111
-
-
Boutilier, C.1
Dearden, R.2
Goldszmidt, M.3
-
10
-
-
0034248853
-
-
Boutilier, C., Dearden, R., Goldszmidt, M.: Stochastic dynamic programming with factored representations. Artif. Intell. 121(1,2), 49-107 (2000)
-
Boutilier, C., Dearden, R., Goldszmidt, M.: Stochastic dynamic programming with factored representations. Artif. Intell. 121(1,2), 49-107 (2000)
-
-
-
-
11
-
-
0348090400
-
The linear programming approach to approximate dynamic programming
-
de Farias, D.P., Van Roy, B.: The linear programming approach to approximate dynamic programming. Oper. Res. 51(6), 850-856 (2003)
-
(2003)
Oper. Res
, vol.51
, Issue.6
, pp. 850-856
-
-
de Farias, D.P.1
Van Roy, B.2
-
12
-
-
5544258192
-
On constraint sampling in the linear programming approach to approximate dynamic programming
-
de Parias, D.P., Van Roy, B.: On constraint sampling in the linear programming approach to approximate dynamic programming. Math. Oper. Res. 29(3), 462-478 (2004)
-
(2004)
Math. Oper. Res
, vol.29
, Issue.3
, pp. 462-478
-
-
de Parias, D.P.1
Van Roy, B.2
-
13
-
-
84990553353
-
A model for reasoning about persistence and causation
-
Dean, T., Kanazawa, K.: A model for reasoning about persistence and causation. Comput. Intell. 5(3). 142-150 (1989)
-
(1989)
Comput. Intell
, vol.5
, Issue.3
, pp. 142-150
-
-
Dean, T.1
Kanazawa, K.2
-
16
-
-
4544318426
-
Efficient solution algorithms for factored MDPs
-
Guestrin, C., Koller, D., Parr, R., Venkataraman, S.: Efficient solution algorithms for factored MDPs. J. Artif. Intell. Res. 19, 399-468 (2003)
-
(2003)
J. Artif. Intell. Res
, vol.19
, pp. 399-468
-
-
Guestrin, C.1
Koller, D.2
Parr, R.3
Venkataraman, S.4
-
20
-
-
0036927202
-
Greedy linear value-approximation for factored Markov decision processes
-
American Association for Artificial Intelligence, Menlo Park, CA
-
Patrascu, R., Poupart, P., Schuurmans, D., Boutilier, C., Guestrin, C.: Greedy linear value-approximation for factored Markov decision processes. In: Eighteenth National Conference on Artificial Intelligence, pp. 285-291. American Association for Artificial Intelligence, Menlo Park, CA (2002)
-
(2002)
Eighteenth National Conference on Artificial Intelligence
, pp. 285-291
-
-
Patrascu, R.1
Poupart, P.2
Schuurmans, D.3
Boutilier, C.4
Guestrin, C.5
-
21
-
-
0036923210
-
Piecewise linear value function approximation for factored MDPs
-
American Association for Artificial Intelligence, Menlo Park, CA
-
Poupart, P., Boutilier, C., Patrascu, R., Schuurmans, D.: Piecewise linear value function approximation for factored MDPs. In: Eighteenth national conference on Artificial Intelligence, pp. 292-299. American Association for Artificial Intelligence, Menlo Park, CA (2002)
-
(2002)
Eighteenth national conference on Artificial Intelligence
, pp. 292-299
-
-
Poupart, P.1
Boutilier, C.2
Patrascu, R.3
Schuurmans, D.4
-
24
-
-
0000273218
-
Generalized polynomial approximations in Markovian decision processes
-
Schweitzer, P., Seidmann, A.: Generalized polynomial approximations in Markovian decision processes. J. Math. Anal. Appl. 110, 568-582 (1985)
-
(1985)
J. Math. Anal. Appl
, vol.110
, pp. 568-582
-
-
Schweitzer, P.1
Seidmann, A.2
|