SCOPUS 정보 검색 플랫폼

Annals of Mathematics and Artificial Intelligence

Volumn 47, Issue 3-4, 2006, Pages 273-293

Symmetric approximate linear programming for factored MDPs with application to constrained problems

(2) Dolgov, Dmitri A a Durfee, Edmund H b

a TOYOTA TECHNICAL CENTER (United States)

b UNIVERSITY OF MICHIGAN (United States)

Author keywords

Approximate linear programming; Constrained Markov problems; Dual LP; Markov decision processes; Primal LP formulation

Indexed keywords

EID: 33847310802 PISSN: 10122443 EISSN: None Source Type: Journal
DOI: 10.1007/s10472-006-9038-x Document Type: Conference Paper

Times cited : (7)

References (25)

1
- 0009459044
- Constrained Markov decision processes with total cost criteria: Occupation measures and primal LP. Methods Models
- Altman, E.: Constrained Markov decision processes with total cost criteria: occupation measures and primal LP. Methods Models Oper. Res. 43(1), 45-72 (1996)
- (1996) Oper. Res , vol.43 , Issue.1 , pp. 45-72
- Altman, E.¹

2
- 1942424978
- Constrained Markov decision processes with total cost criteria: Lagrange approach and dual LP. Methods Models
- Altman, E.: Constrained Markov decision processes with total cost criteria: Lagrange approach and dual LP. Methods Models Oper. Res. 48, 387-417 (1998)
- (1998) Oper. Res , vol.48 , pp. 387-417
- Altman, E.¹

3
- 0000235370
- Altman, E., Shwartz, A.: Adaptive control of constrained Markov chains: criteria and policies. Ann. Oper. Res., special issue on Markov Decision Processes 28, 101-134 (1991)
- Altman, E., Shwartz, A.: Adaptive control of constrained Markov chains: criteria and policies. Ann. Oper. Res., special issue on Markov Decision Processes 28, 101-134 (1991)

4
- 0003989208
- Chapman & Hall, London, UK
- Altman, E.: Constrained Markov Decision Processes. Chapman & Hall, London, UK (1999)
- (1999) Constrained Markov Decision Processes
- Altman, E.¹

5
- 0004020376
- Princeton University Press, Princeton, NJ
- Bellman, R.: Adaptive Control Processes: A Guided Tour. Princeton University Press, Princeton, NJ (1961)
- (1961) Adaptive Control Processes: A Guided Tour
- Bellman, R.¹

6
- 0004089406
- Academic, New York
- Bertele, U., Brioschi, F.: Nonserial Dynamic Programming. Academic, New York (1972)
- (1972) Nonserial Dynamic Programming
- Bertele, U.¹ Brioschi, F.²

7
- 0003487482
- Athena Scientific, Belmont, MA
- Bertsekas, D.P., Tsitsiklis, J. N.: Neuro-dynamic Programming. Athena Scientific, Belmont, MA (1996)
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

8
- 0003850196
- Athena Scientific. Belmont, MA
- Bertsimas, D., Tsitsiklis, J.N.: Introduction to Linear Optimization. Athena Scientific. Belmont, MA (1997)
- (1997) Introduction to Linear Optimization
- Bertsimas, D.¹ Tsitsiklis, J.N.²

9
- 85166207010
- Exploiting structure in policy construction
- Boutilier, C., Dearden, R., Goldszmidt, M.: Exploiting structure in policy construction. In: Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI-95), pp.1104-1111 (1995)
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI-95) , pp. 1104-1111
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

10
- 0034248853
- Boutilier, C., Dearden, R., Goldszmidt, M.: Stochastic dynamic programming with factored representations. Artif. Intell. 121(1,2), 49-107 (2000)
- Boutilier, C., Dearden, R., Goldszmidt, M.: Stochastic dynamic programming with factored representations. Artif. Intell. 121(1,2), 49-107 (2000)

11
- 0348090400
- The linear programming approach to approximate dynamic programming
- de Farias, D.P., Van Roy, B.: The linear programming approach to approximate dynamic programming. Oper. Res. 51(6), 850-856 (2003)
- (2003) Oper. Res , vol.51 , Issue.6 , pp. 850-856
- de Farias, D.P.¹ Van Roy, B.²

12
- 5544258192
- On constraint sampling in the linear programming approach to approximate dynamic programming
- de Parias, D.P., Van Roy, B.: On constraint sampling in the linear programming approach to approximate dynamic programming. Math. Oper. Res. 29(3), 462-478 (2004)
- (2004) Math. Oper. Res , vol.29 , Issue.3 , pp. 462-478
- de Parias, D.P.¹ Van Roy, B.²

13
- 84990553353
- A model for reasoning about persistence and causation
- Dean, T., Kanazawa, K.: A model for reasoning about persistence and causation. Comput. Intell. 5(3). 142-150 (1989)
- (1989) Comput. Intell , vol.5 , Issue.3 , pp. 142-150
- Dean, T.¹ Kanazawa, K.²

14
- 4544299571
- Graphical models in local, asymmetric multi-agent Markov decision processes
- Dolgov, D.A., Durfee, E.H.: Graphical models in local, asymmetric multi-agent Markov decision processes. In: Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-04), pp. 956-963 (2004a)
- (2004) Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-04) , pp. 956-963
- Dolgov, D.A.¹ Durfee, E.H.²

15
- 13444251629
- Optimal resource allocation and policy formulation in loosely-coupled Markov decision processes
- Dolgov, D.A., Durfee, E.H.: Optimal resource allocation and policy formulation in loosely-coupled Markov decision processes. In: Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (ICAPS-04), pp. 315-324 (2004b)
- (2004) Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (ICAPS-04) , pp. 315-324
- Dolgov, D.A.¹ Durfee, E.H.²

16
- 4544318426
- Efficient solution algorithms for factored MDPs
- Guestrin, C., Koller, D., Parr, R., Venkataraman, S.: Efficient solution algorithms for factored MDPs. J. Artif. Intell. Res. 19, 399-468 (2003)
- (2003) J. Artif. Intell. Res , vol.19 , pp. 399-468
- Guestrin, C.¹ Koller, D.² Parr, R.³ Venkataraman, S.⁴

17
- 14344256227
- Ph.D. thesis, Computer Science Department, Stanford University
- Guestrin, C.: Planning Under Uncertainty in Complex Structured Environments. Ph.D. thesis, Computer Science Department, Stanford University (2003)
- (2003) Planning Under Uncertainty in Complex Structured Environments
- Guestrin, C.¹

18
- 0003759935
- Math. Centrum, Amsterdam, Holland
- Kallenberg, L.: Linear Programming and Finite Markovian Control Problems. Math. Centrum, Amsterdam, Holland (1983)
- (1983) Linear Programming and Finite Markovian Control Problems
- Kallenberg, L.¹

19
- 84880688552
- Computing factored value functions for policies in structured MDPs
- Koller, D., Parr, R.: Computing factored value functions for policies in structured MDPs. In: Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence IJCAI-99, pp.1332-1339 (1999)
- (1999) Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence IJCAI-99 , pp. 1332-1339
- Koller, D.¹ Parr, R.²

20
- 0036927202
- Greedy linear value-approximation for factored Markov decision processes
- American Association for Artificial Intelligence, Menlo Park, CA
- Patrascu, R., Poupart, P., Schuurmans, D., Boutilier, C., Guestrin, C.: Greedy linear value-approximation for factored Markov decision processes. In: Eighteenth National Conference on Artificial Intelligence, pp. 285-291. American Association for Artificial Intelligence, Menlo Park, CA (2002)
- (2002) Eighteenth National Conference on Artificial Intelligence , pp. 285-291
- Patrascu, R.¹ Poupart, P.² Schuurmans, D.³ Boutilier, C.⁴ Guestrin, C.⁵

21
- 0036923210
- Piecewise linear value function approximation for factored MDPs
- American Association for Artificial Intelligence, Menlo Park, CA
- Poupart, P., Boutilier, C., Patrascu, R., Schuurmans, D.: Piecewise linear value function approximation for factored MDPs. In: Eighteenth national conference on Artificial Intelligence, pp. 292-299. American Association for Artificial Intelligence, Menlo Park, CA (2002)
- (2002) Eighteenth national conference on Artificial Intelligence , pp. 292-299
- Poupart, P.¹ Boutilier, C.² Patrascu, R.³ Schuurmans, D.⁴

22
- 0003998452
- Wiley, New York
- Puterman, M. L.: Markov Decision Processes. Wiley, New York (1994)
- (1994) Markov Decision Processes
- Puterman, M.L.¹

23
- 1542342765
- Direct value-approximation for factored MDPs
- Schuurmans, D., Patrascu, R.: Direct value-approximation for factored MDPs. In: Proceedings of the Fourteenths Neural Information Processing Systems (NIPS) (2001)
- (2001) Proceedings of the Fourteenths Neural Information Processing Systems (NIPS)
- Schuurmans, D.¹ Patrascu, R.²

24
- 0000273218
- Generalized polynomial approximations in Markovian decision processes
- Schweitzer, P., Seidmann, A.: Generalized polynomial approximations in Markovian decision processes. J. Math. Anal. Appl. 110, 568-582 (1985)
- (1985) J. Math. Anal. Appl , vol.110 , pp. 568-582
- Schweitzer, P.¹ Seidmann, A.²

25
- 0004102479
- MIT Press, Cambridge, MA
- Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA (1998)
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.