SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Proceedings of the National Conference on Artificial Intelligence

Volumn , Issue , 2002, Pages 292-299

Piecewise linear value function approximation for factored MDPs

(4) Poupart, Pascal a Boutilier, Craig a Patrascu, Relu b Schuurmans, Dale b

a UNIVERSITY OF TORONTO (Canada)

b UNIVERSITY OF WATERLOO (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION THEORY; DECISION THEORY; OPTIMAL CONTROL SYSTEMS; PIECEWISE LINEAR TECHNIQUES; TREES (MATHEMATICS);

DECISION TREES;

MARKOV PROCESSES;

EID: 0036923210 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (27)

References (18)

1
- 0003487482
- Athena, Belmont, MA
- D. P. Bertsekas and J. N. Tsitsiklis. Neuro-dynamic Programming. Athena, Belmont, MA, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

2
- 84880685295
- Prioritized goal decomposition of Markov decision processes: Toward a synthesis of classical and decision theoretic planning
- Nagoya
- C. Boutilier, R. I. Brafman, and C. Geib. Prioritized goal decomposition of Markov decision processes: Toward a synthesis of classical and decision theoretic planning. In Proc. Fifteenth International Joint Conf. on AI, pp.1156-1162, Nagoya, 1997.
- (1997) Proc. Fifteenth International Joint Conf. on AI , pp. 1156-1162
- Boutilier, C.¹ Brafman, R.I.² Geib, C.³

3
- 85166207010
- Exploiting structure in policy construction
- Montreal
- C. Boutilier, R. Dearden, and M. Goldszmidt. Exploiting structure in policy construction. In Proc. Fourteenth International Joint Conf. on AI, pp.1104-1111, Montreal, 1995.
- (1995) Proc. Fourteenth International Joint Conf. on AI , pp. 1104-1111
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

4
- 0002192119
- Input generalization in delayed reinforcement learning: An algorithm and performance comparisons
- Sydney
- D. Chapman and L. P. Kaelbling. Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. In Proc. Twelfth International Joint Conf. on AI, pp.726-731, Sydney, 1991.
- (1991) Proc. Twelfth International Joint Conf. on AI , pp. 726-731
- Chapman, D.¹ Kaelbling, L.P.²

5
- 0000746330
- Model reduction techniques for computing approximately optimal solutions for Markov decision processes
- Providence, RI
- T. Dean, R. Givan, and S. Leach. Model reduction techniques for computing approximately optimal solutions for Markov decision processes. In Proc. Thirteenth Conf. on Uncertainty in AI, pp.124-131, Providence, RI, 1997.
- (1997) Proc. Thirteenth Conf. on Uncertainty in AI , pp. 124-131
- Dean, T.¹ Givan, R.² Leach, S.³

6
- 84990553353
- A model for reasoning about persistence and causation
- T. Dean and K. Kanazawa. A model for reasoning about persistence and causation. Comput. Intel, 5(3): 142-150, 1989.
- (1989) Comput. Intel , vol.5 , Issue.3 , pp. 142-150
- Dean, T.¹ Kanazawa, K.²

7
- 0030697013
- Abstraction and approximate decision theoretic planning
- R. Dearden and C. Boutilier. Abstraction and approximate decision theoretic planning. Artif. Intel, 89:219-283, 1997.
- (1997) Artif. Intel , vol.89 , pp. 219-283
- Dearden, R.¹ Boutilier, C.²

8
- 84880898477
- Max-norm projections for factored MDPs
- Seattle
- C. Guestrin, D. Koller, and R. Parr. Max-norm projections for factored MDPs. In Proc. Seventeenth International Joint Conf. on AI, pp.673-680, Seattle, 2001.
- (2001) Proc. Seventeenth International Joint Conf. on AI , pp. 673-680
- Guestrin, C.¹ Koller, D.² Parr, R.³

9
- 0012296128
- Multiagent planning with factored MDPs
- Vancouver
- C. Guestrin, D. Koller, and R. Parr. Multiagent planning with factored MDPs. In Advances in Neural Info. Processing Sys. 14 (NIPS-2001), Vancouver, 2001.
- (2001) Advances in Neural Info. Processing Sys. 14 (NIPS-2001)
- Guestrin, C.¹ Koller, D.² Parr, R.³

10
- 0002956570
- SPUDD: Stochastic planning using decision diagrams
- Stockholm
- J. Hoey, R. St-Aubin, A. Hu, and C. Boutilier. SPUDD: Stochastic planning using decision diagrams. In Proc. Fifteenth Conf. on Uncertainty in AI, pp.279-288, Stockholm, 1999.
- (1999) Proc. Fifteenth Conf. on Uncertainty in AI , pp. 279-288
- Hoey, J.¹ St-Aubin, R.² Hu, A.³ Boutilier, C.⁴

11
- 0003644124
- MIT Press, Cambridge
- R. A. Howard. Dynamic Programming and Markov Processes. MIT Press, Cambridge, 1960.
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

12
- 0031632806
- Solving very large weakly coupled Markov decision processes
- Madison, WI
- N. Meuleau, M. Hauskrecht, K. Kim, L. Peshkin, L. P. Kaelbling, T. Dean, and C. Boutilier. Solving very large weakly coupled Markov decision processes. In Proc. Fifteenth National Conf. on AI, pp.165-172, Madison, WI, 1998.
- (1998) Proc. Fifteenth National Conf. on AI , pp. 165-172
- Meuleau, N.¹ Hauskrecht, M.² Kim, K.³ Peshkin, L.⁴ Kaelbling, L.P.⁵ Dean, T.⁶ Boutilier, C.⁷

13
- 0029514510
- The parti-game algorithm for variable resolution reinforcement learning in multidimensional state spaces
- A. W. Moore and C. G. Atkeson. The parti-game algorithm for variable resolution reinforcement learning in multidimensional state spaces. Mach. Learn., 21:199-234, 1995.
- (1995) Mach. Learn. , vol.21 , pp. 199-234
- Moore, A.W.¹ Atkeson, C.G.²

14
- 0036923210
- Piecewise linear value function approximation for factored MDPs
- Edmonton. to appear
- P. Poupart, C. Boutilier, R. Patrascu, and D. Schuurmans. Piecewise linear value function approximation for factored MDPs. In Proc. Eighteenth National Conf. on AI, Edmonton, 2002. to appear.
- (2002) Proc. Eighteenth National Conf. on AI
- Poupart, P.¹ Boutilier, C.² Patrascu, R.³ Schuurmans, D.⁴

15
- 85102627959
- Wiley, New York
- M. L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

16
- 1542342765
- Direct value approximation for factored MDPs
- Vancouver
- D. Schuurmans and R. Patrascu. Direct value approximation for factored MDPs. In Advances in Neural Info. Processing Sys. 14 (NIPS-2001), Vancouver, 2001.
- (2001) Advances in Neural Info. Processing Sys. 14 (NIPS-2001)
- Schuurmans, D.¹ Patrascu, R.²

17
- 84899022377
- How to dynamically merge Markov decision processes
- MIT Press, Cambridge
- S. P. Singh and D. Cohn. How to dynamically merge Markov decision processes. In Advances in Neural Info. Processing Sys. 10, pp.1057-1063. MIT Press, Cambridge, 1998.
- (1998) Advances in Neural Info. Processing Sys. , vol.10 , pp. 1057-1063
- Singh, S.P.¹ Cohn, D.²

18
- 0029752470
- Feature-based methods for large scale dynamic programming
- J. Tsitsiklis and B. Van Roy. Feature-based methods for large scale dynamic programming. Mach. Learn., 22:59-94, 1996.
- (1996) Mach. Learn. , vol.22 , pp. 59-94
- Tsitsiklis, J.¹ Van Roy, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.