SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Advances in Neural Information Processing Systems

Volumn , Issue , 2004, Pages

Approximate policy iteration with a policy language bias

(3) Fern, Alan a Yoon, SungWook a Givan, Robert a

a Purdue University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

MARKOV PROCESSES;

CLASSICAL PLANNING; DOMAIN SPECIFIC; HIGH QUALITY; MARKOV DECISION PROCESSES; POLICY ITERATION; POLICY LANGUAGE;

ITERATIVE METHODS;

EID: 22944468731 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (48)

References (28)

1
- 0036784224
- Using genetic programming to learn and improve control knowledge
- Ricardo Aler, Daniel Borrajo, and Pedro Isasi. Using genetic programming to learn and improve control knowledge. AIJ, 141(1-2):29-56, 2002.
- (2002) AIJ , vol.141 , Issue.1-2 , pp. 29-56
- Aler, R.¹ Borrajo, D.² Isasi, P.³

2
- 0035442648
- The AIPS '00 planning competition
- 3
- Fahiem Bacchus. The AIPS '00 planning competition. AI Magazine, 22(3)(3):57-62, 2001.
- (2001) AI Magazine , vol.22 , Issue.3 , pp. 57-62
- Bacchus, F.¹

3
- 0033897011
- Using temporal logics to express search control knowledge for planning
- Fahiem Bacchus and Froduald Kabanza. Using temporal logics to express search control knowledge for planning. AIJ, 16:123-191, 2000.
- (2000) AIJ , vol.16 , pp. 123-191
- Bacchus, F.¹ Kabanza, F.²

4
- 0003787146
- Princeton University Press
- R. Bellman. Dynamic Programming. Princeton University Press, 1957.
- (1957) Dynamic Programming
- Bellman, R.¹

5
- 0003487482
- Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

6
- 0012352653
- Approximating value trees in structured dynamic programming
- Lorenza Saitta, editor
- Craig Boutilier and Richard Dearden. Approximating value trees in structured dynamic programming. In Lorenza Saitta, editor, ICML, 1996.
- (1996) ICML
- Boutilier, C.¹ Dearden, R.²

7
- 0034248853
- Stochastic dynamic programming with factored representations
- Craig Boutilier, Richard Dearden, and Moises Goldszmidt. Stochastic dynamic programming with factored representations. AIJ, 121(1-2):49-107, 2000.
- (2000) AIJ , vol.121 , Issue.1-2 , pp. 49-107
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

8
- 84880891360
- Symbolic dynamic programming for firstorder MDPs
- Craig Boutilier, Raymond Reiter, and Bob Price. Symbolic dynamic programming for firstorder MDPs. In IJCAI, 2001.
- (2001) IJCAI
- Boutilier, C.¹ Reiter, R.² Price, B.³

9
- 0035312760
- Relational reinforcement learning
- S. Dzeroski, L. De Raedt & K. Driessens. Relational reinforcement learning. MLJ, 43:7-52, 2001.
- (2001) MLJ , vol.43 , pp. 7-52
- Dzeroski, S.¹ De Raedt, L.² Driessens, K.³

10
- 33845987323
- Multi-strategy learning of search control for partialorder planning
- Tara A. Estlin and Raymond J. Mooney. Multi-strategy learning of search control for partialorder planning. In AAAI, 1996.
- (1996) AAAI
- Estlin, T.A.¹ Mooney, R.J.²

11
- 0038517214
- Equivalence notions and model minimization in markov decision processes
- Robert Givan, Thomas Dean, and Matt Greig. Equivalence notions and model minimization in Markov decision processes. AIJ, 147(1-2):163-223, 2003.
- (2003) AIJ , vol.147 , Issue.1-2 , pp. 163-223
- Givan, R.¹ Dean, T.² Greig, M.³

12
- 84880898477
- Max-norm projections for factored MDPs
- Carlos Guestrin, Daphne Koller, and Ronald Parr. Max-norm projections for factored MDPs. In IJCAI, pages 673-680, 2001.
- (2001) IJCAI , pp. 673-680
- Guestrin, C.¹ Koller, D.² Parr, R.³

13
- 0036377352
- The FF planning system: Fast plan generation through heuristic search
- Jorg Hoffmann and Bernhard Nebel. The FF planning system: Fast plan generation through heuristic search. JAIR, 14:263-302, 2001.
- (2001) JAIR , vol.14 , pp. 263-302
- Hoffmann, J.¹ Nebel, B.²

14
- 0003644124
- MIT Press
- R. Howard. Dynamic Programming and Markov Decision Processes. MIT Press, 1960.
- (1960) Dynamic Programming and Markov Decision Processes
- Howard, R.¹

15
- 8344223155
- Learning declarative control rules for constraint-based planning
- Yi-Cheng Huang, Bart Selman, and Henry Kautz. Learning declarative control rules for constraint-based planning. In ICML, pages 415-422, 2000.
- (2000) ICML , pp. 415-422
- Huang, Y.-C.¹ Selman, B.² Kautz, H.³

16
- 0036832951
- A sparse sampling algorithm for nearoptimal planning in large markov decision processes
- Michael J. Kearns, Yishay Mansour, and Andrew Y. Ng. A sparse sampling algorithm for nearoptimal planning in large markov decision processes. MLJ, 49(2-3):193-208, 2002.
- (2002) MLJ , vol.49 , Issue.2-3 , pp. 193-208
- Kearns, M.J.¹ Mansour, Y.² Ng, A.Y.³

17
- 0033189384
- Learning action strategies for planning domains
- Roni Khardon. Learning action strategies for planning domains. AIJ, 113(1-2):125-148, 1999.
- (1999) AIJ , vol.113 , Issue.1-2 , pp. 125-148
- Khardon, R.¹

18
- 1942420814
- Reinforcement learning as classification: Leveraging modern classifiers
- M. Lagoudakis and R. Parr. Reinforcement learning as classification: Leveraging modern classifiers. In ICML, 2003.
- (2003) ICML
- Lagoudakis, M.¹ Parr, R.²

19
- 0038362668
- Learning generalized policies in planning domains using concept languages
- Mario Martin and Hector Geffner. Learning generalized policies in planning domains using concept languages. In KRR, 2000.
- (2000) KRR
- Martin, M.¹ Geffner, H.²

20
- 0027574520
- Taxonomic syntax for 1st-order inference
- D. McAllester & R. Givan. Taxonomic syntax for 1st-order inference. JACM, 40:246-83, 1993.
- (1993) JACM , vol.40 , pp. 246-283
- Mcallester, D.¹ Givan, R.²

21
- 84990622495
- Quantitative results on the utility of explanation-based learning
- S. Minton. Quantitative results on the utility of explanation-based learning. In AAAI, 1988.
- (1988) AAAI
- Minton, S.¹

22
- 0004319350
- MorganKaufmann
- S. Minton, editor. Machine Learning Methods for Planning. MorganKaufmann, 1993.
- (1993) Machine Learning Methods for Planning
- Minton, S.¹

23
- 0024733810
- Explanationbased learning: A problem solving perspective
- S. Minton, J. Carbonell, C. A. Knoblock, D. R. Kuokka, O. Etzioni, and Y. Gil. Explanationbased learning: A problem solving perspective. AIJ, 40:63-118, 1989.
- (1989) AIJ , vol.40 , pp. 63-118
- Minton, S.¹ Carbonell, J.² Knoblock, C.A.³ Kuokka, D.R.⁴ Etzioni, O.⁵ Gil, Y.⁶

24
- 0001046225
- Practical issues in temporal difference learning
- G. Tesauro. Practical issues in temporal difference learning. MLJ, 8:257-277, 1992.
- (1992) MLJ , vol.8 , pp. 257-277
- Tesauro, G.¹

25
- 0001332415
- Online policy improvement via monte-carlo search
- G. Tesauro & G. Galperin. Online policy improvement via monte-carlo search. In NIPS, 1996.
- (1996) NIPS
- Tesauro, G.¹ Galperin, G.²

26
- 0029752470
- Feature-based methods for large scale DP
- J. Tsitsiklis and B. Van Roy. Feature-based methods for large scale DP. MLJ, 22:59-94, 1996.
- (1996) MLJ , vol.22 , pp. 59-94
- Tsitsiklis, J.¹ Van Roy, B.²

27
- 32144443210
- Integrating planning and learning: The PRODIGY architecture
- M. Veloso, J. Carbonell, A. Perez, D. Borrajo, E. Fink, and J. Blythe. Integrating planning and learning: The PRODIGY architecture. Journal of Experimental and Theoretical AI, 7(1), 1995.
- (1995) Journal of Experimental and Theoretical AI , vol.7 , Issue.1
- Veloso, M.¹ Carbonell, J.² Perez, A.³ Borrajo, D.⁴ Fink, E.⁵ Blythe, J.⁶

28
- 13444310066
- Inductive policy selection for first-order MDPs
- S. Yoon, A. Fern, and R. Givan. Inductive policy selection for first-order MDPs. In UAI, 2002.
- (2002) UAI
- Yoon, S.¹ Fern, A.² Givan, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.