SCOPUS 정보 검색 플랫폼

ICAPS 2007, 17th International Conference on Automated Planning and Scheduling

Volumn , Issue , 2007, Pages 42-48

FF+FPG: Guiding a Policy-Gradient planner

(2) Buffet, Olivier a Aberdeen, Douglas b

a UNIVERSITÉ DE TOULOUSE (France)

b AUSTRALIAN NATIONAL UNIVERSITY (Australia)

Author keywords

[No Author keywords available]

Indexed keywords

IMPORTANCE SAMPLING; STOCHASTIC SYSTEMS; TEACHING;

ABERDEEN; INTERNATIONAL PLANNING COMPETITIONS; LEARNING TIME; PLANNING DOMAINS; POLICY GRADIENT; PROBABILISTICS; RE-PLANNING; REINFORCEMENT LEARNINGS; STOCHASTIC LOCAL SEARCHES; TEACHERS';

REINFORCEMENT LEARNING;

EID: 57749179024 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (17)

References (20)

1
- 85163512494
- Temporal probabilistic planning with policy-gradients
- Aberdeen, D., and Buffet, O. 2007. Temporal probabilistic planning with policy-gradients. In Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling (ICAPS'07).
- (2007) Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling (ICAPS'07)
- Aberdeen, D.¹ Buffet, O.²

2
- 0013495368
- Experiments with infinite-horizon, policy-gradient estimation
- Baxter, J.; Bartlett, P.; and Weaver, L. 2001. Experiments with infinite-horizon, policy-gradient estimation. Journal of Artificial Intelligence Research 15:351-381.
- (2001) Journal of Artificial Intelligence Research , vol.15 , pp. 351-381
- Baxter, J.¹ Bartlett, P.² Weaver, L.³

3
- 78751686224
- The factored policy gradient planner (ipc'06 version)
- Buffet, O., and Aberdeen, D. 2006. The factored policy gradient planner (ipc'06 version). In Proceedings of the Fifth International Planning Competition (IPCS).
- (2006) Proceedings of the Fifth International Planning Competition (IPCS)
- Buffet, O.¹ Aberdeen, D.²

4
- 58349113822
- Approximate policy iteration with a policy language bias
- NIPS'03
- Fern, A.; Yoon, S.; and Givan, R. 2003. Approximate policy iteration with a policy language bias. In Advances in Neural Information Processing Systems 15 (NIPS'03).
- (2003) Advances in Neural Information Processing Systems , vol.15
- Fern, A.¹ Yoon, S.² Givan, R.³

5
- 0001240715
- Importance sampling for stochastic simulations
- Glynn, P., and Iglehart, D. 1989. Importance sampling for stochastic simulations. Management Science 35(11):1367-1392.
- (1989) Management Science , vol.35 , Issue.11 , pp. 1367-1392
- Glynn, P.¹ Iglehart, D.²

6
- 0036377352
- The FF planning system: Fast plan generation through heuristic search
- Hoffmann, J., and Nebel, B. 2001. The FF planning system: Fast plan generation through heuristic search. Journal of Artificial Intelligence Research 14:253-302.
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 253-302
- Hoffmann, J.¹ Nebel, B.²

7
- 0035441926
- FF: The fast-forward planning system
- Hoffmann, J. 2001. FF: The fast-forward planning system. AI Magazine 22(3):57-62.
- (2001) AI Magazine , vol.22 , Issue.3 , pp. 57-62
- Hoffmann, J.¹

8
- 58349087845
- Paragraph: A graphplan-based probabilistic planner
- Little, I. 2006. Paragraph: A graphplan-based probabilistic planner. In Proceedings of the Fifth International Planning Competition (IPCS).
- (2006) Proceedings of the Fifth International Planning Competition (IPCS)
- Little, I.¹

9
- 80052601300
- A hybridized planner for stochastic domains
- Mausam; Bertoli, P.; and Weld, D. S. 2007. A hybridized planner for stochastic domains. In Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI'07).
- (2007) Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI'07)
- Mausam¹ Bertoli, P.² Weld, D.S.³

10
- 33746878798
- Exploration in gradient-based reinforcement learning
- Memo 2001-003, MIT, AI lab
- Meuleau, N.; Peshkin, L.; and Kim, K. 2001. Exploration in gradient-based reinforcement learning. Technical Report AI Memo 2001-003, MIT - AI lab.
- (2001) Technical Report AI
- Meuleau, N.¹ Peshkin, L.² Kim, K.³

11
- 18544382314
- Learning from scarce experience
- Peshkin, L., and Shelton, C. 2002. Learning from scarce experience. In Proceedings of the Nineteenth International Conference on Machine Learning (ICML'02).
- (2002) Proceedings of the Nineteenth International Conference on Machine Learning (ICML'02)
- Peshkin, L.¹ Shelton, C.²

12
- 0004080531
- John Wiley & Sons, Inc. New York, NY, USA
- Rubinstein, R. 1981. Simulation and the Monte Carlo Method. John Wiley & Sons, Inc. New York, NY, USA.
- (1981) Simulation and the Monte Carlo Method
- Rubinstein, R.¹

13
- 85163521193
- Probabilistic planning via linear value-approximation of first-order MDPs
- Sanner, S., and Boutilier, C. 2006. Probabilistic planning via linear value-approximation of first-order MDPs. In Proceedings of the Fifth International Planning Competition (IPCS).
- (2006) Proceedings of the Fifth International Planning Competition (IPCS)
- Sanner, S.¹ Boutilier, C.²

14
- 0005942760
- Importance sampling for reinforcement learning with multiple objectives
- Memo 2001-003, MIT AI Lab
- Shelton, C. 2001. Importance sampling for reinforcement learning with multiple objectives. Technical Report AI Memo 2001-003, MIT AI Lab.
- (2001) Technical Report AI
- Shelton, C.¹

15
- 85163528115
- Symbolic stochastic focused dynamic programming with decision diagrams
- Teichteil-Konigsbuch, F., and Fabiani, P. 2006. Symbolic stochastic focused dynamic programming with decision diagrams. In Proceedings of the Fifth International Planning Competition (IPCS).
- (2006) Proceedings of the Fifth International Planning Competition (IPCS)
- Teichteil-Konigsbuch, F.¹ Fabiani, P.²

16
- 0000337576
- Simple statistical gradient-following algorithms for connectionnist reinforcement learning
- Williams, R. 1992. Simple statistical gradient-following algorithms for connectionnist reinforcement learning. Machine Learning 8(3):229-256.
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 229-256
- Williams, R.¹

17
- 84880862552
- Discriminative learning of beam-search heuristics for planning
- Xu, Y; Fern, A.; and Yoon, S. 2007. Discriminative learning of beam-search heuristics for planning. In Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI'07).
- (2007) Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI'07)
- Xu, Y.¹ Fern, A.² Yoon, S.³

18
- 85163543018
- Yoon, S.; Fern, A.; and Givan, R. 2004. FF-rePlan. http://www.ecn.purdue. edu/sy/ffreplan.html.
- (2004) FF-rePlan
- Yoon, S.¹ Fern, A.² Givan, R.³

19
- 58349118462
- FF-Replan: A baseline for probabilistic planning
- Yoon, S.; Fern, A.; and Givan, B. 2007. FF-Replan: a baseline for probabilistic planning. In Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling (ICAPS'07).
- (2007) Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling (ICAPS'07)
- Yoon, S.¹ Fern, A.² Givan, B.³

20
- 31144453572
- The first probabilistic track of the international planning competition
- Younes, H. L. S.; Liftman, M. L.; Weissman, D.; and Asmuth, J. 2005. The first probabilistic track of the international planning competition. Journal of Artificial Intelligence Research 24:851-887.
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 851-887
- Younes, H.L.S.¹ Liftman, M.L.² Weissman, D.³ Asmuth, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.