SCOPUS 정보 검색 플랫폼

ICAPS 2012 - Proceedings of the 22nd International Conference on Automated Planning and Scheduling

Volumn , Issue , 2012, Pages 146-154

Reverse iterative deepening for finite-horizon MDPs with large branching factors

(4) Kolobov, Andrey a Dai, Peng a,b Mausam, a Weld, Daniel S a

a University of Washington (United States)

b GOOGLE INC (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BRANCHING FACTORS; DETERMINIZATION; GOAL-ORIENTED; ITERATIVE DEEPENING; MAXIMIZATION PROBLEM; NATURAL DYNAMICS; OPTIMAL ALGORITHM; PROBABILISTIC PLANNING; TERMINAL STATE; TRANSITION FUNCTIONS;

OPTIMIZATION;

ALGORITHMS;

EID: 84866455769 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (33)

References (16)

1
- 0029210635
- Learning to act using real-time dynamic programming
- Barto, A.; Bradtke, S.; and Singh, S. 1995. Learning to act using real-time dynamic programming. Artificial Intelligence 72:81-138.
- (1995) Artificial Intelligence , vol.72 , pp. 81-138
- Barto, A.¹ Bradtke, S.² Singh, S.³

2
- 85012688561
- Princeton University Press
- Bellman, R. 1957. Dynamic Programming. Princeton University Press.
- (1957) Dynamic Programming
- Bellman, R.¹

3
- 0003565783
- Athena Scientific
- Bertsekas, D. 1995. Dynamic Programming and Optimal Control. Athena Scientific.
- (1995) Dynamic Programming and Optimal Control
- Bertsekas, D.¹

4
- 9444233135
- Labeled RTDP: Improving the convergence of real-time dynamic programming
- Bonet, B., and Geffner, H. 2003. Labeled RTDP: Improving the convergence of real-time dynamic programming. In ICAPS'03, 12-21.
- (2003) ICAPS'03 , pp. 12-21
- Bonet, B.¹ Geffner, H.²

5
- 84863056882
- Bryce, D., and Buffet, O. 2008. International planning competition, uncertainty part: Benchmarks and results. In http://ippc-2008.loria.fr/wiki/ images/0703/Results.pdf.
- (2008) International Planning Competition, Uncertainty Part: Benchmarks and Results
- Bryce, D.¹ Buffet, O.²

6
- 33744500784
- Symbolic generalization for on-line planning
- Feng, Z.; Hansen, E. A.; and Zilberstein, S. 2003. Symbolic generalization for on-line planning. In UAI, 109-116.
- (2003) UAI , pp. 109-116
- Feng, Z.¹ Hansen, E.A.² Zilberstein, S.³

7
- 0036377352
- The FF planning system: Fast plan generation through heuristic search
- Hoffmann, J., and Nebel, B. 2001. The FF planning system: Fast plan generation through heuristic search. Journal of Artificial Intelligence Research 14:253-302.
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 253-302
- Hoffmann, J.¹ Nebel, B.²

8
- 84866455160
- PROST: Probabilistic Planning Based on UCT
- Keller, T., and Eyerich, P. 2012. PROST: Probabilistic Planning Based on UCT. In ICAPS'12.
- (2012) ICAPS'12
- Keller, T.¹ Eyerich, P.²

9
- 32144451758
- Solving concurrent markov decision processes
- Mausam, and Weld, D. S. 2004. Solving concurrent markov decision processes. In AAAI'04.
- (2004) AAAI'04
- Mausam¹ Weld, D.S.²

10
- 0004219017
- Tioga Publishing
- Nilsson, N. 1980. Principles of Artificial Intelligence. Tioga Publishing.
- (1980) Principles of Artificial Intelligence
- Nilsson, N.¹

11
- 33750296380
- Scaling model-based average-reward reinforcement learning for product delivery
- Proper, S., and Tadepalli, P. 2006. Scaling model-based average-reward reinforcement learning for product delivery. In ECML, 735-742.
- (2006) ECML , pp. 735-742
- Proper, S.¹ Tadepalli, P.²

12
- 0003998452
- John Wiley & Sons
- Puterman, M. 1994. Markov Decision Processes. John Wiley & Sons.
- (1994) Markov Decision Processes
- Puterman, M.¹

13
- 84861364904
- Sanner, S. 2010. Relational dynamic influence diagram language (RDDL): Language description. http://users.cecs.anu.edu.au/ssanner/IPPC-2011/RDDL.pdf.
- (2010) Relational Dynamic Influence Diagram Language (RDDL): Language Description
- Sanner, S.¹

14
- 84863844639
- Sanner, S. 2011. ICAPS 2011 international probabilistic planning competition. http://users.cecs.anu.edu.au/ssanner/IPPC-2011/.
- (2011) ICAPS 2011 International Probabilistic Planning Competition
- Sanner, S.¹

15
- 77958580068
- RFF: A robust, FF-based MDP planning algorithm for generating policies with low probability of failure
- Teichteil-Koenigsbuch, F.; Infantes, G.; and Kuter, U. 2008. RFF: A robust, FF-based MDP planning algorithm for generating policies with low probability of failure. In Sixth International Planning Competition at ICAPS'08.
- (2008) Sixth International Planning Competition at ICAPS'08
- Teichteil-Koenigsbuch, F.¹ Infantes, G.² Kuter, U.³

16
- 58349118462
- FF-Replan: A baseline for probabilistic planning
- Yoon, S.; Fern, A.; and Givan, R. 2007. FF-Replan: A baseline for probabilistic planning. In ICAPS'07, 352-359.
- (2007) ICAPS'07 , pp. 352-359
- Yoon, S.¹ Fern, A.² Givan, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.