메뉴 건너뛰기




Volumn , Issue , 2012, Pages 146-154

Reverse iterative deepening for finite-horizon MDPs with large branching factors

Author keywords

[No Author keywords available]

Indexed keywords

BRANCHING FACTORS; DETERMINIZATION; GOAL-ORIENTED; ITERATIVE DEEPENING; MAXIMIZATION PROBLEM; NATURAL DYNAMICS; OPTIMAL ALGORITHM; PROBABILISTIC PLANNING; TERMINAL STATE; TRANSITION FUNCTIONS;

EID: 84866455769     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (33)

References (16)
  • 1
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • Barto, A.; Bradtke, S.; and Singh, S. 1995. Learning to act using real-time dynamic programming. Artificial Intelligence 72:81-138.
    • (1995) Artificial Intelligence , vol.72 , pp. 81-138
    • Barto, A.1    Bradtke, S.2    Singh, S.3
  • 4
    • 9444233135 scopus 로고    scopus 로고
    • Labeled RTDP: Improving the convergence of real-time dynamic programming
    • Bonet, B., and Geffner, H. 2003. Labeled RTDP: Improving the convergence of real-time dynamic programming. In ICAPS'03, 12-21.
    • (2003) ICAPS'03 , pp. 12-21
    • Bonet, B.1    Geffner, H.2
  • 6
    • 33744500784 scopus 로고    scopus 로고
    • Symbolic generalization for on-line planning
    • Feng, Z.; Hansen, E. A.; and Zilberstein, S. 2003. Symbolic generalization for on-line planning. In UAI, 109-116.
    • (2003) UAI , pp. 109-116
    • Feng, Z.1    Hansen, E.A.2    Zilberstein, S.3
  • 7
    • 0036377352 scopus 로고    scopus 로고
    • The FF planning system: Fast plan generation through heuristic search
    • Hoffmann, J., and Nebel, B. 2001. The FF planning system: Fast plan generation through heuristic search. Journal of Artificial Intelligence Research 14:253-302.
    • (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 253-302
    • Hoffmann, J.1    Nebel, B.2
  • 8
    • 84866455160 scopus 로고    scopus 로고
    • PROST: Probabilistic Planning Based on UCT
    • Keller, T., and Eyerich, P. 2012. PROST: Probabilistic Planning Based on UCT. In ICAPS'12.
    • (2012) ICAPS'12
    • Keller, T.1    Eyerich, P.2
  • 9
    • 32144451758 scopus 로고    scopus 로고
    • Solving concurrent markov decision processes
    • Mausam, and Weld, D. S. 2004. Solving concurrent markov decision processes. In AAAI'04.
    • (2004) AAAI'04
    • Mausam1    Weld, D.S.2
  • 11
    • 33750296380 scopus 로고    scopus 로고
    • Scaling model-based average-reward reinforcement learning for product delivery
    • Proper, S., and Tadepalli, P. 2006. Scaling model-based average-reward reinforcement learning for product delivery. In ECML, 735-742.
    • (2006) ECML , pp. 735-742
    • Proper, S.1    Tadepalli, P.2
  • 16
    • 58349118462 scopus 로고    scopus 로고
    • FF-Replan: A baseline for probabilistic planning
    • Yoon, S.; Fern, A.; and Givan, R. 2007. FF-Replan: A baseline for probabilistic planning. In ICAPS'07, 352-359.
    • (2007) ICAPS'07 , pp. 352-359
    • Yoon, S.1    Fern, A.2    Givan, R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.