메뉴 건너뛰기




Volumn , Issue , 2007, Pages 1860-1865

Topological value iteration algorithm for Markov decision processes

Author keywords

[No Author keywords available]

Indexed keywords

BACKING-UP; GRAPHICAL FEATURES; MARKOV DECISION PROCESSES; REACHABILITY ANALYSIS; TOPOLOGICAL SEQUENCE; VALUE ITERATION; VALUE ITERATION ALGORITHM;

EID: 84880882345     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (51)

References (18)
  • 1
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • Barto, A.; Bradke, S.; and Singh, S. 1995. Learning to act using real-time dynamic programming. JAI 72:81-138.
    • (1995) JAI , vol.72 , pp. 81-138
    • Barto, A.1    Bradke, S.2    Singh, S.3
  • 2
    • 0003787146 scopus 로고
    • Princeton, NJ: Princeton University Press
    • Bellman, R. 1957. Dynamic Programming. Princeton, NJ: Princeton University Press.
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 3
    • 84880851659 scopus 로고    scopus 로고
    • Faster heuristic search algorithms for planning with uncertainty and full feedback
    • Gottlob, G., ed., Acapulco, Mexico: Morgan Kaufmann
    • Bonet, B., and Geffner, H. 2003a. Faster heuristic search algorithms for planning with uncertainty and full feedback. In Gottlob, G., ed., Proc. of IJCAI-03, 1233-1238. Acapulco, Mexico: Morgan Kaufmann.
    • (2003) Proc. of IJCAI-03 , pp. 1233-1238
    • Bonet, B.1    Geffner, H.2
  • 4
    • 9444233135 scopus 로고    scopus 로고
    • Labeled RTDP: Improving the convergence of real-time dynamic programming
    • Giunchiglia, E.; Muscettola, N.; and Nau, D., eds., Trento, Italy: AAAI Press
    • Bonet, B., and Geffner, H. 2003b. Labeled RTDP: Improving the convergence of real-time dynamic programming. In Giunchiglia, E.; Muscettola, N.; and Nau, D., eds., Proc. 13th ICAPS, 12-21. Trento, Italy: AAAI Press.
    • (2003) Proc. 13th ICAPS , pp. 12-21
    • Bonet, B.1    Geffner, H.2
  • 7
    • 33750713477 scopus 로고    scopus 로고
    • Symbolic heuristic search for factored Markov decision processes
    • Feng, Z., and Hansen, E. A. 2002. Symbolic heuristic search for factored Markov decision processes. In Proc. of AAAI-05.
    • (2002) Proc. of AAAI-05
    • Feng, Z.1    Hansen, E.A.2
  • 8
    • 33744500784 scopus 로고    scopus 로고
    • Symbolic generalization for on-line planning
    • Feng, Z.; Hansen, E. A.; and Zilberstein, S. 2003. Symbolic generalization for on-line planning. In Proc. of UAI-03, 209-216.
    • (2003) Proc. of UAI-03 , pp. 209-216
    • Feng, Z.1    Hansen, E.A.2    Zilberstein, S.3
  • 11
    • 0035369425 scopus 로고    scopus 로고
    • LAO*: A heuristic search algorithm that finds solutions with loops
    • Hansen, E., and Zilberstein, S. 2001. LAO*: A heuristic search algorithm that finds solutions with loops. JAI 129:35-62.
    • (2001) JAI , vol.129 , pp. 35-62
    • Hansen, E.1    Zilberstein, S.2
  • 12
    • 0002956570 scopus 로고    scopus 로고
    • SPUDD: Stochastic planning using decision diagrams
    • Hoey, J.; St-Aubin, R.; Hu, A.; and Boutilier, C. 1999. SPUDD: Stochastic planning using decision diagrams. In Proc. of UAI-99, 279-288.
    • (1999) Proc. of UAI-99 , pp. 279-288
    • Hoey, J.1    St-Aubin, R.2    Hu, A.3    Boutilier, C.4
  • 14
    • 0002290970 scopus 로고
    • On the complexity of solving Markov decision problems
    • Littman, M. L.; Dean, T.; and Kaelbling, L. P. 1995. On the complexity of solving Markov decision problems. In Proc. of UAI-95, 394-402.
    • (1995) Proc. of UAI-95 , pp. 394-402
    • Littman, M.L.1    Dean, T.2    Kaelbling, L.P.3
  • 15
    • 84890267871 scopus 로고    scopus 로고
    • Fast exact planning in Markov decision processes
    • McMahan, H. B., and Gordon, G. J. 2005. Fast exact planning in Markov decision processes. In Proc. of ICAPS-05.
    • (2005) Proc. of ICAPS-05
    • McMahan, H.B.1    Gordon, G.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.