메뉴 건너뛰기




Volumn 32, Issue 2, 2007, Pages 365-373

On the speed of convergence of value iteration on stochastic shortest-path problems

Author keywords

Markov decision processes; Stochastic shortest path problems; Value iteration

Indexed keywords

ALGORITHMS; CONVERGENCE OF NUMERICAL METHODS; POLYNOMIALS; RANDOM PROCESSES; VECTORS;

EID: 38549129333     PISSN: 0364765X     EISSN: 15265471     Source Type: Journal    
DOI: 10.1287/moor.1060.0238     Document Type: Article
Times cited : (28)

References (14)
  • 1
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • Barto, A., S. Bradtke, S. Singh. 1995. Learning to act using real-time dynamic programming. Artificial Intelligence 72 81-138.
    • (1995) Artificial Intelligence , vol.72 , pp. 81-138
    • Barto, A.1    Bradtke, S.2    Singh, S.3
  • 2
    • 0003787146 scopus 로고
    • Princeton University Press, Princeton, NJ
    • Bellman, R. 1957. Dynamic Programming. Princeton University Press, Princeton, NJ.
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 5
    • 0003161907 scopus 로고
    • An analysis of stochastic shortest-path problems
    • Bertsekas, D., J. Tsitsiklis. 1991. An analysis of stochastic shortest-path problems. Math. Oper. Res. 16 580-595.
    • (1991) Math. Oper. Res , vol.16 , pp. 580-595
    • Bertsekas, D.1    Tsitsiklis, J.2
  • 7
    • 0022094206 scopus 로고
    • Generalized best-first search strategies and the optimality of A*
    • Mach
    • Dechter, R., J. Pearl. 1985. Generalized best-first search strategies and the optimality of A*. J. Assoc. Comput. Mach. 32(3) 505-536.
    • (1985) J. Assoc. Comput , vol.32 , Issue.3 , pp. 505-536
    • Dechter, R.1    Pearl, J.2
  • 8
    • 84959258302 scopus 로고
    • Optimal pursuit strategies in discrete state probabilistic systems
    • Eaton, J. H., L. A. Zadeh. 1962. Optimal pursuit strategies in discrete state probabilistic systems. Trans. ASME, Series D, J. Basic Engrg. 84 23-29.
    • (1962) Trans. ASME, Series D, J. Basic Engrg , vol.84 , pp. 23-29
    • Eaton, J.H.1    Zadeh, L.A.2
  • 9
    • 84899829959 scopus 로고
    • A formal basis for the heuristic determination of minimum cost paths
    • Hart, P., N. Nilsson, B. Raphael. 1968. A formal basis for the heuristic determination of minimum cost paths. IEEE Trans. Systems Sci. Cybernetics 4 100-107.
    • (1968) IEEE Trans. Systems Sci. Cybernetics , vol.4 , pp. 100-107
    • Hart, P.1    Nilsson, N.2    Raphael, B.3
  • 10
    • 0035369426 scopus 로고    scopus 로고
    • Minimax real-time heuristic search
    • Koenig, S. 2001. Minimax real-time heuristic search. Artificial Intelligence 129 165-197.
    • (2001) Artificial Intelligence , vol.129 , pp. 165-197
    • Koenig, S.1
  • 11
    • 0022129301 scopus 로고
    • Depth-first iterative-deepening: An optimal admissible tree search
    • Korf, R. 1985. Depth-first iterative-deepening: An optimal admissible tree search. Artificial Intelligence 27(1) 97-109.
    • (1985) Artificial Intelligence , vol.27 , Issue.1 , pp. 97-109
    • Korf, R.1
  • 12
    • 0004286550 scopus 로고
    • Morgan Kaufmann, San Francisco, CA
    • Pearl, J. 1983. Heuristics. Morgan Kaufmann, San Francisco, CA.
    • (1983) Heuristics
    • Pearl, J.1
  • 14
    • 0025491302 scopus 로고
    • Solving H-horizon, stationary Markov decision problems in time proportional to log(H)
    • Tseng, P. 1990. Solving H-horizon, stationary Markov decision problems in time proportional to log(H). Oper. Res. Lett. 9 289-297.
    • (1990) Oper. Res. Lett , vol.9 , pp. 289-297
    • Tseng, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.