메뉴 건너뛰기




Volumn , Issue , 2002, Pages 273-278

On policy iteration as a Newton's method and polynomial policy iteration algorithms

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; ITERATIVE METHODS; LINEAR PROGRAMMING; MARKOV PROCESSES;

EID: 0036927574     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (15)

References (19)
  • 1
    • 0346942368 scopus 로고    scopus 로고
    • Decision theoretic planning: Structural assumptions and computational leverage
    • [BDH99]
    • [BDH99] C. Boutilier, T. Dean, and S. Hanks. Decision theoretic planning: Structural assumptions and computational leverage. JAIR, pages 157-171, 1999.
    • (1999) JAIR , pp. 157-171
    • Boutilier, C.1    Dean, T.2    Hanks, S.3
  • 3
  • 4
    • 84880898477 scopus 로고    scopus 로고
    • Max-norm projections for factored MDPs
    • [GKP01]
    • [GKP01] C. Guestrin, D. Koller, and R. Parr. Max-norm projections for factored MDPs. In AAAI, pages 673-679, 2001.
    • (2001) AAAI , pp. 673-679
    • Guestrin, C.1    Koller, D.2    Parr, R.3
  • 6
    • 0012251278 scopus 로고
    • On nonterminating stochastic games
    • [HK66]
    • [HK66] A. Hoffman and R. Karp. On nonterminating stochastic games. Management Science, 12(5), 1966.
    • (1966) Management Science , vol.12 , Issue.5
    • Hoffman, A.1    Karp, R.2
  • 7
    • 0028730912 scopus 로고
    • Simple and fast algorithms for linear and integer programs with two variables per inequality
    • [HN94] Dec
    • [HN94] D. S. Hochbaum and J. Naor. Simple and fast algorithms for linear and integer programs with two variables per inequality. SICOMP, 23(6):1179-1192, Dec 1994.
    • (1994) SICOMP , vol.23 , Issue.6 , pp. 1179-1192
    • Hochbaum, D.S.1    Naor, J.2
  • 8
    • 0035369425 scopus 로고    scopus 로고
    • LAO*: A heuristic search algorithm that finds solutions with loops
    • [HZ01]
    • [HZ01] E. Hansen and S. Zilberstein. LAO*: a heuristic search algorithm that finds solutions with loops. Artificial Intelligence, 129:35-62, 2001.
    • (2001) Artificial Intelligence , vol.129 , pp. 35-62
    • Hansen, E.1    Zilberstein, S.2
  • 9
    • 0001391363 scopus 로고
    • A characterization of the minimum cycle mean in a digraph
    • [Kar78]
    • [Kar78] R. M. Karp. A characterization of the minimum cycle mean in a digraph. Discrete Mathematics, 23:309-311, 1978.
    • (1978) Discrete Mathematics , vol.23 , pp. 309-311
    • Karp, R.M.1
  • 13
    • 0012315622 scopus 로고
    • On the complexity of the policy improvement algorithm for Markov decision processes
    • [MC94]
    • [MC94] M. Melekopoglou and A. Condon. On the complexity of the policy improvement algorithm for Markov decision processes. ORSA Journal on Computing, 6(2), 1994.
    • (1994) ORSA Journal on Computing , vol.6 , Issue.2
    • Melekopoglou, M.1    Condon, A.2
  • 14
    • 0000423550 scopus 로고
    • Towards a genuinely polynomial algorithm for linear programming
    • [Meg83]
    • [Meg83] N. Megiddo. Towards a genuinely polynomial algorithm for linear programming. SICOMP, 12(2):347-353,1983.
    • (1983) SICOMP , vol.12 , Issue.2 , pp. 347-353
    • Megiddo, N.1
  • 15
    • 8744295510 scopus 로고    scopus 로고
    • On the complexity of policy iteration
    • [MS99]
    • [MS99] Y. Mansour and S. Singh. On the complexity of policy iteration. In Uncertainty in AI, pages 401-408, 1999.
    • (1999) Uncertainty in AI , pp. 401-408
    • Mansour, Y.1    Singh, S.2
  • 16
    • 85065713623 scopus 로고
    • Newton's method for fractional combinatorial optimization
    • [Rad92]
    • [Rad92] T. Radzik. Newton's method for fractional combinatorial optimization. In 33rd FOCS, pages 659-669, 1992.
    • (1992) 33rd FOCS , pp. 659-669
    • Radzik, T.1
  • 18
    • 0019621772 scopus 로고
    • Deciding linear inequalities by computing loop residues
    • [Sho81]
    • [Sho81] R. Shostack. Deciding linear inequalities by computing loop residues. J. ACM, 28:769-779, 1981.
    • (1981) J. ACM , vol.28 , pp. 769-779
    • Shostack, R.1
  • 19
    • 0032654630 scopus 로고    scopus 로고
    • A polynomial combinatorial algorithm for generalized minimum cost flow
    • [Way99]
    • [Way99] K.D. Wayne. A polynomial combinatorial algorithm for generalized minimum cost flow. In 31st STOC, 1999.
    • (1999) 31st STOC
    • Wayne, K.D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.