메뉴 건너뛰기




Volumn , Issue , 1998, Pages 1015-1021

An improved policy iteration algorithm for partially observable MDPs

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; CONTROLLERS;

EID: 84898987770     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (51)

References (10)
  • 1
    • 0001432658 scopus 로고
    • Discounted dynamic programming
    • Blackwell, D. (1965) Discounted dynamic programming. Ann. Math. Stat. 36:226- 235.
    • (1965) Ann. Math. Stat. , vol.36 , pp. 226-235
    • Blackwell, D.1
  • 3
    • 0001909869 scopus 로고    scopus 로고
    • Incremental pruning: A simple, fast, exact algorithm for partially observable Markov decision processes
    • Cassandra, A.; Littman, M.L.; & Zhang, N.L. (1997) Incremental pruning: A simple, fast, exact algorithm for partially observable Markov decision processes. In Proc. 13th Annual Conf. on Uncertainty in AL.
    • (1997) Proc. 13th Annual Conf. on Uncertainty in AL
    • Cassandra, A.1    Littman, M.L.2    Zhang, N.L.3
  • 5
    • 0000624333 scopus 로고
    • Reinforcement learning algorithm for partially observable Markov decision problems
    • Jaakkola, T.; Singh, S.P.; &: Jordan, M.I. (1995) Reinforcement learning algorithm for partially observable Markov decision problems. In NIPS-7.
    • (1995) NIPS-7
    • Jaakkola, T.1    Singh, S.P.2    Jordan, M.I.3
  • 7
    • 0019909899 scopus 로고
    • A survey of partially observable Markov decision processes: Theory, models, and algorithms
    • Monahan, G.E. (1982) A survey of partially observable Markov decision processes: Theory, models, and algorithms. Management Science 28:1-16.
    • (1982) Management Science , vol.28 , pp. 1-16
    • Monahan, G.E.1
  • 8
    • 0015658957 scopus 로고
    • The optimal control of partially observable Markov processes over a finite horizon
    • Small wood, R.D. & Sondik, E.J. (1973) The optimal control of partially observable Markov processes over a finite horizon. Operations Research 21:1071-1088.
    • (1973) Operations Research , vol.21 , pp. 1071-1088
    • Small Wood, R.D.1    Sondik, E.J.2
  • 10
    • 0017943242 scopus 로고
    • The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs
    • Sondik, E.J. (1978) The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Operations Research 26:282-304.
    • (1978) Operations Research , vol.26 , pp. 282-304
    • Sondik, E.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.