SCOPUS 정보 검색 플랫폼

Volumn , Issue , 1998, Pages 1015-1021

An improved policy iteration algorithm for partially observable MDPs

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; CONTROLLERS;

DECISION PROCESS; FINITE-STATE CONTROLLERS; INFINITE HORIZONS; POLICY EVALUATION; POLICY ITERATION ALGORITHMS; VALUE ITERATION;

ITERATIVE METHODS;

EID: 84898987770 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (51)

References (10)

1
- 0001432658
- Discounted dynamic programming
- Blackwell, D. (1965) Discounted dynamic programming. Ann. Math. Stat. 36:226- 235.
- (1965) Ann. Math. Stat. , vol.36 , pp. 226-235
- Blackwell, D.¹

2
- 0028564629
- Acting optimally in partially observable stochastic domains
- Cassandra, A.; Kaelbling, L.P.; Littman, M.L. (1994) Acting optimally in partially observable stochastic domains. In Proc. 13th National Conf. on AI, 1023-1028.
- (1994) Proc. 13th National Conf. on AI , pp. 1023-1028
- Cassandra, A.¹ Kaelbling, L.P.² Littman, M.L.³

3
- 0001909869
- Incremental pruning: A simple, fast, exact algorithm for partially observable Markov decision processes
- Cassandra, A.; Littman, M.L.; & Zhang, N.L. (1997) Incremental pruning: A simple, fast, exact algorithm for partially observable Markov decision processes. In Proc. 13th Annual Conf. on Uncertainty in AL.
- (1997) Proc. 13th Annual Conf. on Uncertainty in AL
- Cassandra, A.¹ Littman, M.L.² Zhang, N.L.³

4
- 0003659747
- PhD thesis, Department of Computer Science, University of Massachusetts at Amherst
- Hansen, E.A. (1998). Finite-Memory Control of Partially Observable Systems. PhD thesis, Department of Computer Science, University of Massachusetts at Amherst.
- (1998) Finite-Memory Control of Partially Observable Systems
- Hansen, E.A.¹

5
- 0000624333
- Reinforcement learning algorithm for partially observable Markov decision problems
- Jaakkola, T.; Singh, S.P.; &: Jordan, M.I. (1995) Reinforcement learning algorithm for partially observable Markov decision problems. In NIPS-7.
- (1995) NIPS-7
- Jaakkola, T.¹ Singh, S.P.² Jordan, M.I.³

7
- 0019909899
- A survey of partially observable Markov decision processes: Theory, models, and algorithms
- Monahan, G.E. (1982) A survey of partially observable Markov decision processes: Theory, models, and algorithms. Management Science 28:1-16.
- (1982) Management Science , vol.28 , pp. 1-16
- Monahan, G.E.¹

8
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- Small wood, R.D. & Sondik, E.J. (1973) The optimal control of partially observable Markov processes over a finite horizon. Operations Research 21:1071-1088.
- (1973) Operations Research , vol.21 , pp. 1071-1088
- Small Wood, R.D.¹ Sondik, E.J.²

9
- 0003871607
- PhD thesis, Department of Electrical Engineering, Stanford University
- Sondik, E.J. (1971) The Optimal Control of Partially Observable Markov Processes. PhD thesis, Department of Electrical Engineering, Stanford University.
- (1971) The Optimal Control of Partially Observable Markov Processes
- Sondik, E.J.¹

10
- 0017943242
- The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs
- Sondik, E.J. (1978) The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Operations Research 26:282-304.
- (1978) Operations Research , vol.26 , pp. 282-304
- Sondik, E.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.