SCOPUS 정보 검색 플랫폼

Volumn 52, Issue 4, 2007, Pages 677-681

Partially observable markov decision processes with reward information: Basic ideas and models

Author keywords

Partially observable Markov decision process (POMDP); Reward information policy

Indexed keywords

DECISION THEORY; MARKOV PROCESSES; MATHEMATICAL MODELS; PROBABILITY DISTRIBUTIONS; STATE ESTIMATION;

PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; REWARD-INFORMATION POLICY;

CONTROL THEORY;

EID: 34247229563 PISSN: 00189286 EISSN: None Source Type: Journal
DOI: 10.1109/TAC.2007.894520 Document Type: Article

Times cited : (14)

References (15)

2
- 0027557742
- Discrete-time controlled Markov processes with average cost criterion: A survey
- A. Arapostathis, V. S. Borkar, E. Fernandez-Gaucherand, M. K. Ghosh, and S. I. Markus, "Discrete-time controlled Markov processes with average cost criterion: A survey," SIAM J. Control Optim., vol. 31, pp. 282-344, 1993.
- (1993) SIAM J. Control Optim , vol.31 , pp. 282-344
- Arapostathis, A.¹ Borkar, V.S.² Fernandez-Gaucherand, E.³ Ghosh, M.K.⁴ Markus, S.I.⁵

3
- 0034437507
- Average cost dynamic programming equations for controlled Markov chains with partial observations
- V. S. Borkar, "Average cost dynamic programming equations for controlled Markov chains with partial observations," SIAM J. Control Optim., vol. 39, pp. 673-681, 2001.
- (2001) SIAM J. Control Optim , vol.39 , pp. 673-681
- Borkar, V.S.¹

4
- 3843150404
- A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: Multichain cases
- X.-R. Cao and X. P. Guo, "A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: Multichain cases," Automatica, vol. 40, pp. 1749-1759, 2004.
- (2004) Automatica , vol.40 , pp. 1749-1759
- Cao, X.-R.¹ Guo, X.P.²

5
- 33244489385
- Optimal control of ergodic continuous-time Markov chains with average sample-path rewards
- X. P. Guo and X.-R Cao, "Optimal control of ergodic continuous-time Markov chains with average sample-path rewards," SIAM J. Control Optim., vol. 44, pp. 29-48, 2005.
- (2005) SIAM J. Control Optim , vol.44 , pp. 29-48
- Guo, X.P.¹ Cao, X.-R.²

6
- 0003952176
- New York: Springer-Verlag
- O. Hernández-Lerma and J. B. Lasserre, Further Topics on Discrete-Time Markov Control Processes. New York: Springer-Verlag, 1999.
- (1999) Further Topics on Discrete-Time Markov Control Processes
- Hernández-Lerma, O.¹ Lasserre, J.B.²

7
- 0036112835
- Limiting discounted-cost control of partially observable stochastic systems
- O. Hernández-Lerma and R. Romera, "Limiting discounted-cost control of partially observable stochastic systems," SIAM J. Control Optim., vol. 40, pp. 348-369, 2001.
- (2001) SIAM J. Control Optim , vol.40 , pp. 348-369
- Hernández-Lerma, O.¹ Romera, R.²

9
- 0002679852
- A survey of algorithmic results for partially observable Markov decision processes
- W. S. Lovejoy, "A survey of algorithmic results for partially observable Markov decision processes," Ann. Oper. Res., vol. 35, pp. 47-66, 1991.
- (1991) Ann. Oper. Res , vol.35 , pp. 47-66
- Lovejoy, W.S.¹

10
- 0006034218
- An optimal inspection and replacement policy under incomplete state information
- M. Ohnish, H. Kawai, and H. Mine, "An optimal inspection and replacement policy under incomplete state information," Eur. J. Oper. Res., vol. 27, pp. 117-128, 1986.
- (1986) Eur. J. Oper. Res , vol.27 , pp. 117-128
- Ohnish, M.¹ Kawai, H.² Mine, H.³

11
- 85102627959
- New York: Wiley
- M. L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming. New York: Wiley, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

12
- 33646420905
- Constrained ordinal optimization - A feasibility based approach
- C. Song, X. H. Guan, and Y. C. Ho, "Constrained ordinal optimization - A feasibility based approach," Discrete Event Dyna. Syst.: Theory Appl., vol. 16, pp. 279-299, 2006.
- (2006) Discrete Event Dyna. Syst.: Theory Appl , vol.16 , pp. 279-299
- Song, C.¹ Guan, X.H.² Ho, Y.C.³

13
- 14244259416
- Bonds on optimal cost for a replacement problem with partial observation
- C. C. White, "Bonds on optimal cost for a replacement problem with partial observation," Naval Res. Logist. Quart., vol. 26, pp. 415-422, 1979.
- (1979) Naval Res. Logist. Quart , vol.26 , pp. 415-422
- White, C.C.¹

14
- 0008632494
- Discrete-time Markovian decision processes with incomplete state observation
- S. Yoshikazu and Y. Tsuneo, "Discrete-time Markovian decision processes with incomplete state observation," Ann. Math. Statist., vol. 41, pp.78-86, 1970.
- (1970) Ann. Math. Statist , vol.41 , pp. 78-86
- Yoshikazu, S.¹ Tsuneo, Y.²

15
- 17644387624
- Vector ordinal optimization
- Q. C. Zhao, Y. C. Ho, and Q. S. Jia, "Vector ordinal optimization," J. Optim. Theory Appl., vol. 125, pp. 259-274, 2005.
- (2005) J. Optim. Theory Appl , vol.125 , pp. 259-274
- Zhao, Q.C.¹ Ho, Y.C.² Jia, Q.S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.