SCOPUS 정보 검색 플랫폼

Volumn 3, Issue 3, 2007, Pages 429-444

Learning algorithms for finite horizon constrained Markov decision processes

Author keywords

Control of queues; Markov decision processes; Risk management; Supply chains

Indexed keywords

EID: 56649109020 PISSN: 15475816 EISSN: 1553166X Source Type: Journal
DOI: 10.3934/jimo.2007.3.429 Document Type: Article

Times cited : (5)

References (20)

1
- 0003989208
- Chapman & Hall, Boca Raton
- E. Altman, "Constrained Markov Decision Processes," Chapman & Hall, Boca Raton, 1999.
- (1999) Constrained Markov Decision Processes
- Altman, E.¹

2
- 0003565783
- Athena Scientific, Belmont
- D. P. Bertsekas, "Dynamic Programming and Optimal Control," Athena Scientific, Belmont, 1995.
- (1995) Dynamic Programming and Optimal Control
- Bertsekas, D.P.¹

3
- 0003487482
- Athena Scientific, Belmont
- D. P. Bertsekas, and J. N. Tsitsiklis, "Neuro-Dynamic Programming," Athena Scientific, Belmont, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

4
- 0346902105
- Two-time scale simultaneous perturbation stochastic approximation using deterministic perturbation sequences
- S. Bhatnagar, M. C. Fu, S. I. Marcus, and I-J Wang, Two-time scale simultaneous perturbation stochastic approximation using deterministic perturbation sequences, ACM Transactions on Modeling and Computer Simulation, 13, (2003), 180-209.
- (2003) ACM Transactions on Modeling and Computer Simulation , vol.13 , pp. 180-209
- Bhatnagar, S.¹ Fu, M.C.² Marcus, S.I.³ Wang, I.-J.⁴

6
- 13244278201
- An Actor-Critic Algorithm for Constrained Markov Decision Processes
- V. S. Borkar, An Actor-Critic Algorithm for Constrained Markov Decision Processes, Systems & Control Letters, 54 (2005), 207-213.
- (2005) Systems & Control Letters , vol.54 , pp. 207-213
- Borkar, V.S.¹

7
- 0031076413
- Stochastic Approximation with Two Time Scales
- V. S. Borkar, Stochastic Approximation with Two Time Scales, Systems & Control Letters, 29 (1997), 291-294.
- (1997) Systems & Control Letters , vol.29 , pp. 291-294
- Borkar, V.S.¹

9
- 57249098720
- Research Report, ISE Dept. University of Florida, 2003
- A. Chekhlov, S. Uryasev and M. Zabarankin, Drawdown Measure in Portfolio Optimization, Research Report 2003-15, ISE Dept. University of Florida, 2003.
- (1915) Drawdown Measure in Portfolio Optimization
- Chekhlov, A.¹ Uryasev, S.² Zabarankin, M.³

10
- 0009990403
- Some Remarks on Finite Horizon Markovian Decision Models
- C. Derman and M. Klein, Some Remarks on Finite Horizon Markovian Decision Models, Operations Research, 13 (1965), 272-278.
- (1965) Operations Research , vol.13 , pp. 272-278
- Derman, C.¹ Klein, M.²

11
- 0041648459
- E. A. Feinberg and A. Schwartz Editors, Kluwer Academic Publishers, Dordrecht
- E. A. Feinberg and A. Schwartz (Editors), "Handbook of Markov Decision Processes," Kluwer Academic Publishers, Dordrecht, 2001.
- (2001) Handbook of Markov Decision Processes

14
- 15044360747
- IEW, Working Papers iewwp122, Institute for Empirical Research in Economics, IEW
- E. D. Giorgi, A Note on Portfolio Selection under Various Risk Measures, IEW - Working Papers iewwp122, Institute for Empirical Research in Economics, IEW.
- A Note on Portfolio Selection under Various Risk Measures
- Giorgi, E.D.¹

15
- 79960013704
- A Geometric Approach to Multi-Criterion Reinforcement Learning
- S. Mannor and N. Shimkin, A Geometric Approach to Multi-Criterion Reinforcement Learning, Journal of Machine Learning Research, 5 (2004), 325-360.
- (2004) Journal of Machine Learning Research , vol.5 , pp. 325-360
- Mannor, S.¹ Shimkin, N.²

17
- 0003998452
- Wiley, New York
- M. L. Puterman, "Markov Decision Processes," Wiley, New York, 1994.
- (1994) Markov Decision Processes
- Puterman, M.L.¹

18
- 0031131261
- Constrained Optimization via Stochastic Approximation with a Simultaneous Perturbation Gradient Approximation
- P. Sadhegh, Constrained Optimization via Stochastic Approximation with a Simultaneous Perturbation Gradient Approximation, Automatica, 33 (1997), 889-892.
- (1997) Automatica , vol.33 , pp. 889-892
- Sadhegh, P.¹

19
- 0030737152
- A One-Measurement Form of Simultaneous Perturbation Stochastic Approximation
- J. C. Spall, A One-Measurement Form of Simultaneous Perturbation Stochastic Approximation, Automatica, 33 (1997), 109-112.
- (1997) Automatica , vol.33 , pp. 109-112
- Spall, J.C.¹

20
- 0013025914
- Wiley, New York
- J. C. Spall, "Introduction to Stochastic Search and Optimization," Wiley, New York, 2003.
- (2003) Introduction to Stochastic Search and Optimization
- Spall, J.C.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.