메뉴 건너뛰기




Volumn 3, Issue 3, 2007, Pages 429-444

Learning algorithms for finite horizon constrained Markov decision processes

Author keywords

Control of queues; Markov decision processes; Risk management; Supply chains

Indexed keywords


EID: 56649109020     PISSN: 15475816     EISSN: 1553166X     Source Type: Journal    
DOI: 10.3934/jimo.2007.3.429     Document Type: Article
Times cited : (5)

References (20)
  • 4
    • 0346902105 scopus 로고    scopus 로고
    • Two-time scale simultaneous perturbation stochastic approximation using deterministic perturbation sequences
    • S. Bhatnagar, M. C. Fu, S. I. Marcus, and I-J Wang, Two-time scale simultaneous perturbation stochastic approximation using deterministic perturbation sequences, ACM Transactions on Modeling and Computer Simulation, 13, (2003), 180-209.
    • (2003) ACM Transactions on Modeling and Computer Simulation , vol.13 , pp. 180-209
    • Bhatnagar, S.1    Fu, M.C.2    Marcus, S.I.3    Wang, I.-J.4
  • 5
    • 57249094700 scopus 로고    scopus 로고
    • Simulation-Based Optimization Algorithms for Finite Horizon Markov Decision Processes
    • Submitted
    • S. Bhatnagar and M. S. Abdulla, Simulation-Based Optimization Algorithms for Finite Horizon Markov Decision Processes, Submitted, 2006.
    • (2006)
    • Bhatnagar, S.1    Abdulla, M.S.2
  • 6
    • 13244278201 scopus 로고    scopus 로고
    • An Actor-Critic Algorithm for Constrained Markov Decision Processes
    • V. S. Borkar, An Actor-Critic Algorithm for Constrained Markov Decision Processes, Systems & Control Letters, 54 (2005), 207-213.
    • (2005) Systems & Control Letters , vol.54 , pp. 207-213
    • Borkar, V.S.1
  • 7
    • 0031076413 scopus 로고    scopus 로고
    • Stochastic Approximation with Two Time Scales
    • V. S. Borkar, Stochastic Approximation with Two Time Scales, Systems & Control Letters, 29 (1997), 291-294.
    • (1997) Systems & Control Letters , vol.29 , pp. 291-294
    • Borkar, V.S.1
  • 8
    • 13244262450 scopus 로고    scopus 로고
    • Convex Analytic Methods in Markov Decision Processes Analysis
    • eds. E. A. Feinberg and A. Schwartz, Kluwer Academic Publishers, Dordrecht
    • V. S. Borkar, Convex Analytic Methods in Markov Decision Processes Analysis, in "Handbook of Markov Decision Processes" (eds. E. A. Feinberg and A. Schwartz), Kluwer Academic Publishers, Dordrecht, 2001.
    • (2001) Handbook of Markov Decision Processes
    • Borkar, V.S.1
  • 10
    • 0009990403 scopus 로고
    • Some Remarks on Finite Horizon Markovian Decision Models
    • C. Derman and M. Klein, Some Remarks on Finite Horizon Markovian Decision Models, Operations Research, 13 (1965), 272-278.
    • (1965) Operations Research , vol.13 , pp. 272-278
    • Derman, C.1    Klein, M.2
  • 11
    • 0041648459 scopus 로고    scopus 로고
    • E. A. Feinberg and A. Schwartz Editors, Kluwer Academic Publishers, Dordrecht
    • E. A. Feinberg and A. Schwartz (Editors), "Handbook of Markov Decision Processes," Kluwer Academic Publishers, Dordrecht, 2001.
    • (2001) Handbook of Markov Decision Processes
  • 15
    • 79960013704 scopus 로고    scopus 로고
    • A Geometric Approach to Multi-Criterion Reinforcement Learning
    • S. Mannor and N. Shimkin, A Geometric Approach to Multi-Criterion Reinforcement Learning, Journal of Machine Learning Research, 5 (2004), 325-360.
    • (2004) Journal of Machine Learning Research , vol.5 , pp. 325-360
    • Mannor, S.1    Shimkin, N.2
  • 16
    • 57249117063 scopus 로고    scopus 로고
    • Learning Algorithms for Risk Management,
    • M. Tech. Thesis, IEOR Interdisciplinary Programme, IIT Bombay
    • A. K. Mittal, "Learning Algorithms for Risk Management," M. Tech. Thesis, IEOR Interdisciplinary Programme, IIT Bombay, 2005.
    • (2005)
    • Mittal, A.K.1
  • 18
    • 0031131261 scopus 로고    scopus 로고
    • Constrained Optimization via Stochastic Approximation with a Simultaneous Perturbation Gradient Approximation
    • P. Sadhegh, Constrained Optimization via Stochastic Approximation with a Simultaneous Perturbation Gradient Approximation, Automatica, 33 (1997), 889-892.
    • (1997) Automatica , vol.33 , pp. 889-892
    • Sadhegh, P.1
  • 19
    • 0030737152 scopus 로고    scopus 로고
    • A One-Measurement Form of Simultaneous Perturbation Stochastic Approximation
    • J. C. Spall, A One-Measurement Form of Simultaneous Perturbation Stochastic Approximation, Automatica, 33 (1997), 109-112.
    • (1997) Automatica , vol.33 , pp. 109-112
    • Spall, J.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.