-
4
-
-
0346902105
-
Two-time scale simultaneous perturbation stochastic approximation using deterministic perturbation sequences
-
S. Bhatnagar, M. C. Fu, S. I. Marcus, and I-J Wang, Two-time scale simultaneous perturbation stochastic approximation using deterministic perturbation sequences, ACM Transactions on Modeling and Computer Simulation, 13, (2003), 180-209.
-
(2003)
ACM Transactions on Modeling and Computer Simulation
, vol.13
, pp. 180-209
-
-
Bhatnagar, S.1
Fu, M.C.2
Marcus, S.I.3
Wang, I.-J.4
-
5
-
-
57249094700
-
Simulation-Based Optimization Algorithms for Finite Horizon Markov Decision Processes
-
Submitted
-
S. Bhatnagar and M. S. Abdulla, Simulation-Based Optimization Algorithms for Finite Horizon Markov Decision Processes, Submitted, 2006.
-
(2006)
-
-
Bhatnagar, S.1
Abdulla, M.S.2
-
6
-
-
13244278201
-
An Actor-Critic Algorithm for Constrained Markov Decision Processes
-
V. S. Borkar, An Actor-Critic Algorithm for Constrained Markov Decision Processes, Systems & Control Letters, 54 (2005), 207-213.
-
(2005)
Systems & Control Letters
, vol.54
, pp. 207-213
-
-
Borkar, V.S.1
-
7
-
-
0031076413
-
Stochastic Approximation with Two Time Scales
-
V. S. Borkar, Stochastic Approximation with Two Time Scales, Systems & Control Letters, 29 (1997), 291-294.
-
(1997)
Systems & Control Letters
, vol.29
, pp. 291-294
-
-
Borkar, V.S.1
-
8
-
-
13244262450
-
Convex Analytic Methods in Markov Decision Processes Analysis
-
eds. E. A. Feinberg and A. Schwartz, Kluwer Academic Publishers, Dordrecht
-
V. S. Borkar, Convex Analytic Methods in Markov Decision Processes Analysis, in "Handbook of Markov Decision Processes" (eds. E. A. Feinberg and A. Schwartz), Kluwer Academic Publishers, Dordrecht, 2001.
-
(2001)
Handbook of Markov Decision Processes
-
-
Borkar, V.S.1
-
9
-
-
57249098720
-
-
Research Report, ISE Dept. University of Florida, 2003
-
A. Chekhlov, S. Uryasev and M. Zabarankin, Drawdown Measure in Portfolio Optimization, Research Report 2003-15, ISE Dept. University of Florida, 2003.
-
(1915)
Drawdown Measure in Portfolio Optimization
-
-
Chekhlov, A.1
Uryasev, S.2
Zabarankin, M.3
-
10
-
-
0009990403
-
Some Remarks on Finite Horizon Markovian Decision Models
-
C. Derman and M. Klein, Some Remarks on Finite Horizon Markovian Decision Models, Operations Research, 13 (1965), 272-278.
-
(1965)
Operations Research
, vol.13
, pp. 272-278
-
-
Derman, C.1
Klein, M.2
-
11
-
-
0041648459
-
-
E. A. Feinberg and A. Schwartz Editors, Kluwer Academic Publishers, Dordrecht
-
E. A. Feinberg and A. Schwartz (Editors), "Handbook of Markov Decision Processes," Kluwer Academic Publishers, Dordrecht, 2001.
-
(2001)
Handbook of Markov Decision Processes
-
-
-
12
-
-
0343860991
-
Multi-Criteria Reinforcement Learning
-
Madison, WI
-
Z. Gabor, Z. Kalmar and C. Szepesvari, Multi-Criteria Reinforcement Learning, Proceedings of the Fifteenth International Conference on Machine Learning, Madison, WI, (1998), 197-205.
-
(1998)
Proceedings of the Fifteenth International Conference on Machine Learning
, pp. 197-205
-
-
Gabor, Z.1
Kalmar, Z.2
Szepesvari, C.3
-
13
-
-
39649113100
-
A Learning Rate Analysis of Reinforcement Learning Algorithms in Finite-Horizon
-
Madison, WI
-
F. Garcia and S. M. Nadiaye, A Learning Rate Analysis of Reinforcement Learning Algorithms in Finite-Horizon, Proceedings of the Fifteenth International Conference on Machine Learning, Madison, WI, (1998), 215-223.
-
(1998)
Proceedings of the Fifteenth International Conference on Machine Learning
, pp. 215-223
-
-
Garcia, F.1
Nadiaye, S.M.2
-
15
-
-
79960013704
-
A Geometric Approach to Multi-Criterion Reinforcement Learning
-
S. Mannor and N. Shimkin, A Geometric Approach to Multi-Criterion Reinforcement Learning, Journal of Machine Learning Research, 5 (2004), 325-360.
-
(2004)
Journal of Machine Learning Research
, vol.5
, pp. 325-360
-
-
Mannor, S.1
Shimkin, N.2
-
16
-
-
57249117063
-
Learning Algorithms for Risk Management,
-
M. Tech. Thesis, IEOR Interdisciplinary Programme, IIT Bombay
-
A. K. Mittal, "Learning Algorithms for Risk Management," M. Tech. Thesis, IEOR Interdisciplinary Programme, IIT Bombay, 2005.
-
(2005)
-
-
Mittal, A.K.1
-
18
-
-
0031131261
-
Constrained Optimization via Stochastic Approximation with a Simultaneous Perturbation Gradient Approximation
-
P. Sadhegh, Constrained Optimization via Stochastic Approximation with a Simultaneous Perturbation Gradient Approximation, Automatica, 33 (1997), 889-892.
-
(1997)
Automatica
, vol.33
, pp. 889-892
-
-
Sadhegh, P.1
-
19
-
-
0030737152
-
A One-Measurement Form of Simultaneous Perturbation Stochastic Approximation
-
J. C. Spall, A One-Measurement Form of Simultaneous Perturbation Stochastic Approximation, Automatica, 33 (1997), 109-112.
-
(1997)
Automatica
, vol.33
, pp. 109-112
-
-
Spall, J.C.1
|