-
1
-
-
0003787146
-
-
Princeton University Press Princeton
-
Bellman R. Dynamic programming 1957 Princeton University Press Princeton
-
(1957)
Dynamic Programming
-
-
Bellman, R.1
-
4
-
-
79960439729
-
Approximate policy iteration: A survey and some new methods
-
Bertsekas D.P. Approximate policy iteration: A survey and some new methods Journal of Control Theory and Application 9 3 2011 310 335
-
(2011)
Journal of Control Theory and Application
, vol.9
, Issue.3
, pp. 310-335
-
-
Bertsekas, D.P.1
-
9
-
-
0000879068
-
Optimal policies for multi-echelon inventory problems
-
Clark A.J., Scarf H. Optimal policies for multi-echelon inventory problems Management Science 6 1960 475 490
-
(1960)
Management Science
, vol.6
, pp. 475-490
-
-
Clark, A.J.1
Scarf, H.2
-
11
-
-
0034466408
-
Extended kanban control system: Combining kanban and base stock
-
Dallery Y., Liberopoulos G. Extended kanban control system: Combining kanban and base stock IIE Transactions 32 2000 369 386
-
(2000)
IIE Transactions
, vol.32
, pp. 369-386
-
-
Dallery, Y.1
Liberopoulos, G.2
-
12
-
-
0032643313
-
Solving semi-Markov decision problems using average reward reinforcement learning
-
Das T.K., Gosavi A., Mahadevan S., Marchalleck N. Solving semi-Markov decision problems using average reward reinforcement learning Management Science 45 1999 560 574
-
(1999)
Management Science
, vol.45
, pp. 560-574
-
-
Das, T.K.1
Gosavi, A.2
Mahadevan, S.3
Marchalleck, N.4
-
13
-
-
84864704233
-
Approximate dynamic programming via a smoothed linear program
-
Desai V.V., Farias V.F., Moallemi C.C. Approximate dynamic programming via a smoothed linear program Operations Research 60 2012 655 674
-
(2012)
Operations Research
, vol.60
, pp. 655-674
-
-
Desai, V.V.1
Farias, V.F.2
Moallemi, C.C.3
-
14
-
-
0036722536
-
A reinforcement learning approach to a single-leg airline revenue management problem with multiple fare classes and overbooking
-
Gosavi A., Bandla N., Das T.K. A reinforcement learning approach to a single-leg airline revenue management problem with multiple fare classes and overbooking IIE Transactions 34 2002 729 742
-
(2002)
IIE Transactions
, vol.34
, pp. 729-742
-
-
Gosavi, A.1
Bandla, N.2
Das, T.K.3
-
16
-
-
25144479690
-
A simulation- based policy iteration algorithm for average cost unichain Markov decision processes
-
M. Laguna J.L.G. Velarde Kluwer Academic
-
He Y., Fu M.C., Marcus S.I. A simulation- based policy iteration algorithm for average cost unichain Markov decision processes M. Laguna J.L.G. Velarde Computing tools for modeling, optimization and simulation 2000 Kluwer Academic 161 182
-
(2000)
Computing Tools for Modeling, Optimization and Simulation
, pp. 161-182
-
-
He, Y.1
Fu, M.C.2
Marcus, S.I.3
-
19
-
-
84948710300
-
Modified policy iteration algorithm with nonoptimality tests for undiscounted Markov decision process
-
Department of Information System and Management Science, Konan University, Japan
-
Ohno K. Modified policy iteration algorithm with nonoptimality tests for undiscounted Markov decision process Working Paper 1985 Department of Information System and Management Science, Konan University, Japan
-
(1985)
Working Paper
-
-
Ohno, K.1
-
20
-
-
0023170103
-
Computing optimal policies for controlled tandem queueing systems
-
Ohno K., Ichiki K. Computing optimal policies for controlled tandem queueing systems Operations Research 35 1 1987 121 126
-
(1987)
Operations Research
, vol.35
, Issue.1
, pp. 121-126
-
-
Ohno, K.1
Ichiki, K.2
-
22
-
-
79955605478
-
Neuro-dynamic programming algorithms for computing optimal control of production lines
-
(in Japanese)
-
Ohno K., Yashima K., Ito T. Neuro-dynamic programming algorithms for computing optimal control of production lines Journal of Japan Industrial Management Association 54 5 2003 316 325 (in Japanese)
-
(2003)
Journal of Japan Industrial Management Association
, vol.54
, Issue.5
, pp. 316-325
-
-
Ohno, K.1
Yashima, K.2
Ito, T.3
-
23
-
-
25144466638
-
An optimal control of a production and distribution system by neuro-dynamic programming and a comparison of pull systems
-
(in Japanese)
-
Ohno K., Ito T. An optimal control of a production and distribution system by neuro-dynamic programming and a comparison of pull systems Journal of Japan Industrial Management Association 55 4 2004 179 188 (in Japanese)
-
(2004)
Journal of Japan Industrial Management Association
, vol.55
, Issue.4
, pp. 179-188
-
-
Ohno, K.1
Ito, T.2
-
24
-
-
79955642707
-
The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems
-
Ohno K. The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems European Journal of Operational Research 213 2011 124 133
-
(2011)
European Journal of Operational Research
, vol.213
, pp. 124-133
-
-
Ohno, K.1
-
26
-
-
79960463702
-
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications
-
Powell W.B., Ma J. A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications Journal of Control Theory and Applications 9 3 2011 336 352
-
(2011)
Journal of Control Theory and Applications
, vol.9
, Issue.3
, pp. 336-352
-
-
Powell, W.B.1
Ma, J.2
|