SCOPUS 정보 검색 플랫폼

European Journal of Operational Research

Volumn 249, Issue 1, 2016, Pages 22-31

New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system

(4) Ohno, Katsuhisa a Boh, Toshitaka b Nakade, Koichi c Tamura, Takayoshi a

a AICHI INSTITUTE OF TECHNOLOGY (Japan)

b PANASONIC CORPORATION (Japan)

c NAGOYA INSTITUTE OF TECHNOLOGY (Japan)

Author keywords

Approximate dynamic programming algorithms; JIT based production and distribution system; Optimal control; The curses of dimensionality; Undiscounted Markov decision processes

Indexed keywords

COMPUTER SYSTEMS PROGRAMMING; ITERATIVE METHODS; JUST IN TIME PRODUCTION; MARKOV PROCESSES; OPTIMAL CONTROL SYSTEMS; POISSON DISTRIBUTION; STOCHASTIC CONTROL SYSTEMS; STOCHASTIC SYSTEMS;

APPROXIMATE DYNAMIC PROGRAMMING; DISTRIBUTION SYSTEMS; MARKOV DECISION PROCESSES; OPTIMAL CONTROLS; THE CURSES OF DIMENSIONALITY;

DYNAMIC PROGRAMMING;

EID: 84948715868 PISSN: 03772217 EISSN: None Source Type: Journal
DOI: 10.1016/j.ejor.2015.07.026 Document Type: Article

Times cited : (21)

References (30)

1
- 0003787146
- Princeton University Press Princeton
- Bellman R. Dynamic programming 1957 Princeton University Press Princeton
- (1957) Dynamic Programming
- Bellman, R.¹

2
- 0003487482
- Athena Scientific
- Bertsekas D.P., Tsitsiklis J.N. Neuro-dynamic programming 1996 Athena Scientific
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

3
- 79953145727
- Pathologies of temporal difference methods in approximate dynamic programming
- December 2010
- Bertsekas D.P. Pathologies of temporal difference methods in approximate dynamic programming Proceedings of 2010 conference on decision and control, Atlanta, GA, 2010 December 2010
- (2010) Proceedings of 2010 Conference on Decision and Control, Atlanta, GA
- Bertsekas, D.P.¹

4
- 79960439729
- Approximate policy iteration: A survey and some new methods
- Bertsekas D.P. Approximate policy iteration: A survey and some new methods Journal of Control Theory and Application 9 3 2011 310 335
- (2011) Journal of Control Theory and Application , vol.9 , Issue.3 , pp. 310-335
- Bertsekas, D.P.¹

5
- 0031095866
- A comparison of production-line control mechanisms
- Bonvik A.M., Couch C.E., Gershwin S.B. A comparison of production-line control mechanisms International Journal of Production Research 35 3 1997 789 804
- (1997) International Journal of Production Research , vol.35 , Issue.3 , pp. 789-804
- Bonvik, A.M.¹ Couch, C.E.² Gershwin, S.B.³

6
- 85046476577
- CRC Press Boca Raton
- Buşoniu L., Babuška R., Schutter B.D., Ernst D. Reinforcement learning and dynamic programming using function approximators 2010 CRC Press Boca Raton
- (2010) Reinforcement Learning and Dynamic Programming Using Function Approximators
- Buşoniu, L.¹ Babuška, R.² Schutter, B.D.³ Ernst, D.⁴

7
- 84889784415
- Springer NY
- Cao X.-R. Stochastic learning and optimization - A sensitivity-based approach 2007 Springer NY
- (2007) Stochastic Learning and Optimization - A Sensitivity-based Approach
- Cao, X.-R.¹

8
- 34547120053
- Springer-Verlag
- Chang H.S., Fu M.C., Hu J., Marcus S.I. Simulation-based algorithms for Markov decision processes 2007 Springer-Verlag
- (2007) Simulation-based Algorithms for Markov Decision Processes
- Chang, H.S.¹ Fu, M.C.² Hu, J.³ Marcus, S.I.⁴

9
- 0000879068
- Optimal policies for multi-echelon inventory problems
- Clark A.J., Scarf H. Optimal policies for multi-echelon inventory problems Management Science 6 1960 475 490
- (1960) Management Science , vol.6 , pp. 475-490
- Clark, A.J.¹ Scarf, H.²

10
- 0038380746
- Convergence of simulation-based policy iteration
- Cooper W.L., Henderson S.G., Lewis M.E. Convergence of simulation-based policy iteration Probability in the Engineering and Informational Sciences 17 2003 213 234
- (2003) Probability in the Engineering and Informational Sciences , vol.17 , pp. 213-234
- Cooper, W.L.¹ Henderson, S.G.² Lewis, M.E.³

11
- 0034466408
- Extended kanban control system: Combining kanban and base stock
- Dallery Y., Liberopoulos G. Extended kanban control system: Combining kanban and base stock IIE Transactions 32 2000 369 386
- (2000) IIE Transactions , vol.32 , pp. 369-386
- Dallery, Y.¹ Liberopoulos, G.²

12
- 0032643313
- Solving semi-Markov decision problems using average reward reinforcement learning
- Das T.K., Gosavi A., Mahadevan S., Marchalleck N. Solving semi-Markov decision problems using average reward reinforcement learning Management Science 45 1999 560 574
- (1999) Management Science , vol.45 , pp. 560-574
- Das, T.K.¹ Gosavi, A.² Mahadevan, S.³ Marchalleck, N.⁴

13
- 84864704233
- Approximate dynamic programming via a smoothed linear program
- Desai V.V., Farias V.F., Moallemi C.C. Approximate dynamic programming via a smoothed linear program Operations Research 60 2012 655 674
- (2012) Operations Research , vol.60 , pp. 655-674
- Desai, V.V.¹ Farias, V.F.² Moallemi, C.C.³

14
- 0036722536
- A reinforcement learning approach to a single-leg airline revenue management problem with multiple fare classes and overbooking
- Gosavi A., Bandla N., Das T.K. A reinforcement learning approach to a single-leg airline revenue management problem with multiple fare classes and overbooking IIE Transactions 34 2002 729 742
- (2002) IIE Transactions , vol.34 , pp. 729-742
- Gosavi, A.¹ Bandla, N.² Das, T.K.³

15
- 84888630832
- Kluwer Academic
- Gosavi A. Simulation-based optimization: Parametric optimization techniques and reinforcement learning 2003 Kluwer Academic
- (2003) Simulation-based Optimization: Parametric Optimization Techniques and Reinforcement Learning
- Gosavi, A.¹

16
- 25144479690
- A simulation- based policy iteration algorithm for average cost unichain Markov decision processes
- M. Laguna J.L.G. Velarde Kluwer Academic
- He Y., Fu M.C., Marcus S.I. A simulation- based policy iteration algorithm for average cost unichain Markov decision processes M. Laguna J.L.G. Velarde Computing tools for modeling, optimization and simulation 2000 Kluwer Academic 161 182
- (2000) Computing Tools for Modeling, Optimization and Simulation , pp. 161-182
- He, Y.¹ Fu, M.C.² Marcus, S.I.³

17
- 79952619478
- Approximate dynamic programming for an inventory problem: Empirical comparison
- Katanyukul T., Duff W.S., Chong E.K.P. Approximate dynamic programming for an inventory problem: Empirical comparison Computers & Industrial Engineering 60 2011 719 743
- (2011) Computers & Industrial Engineering , vol.60 , pp. 719-743
- Katanyukul, T.¹ Duff, W.S.² Chong, E.K.P.³

18
- 84865001599
- 5th ed. CRC Press New York
- Monden Y. Toyota production system 5th ed. 2012 CRC Press New York
- (2012) Toyota Production System
- Monden, Y.¹

19
- 84948710300
- Modified policy iteration algorithm with nonoptimality tests for undiscounted Markov decision process
- Department of Information System and Management Science, Konan University, Japan
- Ohno K. Modified policy iteration algorithm with nonoptimality tests for undiscounted Markov decision process Working Paper 1985 Department of Information System and Management Science, Konan University, Japan
- (1985) Working Paper
- Ohno, K.¹

20
- 0023170103
- Computing optimal policies for controlled tandem queueing systems
- Ohno K., Ichiki K. Computing optimal policies for controlled tandem queueing systems Operations Research 35 1 1987 121 126
- (1987) Operations Research , vol.35 , Issue.1 , pp. 121-126
- Ohno, K.¹ Ichiki, K.²

21
- 0000939673
- Optimal numbers of two kinds of kanbans in a JIT production system
- Ohno K., Nakashima K., Kojima M. Optimal numbers of two kinds of kanbans in a JIT production system International Journal of Production Research 33 5 1995 1387 1401
- (1995) International Journal of Production Research , vol.33 , Issue.5 , pp. 1387-1401
- Ohno, K.¹ Nakashima, K.² Kojima, M.³

22
- 79955605478
- Neuro-dynamic programming algorithms for computing optimal control of production lines
- (in Japanese)
- Ohno K., Yashima K., Ito T. Neuro-dynamic programming algorithms for computing optimal control of production lines Journal of Japan Industrial Management Association 54 5 2003 316 325 (in Japanese)
- (2003) Journal of Japan Industrial Management Association , vol.54 , Issue.5 , pp. 316-325
- Ohno, K.¹ Yashima, K.² Ito, T.³

23
- 25144466638
- An optimal control of a production and distribution system by neuro-dynamic programming and a comparison of pull systems
- (in Japanese)
- Ohno K., Ito T. An optimal control of a production and distribution system by neuro-dynamic programming and a comparison of pull systems Journal of Japan Industrial Management Association 55 4 2004 179 188 (in Japanese)
- (2004) Journal of Japan Industrial Management Association , vol.55 , Issue.4 , pp. 179-188
- Ohno, K.¹ Ito, T.²

24
- 79955642707
- The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems
- Ohno K. The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems European Journal of Operational Research 213 2011 124 133
- (2011) European Journal of Operational Research , vol.213 , pp. 124-133
- Ohno, K.¹

25
- 47349092417
- Wiley-Interscience New Jersey
- Powell W.B. Approximate dynamic programming - Solving the curses of dimensionality 2007 Wiley-Interscience New Jersey
- (2007) Approximate Dynamic Programming - Solving the Curses of Dimensionality
- Powell, W.B.¹

26
- 79960463702
- A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications
- Powell W.B., Ma J. A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications Journal of Control Theory and Applications 9 3 2011 336 352
- (2011) Journal of Control Theory and Applications , vol.9 , Issue.3 , pp. 336-352
- Powell, W.B.¹ Ma, J.²

27
- 85102627959
- John Wiley & Sons New York
- Puterman M.L. Markov decision processes: Discrete stochastic dynamic programming 1994 John Wiley & Sons New York
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

28
- 84921399937
- IEEE Press
- Si J., Barto A., Powell W., Wunsch D. Handbook of learning and approximate dynamic programming 2004 IEEE Press
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.² Powell, W.³ Wunsch, D.⁴

29
- 0025430443
- CONWIP: A pull alternative to kanban
- Spearman M.L., Woodruff D.L., Hopp W.J. CONWIP: A pull alternative to kanban International Journal of Production Research 28 5 1990 879 894
- (1990) International Journal of Production Research , vol.28 , Issue.5 , pp. 879-894
- Spearman, M.L.¹ Woodruff, D.L.² Hopp, W.J.³

30
- 0004007508
- MIT Press
- Sutton R.S., Barto A.G. Reinforcement learning 1998 MIT Press
- (1998) Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.