메뉴 건너뛰기




Volumn 249, Issue 1, 2016, Pages 22-31

New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system

Author keywords

Approximate dynamic programming algorithms; JIT based production and distribution system; Optimal control; The curses of dimensionality; Undiscounted Markov decision processes

Indexed keywords

COMPUTER SYSTEMS PROGRAMMING; ITERATIVE METHODS; JUST IN TIME PRODUCTION; MARKOV PROCESSES; OPTIMAL CONTROL SYSTEMS; POISSON DISTRIBUTION; STOCHASTIC CONTROL SYSTEMS; STOCHASTIC SYSTEMS;

EID: 84948715868     PISSN: 03772217     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.ejor.2015.07.026     Document Type: Article
Times cited : (21)

References (30)
  • 1
    • 0003787146 scopus 로고
    • Princeton University Press Princeton
    • Bellman R. Dynamic programming 1957 Princeton University Press Princeton
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 4
    • 79960439729 scopus 로고    scopus 로고
    • Approximate policy iteration: A survey and some new methods
    • Bertsekas D.P. Approximate policy iteration: A survey and some new methods Journal of Control Theory and Application 9 3 2011 310 335
    • (2011) Journal of Control Theory and Application , vol.9 , Issue.3 , pp. 310-335
    • Bertsekas, D.P.1
  • 9
    • 0000879068 scopus 로고
    • Optimal policies for multi-echelon inventory problems
    • Clark A.J., Scarf H. Optimal policies for multi-echelon inventory problems Management Science 6 1960 475 490
    • (1960) Management Science , vol.6 , pp. 475-490
    • Clark, A.J.1    Scarf, H.2
  • 11
    • 0034466408 scopus 로고    scopus 로고
    • Extended kanban control system: Combining kanban and base stock
    • Dallery Y., Liberopoulos G. Extended kanban control system: Combining kanban and base stock IIE Transactions 32 2000 369 386
    • (2000) IIE Transactions , vol.32 , pp. 369-386
    • Dallery, Y.1    Liberopoulos, G.2
  • 12
    • 0032643313 scopus 로고    scopus 로고
    • Solving semi-Markov decision problems using average reward reinforcement learning
    • Das T.K., Gosavi A., Mahadevan S., Marchalleck N. Solving semi-Markov decision problems using average reward reinforcement learning Management Science 45 1999 560 574
    • (1999) Management Science , vol.45 , pp. 560-574
    • Das, T.K.1    Gosavi, A.2    Mahadevan, S.3    Marchalleck, N.4
  • 13
    • 84864704233 scopus 로고    scopus 로고
    • Approximate dynamic programming via a smoothed linear program
    • Desai V.V., Farias V.F., Moallemi C.C. Approximate dynamic programming via a smoothed linear program Operations Research 60 2012 655 674
    • (2012) Operations Research , vol.60 , pp. 655-674
    • Desai, V.V.1    Farias, V.F.2    Moallemi, C.C.3
  • 14
    • 0036722536 scopus 로고    scopus 로고
    • A reinforcement learning approach to a single-leg airline revenue management problem with multiple fare classes and overbooking
    • Gosavi A., Bandla N., Das T.K. A reinforcement learning approach to a single-leg airline revenue management problem with multiple fare classes and overbooking IIE Transactions 34 2002 729 742
    • (2002) IIE Transactions , vol.34 , pp. 729-742
    • Gosavi, A.1    Bandla, N.2    Das, T.K.3
  • 16
    • 25144479690 scopus 로고    scopus 로고
    • A simulation- based policy iteration algorithm for average cost unichain Markov decision processes
    • M. Laguna J.L.G. Velarde Kluwer Academic
    • He Y., Fu M.C., Marcus S.I. A simulation- based policy iteration algorithm for average cost unichain Markov decision processes M. Laguna J.L.G. Velarde Computing tools for modeling, optimization and simulation 2000 Kluwer Academic 161 182
    • (2000) Computing Tools for Modeling, Optimization and Simulation , pp. 161-182
    • He, Y.1    Fu, M.C.2    Marcus, S.I.3
  • 17
    • 79952619478 scopus 로고    scopus 로고
    • Approximate dynamic programming for an inventory problem: Empirical comparison
    • Katanyukul T., Duff W.S., Chong E.K.P. Approximate dynamic programming for an inventory problem: Empirical comparison Computers & Industrial Engineering 60 2011 719 743
    • (2011) Computers & Industrial Engineering , vol.60 , pp. 719-743
    • Katanyukul, T.1    Duff, W.S.2    Chong, E.K.P.3
  • 19
    • 84948710300 scopus 로고
    • Modified policy iteration algorithm with nonoptimality tests for undiscounted Markov decision process
    • Department of Information System and Management Science, Konan University, Japan
    • Ohno K. Modified policy iteration algorithm with nonoptimality tests for undiscounted Markov decision process Working Paper 1985 Department of Information System and Management Science, Konan University, Japan
    • (1985) Working Paper
    • Ohno, K.1
  • 20
    • 0023170103 scopus 로고
    • Computing optimal policies for controlled tandem queueing systems
    • Ohno K., Ichiki K. Computing optimal policies for controlled tandem queueing systems Operations Research 35 1 1987 121 126
    • (1987) Operations Research , vol.35 , Issue.1 , pp. 121-126
    • Ohno, K.1    Ichiki, K.2
  • 22
    • 79955605478 scopus 로고    scopus 로고
    • Neuro-dynamic programming algorithms for computing optimal control of production lines
    • (in Japanese)
    • Ohno K., Yashima K., Ito T. Neuro-dynamic programming algorithms for computing optimal control of production lines Journal of Japan Industrial Management Association 54 5 2003 316 325 (in Japanese)
    • (2003) Journal of Japan Industrial Management Association , vol.54 , Issue.5 , pp. 316-325
    • Ohno, K.1    Yashima, K.2    Ito, T.3
  • 23
    • 25144466638 scopus 로고    scopus 로고
    • An optimal control of a production and distribution system by neuro-dynamic programming and a comparison of pull systems
    • (in Japanese)
    • Ohno K., Ito T. An optimal control of a production and distribution system by neuro-dynamic programming and a comparison of pull systems Journal of Japan Industrial Management Association 55 4 2004 179 188 (in Japanese)
    • (2004) Journal of Japan Industrial Management Association , vol.55 , Issue.4 , pp. 179-188
    • Ohno, K.1    Ito, T.2
  • 24
    • 79955642707 scopus 로고    scopus 로고
    • The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems
    • Ohno K. The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems European Journal of Operational Research 213 2011 124 133
    • (2011) European Journal of Operational Research , vol.213 , pp. 124-133
    • Ohno, K.1
  • 26
    • 79960463702 scopus 로고    scopus 로고
    • A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications
    • Powell W.B., Ma J. A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications Journal of Control Theory and Applications 9 3 2011 336 352
    • (2011) Journal of Control Theory and Applications , vol.9 , Issue.3 , pp. 336-352
    • Powell, W.B.1    Ma, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.