메뉴 건너뛰기




Volumn 34, Issue 12-13, 2003, Pages 717-730

A simulation-based approach to study stochastic inventory-planning games

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTER SIMULATION; DECISION MAKING; GAME THEORY; INTEGER PROGRAMMING; LEARNING SYSTEMS; LINEAR PROGRAMMING; MARKOV PROCESSES; MATHEMATICAL MODELS; MULTI AGENT SYSTEMS; RETAIL STORES; WAREHOUSES;

EID: 1642461607     PISSN: 00207721     EISSN: 14645319     Source Type: Journal    
DOI: 10.1080/00207720310001640755     Document Type: Article
Times cited : (12)

References (18)
  • 1
    • 0003874616 scopus 로고    scopus 로고
    • Learning algorithms for Markov decision processes with average cost
    • Laboratory for Information and Decision Systems, Cambridge, MA: MIT)
    • Abounadi, J., Bertsekas, D., and Borkar, V. S., 1998, Learning algorithms for Markov decision processes with average cost. LIDS-P-2434, Laboratory for Information and Decision Systems (Cambridge, MA: MIT).
    • (1998) LIDS-P-2434
    • Abounadi, J.1    Bertsekas, D.2    Borkar, V.S.3
  • 3
    • 0003787146 scopus 로고
    • Princeton, NJ: Princeton university Press)
    • Bellman, R. E., 1957, Dynamic Programming (Princeton, NJ: Princeton university Press).
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 4
    • 85156187730 scopus 로고    scopus 로고
    • Improving elevator performance using reinforcement learning
    • In D. S. Touretzky, M. C. Mozer, M. E. Hasselmo (eds.), Cambridge, MA: MIT
    • Crites, R., and Barto, A., 1996, Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, M. E. Hasselmo (eds.) Advances in Neural Information Processing Systems 8 (Cambridge, MA: MIT) pp. 1017-1023.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1017-1023
    • Crites, R.1    Barto, A.2
  • 6
    • 0032643313 scopus 로고    scopus 로고
    • Solving semi-Markov decision problems using average reward reinforcement learning
    • Das, T. K., Gosavi, A., Mahadevan, S., and Marchalleck N., 1999, Solving semi-Markov decision problems using average reward reinforcement learning. Management Science, 45, 560-574.
    • (1999) Management Science , vol.45 , pp. 560-574
    • Das, T.K.1    Gosavi, A.2    Mahadevan, S.3    Marchalleck, N.4
  • 7
    • 0038829878 scopus 로고    scopus 로고
    • Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria
    • Erev, I., and Roth, A. E., 1998, Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria. The American Economic Review, 88, 848-881.
    • (1998) The American Economic Review , vol.88 , pp. 848-881
    • Erev, I.1    Roth, A.E.2
  • 10
    • 0036722536 scopus 로고    scopus 로고
    • A reinforcement learning approach to airline seat allocation for multiple fare classes with overbooking
    • (, on Advances on Large Scale Optimization for Logistics, Production, and Manufacturing Systems)
    • Gosavi, A., Bandla, N., and Das, T. K., 2002, A reinforcement learning approach to airline seat allocation for multiple fare classes with overbooking. IIE Transactions on Operations Engineering (SpecialIssue on Advances on Large Scale Optimization for Logistics, Production, and Manufacturing Systems), 34, 729-742.
    • (2002) IIE Transactions on Operations Engineering , vol.34 , pp. 729-742
    • Gosavi, A.1    Bandla, N.2    Das, T.K.3
  • 11
    • 84995317030 scopus 로고    scopus 로고
    • A simulation-based learning automata framework for solving semi-markov decision problems under long-run average cost
    • Gosavi, A., Das, T. K., and Sarkar, S., In press. A simulation-based learning automata framework for solving semi-markov decision problems under long-run average cost. IIE Transactions on Operations Engineering.
    • IIE Transactions on Operations Engineering
    • Gosavi, A.1    Das, T.K.2    Sarkar, S.3
  • 14
    • 0035124331 scopus 로고    scopus 로고
    • Intelligent dynamic control policies for serial production lines
    • Paternina, C. D., and Das, T. K., 2000, Intelligent dynamic control policies for serial production lines. IIE Transactions, 33, pp. 65-77.
    • (2000) IIE Transactions , vol.33 , pp. 65-77
    • Paternina, C.D.1    Das, T.K.2
  • 18
    • 0344545422 scopus 로고    scopus 로고
    • Reinforcement learning for dynamic channel alloction in cellular telephone systems
    • Cambridge, MA: MIT Press)
    • Singh, S., and Bertsekas, D., 1996, Reinforcement learning for dynamic channel alloction in cellular telephone systems. Neural Information Processing Systems (Cambridge, MA: MIT Press).
    • (1996) Neural Information Processing Systems
    • Singh, S.1    Bertsekas, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.