SCOPUS 정보 검색 플랫폼

International Journal of Systems Science

Volumn 34, Issue 12-13, 2003, Pages 717-730

A simulation-based approach to study stochastic inventory-planning games

(3) Rao, Jaideep J a Ravulapati, Kiran Kumar b Das, Tapas K c

a Pilgrim Software (United States)

b Delta Technology (United States)

c University of South Florida (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTER SIMULATION; DECISION MAKING; GAME THEORY; INTEGER PROGRAMMING; LEARNING SYSTEMS; LINEAR PROGRAMMING; MARKOV PROCESSES; MATHEMATICAL MODELS; MULTI AGENT SYSTEMS; RETAIL STORES; WAREHOUSES;

INVENTORY PLANNING; NONCOOPERATIVE DECISION MAKING; REINFORCEMENT LEARNING; STOCHASTIC GAMES;

INVENTORY CONTROL;

EID: 1642461607 PISSN: 00207721 EISSN: 14645319 Source Type: Journal
DOI: 10.1080/00207720310001640755 Document Type: Article

Times cited : (12)

References (18)

1
- 0003874616
- Learning algorithms for Markov decision processes with average cost
- Laboratory for Information and Decision Systems, Cambridge, MA: MIT)
- Abounadi, J., Bertsekas, D., and Borkar, V. S., 1998, Learning algorithms for Markov decision processes with average cost. LIDS-P-2434, Laboratory for Information and Decision Systems (Cambridge, MA: MIT).
- (1998) LIDS-P-2434
- Abounadi, J.¹ Bertsekas, D.² Borkar, V.S.³

2
- 0013155747
- A general framework for the study of decentralized distribution systems
- Anupindi, R., Bassok, Y., and Zemel, E., 2001, A general framework for the study of decentralized distribution systems. Journal of Manufacturing and Service Operations Management, 3, 349-368.
- (2001) Journal of Manufacturing and Service Operations Management , vol.3 , pp. 349-368
- Anupindi, R.¹ Bassok, Y.² Zemel, E.³

3
- 0003787146
- Princeton, NJ: Princeton university Press)
- Bellman, R. E., 1957, Dynamic Programming (Princeton, NJ: Princeton university Press).
- (1957) Dynamic Programming
- Bellman, R.E.¹

4
- 85156187730
- Improving elevator performance using reinforcement learning
- In D. S. Touretzky, M. C. Mozer, M. E. Hasselmo (eds.), Cambridge, MA: MIT
- Crites, R., and Barto, A., 1996, Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, M. E. Hasselmo (eds.) Advances in Neural Information Processing Systems 8 (Cambridge, MA: MIT) pp. 1017-1023.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1017-1023
- Crites, R.¹ Barto, A.²

5
- 84996565038
- Learning rate schedules for faster stochastic gradient search
- In D. A. White and D. A. Sofge (eds.), Piscataway, NJ: IEEE Press
- Darken, C., Chang, J., and Moody, J., 1992, Learning rate schedules for faster stochastic gradient search. In D. A. White and D. A. Sofge (eds.) Neural Networks for Signal Processing 2-Proceedings of the 1992 IEEE Workshop (Piscataway, NJ: IEEE Press).
- (1992) Neural Networks for Signal Processing 2-Proceedings of the 1992 IEEE Workshop
- Darken, C.¹ Chang, J.² Moody, J.³

6
- 0032643313
- Solving semi-Markov decision problems using average reward reinforcement learning
- Das, T. K., Gosavi, A., Mahadevan, S., and Marchalleck N., 1999, Solving semi-Markov decision problems using average reward reinforcement learning. Management Science, 45, 560-574.
- (1999) Management Science , vol.45 , pp. 560-574
- Das, T.K.¹ Gosavi, A.² Mahadevan, S.³ Marchalleck, N.⁴

7
- 0038829878
- Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria
- Erev, I., and Roth, A. E., 1998, Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria. The American Economic Review, 88, 848-881.
- (1998) The American Economic Review , vol.88 , pp. 848-881
- Erev, I.¹ Roth, A.E.²

8
- 0003989209
- New York: Springer)
- Filar, J., and Vrieze, K., 1997, Competitive Markov Decision Processes (New York: Springer).
- (1997) Competitive Markov Decision Processes
- Filar, J.¹ Vrieze, K.²

9
- 0003653971
- PhD dissertation, University of South Florida
- Gosavi, A., 1999, An algorithm for solving semi-Markov decision problems using reinforcement learning: Convergence analysis and numerical results. PhD dissertation, University of South Florida.
- (1999) An Algorithm for Solving Semi-Markov Decision Problems Using Reinforcement Learning: Convergence Analysis and Numerical Results
- Gosavi, A.¹

10
- 0036722536
- A reinforcement learning approach to airline seat allocation for multiple fare classes with overbooking
- (, on Advances on Large Scale Optimization for Logistics, Production, and Manufacturing Systems)
- Gosavi, A., Bandla, N., and Das, T. K., 2002, A reinforcement learning approach to airline seat allocation for multiple fare classes with overbooking. IIE Transactions on Operations Engineering (SpecialIssue on Advances on Large Scale Optimization for Logistics, Production, and Manufacturing Systems), 34, 729-742.
- (2002) IIE Transactions on Operations Engineering , vol.34 , pp. 729-742
- Gosavi, A.¹ Bandla, N.² Das, T.K.³

11
- 84995317030
- A simulation-based learning automata framework for solving semi-markov decision problems under long-run average cost
- Gosavi, A., Das, T. K., and Sarkar, S., In press. A simulation-based learning automata framework for solving semi-markov decision problems under long-run average cost. IIE Transactions on Operations Engineering.
- IIE Transactions on Operations Engineering
- Gosavi, A.¹ Das, T.K.² Sarkar, S.³

12
- 0000929496
- Multi-agent reinforcement learning: Theoretical framework and an algorithm
- Hu, J., and Wellman, M. P., 1998, Multi-agent reinforcement learning: Theoretical framework and an algorithm. Proceedings of the Fifteenth International Conference on Machine Learning, pp. 242-250.
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning , pp. 242-250
- Hu, J.¹ Wellman, M.P.²

13
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Littman, M. L., 1994, Markov games as a framework for multi-agent reinforcement learning. Proceedings of the Eleventh International Conference on Machine Learning, pp. 157-163.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

14
- 0035124331
- Intelligent dynamic control policies for serial production lines
- Paternina, C. D., and Das, T. K., 2000, Intelligent dynamic control policies for serial production lines. IIE Transactions, 33, pp. 65-77.
- (2000) IIE Transactions , vol.33 , pp. 65-77
- Paternina, C.D.¹ Das, T.K.²

15
- 84995345542
- A reinforcement learning approach to stochastic business games
- Ravulapati, K. K., Rao, J. J., and Das, T. K., in press. A reinforcement learning approach to stochastic business games. IIE Transactions on Scheduling and Logistics.
- IIE Transactions on Scheduling and Logistics
- Ravulapati, K.K.¹ Rao, J.J.² Das, T.K.³

16
- 84953405534
- Cambridge: Cambridge University Press)
- Ripley, B. D., 1996, Pattern Recognition and Neural Networks (Cambridge: Cambridge University Press).
- (1996) Pattern Recognition and Neural Networks
- Ripley, B.D.¹

17
- 0000016172
- A Stochastic Approximation Method
- Robbins, H., and Monro, S., 1951, A Stochastic Approximation Method. Annals of Mathematics and Statistics, 22, 400-407.
- (1951) Annals of Mathematics and Statistics , vol.22 , pp. 400-407
- Robbins, H.¹ Monro, S.²

18
- 0344545422
- Reinforcement learning for dynamic channel alloction in cellular telephone systems
- Cambridge, MA: MIT Press)
- Singh, S., and Bertsekas, D., 1996, Reinforcement learning for dynamic channel alloction in cellular telephone systems. Neural Information Processing Systems (Cambridge, MA: MIT Press).
- (1996) Neural Information Processing Systems
- Singh, S.¹ Bertsekas, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.