메뉴 건너뛰기




Volumn 36, Issue 4, 2004, Pages 373-385

A reinforcement learning approach to stochastic business games

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPETITION; COMPUTER SIMULATION; DECISION MAKING; ELECTRONIC COMMERCE; INTERNET; INVENTORY CONTROL; LEARNING SYSTEMS; MARKOV PROCESSES; MATRIX ALGEBRA; PROBABILITY; STRATEGIC PLANNING;

EID: 1642315516     PISSN: 0740817X     EISSN: None     Source Type: Journal    
DOI: 10.1080/07408170490278698     Document Type: Article
Times cited : (25)

References (24)
  • 3
    • 0003787146 scopus 로고
    • Princeton University Press, Princeton, NJ
    • Bellman, R.E. (1957) Dynamic Programming, Princeton University Press, Princeton, NJ.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 6
    • 0032643313 scopus 로고    scopus 로고
    • Solving semi-Markov decision problems using average reward reinforcement learning
    • Das, T.K., Gosavi, A., Mahadevan, S. and Marchalleck, N. (1999) Solving semi-Markov decision problems using average reward reinforcement learning. Management Science, 45(4), 560-574.
    • (1999) Management Science , vol.45 , Issue.4 , pp. 560-574
    • Das, T.K.1    Gosavi, A.2    Mahadevan, S.3    Marchalleck, N.4
  • 7
    • 0038829878 scopus 로고    scopus 로고
    • Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria
    • Erev, I. and Roth, A.E. (1998) Predicting how people play games: reinforcement learning in experimental games with unique, mixed strategy equilibria. The American Economic Review, 88(4), 848-881.
    • (1998) The American Economic Review , vol.88 , Issue.4 , pp. 848-881
    • Erev, I.1    Roth, A.E.2
  • 10
    • 0036722536 scopus 로고    scopus 로고
    • A reinforcement learning approach to airline seat allocation for multiple fare classes with over-booking
    • Gosavi, A., Bandla, N. and Das, T.K. (2002) A reinforcement learning approach to airline seat allocation for multiple fare classes with over-booking. IIE Transactions, 34(9), 729-742.
    • (2002) IIE Transactions , vol.34 , Issue.9 , pp. 729-742
    • Gosavi, A.1    Bandla, N.2    Das, T.K.3
  • 12
    • 1642351771 scopus 로고    scopus 로고
    • Learning Nash equilibrium for average reward irreducible stochastic games
    • Department of Industrial and Management Systems Engineering, University of South Florida, Tampa, FL 33620
    • Li, J. and Das, T.K. (2003) Learning Nash equilibrium for average reward irreducible stochastic games. Working paper, Department of Industrial and Management Systems Engineering, University of South Florida, Tampa, FL 33620.
    • (2003) Working Paper
    • Li, J.1    Das, T.K.2
  • 14
    • 0001730497 scopus 로고
    • Non-cooperative games
    • Nash, J.F. (1951) Non-cooperative games. Annals of Mathematics, 54, 286-295.
    • (1951) Annals of Mathematics , vol.54 , pp. 286-295
    • Nash, J.F.1
  • 15
    • 0016594972 scopus 로고
    • On the core of linear production games
    • Owen, G. (1975) On the core of linear production games. Mathamatical Programming, 9, 358-370.
    • (1975) Mathamatical Programming , vol.9 , pp. 358-370
    • Owen, G.1
  • 16
    • 0035124331 scopus 로고    scopus 로고
    • Intelligent dynamic control policies for serial production lines
    • Paternina, C.D. and Das, T.K. (2000) Intelligent dynamic control policies for serial production lines. IIE Transactions, 33(1), 65-77.
    • (2000) IIE Transactions , vol.33 , Issue.1 , pp. 65-77
    • Paternina, C.D.1    Das, T.K.2
  • 20
    • 0346523383 scopus 로고
    • Competitive outcomes in the core of market games
    • The Rand Corporation
    • Shapley, L. and Shubik, M. (1975) Competitive outcomes in the core of market games. Technical report R-1692-NSF, The Rand Corporation.
    • (1975) Technical Report , vol.R-1692-NSF
    • Shapley, L.1    Shubik, M.2
  • 22
    • 0001081294 scopus 로고
    • Simplicial variable dimension algorithms for solving the nonlinear complimentary problem on a product of unit simplices using a general labeling
    • Van der Lann, G., Talman, A.J.J. and Van der Heyden, L. (1987) Simplicial variable dimension algorithms for solving the nonlinear complimentary problem on a product of unit simplices using a general labeling. Mathematics of Operations Research, 377-397.
    • (1987) Mathematics of Operations Research , pp. 377-397
    • Van der Lann, G.1    Talman, A.J.J.2    Van der Heyden, L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.