메뉴 건너뛰기




Volumn 63, Issue 8, 2012, Pages 1165-1173

Comparing reinforcement learning approaches for solving game theoretic models: A dynamic airline pricing game example

Author keywords

air transport; artificial intelligence; game theory; reinforcement learning

Indexed keywords

ARTIFICIAL INTELLIGENCE; COSTS; DECISION MAKING; GAME THEORY; MACHINE LEARNING;

EID: 84863425031     PISSN: 01605682     EISSN: 14769360     Source Type: Journal    
DOI: 10.1057/jors.2011.94     Document Type: Article
Times cited : (16)

References (41)
  • 1
    • 40249113497 scopus 로고    scopus 로고
    • An overview of the issues in the airline industry and the role of optimization models and algorithms
    • DOI 10.1057/palgrave.jors.2602350, PII 2602350
    • Ahmed AH and Poojari CA (2008). An overview of the issues in the airline industry and the role of optimization models and algorithms. J Opl Res Soc 59: 267-277. (Pubitemid 351334690)
    • (2008) Journal of the Operational Research Society , vol.59 , Issue.3 , pp. 267-277
    • Ahmed, A.H.1    Poojari, C.A.2
  • 3
    • 0008556523 scopus 로고
    • On the theory of dynamic programming
    • Bellman R (1952). On the theory of dynamic programming. Proc Natl Acad Sci USA 38: 716-719.
    • (1952) Proc Natl Acad Sci USA , vol.38 , pp. 716-719
    • Bellman, R.1
  • 8
    • 0001699291 scopus 로고
    • Training stochastic model recognition algorithms as networks can lead to maximum mutual information estimation of parameters
    • In: Touretzky DS (ed) Morgan Kaufmann: San Mateo, USA
    • Bridle JS (1990). Training stochastic model recognition algorithms as networks can lead to maximum mutual information estimation of parameters. In: Touretzky DS (ed). Advances in Neural Information Processing Systems: Proceedings of the 1989 Conference. Morgan Kaufmann: San Mateo, USA, pp 211-217.
    • (1990) Advances in Neural Information Processing Systems: Proceedings of the 1989 Conference , pp. 211-217
    • Bridle, J.S.1
  • 9
    • 34247158133 scopus 로고    scopus 로고
    • Drama theory: Dispelling the myths
    • Bryant JW (2007). Drama theory: Dispelling the myths. J Opl Res Soc 58: 602-613.
    • (2007) J Opl Res Soc , vol.58 , pp. 602-613
    • Bryant, J.W.1
  • 10
    • 33748701543 scopus 로고    scopus 로고
    • Settling the complexity of 2-player Nash-Equilibrium
    • (TR05), accessed 1 June 2011
    • Chen X and Deng X (2005). Settling the complexity of 2-player Nash-Equilibrium. In Electronic Colloquium on Computational Complexity, 140(TR05), http://eccc.hpi-web.de/report/2005/140/accessed 1 June 2011.
    • (2005) Electronic Colloquium on Computational Complexity , vol.140
    • Chen, X.1    Deng, X.2
  • 11
    • 37149044142 scopus 로고    scopus 로고
    • Agent-based modelling and simulation of urban evacuation: Relative effectiveness of simultaneous and staged evacuation strategies
    • DOI 10.1057/palgrave.jors.2602321, PII 2602321
    • Chen X and Zhan FB (2008). Agent-based modelling and simulation of urban evacuation: Relative effectiveness of simultaneous and staged evacuation strategies. J Opl Res Soc 59: 25-33. (Pubitemid 350261770)
    • (2008) Journal of the Operational Research Society , vol.59 , Issue.1 , pp. 25-33
    • Chen, X.1    Zhan, F.B.2
  • 14
    • 47049125492 scopus 로고    scopus 로고
    • Dynamic pricing of airline tickets with competition
    • Currie C, Cheng RCH and Smith HK (2008). Dynamic pricing of airline tickets with competition. J Opl Res Soc 59: 1026-1037.
    • (2008) J Opl Res Soc , vol.59 , pp. 1026-1037
    • Currie, C.1    Cheng, R.C.H.2    Smith, H.K.3
  • 19
    • 34249076324 scopus 로고    scopus 로고
    • Game theoretic analysis of the bargaining process over a long-term replenishment contract
    • DOI 10.1057/palgrave.jors.2602183, PII 2602183
    • Kim JS and Kwak TC (2007). Game theoretic analysis of the bargaining process over a long-term replenishment contract. J Opl Res Soc 58: 769-778. (Pubitemid 46782021)
    • (2007) Journal of the Operational Research Society , vol.58 , Issue.6 , pp. 769-778
    • Kim, J.S.1    Kwak, T.C.2
  • 21
    • 0000176346 scopus 로고
    • Equilibrium points of bimatrix games
    • Lemke CE and Howson JJT (1964). Equilibrium points of bimatrix games. SIAM J Appl Math 12: 413-423.
    • (1964) SIAM J Appl Math , vol.12 , pp. 413-423
    • Lemke, C.E.1    Howson, J.J.T.2
  • 22
    • 0035536032 scopus 로고    scopus 로고
    • Learning: Association or computation? Introduction to a special section
    • Leslie AM (2001). Learning: Association or computation? Introduction to a special section. Curr Dir Psychol Sci 10(4): 124-127. (Pubitemid 33388536)
    • (2001) Current Directions in Psychological Science , vol.10 , Issue.4 , pp. 124-127
    • Leslie, A.M.1
  • 23
    • 33645029191 scopus 로고    scopus 로고
    • Individual q-learning in normal form games
    • Leslie DS and Collins EJ (2005). Individual q-learning in normal form games. SIAM J Control Optim 44: 495-414.
    • (2005) SIAM J Control Optim , vol.44 , pp. 495-414
    • Leslie, D.S.1    Collins, E.J.2
  • 25
    • 34249048200 scopus 로고    scopus 로고
    • Multi-agent learning for engineers
    • DOI 10.1016/j.artint.2007.01.003, PII S0004370207000070, Foundations of Multi-Agent Learning
    • Mannor S and Shamma J (2007). Multi-agent learning for engineers. Artificial Intelligence 171: 417-422. (Pubitemid 46802418)
    • (2007) Artificial Intelligence , vol.171 , Issue.7 , pp. 417-422
    • Mannor, S.1    Shamma, J.S.2
  • 26
    • 0002974509 scopus 로고
    • The structure of random utility models
    • Manski CF (1977). The structure of random utility models. Theory and Decision 8: 229-254.
    • (1977) Theory and Decision , vol.8 , pp. 229-254
    • Manski, C.F.1
  • 28
    • 0000827179 scopus 로고
    • BOXES: An experiment in adaptive control
    • In: Dale E and Michie D (eds) Oliver and Boyd: Edinburgh
    • Michie D and Chambers RA (1968). BOXES: An experiment in adaptive control. In: Dale E and Michie D (eds). Machine Intelligence. Vol. 2, Oliver and Boyd: Edinburgh, pp 137-152.
    • (1968) Machine Intelligence. , vol.2 , pp. 137-152
    • Michie, D.1    Chambers, R.A.2
  • 30
    • 0001730497 scopus 로고
    • Non-cooperative games
    • Nash J (1951). Non-cooperative games. Ann Math 54: 286-295.
    • (1951) Ann Math , vol.54 , pp. 286-295
    • Nash, J.1
  • 39
    • 67649370955 scopus 로고    scopus 로고
    • Computing equilibria for two-person games
    • In: Aumann RJ and Hart S (eds) Elsevier: Amsterdam
    • von Stengel B (2002). Computing equilibria for two-person games. In: Aumann RJ and Hart S (eds). Handbook of Game Theory with Economic Applications. Vol. 3, Elsevier: Amsterdam, pp 1723-1759.
    • (2002) Handbook of Game Theory with Economic Applications. , vol.3 , pp. 1723-1759
    • Von Stengel, B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.