메뉴 건너뛰기




Volumn FS-04-02, Issue , 2004, Pages 89-95

On the agenda(s) of research on multi-agent learning

Author keywords

[No Author keywords available]

Indexed keywords

MULTI-AGENT LEARNING; REINFORCEMENT LEARNING; STOCHASTIC GAMES; WELL-DEFINED PROBLEMS;

EID: 26444543263     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (32)
  • 4
    • 0002672918 scopus 로고
    • Iterative solution of games by fictitious play
    • New York: John Wiley and Sons
    • Brown, G. 1951. Iterative solution of games by fictitious play. In Activity Analysis of Production and Allocation. New York: John Wiley and Sons.
    • (1951) Activity Analysis of Production and Allocation
    • Brown, G.1
  • 5
    • 0036268277 scopus 로고    scopus 로고
    • Sophisticated EWA learning and strategic teaching in repeated games
    • Camerer, C.; Ho, T.; and Chong, J. 2002. Sophisticated EWA learning and strategic teaching in repeated games. Journal of Economic Theory 104:137-188.
    • (2002) Journal of Economic Theory , vol.104 , pp. 137-188
    • Camerer, C.1    Ho, T.2    Chong, J.3
  • 6
    • 27944508225 scopus 로고    scopus 로고
    • Playing is believing: The role of beliefs in multi-agent learning
    • Chang, Y.-H., and Kaelbling, L. P. 2001. Playing is believing: The role of beliefs in multi-agent learning. In Proceedings of NIPS.
    • (2001) Proceedings of NIPS
    • Chang, Y.-H.1    Kaelbling, L.P.2
  • 8
    • 0038829878 scopus 로고    scopus 로고
    • Predicting how people play games: Reinforcement leaning in experimental games with unique, mixed strategy equilibria
    • Erev, I., and Roth, A. E. 1998. Predicting how people play games: reinforcement leaning in experimental games with unique, mixed strategy equilibria. The American Economic Review 88(4):848-881.
    • (1998) The American Economic Review , vol.88 , Issue.4 , pp. 848-881
    • Erev, I.1    Roth, A.E.2
  • 12
    • 0001976283 scopus 로고
    • Approximation to bayes risk in repeated plays
    • Hannan, J. F. 1959. Approximation to bayes risk in repeated plays. Contributions to the Theory of Games 3:97-139.
    • (1959) Contributions to the Theory of Games , vol.3 , pp. 97-139
    • Hannan, J.F.1
  • 14
    • 0002550841 scopus 로고    scopus 로고
    • Learning about other agents in a dynamic multiagent system
    • Hu, J., and Wellman, M. 2001. Learning about other agents in a dynamic multiagent system. Journal of Cognitive Systems Research 2:67-69.
    • (2001) Journal of Cognitive Systems Research , vol.2 , pp. 67-69
    • Hu, J.1    Wellman, M.2
  • 16
    • 1142305713 scopus 로고    scopus 로고
    • Learning to play games in extensive form by valuation
    • Jehiel, P., and Samet, D. 2001. Learning to play games in extensive form by valuation. NAJ Economics 3.
    • (2001) NAJ Economics , vol.3
    • Jehiel, P.1    Samet, D.2
  • 17
    • 0000221289 scopus 로고
    • Rational learning leads to nash equilibrium
    • Kalai, E., and Lehrer, E. 1993. Rational learning leads to nash equilibrium. Econometrica 61(5): 1019-1045.
    • (1993) Econometrica , vol.61 , Issue.5 , pp. 1019-1045
    • Kalai, E.1    Lehrer, E.2
  • 22
    • 33845300407 scopus 로고
    • Formulation of bayesian analysis for games with incomplete information
    • Mertens, J.-F., and Zamir, S. 1985. Formulation of bayesian analysis for games with incomplete information. International Journal of Game Theory 14:1-29.
    • (1985) International Journal of Game Theory , vol.14 , pp. 1-29
    • Mertens, J.-F.1    Zamir, S.2
  • 23
    • 0030306234 scopus 로고    scopus 로고
    • Non-computable strategies and discounted repeated games
    • Nachbar, J. H., and Zame, W. R. 1996. Non-computable strategies and discounted repeated games. Economic Theory 8:103-122.
    • (1996) Economic Theory , vol.8 , pp. 103-122
    • Nachbar, J.H.1    Zame, W.R.2
  • 24
    • 0000614213 scopus 로고
    • Bounded complexity justifies cooperation in finitely repeated prisoner's dilemma
    • Neyman, A. 1985. Bounded complexity justifies cooperation in finitely repeated prisoner's dilemma. Economic Letters 227-229.
    • (1985) Economic Letters , pp. 227-229
    • Neyman, A.1
  • 25
    • 0027928808 scopus 로고
    • On complexity as bounded rationality
    • Papadimitriou, C., and Yannakakis, M. 1994. On complexity as bounded rationality. In STOC-94, 726-733.
    • (1994) STOC-94 , pp. 726-733
    • Papadimitriou, C.1    Yannakakis, M.2
  • 31
    • 34249833101 scopus 로고
    • Technical note: Q-learning
    • Watkins, C. J. C. H., and Dayan, P. 1992. Technical note: Q-learning. Machine Learning 8(3/4):279-292.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
    • Watkins, C.J.C.H.1    Dayan, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.