메뉴 건너뛰기




Volumn , Issue , 2005, Pages 817-822

Learning against opponents with bounded memory

Author keywords

[No Author keywords available]

Indexed keywords

BEST RESPONSE; BOUNDED MEMORY; EMPIRICAL TEST; MULTI-AGENT SETTING; SELF-PLAY;

EID: 33745609272     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (85)

References (25)
  • 2
    • 0036531878 scopus 로고    scopus 로고
    • Multiagent learning using a variable learning rate
    • Michael Bowling and Manuela Veloso. Multiagent learning using a variable learning rate. Artificial Intelligence, 136:215-250, 2002.
    • (2002) Artificial Intelligence , vol.136 , pp. 215-250
    • Bowling, M.1    Veloso, M.2
  • 4
    • 0002672918 scopus 로고
    • Iterative solution of games by fictitious play
    • John Wiley and Sons, New York
    • George Brown. Iterative solution of games by fictitious play. In Activity Analysis of Production and Allocation. John Wiley and Sons, New York, 1951.
    • (1951) Activity Analysis of Production and Allocation
    • Brown, G.1
  • 8
    • 1942421183 scopus 로고    scopus 로고
    • Awesome: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
    • Vincent Conitzer and Tuomas Sandholm. Awesome: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents. In Proceedings of the 20th International Conference on Machine Learning, pages 83-90, 2003.
    • (2003) Proceedings of the 20th International Conference on Machine Learning , pp. 83-90
    • Conitzer, V.1    Sandholm, T.2
  • 12
    • 0000908510 scopus 로고    scopus 로고
    • A simple adaptive procedure leading to correlated equilibrium
    • Sergiu Hart and Andreu Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127-1150, 2000.
    • (2000) Econometrica , vol.68 , pp. 1127-1150
    • Hart, S.1    Mas-Colell, A.2
  • 13
    • 0001069505 scopus 로고
    • On the distribution of the number of successes in independent trials
    • Wassily Hoeffding. On the distribution of the number of successes in independent trials. Annals of Mathematical Statistics, 27:713-721, 1956.
    • (1956) Annals of Mathematical Statistics , vol.27 , pp. 713-721
    • Hoeffding, W.1
  • 15
    • 0000221289 scopus 로고
    • Rational learning leads to nash equilibrium
    • Ehud Kalai and Ehud Lehrer. Rational learning leads to nash equilibrium. Econometrica, 61(5):1019-1045, 1993.
    • (1993) Econometrica , vol.61 , Issue.5 , pp. 1019-1045
    • Kalai, E.1    Lehrer, E.2
  • 19
    • 0000614213 scopus 로고
    • Bounded complexity justifies cooperation in finitely repeated prisoner's dilemma
    • Abraham Neyman. Bounded complexity justifies cooperation in finitely repeated prisoner's dilemma. Economic Letters, pages 227-229, 1985.
    • (1985) Economic Letters , pp. 227-229
    • Neyman, A.1
  • 21
    • 4544335718 scopus 로고    scopus 로고
    • Run the gamut: A comprehensive approach to evaluating game-theorectic algorithms
    • Eugene Nudelman, Jenn Wortman, Kevin Leyton-Brown, and Yoav Shoham. Run the gamut: A comprehensive approach to evaluating game-theorectic algorithms. AAMAS, 2004.
    • (2004) AAMAS
    • Nudelman, E.1    Wortman, J.2    Leyton-Brown, K.3    Shoham, Y.4
  • 22
    • 0027928808 scopus 로고
    • On complexity as bounded rationality
    • Christos H. Papadimitriou and Mihalis Yannakakis. On complexity as bounded rationality. In STOC-94, pages 726-733, 1994.
    • (1994) STOC-94 , pp. 726-733
    • Papadimitriou, C.H.1    Yannakakis, M.2
  • 25
    • 34249833101 scopus 로고
    • Technical note: Q-learning
    • May
    • Chris Watkins and Peter Dayan. Technical note: Q-learning. Machine Learning, 8(3/4):279-292, May 1992.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
    • Watkins, C.1    Dayan, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.