메뉴 건너뛰기




Volumn , Issue , 2005, Pages

New criteria and a new algorithm for learning in multi-agent systems

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS;

EID: 84898936075     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (60)

References (19)
  • 1
    • 0036531878 scopus 로고    scopus 로고
    • Multiagent learning using a variable learning rate
    • Bowling, M. & Veloso, M. (2002). Multiagent learning using a variable learning rate. In Artificial Intelligence, 136, pp. 215-250.
    • (2002) Artificial Intelligence , vol.136 , pp. 215-250
    • Bowling, M.1    Veloso, M.2
  • 3
    • 0002672918 scopus 로고
    • Iterative solution of games by fictitious play
    • New York: John Wiley and Sons
    • Brown, G. (1951). Iterative Solution of Games by Fictitious Play. In Activity Analysis of Production and Allocation. New York: John Wiley and Sons.
    • (1951) Activity Analysis of Production and Allocation.
    • Brown, G.1
  • 5
    • 1942421183 scopus 로고    scopus 로고
    • Awesome: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
    • Washington, DC
    • Conitzer, V. & Sandholm, T. (2003). AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Opponents. In Proceedings of the 20th International Conference on Machine Learning, pp. 83-90, Washington, DC.
    • (2003) Proceedings of the 20th International Conference on Machine Learning , pp. 83-90
    • Conitzer, V.1    Sandholm, T.2
  • 6
    • 0002476325 scopus 로고    scopus 로고
    • Regret in the on-line decision problem
    • Foster, D. & Vohra, R. (1999). Regret in the on-line decision problem. "Games and Economic Behavior" 29:7-36.
    • (1999) Games and Economic Behavior , vol.29 , pp. 7-36
    • Foster, D.1    Vohra, R.2
  • 9
    • 0001976283 scopus 로고
    • Approximation to bayes risk in repeated plays
    • Hannan, J. (1957) Approximation to Bayes risk in repeated plays. Contributions to the Theory of Games 3:97-139.
    • (1957) Contributions to the Theory Of, Games , vol.3 , pp. 97-139
    • Hannan, J.1
  • 10
    • 0000908510 scopus 로고    scopus 로고
    • A simple adaptive procedure leading to correlated equilibrium
    • Hart, S. & Mas-Colell, A. (2000). A simple adaptive procedure leading to correlated equilibrium. In Econometrica, Vol. 68, No. 5, pages 1127-1150.
    • (2000) Econometrica , vol.68 , Issue.5 , pp. 1127-1150
    • Hart, S.1    Mas-Colell, A.2
  • 11
    • 0001069505 scopus 로고
    • On the distribution of the number of successes in independent trials
    • Hoeffding, W. (1956). On the distribution of the number of successes in independent trials. Annals of Mathematical Statistics 27:713-721.
    • (1956) Annals of Mathematical Statistics , vol.27 , pp. 713-721
    • Hoeffding, W.1
  • 16
    • 0001644761 scopus 로고    scopus 로고
    • Nash convergence of gradient dynamics in generalsum games
    • Morgan Kaufman
    • Singh, S., Kearns, M., &Mansour, Y. (2000). Nash convergence of gradient dynamics in generalsum games. In Proceedings of UAI-2000, pp. 541-548, Morgan Kaufman.
    • (2000) Proceedings of UAI-2000 , pp. 541-548
    • Singh, S.1    Kearns, M.2    Mansour, Y.3
  • 17
    • 0034205975 scopus 로고    scopus 로고
    • Multiagent systems: A survey from a machine learning perspective
    • Stone, P. & Veloso, M. (2000). Multiagent systems: A survey from a machine learning perspective. Autonomous Robots, 8(3).
    • (2000) Autonomous, Robots , vol.8 , Issue.3
    • Stone, P.1    Veloso, M.2
  • 19
    • 34249833101 scopus 로고
    • Technical note: Q-learning
    • Watkins, C. & Dayan, P. (1992). Technical note: Q-learning. Machine Learning, 8(3):279-292.
    • (1992) Machine Learning , vol.8 , Issue.3 , pp. 279-292
    • Watkins, C.1    Dayan, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.