메뉴 건너뛰기




Volumn 1, Issue , 2005, Pages 41-46

Efficient no-regret multiagent learning

Author keywords

[No Author keywords available]

Indexed keywords

GAME MATRICES; POLYNOMIAL BOUNDS; STOCHASTIC SAMPLING;

EID: 29344441750     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (23)

References (20)
  • 1
    • 0042496192 scopus 로고    scopus 로고
    • Gambling in a rigged casino: The adversarial multi-armed bandit problem
    • NeuroCOLT2 Technical Report Series
    • Auer, P.; Cesa-Bianchi, N.; Freund, Y.; and Schapire, R. E. 1998. Gambling in a rigged casino: The adversarial multi-armed bandit problem. Technical Report NC2-TR-1998-025, NeuroCOLT2 Technical Report Series.
    • (1998) Technical Report , vol.NC2-TR-1998-025
    • Auer, P.1    Cesa-Bianchi, N.2    Freund, Y.3    Schapire, R.E.4
  • 3
    • 0036531878 scopus 로고    scopus 로고
    • Multiagent learning using a variable learning rate
    • Bowling, M., and Veloso, M. 2002. Multiagent learning using a variable learning rate. Artificial Intelligence 136:215-250.
    • (2002) Artificial Intelligence , vol.136 , pp. 215-250
    • Bowling, M.1    Veloso, M.2
  • 4
    • 84899027977 scopus 로고    scopus 로고
    • Convergence and no-regret in multiagent learning
    • Bowling, M. 2005. Convergence and no-regret in multiagent learning. In Proceedings of NIPS 2004/5.
    • (2005) Proceedings of NIPS 2004/5
    • Bowling, M.1
  • 5
    • 0031630561 scopus 로고    scopus 로고
    • The dynamics of reinforcement learning in cooperative multiagent systems
    • Menlo Park, CA: AAAI Press/MIT Press
    • Claus, C., and Boutilier, C. 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the 15th National Conference on Artificial Intelligence, 746-752. Menlo Park, CA: AAAI Press/MIT Press.
    • (1998) Proceedings of the 15th National Conference on Artificial Intelligence , pp. 746-752
    • Claus, C.1    Boutilier, C.2
  • 6
    • 1942421183 scopus 로고    scopus 로고
    • AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
    • Conitzer, V., and Sandholm, T. 2003. AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents. In Proceedings of the 20th International Conference on Machine Learning.
    • (2003) Proceedings of the 20th International Conference on Machine Learning
    • Conitzer, V.1    Sandholm, T.2
  • 8
    • 0002267135 scopus 로고    scopus 로고
    • Adaptive game playing using multiplicative weights
    • Freund, Y., and Schapire. R. E. 1999. Adaptive game playing using multiplicative weights. Games and Economic Behavior 29:79 -103.
    • (1999) Games and Economic Behavior , vol.29 , pp. 79-103
    • Freund, Y.1    Schapire, R.E.2
  • 11
    • 0000929496 scopus 로고    scopus 로고
    • Multiagent reinforcement learning: Theoretical framework and an algorithm
    • San Francisco. CA: Morgan Kaufmann
    • Hu, J., and Wellman, M. P. 1998. Multiagent reinforcement learning: Theoretical framework and an algorithm. In Proc. of the 15th Int. Conf. on Machine Learning (ML'98), 242-250. San Francisco. CA: Morgan Kaufmann.
    • (1998) Proc. of the 15th Int. Conf. on Machine Learning (ML'98) , pp. 242-250
    • Hu, J.1    Wellman, M.P.2
  • 14
    • 85149834820 scopus 로고
    • Markov games as a framework for multiagent reinforcement learning
    • San Mateo, CA: Morgan Kaufmann
    • Littman, M. L. 1994. Markov games as a framework for multiagent reinforcement learning. In Proc. of the 11th Int. Conf. on Machine Learning, 157-163. San Mateo, CA: Morgan Kaufmann.
    • (1994) Proc. of the 11th Int. Conf. on Machine Learning , pp. 157-163
    • Littman, M.L.1
  • 16
    • 84898936075 scopus 로고    scopus 로고
    • New criteria and a new algorithm for learning in multi-agent systems
    • Powers. R., and Shoham, Y. 2005. New criteria and a new algorithm for learning in multi-agent systems. In Proceedings of NIPS 2004/5.
    • (2005) Proceedings of NIPS 2004/5
    • Powers, R.1    Shoham, Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.