SCOPUS 정보 검색 플랫폼

Proceedings of the National Conference on Artificial Intelligence

Volumn , Issue , 2004, Pages 2-7

Performance bounded reinforcement learning in strategic interactions

(2) Banerjee, Bikramjit a Peng, Jing a

a TULANE UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AGENT TECHNOLOGIES; MUTLIAGENT LEARNING (MAL); REINFORCEMENT LEARNING; SUPPLY CHAIN MANAGEMENT;

AUTOMATION; GAME THEORY; INTELLIGENT AGENTS; LEARNING ALGORITHMS; MATRIX ALGEBRA; MULTI AGENT SYSTEMS; PERFORMANCE; USER INTERFACES;

LEARNING SYSTEMS;

EID: 9444299000 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (29)

References (30)

1
- 0029513526
- Gambling in a rigged casino: The adversarial multi-arm bandit problem
- Milwaukee, WI: IEEE Computer Society Press
- Auer, P.; Cesa-Bianchi, N.; Freund, Y.; and Schapire, R. E. 1995. Gambling in a rigged casino: The adversarial multi-arm bandit problem. In Proceedings of the 36th Annual Symposium on Foundations of Computer Science, 322 - 331. Milwaukee, WI: IEEE Computer Society Press.
- (1995) Proceedings of the 36th Annual Symposium on Foundations of Computer Science , pp. 322-331
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

2
- 84880865940
- Rational and convergent learning in stochastic games
- Bowling, M., and Veloso, M. 2001. Rational and convergent learning in stochastic games. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 1021 - 1026.
- (2001) Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence , pp. 1021-1026
- Bowling, M.¹ Veloso, M.²

3
- 0036531878
- Multiagent learning using a variable learning rate
- Bowling, M., and Veloso, M. 2002. Multiagent learning using a variable learning rate. Artificial Intelligence.
- (2002) Artificial Intelligence
- Bowling, M.¹ Veloso, M.²

4
- 34247193577
- Efficient learning equilibrium
- Brafman, R. I., and Tennenholtz, M. 2002a. Efficient learning equilibrium. In Proceedings of Neural Information Processing Systems.
- (2002) Proceedings of Neural Information Processing Systems
- Brafman, R.I.¹ Tennenholtz, M.²

5
- 0041965975
- R-max - A general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R. I., and Tennenholtz, M. 2002b. R-max - A general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research 3:213-231.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

6
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Menlo Park, CA: AAAI Press/MIT Press
- Claus, C., and Boutilier, C. 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the 15th National Conference on Artificial Intelligence, 746-752. Menlo Park, CA: AAAI Press/MIT Press.
- (1998) Proceedings of the 15th National Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

7
- 1942421183
- AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
- Conitzer, V., and Sandholm, T. 2003a. AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents. In Proceedings of the 20th International Conference on Machine Learning.
- (2003) Proceedings of the 20th International Conference on Machine Learning
- Conitzer, V.¹ Sandholm, T.²

8
- 1942452777
- BL-WoLF: A framework for loss-bounded learnability in zero-sum games
- Conitzer, V., and Sandholm, T. 2003b. BL-WoLF: A framework for loss-bounded learnability in zero-sum games. In Proceedings of the 20th International Conference on Machine Learning.
- (2003) Proceedings of the 20th International Conference on Machine Learning
- Conitzer, V.¹ Sandholm, T.²

9
- 0002267135
- Adaptive game playing using multiplicative weights
- Freund, Y., and Schapire, R. E. 1999. Adaptive game playing using multiplicative weights. Games and Economic Behavior 29:79 - 103.
- (1999) Games and Economic Behavior , vol.29 , pp. 79-103
- Freund, Y.¹ Schapire, R.E.²

10
- 0000668347
- Consistency and cautious fictitious play
- Fudenberg, D., and Levine, D. 1995. Consistency and cautious fictitious play. Journal of Economic Dynamics and Control 19:1065- 1089.
- (1995) Journal of Economic Dynamics and Control , vol.19 , pp. 1065-1089
- Fudenberg, D.¹ Levine, D.²

11
- 0004247096
- Cambridge, MA: MIT Press
- Fudenberg, D., and Levine, K. 1998. The Theory of Learning in Games. Cambridge, MA: MIT Press.
- (1998) The Theory of Learning in Games
- Fudenberg, D.¹ Levine, K.²

12
- 9444291149
- Correlated q-learning
- Greenwald, A., and Hall, K. 2002. Correlated q-learning. In Proceedings of the AAAI Symposium on Collaborative Learning Agents.
- (2002) Proceedings of the AAAI Symposium on Collaborative Learning Agents
- Greenwald, A.¹ Hall, K.²

13
- 2942744741
- Uncoupled dynamics do not lead to nash equilibrium
- Hart, S., and Mas-Colell, A. 2003. Uncoupled dynamics do not lead to nash equilibrium. American Economic Review.
- (2003) American Economic Review
- Hart, S.¹ Mas-Colell, A.²

14
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- San Francisco, CA: Morgan Kaufmann
- Hu, J., and Wellman, M. P. 1998. Multiagent reinforcement learning: Theoretical framework and an algorithm. In Proc. of the 15th Int. Conf. on Machine Learning (ML'98), 242-250. San Francisco, CA: Morgan Kaufmann.
- (1998) Proc. of the 15th Int. Conf. on Machine Learning (ML'98) , pp. 242-250
- Hu, J.¹ Wellman, M.P.²

15
- 9444286839
- Multiagent Q-learning
- Hu, J., and Wellman, M. 2002. Multiagent Q-learning. Journal of Machine Learning.
- (2002) Journal of Machine Learning
- Hu, J.¹ Wellman, M.²

16
- 9444236608
- On no-regret learning, fictitious play, and nash equilibrium
- Jafari, A.; Greenwald, A.; Gondek, D.; and Ercal, G. 2001. On no-regret learning, fictitious play, and nash equilibrium. In Proceedings of the Eighteenth International Conference on Machine Learning, 226 - 223.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning , pp. 226-1223
- Jafari, A.¹ Greenwald, A.² Gondek, D.³ Ercal, G.⁴

17
- 35148838877
- The weighted majority algorithm
- Littlestone, N., and Warmuth, M. 1994. The weighted majority algorithm. Information and Computation 108:212 - 261.
- (1994) Information and Computation , vol.108 , pp. 212-261
- Littlestone, N.¹ Warmuth, M.²

18
- 0001961616
- A generalized reinforcement learning model: Convergence and applications
- Littman, M. L., and Szepesvari, C. 1996. A generalized reinforcement learning model: Convergence and applications. In Proceedings of the 13th International Conference on Machine Learning, 310-318.
- (1996) Proceedings of the 13th International Conference on Machine Learning , pp. 310-318
- Littman, M.L.¹ Szepesvari, C.²

19
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- San Mateo, CA: Morgan Kaufmann
- Littman, M. L. 1994. Markov games as a framework for multi-agent reinforcement learning. In Proc. of the 11th Int. Conf. on Machine Learning, 157-163. San Mateo, CA: Morgan Kaufmann.
- (1994) Proc. of the 11th Int. Conf. on Machine Learning , pp. 157-163
- Littman, M.L.¹

20
- 0242466944
- Friend-or-foe Q-learning in general-sum games
- Littman, M. L. 2001. Friend-or-foe Q-learning in general-sum games. In Proceedings of the Eighteenth International Conference on Machine Learnig.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learnig
- Littman, M.L.¹

21
- 0001730497
- Non-cooperative games
- Nash, J.F. 1951. Non-cooperative games. Annals of Mathematics 54:286-295.
- (1951) Annals of Mathematics , vol.54 , pp. 286-295
- Nash, J.F.¹

22
- 0027336968
- A strategy of win-stay, lose-shift that outperforms tit-for-tat in the prisoner's dilemma game
- Nowak, M., and Sigmund, K. 1993. A strategy of win-stay, lose-shift that outperforms tit-for-tat in the prisoner's dilemma game. Nature 364:56 - 58.
- (1993) Nature , vol.364 , pp. 56-58
- Nowak, M.¹ Sigmund, K.²

23
- 9444277661
- Win-stay, lose-shift. A general learning rule for repeated normal form games
- Posch, M., and Brannath, W. 1997. Win-stay, lose-shift. A general learning rule for repeated normal form games. In Proceedings of the Third International Conference on Computing in Economics and Finance.
- (1997) Proceedings of the Third International Conference on Computing in Economics and Finance
- Posch, M.¹ Brannath, W.²

24
- 84949966897
- On multiagent Q-learning in a semi-competitive domain
- Weiß, G., and Sen, S., eds. Springer-Verlag
- Sandholm, T., and Crites, R. 1996. On multiagent Q-learning in a semi-competitive domain. In Weiß, G., and Sen, S., eds. Adaptation and Learning in Multi-Agent Systems. Springer-Verlag. 191-205.
- (1996) Adaptation and Learning in Multi-agent Systems , pp. 191-205
- Sandholm, T.¹ Crites, R.²

25
- 0028555752
- Learning to coordinate without sharing information
- Menlo Park, CA: AAAI Press/MIT Press
- Sen, S.; Sekaran, M.; and Hale, J. 1994. Learning to coordinate without sharing information. In National Conference on Artificial Intelligence, 426-431. Menlo Park, CA: AAAI Press/MIT Press.
- (1994) National Conference on Artificial Intelligence , pp. 426-431
- Sen, S.¹ Sekaran, M.² Hale, J.³

26
- 0001644761
- Nash convergence of gradient dynamics in general-sum games
- Singh, S.; Kearns, M.; and Mansour, Y. 2000. Nash convergence of gradient dynamics in general-sum games. In Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence, 541-548.
- (2000) Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence , pp. 541-548
- Singh, S.¹ Kearns, M.² Mansour, Y.³

27
- 0004102479
- MIT Press
- Sutton, R., and Burto, A. G. 1998. Reinforcement Learning: An Introduction. MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Burto, A.G.²

28
- 85152198941
- Multi-agent reinforcement learning: Independent vs. cooperative agents
- Tan, M. 1993. Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proceedings of the Tenth International Conference on Machine Learning, 330-337.
- (1993) Proceedings of the Tenth International Conference on Machine Learning , pp. 330-337
- Tan, M.¹

29
- 67649405225
- Reinforcement learning to play an optimal nash equilibrium in team markov games
- Wang, X., and Sandholm, T. 2002. Reinforcement learning to play an optimal nash equilibrium in team markov games. In Advances in Neural Information Processing Systems 15, NIPS.
- (2002) Advances in Neural Information Processing Systems 15, NIPS
- Wang, X.¹ Sandholm, T.²

30
- 1942484421
- Online convex programming and generalized infinitesimal gradient ascent
- Zinkevich, M. 2003. Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the 20th International Conference on Machine Learning.
- (2003) Proceedings of the 20th International Conference on Machine Learning
- Zinkevich, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.