SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Proceedings, Twentieth International Conference on Machine Learning

Volumn 1, Issue , 2003, Pages 91-98

BL-WoLF: A Framework For Loss-Bounded Learnability In Zero-Sum Games

(2) Conitzer, Vincent a Sandholm, Thomas a

a Carnegie Mellon University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BOUNDED LOSSES; LOSS-BOUNDED LEARNABILITY; ZERO-SUM GAMES;

APPROXIMATION THEORY; BENCHMARKING; GAME THEORY; PROBABILISTIC LOGICS; STOCHASTIC CONTROL SYSTEMS;

LEARNING SYSTEMS;

EID: 1942452777 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (10)

References (23)

1
- 0029513526
- Gambling in a rigged casino: The adversarial multi-arm bandit problem
- Auer, P., Cesa-Bianchi, N., Freund, Y., & Schapire, R. E. (1995). Gambling in a rigged casino: The adversarial multi-arm bandit problem. FOCS (pp. 322-331).
- (1995) FOCS , pp. 322-331
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

2
- 0038623721
- On pseudo-games
- Banos, A. (1968). On pseudo-games. Annals of Mathematical Statistics, 39, 1932-1945.
- (1968) Annals of Mathematical Statistics , vol.39 , pp. 1932-1945
- Banos, A.¹

3
- 0036531878
- Multiagent learning using a variable learning rate
- Bowling, M., & Veloso, M. (2002). Multiagent learning using a variable learning rate. Artificial Intelligence, 136, 215-250.
- (2002) Artificial Intelligence , vol.136 , pp. 215-250
- Bowling, M.¹ Veloso, M.²

4
- 0034247018
- A near optimal polynomial time algorithm for learning in certain classes of stochastic games
- Brafman, R., & Tennenholtz, M. (2000). A near optimal polynomial time algorithm for learning in certain classes of stochastic games. Artificial Intelligence, 121, 31-47.
- (2000) Artificial Intelligence , vol.121 , pp. 31-47
- Brafman, R.¹ Tennenholtz, M.²

5
- 34247193577
- Efficient learning equilibrium
- Vancouver, Canada
- Brafman, R., & Tennenholtz, M. (2002). Efficient learning equilibrium. Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS). Vancouver, Canada.
- (2002) Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS)
- Brafman, R.¹ Tennenholtz, M.²

6
- 0031140246
- How to use expert advice
- Cesa-Bianchi, N., Freund, Y., Haussler, D., Helmbold, D. P., Schapire, R. E., & Warmuth, M. K. (1997). How to use expert advice. Journal of the ACM, 44, 427-485.
- (1997) Journal of the ACM , vol.44 , pp. 427-485
- Cesa-Bianchi, N.¹ Freund, Y.² Haussler, D.³ Helmbold, D.P.⁴ Schapire, R.E.⁵ Warmuth, M.K.⁶

7
- 84880852207
- Complexity results about Nash equilibria
- Acapulco, Mexico. Earlier version appeared as technical report CMU-CS-02-135
- Conitzer, V., & Sandholm, T. (2003). Complexity results about Nash equilibria. Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI). Acapulco, Mexico. Earlier version appeared as technical report CMU-CS-02-135.
- (2003) Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI)
- Conitzer, V.¹ Sandholm, T.²

8
- 0002095886
- A randomization rule for selecting forecasts
- Foster, D. P., & Vohra, R. V. (1993). A randomization rule for selecting forecasts. Operations Research, 41, 704-709.
- (1993) Operations Research , vol.41 , pp. 704-709
- Foster, D.P.¹ Vohra, R.V.²

9
- 0002267135
- Adaptive game playing using multiplicative weights
- Freund, Y., & Schapire, R. E. (1999). Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29, 79-103.
- (1999) Games and Economic Behavior , vol.29 , pp. 79-103
- Freund, Y.¹ Schapire, R.E.²

10
- 0004247096
- MIT Press
- Fudenberg, D., & Levine, D. (1998). The theory of learning in games. MIT Press.
- (1998) The Theory of Learning in Games
- Fudenberg, D.¹ Levine, D.²

11
- 0000668347
- Consistency and cautious fictitious play
- Fudenberg, D., & Levine, D. K. (1995). Consistency and cautious fictitious play. Journal of Economic Dynamics and Control, 19, 1065-1089.
- (1995) Journal of Economic Dynamics and Control , vol.19 , pp. 1065-1089
- Fudenberg, D.¹ Levine, D.K.²

12
- 0001976283
- Approximation to Bayes risk in repeated play
- Princeton University Press
- Hannan, J. (1957). Approximation to Bayes risk in repeated play. vol. Ill of Contributions to the Theory of Games, 97-139. Princeton University Press.
- (1957) Contributions to the Theory of Games , vol.3 , pp. 97-139
- Hannan, J.¹

13
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- Hu, J., & Wellman, M. P. (1998). Multiagent reinforcement learning: Theoretical framework and an algorithm. International Conference on Machine Learning (pp. 242-250).
- (1998) International Conference on Machine Learning , pp. 242-250
- Hu, J.¹ Wellman, M.P.²

14
- 1142305713
- Learning to play games in extensive form by valuation
- Jehiel, P., & Samet, D. (2001). Learning to play games in extensive form by valuation. NAJ Economics, v3nl.
- (2001) NAJ Economics , vol.3 , Issue.1
- Jehiel, P.¹ Samet, D.²

15
- 0000221289
- Rational learning leads to Nash equilibrium
- Kalai, E., & Lehrer, E. (1993). Rational learning leads to Nash equilibrium. Econometrica, 61, 1019-1045.
- (1993) Econometrica , vol.61 , pp. 1019-1045
- Kalai, E.¹ Lehrer, E.²

16
- 0012257655
- Near-optimal reinforcement learning in polynomial time
- Kearns, M., & Singh, S. (1998). Near-optimal reinforcement learning in polynomial time. International Conference on Machine Learning.
- (1998) International Conference on Machine Learning
- Kearns, M.¹ Singh, S.²

17
- 0242456253
- A polynomial-time Nash equilibrium algorithm for repeated games
- San Diego, CA
- Littman, M., & Stone, P. (2003). A polynomial-time Nash equilibrium algorithm for repeated games. Proceedings of the ACM Conference on Electronic Commerce (ACM-EC). San Diego, CA.
- (2003) Proceedings of the ACM Conference on Electronic Commerce (ACM-EC)
- Littman, M.¹ Stone, P.²

18
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. International Conference on Machine Learning (pp. 157-163).
- (1994) International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

19
- 0004145762
- New York: John Wiley and Sons. Dover republication 1989
- Luce, R. D., & Raiffa, H. (1957). Games and decisions. New York: John Wiley and Sons. Dover republication 1989.
- (1957) Games and Decisions
- Luce, R.D.¹ Raiffa, H.²

20
- 0038675791
- On repeated games with incomplete information played by non-Baycsian players
- Megiddo, N. (1980). On repeated games with incomplete information played by non-Baycsian players. International Journal of Game Theory, 9, 157-167.
- (1980) International Journal of Game Theory , vol.9 , pp. 157-167
- Megiddo, N.¹

21
- 0041166779
- Dynamic non-bayesian decision making
- Monderer, D., & Tennenholtz, M. (1997). Dynamic non-bayesian decision making. Journal of Artificial Intelligence Research, 7, 231-248.
- (1997) Journal of Artificial Intelligence Research , vol.7 , pp. 231-248
- Monderer, D.¹ Tennenholtz, M.²

22
- 0034836562
- Algorithms, games and the Internet
- Papadimitriou, C. (2001). Algorithms, games and the Internet. STOC (pp. 749-753).
- (2001) STOC , pp. 749-753
- Papadimitriou, C.¹

23
- 0001644761
- Nash convergence of gradient dynamics in general-sum games
- Stanford, CA
- Singh, S., Kearns, M., & Mansour, Y. (2000). Nash convergence of gradient dynamics in general-sum games. Proceedings of the Uncertainty in Artificial Intelligence Conference (UAI) (pp. 541-548). Stanford, CA.
- (2000) Proceedings of the Uncertainty in Artificial Intelligence Conference (UAI) , pp. 541-548
- Singh, S.¹ Kearns, M.² Mansour, Y.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.