-
1
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaelbling, L.P., Littman, M.L., Moore, A.W.: Reinforcement learning: A survey. Journal of AI Research 4 (1996) 237-285
-
(1996)
Journal of AI Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
2
-
-
0038829878
-
Predicting how people play games: Reinforcement learning in games with unique strategy equilibrium
-
Erev, I., Roth, A.: Predicting how people play games: Reinforcement learning in games with unique strategy equilibrium. American Economic Review 88 (1998) 848-881
-
(1998)
American Economic Review
, vol.88
, pp. 848-881
-
-
Erev, I.1
Roth, A.2
-
3
-
-
0008614785
-
The dynamics of reinforcement learning in cooperative multi-agent systems
-
Claus, C., Boutilier, C.: The dynamics of reinforcement learning in cooperative multi-agent systems. In: Proc. Workshop on Multi-Agent Learning. (1997) 602-608
-
(1997)
Proc. Workshop on Multi-agent Learning
, pp. 602-608
-
-
Claus, C.1
Boutilier, C.2
-
5
-
-
85149834820
-
Markov games as a framework for multi-agent reinforcement learning
-
Littman, M.L.: Markov games as a framework for multi-agent reinforcement learning. In: Proc. 11th ICML. (1994) 157-163
-
(1994)
Proc. 11th ICML
, pp. 157-163
-
-
Littman, M.L.1
-
6
-
-
0000929496
-
Multi-agent reinforcement learning: Theoretical framework and an algorithms
-
Hu, J., Wellman, M.: Multi-agent reinforcement learning: Theoretical framework and an algorithms. In: Proc. 15th ICML. (1998)
-
(1998)
Proc. 15th ICML
-
-
Hu, J.1
Wellman, M.2
-
7
-
-
84880854156
-
R-max - A general polynomial time algorithm for near-optimal reinforcement learning
-
Brafman, R.I., Tennenholtz, M.: R-max - a general polynomial time algorithm for near-optimal reinforcement learning. In: IJCAI'01. (2001)
-
(2001)
IJCAI'01
-
-
Brafman, R.I.1
Tennenholtz, M.2
-
8
-
-
0033423368
-
Exploration strategies for model-based learning in multiagent systems
-
Carmel, D., Markovitch, S.: Exploration strategies for model-based learning in multiagent systems. Autonomous Agents and Multi-agent Systems 2(2) (1999) 141-172
-
(1999)
Autonomous Agents and Multi-agent Systems
, vol.2
, Issue.2
, pp. 141-172
-
-
Carmel, D.1
Markovitch, S.2
-
10
-
-
58149324992
-
Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term
-
Roth, A., Erev, I.: Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term. Games and Economic Behavior 8 (1995) 164-212
-
(1995)
Games and Economic Behavior
, vol.8
, pp. 164-212
-
-
Roth, A.1
Erev, I.2
-
11
-
-
4043136539
-
A cognitive hierarchy model of games
-
Camerer, C.F., Ho, TH., Chong, J.K.: A cognitive hierarchy model of games. The Quarterly Journal of Economics 119(3) (2004) 861-898
-
(2004)
The Quarterly Journal of Economics
, vol.119
, Issue.3
, pp. 861-898
-
-
Camerer, C.F.1
Ho, T.H.2
Chong, J.K.3
-
12
-
-
0001635606
-
Cognition and behavior in normal-form games: An experimental study
-
Costa-Gomes, M., Crawford, V.P., Broseta, B.: Cognition and behavior in normal-form games: An experimental study. Econometrica 69(5) (2001) 1193-1235
-
(2001)
Econometrica
, vol.69
, Issue.5
, pp. 1193-1235
-
-
Costa-Gomes, M.1
Crawford, V.P.2
Broseta, B.3
-
13
-
-
9444240310
-
Learning social preferences in games
-
Gal, Y., Pfeffer, A., Marzo, F., Grosz, B.J.: Learning social preferences in games. In: Proc. of AAAI-04. (2004) 226-231
-
(2004)
Proc. of AAAI-04
, pp. 226-231
-
-
Gal, Y.1
Pfeffer, A.2
Marzo, F.3
Grosz, B.J.4
-
14
-
-
33750295890
-
Population rule learning in symmetric normal-form games: Theory and evidence
-
Stahl, D.O.: Population rule learning in symmetric normal-form games: theory and evidence. Journal of Economic Behavior and Organization 1304 (2001) 1-14
-
(2001)
Journal of Economic Behavior and Organization
, vol.1304
, pp. 1-14
-
-
Stahl, D.O.1
-
17
-
-
4544335718
-
Run the gamut: A comprehensive approach to evaluating game-theoretic algorithms
-
Nudelman, E., Wortman, J., Shoham, Y., Leyton-Brown, K.: Run the gamut: A comprehensive approach to evaluating game-theoretic algorithms. In: AAMAS '04. (2004) 880-887
-
(2004)
AAMAS '04
, pp. 880-887
-
-
Nudelman, E.1
Wortman, J.2
Shoham, Y.3
Leyton-Brown, K.4
-
19
-
-
84941165004
-
-
Erev, I., Roth, A., Slonim, R., Barron, G.: Learning and equilibrium as useful approximations: accuracy of prediction on randomly selected constant sum games. (2006)
-
(2006)
Learning and Equilibrium as Useful Approximations: Accuracy of Prediction on Randomly Selected Constant Sum Games
-
-
Erev, I.1
Roth, A.2
Slonim, R.3
Barron, G.4
|