-
1
-
-
84948141808
-
The lagging anchor algorithm. Reinforcement learning in two-player zero-sum games with imperfect information
-
Dahl, F. A.: The lagging anchor algorithm. Reinforcement learning in two-player zero-sum games with imperfect information. Machine Learning (to appear).
-
Machine Learning
-
-
Dahl, F.A.1
-
2
-
-
0004260006
-
-
3rd ed. Academic Press, San Diego
-
Owen, G.: Game Theory. 3rd ed. Academic Press, San Diego (1995).
-
(1995)
Game Theory
-
-
Owen, G.1
-
3
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton, R. S.: Learning to predict by the methods of temporal differences. Machine Learning 3 (1988) 9–44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
5
-
-
0033570798
-
A unified analysis of value-function-based reinforcementlearning algorithms
-
Szepesvari, C., Littman, M. L.: A unified analysis of value-function-based reinforcementlearning algorithms. Neural Computation 11 (1999) 2017–2060.
-
(1999)
Neural Computation
, vol.11
, pp. 2017-2060
-
-
Szepesvari, C.1
Littman, M.L.2
-
6
-
-
0001046225
-
Practical issues in temporal difference learning
-
Tesauro, G. J.: Practical issues in temporal difference learning. Machine Learning 8 (1992) 257–277.
-
(1992)
Machine Learning
, vol.8
, pp. 257-277
-
-
Tesauro, G.J.1
-
7
-
-
85149834820
-
Markov games as a framework for multi-agent reinforcement learning
-
Morgan Kaufmann, New Brunswick
-
Littman, M. L.: Markov games as a framework for multi-agent reinforcement learning. In: Proceedings of the 11th International Conference on Machine Learning, Morgan Kaufmann, New Brunswick (1994) 157–163.
-
(1994)
Proceedings of the 11th International Conference on Machine Learning
, pp. 157-163
-
-
Littman, M.L.1
-
8
-
-
84974693459
-
Minimax TD-learning with neural nets in a Markov game
-
: Lopez de Mantaras, R., Plaza, E. (eds.): ECML 2000, Springer-Verlag, Berlin–Heidelberg–New York
-
Dahl F. A., Halck O. M.: Minimax TD-learning with neural nets in a Markov game. In: Lopez de Mantaras, R., Plaza, E. (eds.): ECML 2000. Proceedings of the 11th European Conference on Machine Learning. Lecture Notes in Computer Science Vol. 1810, Springer-Verlag, Berlin–Heidelberg–New York (2000) 117–128.
-
(2000)
Proceedings of the 11th European Conference on Machine Learning. Lecture Notes in Computer Science
, vol.1810
, pp. 117-128
-
-
Dahl, F.A.1
Halck, O.M.2
-
9
-
-
0030170957
-
Efficient computation of equilibria for extensive two-person games
-
Koller, D., Megiddo, N., von Stengel, B.: Efficient computation of equilibria for extensive two-person games. Games and Economic Behavior 14 (1996) 247–259.
-
(1996)
Games and Economic Behavior
, vol.14
, pp. 247-259
-
-
Koller, D.1
Megiddo, N.2
Von Stengel, B.3
-
11
-
-
0031192989
-
Representations and solutions for game-theoretic problems
-
Koller, D., Pfeffer, A.: Representations and solutions for game-theoretic problems. Artificial Intelligence 94 (1997) 167–215.
-
(1997)
Artificial Intelligence
, vol.94
, pp. 167-215
-
-
Koller, D.1
Pfeffer, A.2
-
12
-
-
0008831290
-
Learning to play strong poker
-
: Fürnkranz, J., Kubat, M. (eds.), Jozef Stefan Institute, Ljubljana
-
Schaeffer, J., Billings, D., Peña, L., Szafron, D.: Learning to play strong poker. In: Fürnkranz, J., Kubat, M. (eds.): Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing, Jozef Stefan Institute, Ljubljana (1999).
-
(1999)
Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing
-
-
Schaeffer, J.1
Billings, D.2
Peña, L.3
Szafron, D.4
-
14
-
-
0001826509
-
Anticipatory learning in two-person games
-
: Selten, R. (ed.), Evolution and game dynamics, Springer-Verlag, Berlin
-
Selten R. (1991). Anticipatory learning in two-person games, in: Selten, R. (ed.): Game equilibrium models, vol. I: Evolution and game dynamics, Springer-Verlag, Berlin.
-
(1991)
Game Equilibrium Models
, vol.1
-
-
Selten, R.1
-
15
-
-
0008815685
-
On classification of games and evaluation of players – with some sweeping generalizations about the literature
-
: Fürnkranz, J., Kubat, M. (eds.), Jozef Stefan Institute, Ljubljana
-
Halck, O. M., Dahl, F. A.: On classification of games and evaluation of players – with some sweeping generalizations about the literature. In: Fürnkranz, J., Kubat, M. (eds.): Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing, Jozef Stefan Institute, Ljubljana (1999).
-
(1999)
Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing
-
-
Halck, O.M.1
Dahl, F.A.2
|