-
1
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton, R.S.: Learning to predict by the methods of temporal differences, Machine Learning 3 (1988) 9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
2
-
-
0001046225
-
Practical issues in temporal difference learning
-
Tesauro, G.J.: Practical issues in temporal difference learning, Machine Learning 8 (1992) 257-277.
-
(1992)
Machine Learning
, vol.8
, pp. 257-277
-
-
Tesauro, G.J.1
-
3
-
-
0031192989
-
Representations and solutions for game-theoretic problems
-
Koller, D., Pfeffer, A.: Representations and solutions for game-theoretic problems. Artificial Intelligence 94(1) (1997) 167-215.
-
(1997)
Artificial Intelligence
, vol.94
, Issue.1
, pp. 167-215
-
-
Koller, D.1
Pfeffer, A.2
-
4
-
-
0008815685
-
On classification of games and evaluation of players - With some sweeping generalizations about the literature
-
Fürnkranz, J., Kubat, M. (eds.), Jozef Stefan Institute, Ljubljana
-
Halck, O.M., Dahl, F.A.: On classification of games and evaluation of players - with some sweeping generalizations about the literature. In: Fürnkranz, J., Kubat, M. (eds.): Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing, Jozef Stefan Institute, Ljubljana (1999).
-
(1999)
Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing
-
-
Halck, O.M.1
Dahl, F.A.2
-
5
-
-
84876767994
-
Poker as a testbed for machine intelligence research
-
Mercer, R., Neufeld, E. (eds.), Springer- Verlag, Berlin-Heidelberg-New York
-
Billings, D., Papp, D., Schaeffer, J., Szafron, D.: Poker as a testbed for machine intelligence research. In: Mercer, R., Neufeld, E. (eds.): Advances in Artificial Intelligence, Springer- Verlag, Berlin-Heidelberg-New York (1998) 228-238.
-
(1998)
Advances in Artificial Intelligence
, pp. 228-238
-
-
Billings, D.1
Papp, D.2
Schaeffer, J.3
Szafron, D.4
-
6
-
-
85149834820
-
Markov games as a framework for multi-agent reinforcement learning
-
Morgan Kaufmann, New Brunswick
-
Littman, M.L.: Markov games as a framework for multi-agent reinforcement learning. In: Proceedings of the 11th International Conference on Machine Learning, Morgan Kaufmann, New Brunswick (1994) 157-163.
-
(1994)
Proceedings of the 11Th International Conference on Machine Learning
, pp. 157-163
-
-
Littman, M.L.1
-
9
-
-
84945311957
-
Three games designed for the study of human and automated decision making
-
FFI/RAPPORT-98/02799, Norwegian Defence Research Establishment (FFI), Kjeller, Norway
-
Dahl, F.A., Halck, O.M.: Three games designed for the study of human and automated decision making. Definition and properties of the games Campaign, Operation Lucid and Operation Opaque. FFI/RAPPORT-98/02799, Norwegian Defence Research Establishment (FFI), Kjeller, Norway (1998).
-
(1998)
Definition and Properties of the Games Campaign, Operation Lucid and Operation Opaque
-
-
Dahl, F.A.1
Halck, O.M.2
-
10
-
-
0008839797
-
The Tactical Air Game: A multimove game with mixed strategy solution
-
Grote, J.D. (ed.), Dordrecht, The Netherlands
-
Berkovitz, L.D.: The Tactical Air Game: A multimove game with mixed strategy solution. In: Grote, J.D. (ed.): The Theory and Application of Differential Games, Reidel Publishing Company, Dordrecht, The Netherlands (1975) 169-177.
-
(1975)
The Theory and Application of Differential Games, Reidel Publishing Company
, pp. 169-177
-
-
Berkovitz, L.D.1
-
11
-
-
0029210635
-
Learning to act using real-time dynamic programming
-
Barto, A.G., Bradtke, S.J., Singh, S.P.: Learning to act using real-time dynamic programming. Artificial Intelligence 72 (1995) 81-138.
-
(1995)
Artificial Intelligence
, vol.72
, pp. 81-138
-
-
Barto, A.G.1
Bradtke, S.J.2
Singh, S.P.3
-
12
-
-
0003474751
-
-
Cambridge University Press, Cambridge, UK
-
Press, W.H., Flannery, B.P., Teukolsky, S.A., Vetterling, W.T.: Numerical Recipes in C. The Art of Scientific Computing. Cambridge University Press, Cambridge, UK (1988).
-
(1988)
Numerical Recipes in C. the Art of Scientific Computing
-
-
Press, W.H.1
Flannery, B.P.2
Teukolsky, S.A.3
Vetterling, W.T.4
-
13
-
-
0033570798
-
A unified analysis of value-function-based reinforcementlearning algorithms
-
Szepesvari, C., Littman, M.L.: A unified analysis of value-function-based reinforcementlearning algorithms. Neural Computation 11 (1999) 2017-2060.
-
(1999)
Neural Computation
, vol.11
, pp. 2017-2060
-
-
Szepesvari, C.1
Littman, M.L.2
|