-
1
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
DOI 10.1023/A:1013689704352, Computational Learning Theory
-
Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-time analysis of the multi-armed bandit problem. Machine Learning, 47(2-3), 235-256. (Pubitemid 34126111)
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
4
-
-
84880840280
-
Approximating game-theoretic optimal strategies for full-scale poker
-
In Gottlob, G., & Walsh, T. (Eds.) Morgan Kaufmann
-
Billings, D., Burch, N., Davidson, A., Holte, R. C., Schaeffer, J., Schauenberg, T., & Szafron, D. (2003). Approximating game-theoretic optimal strategies for full-scale poker. In Gottlob, G., & Walsh, T. (Eds.), Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI-03), pp. 661-668. Morgan Kaufmann.
-
(2003)
Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI-03
, pp. 661-668
-
-
Billings, D.1
Burch, N.2
Davidson, A.3
Holte, R.C.4
Schaeffer, J.5
Schauenberg, T.6
Szafron, D.7
-
6
-
-
45149127471
-
Monte-carlo go reinforcement learning experiments
-
Reno, USA
-
Bouzy, B., & Chaslot, G. (2006). Monte-carlo go reinforcement learning experiments. In IEEE 2006Symposium on Computational Intelligence in Games, Reno, USA, pp. 187-194.
-
(2006)
IEEE 2006Symposium on Computational Intelligence in Games
, pp. 187-194
-
-
Bouzy, B.1
Chaslot, G.2
-
7
-
-
84874005312
-
Monte-carlo strategies for computer go
-
In Schobbens, P.-Y., Vanhoof, W., & Schwanen, G. (Eds.)
-
Chaslot, G. M. J.-B., Saito, J.-T., Bouzy, B., Uiterwijk, J., & van den Herik, H. (2006). Monte-carlo strategies for computer go. In Schobbens, P.-Y., Vanhoof, W., & Schwanen, G. (Eds.), Proceedings of the 18th Be Ne Lux Conference on Artificial Intelligence, pp. 83-90.
-
(2006)
Proceedings of the 18th Be Ne Lux Conference on Artificial Intelligence
, pp. 83-90
-
-
Chaslot, G.M.J.-B.1
Saito, J.-T.2
Bouzy, B.3
Uiterwijk, J.4
Van Den Herik, H.5
-
8
-
-
55249127519
-
Progressive strategies for monte-carlo tree search
-
Chaslot, G. M. J.-B., Winands, M., Uiterwijk, J., van den Herik, H., & Bouzy, B. (2008). Progressive strategies for monte-carlo tree search. New Mathematics and Natural Computation, 4(3), 343-357.
-
(2008)
New Mathematics and Natural Computation
, vol.4
, Issue.3
, pp. 343-357
-
-
Chaslot, G.M.J.-B.1
Winands, M.2
Uiterwijk, J.3
Van Den Herik, H.4
Bouzy, B.5
-
9
-
-
38049037928
-
Efficient selectivity and backup operators in monte-carlo tree search
-
In van den Herik, H., Ciancarini, P., & Donkers, H. (Eds.) Springer-Verlag, Heidelberg, Germany
-
Coulom, R. (2006). Efficient selectivity and backup operators in monte-carlo tree search. In van den Herik, H., Ciancarini, P., & Donkers, H. (Eds.), Proceedings of the 5th International Confer-ence on Computer and Games, Vol. 4630 of Lecture Notes in Computer Science (LNCS), pp. 72-83. Springer-Verlag, Heidelberg, Germany.
-
(2006)
Proceedings of the 5th International Confer-ence on Computer and Games 4630 of Lecture Notes in Computer Science (LNCS
, pp. 72-83
-
-
Coulom, R.1
-
10
-
-
0007994645
-
Improved opponent modeling in poker
-
Davidson, A., Billings, D., Schaeffer, J., & Szafron, D. (2000). Improved opponent modeling in poker. In Proceedings of The 2000 International Conference on Artificial Intelligence (ICAI'2000), pp. 1467-1473.
-
(2000)
Proceedings of the 2000 International Conference on Artificial Intelligence (ICAI'2000
, pp. 1467-1473
-
-
Davidson, A.1
Billings, D.2
Schaeffer, J.3
Szafron, D.4
-
13
-
-
0000908510
-
A simple adaptive procedure leading to correlated equilibrium
-
Hart, S., & Mas-Colell, A. (2000). A simple adaptive procedure leading to correlated equilibrium. Econometrica,68(5), 1127-1150.
-
(2000)
Econometrica
, vol.68
, Issue.5
, pp. 1127-1150
-
-
Hart, S.1
Mas-Colell, A.2
-
14
-
-
77953115320
-
Smoothing techniques for computing Nash equilibria of sequential games
-
Hoda, S., Gilpin, A., Peña, J., & Sandholm, T. (2010). Smoothing techniques for computing Nash equilibria of sequential games. Mathematics of Operations Research, 35(2), 494-512.
-
(2010)
Mathematics of Operations Research
, vol.35
, Issue.2
, pp. 494-512
-
-
Hoda, S.1
Gilpin, A.2
Peña, J.3
Sandholm, T.4
-
15
-
-
29344449759
-
Effective short-term opponent exploitation in simplified poker
-
Proceedings of the 20th National Conference on Artificial Intelligence and the 17th Innovative Applications of Artificial Intelligence Conference, AAAI-05/IAAI-05
-
Hoehn, B., Southey, F., & Holte, R. C. (2005). Effective short-term opponent exploitation in sim-plified poker. In In Proceedings of the National Conference on Artificial Intelligence (AAAI), pp. 783-788. AAAI Press. (Pubitemid 43006704)
-
(2005)
Proceedings of the National Conference on Artificial Intelligence
, vol.2
, pp. 783-788
-
-
Hoehn, B.1
Southey, F.2
Holte, R.C.3
Bulitko, V.4
-
19
-
-
33750293964
-
Bandit based Monte-Carlo planning
-
Machine Learning: ECML 2006 - 17th European Conference on Machine Learning, Proceedings
-
Kocsis, L., & Szepesvári, C. (2006). Bandit Based Monte-Carlo Planning. In Fürnkranz, J., Scheffer, T., & Spiliopoulou, M. (Eds.), Machine Learning: ECML 2006, Vol. 4212 of Lecture Notes in Artificial Intelligence, pp. 282-293. (Pubitemid 44618839)
-
(2006)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
, vol.4212
, pp. 282-293
-
-
Kocsis, L.1
Szepesvari, C.2
-
21
-
-
79953227801
-
Monte carlo sampling for regret minimization in extensive games
-
NIPS
-
Lanctot, M., Waugh, K., Zinkevich, M., & Bowling, M. (2009). Monte carlo sampling for regret minimization in extensive games. In Advances in Neural Information Processing Systems 22 (NIPS), pp. 1078-1086.
-
(2009)
Advances in Neural Information Processing Systems
, vol.22
, pp. 1078-1086
-
-
Lanctot, M.1
Waugh, K.2
Zinkevich, M.3
Bowling, M.4
-
22
-
-
71249086073
-
The computational intelligence of mogo revealed in taiwan's computer go tournaments
-
Lee, C.-S., Wang, M.-H., Chaslot, G.-B., Hoock, J.-B., Rimmel, A., Teytaud, O., Tsai, S.-R., Hsu, S.-C., & Hong, T.-P. (2010). The computational intelligence of mogo revealed in taiwan's computer go tournaments. IEEE Transactions on Computational Intelligence and AI in games, 1,73-89.
-
(2010)
IEEE Transactions on Computational Intelligence and AI in Games
, vol.1
, pp. 73-89
-
-
Lee, C.-S.1
Wang, M.-H.2
Chaslot, G.-B.3
Hoock, J.-B.4
Rimmel, A.5
Teytaud, O.6
Tsai, S.-R.7
Hsu, S.-C.8
Hong, T.-P.9
-
23
-
-
0001730497
-
Non-cooperative games
-
Nash, J. (1951). Non-cooperative games. The Annals of Mathematics, 54(2), 286-295.
-
(1951)
The Annals of Mathematics
, vol.54
, Issue.2
, pp. 286-295
-
-
Nash, J.1
-
26
-
-
0002345896
-
Goofspiel - The game of pure strategy
-
Ross, S. M. (1971). Goofspiel \- the game of pure strategy. Journal of Applied Probability, 8(3), 621-625.
-
(1971)
Journal of Applied Probability
, vol.8
, Issue.3
, pp. 621-625
-
-
Ross, S.M.1
-
27
-
-
79959727828
-
The state of solving large incomplete-information games, and application to poker
-
Sandholm, T. (2010). The state of solving large incomplete-information games, and application to poker. AIMagazine, 31(4), 13-32.
-
(2010)
AIMagazine
, vol.31
, Issue.4
, pp. 13-32
-
-
Sandholm, T.1
-
28
-
-
0035441917
-
A gamut of games
-
Schaeffer, J. (2001). A gamut of games. AIMagazine, 22, 29-46. (Pubitemid 32935356)
-
(2001)
AI Magazine
, vol.22
, Issue.3
, pp. 29-46
-
-
Schaeffer, J.1
-
31
-
-
80053237361
-
Bayes' bluff: Opponent modelling in poker
-
Southey, F., Bowling, M., Larson, B., Piccione, C., Burch, N., Billings, D., & Rayner, D. C. (2005). Bayes' bluff: Opponent modelling in poker. In Proceedings of the 21st Conference in Uncer-tainty In Artificial Intelligence (UAI '05), pp. 550-558.
-
(2005)
Proceedings of the 21st Conference in Uncer-tainty in Artificial Intelligence (UAI '05
, pp. 550-558
-
-
Southey, F.1
Bowling, M.2
Larson, B.3
Piccione, C.4
Burch, N.5
Billings, D.6
Rayner, D.C.7
-
32
-
-
68049133318
-
An analysis of uct in multi-player games
-
Sturtevant, N. R. (2008). An analysis of uct in multi-player games. In In Computers and Games.
-
(2008)
Computers and Games
-
-
Sturtevant, N.R.1
-
34
-
-
84890303245
-
A prac-tical use of imperfect recall
-
Waugh, K., Zinkevich, M., Johanson, M., Kan, M., Schnizlein, D., & Bowling, M. (2009). A prac-tical use of imperfect recall. In Proceedings of the 8th Symposium on Abstraction, Reformu-lation and Approximation (SARA).
-
(2009)
Proceedings of the 8th Symposium on Abstraction, Reformu-lation and Approximation (SARA)
-
-
Waugh, K.1
Zinkevich, M.2
Johanson, M.3
Kan, M.4
Schnizlein, D.5
Bowling, M.6
-
35
-
-
85162042235
-
Regret minimization in games with incomplete information
-
Zinkevich, M., Johanson, M., Bowling, M., & Piccione, C. (2008). Regret minimization in games with incomplete information. In Advances in Neural Information Processing Systems 20 NIPS.
-
(2008)
Advances in Neural Information Processing Systems 20 NIPS
-
-
Zinkevich, M.1
Johanson, M.2
Bowling, M.3
Piccione, C.4
|