-
1
-
-
0023453626
-
Learning regular sets from queries and counterexamples
-
D. Angluin. "Learning regular sets from queries and counterexamples." Information and Computation, vol. 75 pp. 87-106, 1987.
-
(1987)
Information and Computation
, vol.75
, pp. 87-106
-
-
Angluin, D.1
-
3
-
-
0011471586
-
The complexity of computing a best response automaton in repeated games with mixed strategies
-
E. Ben-Porath. "The complexity of computing a best response automaton in repeated games with mixed strategies." Games and Economic Behavior, vol. 2 pp. 1-12, 1990.
-
(1990)
Games and Economic Behavior
, vol.2
, pp. 1-12
-
-
Ben-Porath, E.1
-
5
-
-
0030365402
-
Learning models of intelligent agents
-
Portland, Oregon, August
-
D. Carmel and S. Markovitch. "Learning models of intelligent agents," in Proceedings of Thirteenth National Conference on Artificial Intelligence (AAAI 96), Portland, Oregon, pp. 62-67, August 1996.
-
(1996)
Proceedings of Thirteenth National Conference on Artificial Intelligence (AAAI 96)
, pp. 62-67
-
-
Carmel, D.1
Markovitch, S.2
-
6
-
-
0042413243
-
Exploration and adaptation in multi-agent systems: A model-based approach
-
Nagoya, Japan, August
-
D. Carmel and S. Markovitch. "Exploration and adaptation in multi-agent systems: A model-based approach," in Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI-97), Nagoya, Japan, pp. 606-611, August 1997.
-
(1997)
Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI-97)
, pp. 606-611
-
-
Carmel, D.1
Markovitch, S.2
-
8
-
-
0003328374
-
Neural network exploration using optimal experimental design
-
J. D. Cowan, G. Tesauro, and J. Alspector, (Eds.), Morgan Kaufmann
-
D. A. Cohn. "Neural network exploration using optimal experimental design," in J. D. Cowan, G. Tesauro, and J. Alspector, (Eds.), Advances in Neural Information Processing Systems 6, Morgan Kaufmann: pp. 679-686, 1994.
-
(1994)
Advances in Neural Information Processing Systems 6
, pp. 679-686
-
-
Cohn, D.A.1
-
9
-
-
0030260201
-
Exploration bonuses and dual control
-
P. Dayan and T. J. Sejnowski. "Exploration bonuses and dual control." Machine Learning, vol. 25(1) pp. 5-22, 1996.
-
(1996)
Machine Learning
, vol.25
, Issue.1
, pp. 5-22
-
-
Dayan, P.1
Sejnowski, T.J.2
-
12
-
-
0027307379
-
Efficient learning of typical finite automata from random walks
-
Y. Freund, M. Kearns, D. Ron, R. Rubinfeld, R. E. Schapire, and Linda Sellie. "Efficient learning of typical finite automata from random walks," in Proceedings of the 25th Annual ACM Symposium on Theory and Computing, pp. 315-324, 1993.
-
(1993)
Proceedings of the 25th Annual ACM Symposium on Theory and Computing
, pp. 315-324
-
-
Freund, Y.1
Kearns, M.2
Ron, D.3
Rubinfeld, R.4
Schapire, R.E.5
Sellie, L.6
-
13
-
-
0029547692
-
Efficient algorithms for learning to play repeated games against computationally bounded adversaries
-
Y. Freund, M. Kearns, Y. Mansour, D. Ron, R. Rubinfeled, and R. E. Schapire. "Efficient algorithms for learning to play repeated games against computationally bounded adversaries," in Proceedings of the Annual Symposium on the Foundations of Computer Science, pp. 332-341, 1995.
-
(1995)
Proceedings of the Annual Symposium on the Foundations of Computer Science
, pp. 332-341
-
-
Freund, Y.1
Kearns, M.2
Mansour, Y.3
Ron, D.4
Rubinfeled, R.5
Schapire, R.E.6
-
14
-
-
0001536620
-
Steady state learning and nash equilibrium
-
D. Fudenberg and D. Levine. "Steady state learning and nash equilibrium." Econometrica, vol. 61 pp. 547-574, 1993.
-
(1993)
Econometrica
, vol.61
, pp. 547-574
-
-
Fudenberg, D.1
Levine, D.2
-
15
-
-
38249006045
-
Bounded versus unbounded rationality: The tyranny of the weak
-
I. Gilboa and D. Samet. "Bounded versus unbounded rationality: The tyranny of the weak." Games and Economic Behavior, vol. 1 pp. 213-221, 1989.
-
(1989)
Games and Economic Behavior
, vol.1
, pp. 213-221
-
-
Gilboa, I.1
Samet, D.2
-
16
-
-
38249029225
-
The complexity of computing best response automata in repeated games
-
I. Gilboa. "The complexity of computing best response automata in repeated games." Journal of Economic Theory, vol. 45 pp. 342-352, 1988.
-
(1988)
Journal of Economic Theory
, vol.45
, pp. 342-352
-
-
Gilboa, I.1
-
19
-
-
0002298153
-
Bayesian learning in normal form games
-
J. S. Jordan. "Bayesian learning in normal form games." Games and Economic Behavior, vol. 3 pp. 60-81, 1991.
-
(1991)
Games and Economic Behavior
, vol.3
, pp. 60-81
-
-
Jordan, J.S.1
-
20
-
-
38249015887
-
The exponential convergence of bayesian learning in normal form games
-
J. S. Jordan. "The exponential convergence of bayesian learning in normal form games." Games and Economic Behavior, vol. 4 pp. 202-217, 1991.
-
(1991)
Games and Economic Behavior
, vol.4
, pp. 202-217
-
-
Jordan, J.S.1
-
23
-
-
0000221289
-
Rational learning leads to Nash equilibrium
-
September
-
E. Kalai and E. Lehrer. "Rational learning leads to Nash equilibrium." Econometrica, vol. 61(5) pp. 1019-1045, September 1993.
-
(1993)
Econometrica
, vol.61
, Issue.5
, pp. 1019-1045
-
-
Kalai, E.1
Lehrer, E.2
-
24
-
-
0011473030
-
Bounded rationality and strategic complexity in repeated games
-
T Ichiishi, A. Neyman, and Y. Tauman, (Eds.), Academic Press: San Diego
-
E. Kalai. "Bounded rationality and strategic complexity in repeated games," in T Ichiishi, A. Neyman, and Y. Tauman, (Eds.), Game Theory and Applications, Academic Press: San Diego, pp. 131-157, 1990.
-
(1990)
Game Theory and Applications
, pp. 131-157
-
-
Kalai, E.1
-
28
-
-
0027684215
-
Prioritized sweeping: Reinforcement learning with less data and less time
-
A. W. Moore and C. G. Atkeson. "Prioritized sweeping: Reinforcement learning with less data and less time." Machine Learning, vol. 13(1), 1993.
-
(1993)
Machine Learning
, vol.13
, Issue.1
-
-
Moore, A.W.1
Atkeson, C.G.2
-
29
-
-
84949966497
-
Learn your opponent's strategy (in polynomial time)
-
G. Weiß and S. Sen, (Eds.), Springer-Verlag
-
Y. Mor, C. V. Goldman, and J. S. Rosenschein. "Learn your opponent's strategy (in polynomial time)," in G. Weiß and S. Sen, (Eds.), Adaptation and Learning in Multi-agent Systems, Lecture Notes in AI. Springer-Verlag: 1996.
-
(1996)
Adaptation and Learning in Multi-agent Systems, Lecture Notes in AI
-
-
Mor, Y.1
Goldman, C.V.2
Rosenschein, J.S.3
-
30
-
-
0042914184
-
Optimization and rational learning in games
-
J. H. Nachbar. "Optimization and rational learning in games." Econometrica vol. 65(2), 1997.
-
(1997)
Econometrica
, vol.65
, Issue.2
-
-
Nachbar, J.H.1
-
32
-
-
0000948830
-
On players with a bounded number of states
-
C. H. Papadimitriou. "On players with a bounded number of states." Games and Economic Behavior, vol. 4 pp. 122-131, 1992.
-
(1992)
Games and Economic Behavior
, vol.4
, pp. 122-131
-
-
Papadimitriou, C.H.1
-
34
-
-
46149134052
-
Finite automata play the repeated Prisoner's Dilemma
-
A. Rubinstein. "Finite automata play the repeated Prisoner's Dilemma." Journal of Economic Theory, vol. 39 pp. 83-96, 1986.
-
(1986)
Journal of Economic Theory
, vol.39
, pp. 83-96
-
-
Rubinstein, A.1
-
35
-
-
0030050933
-
Multiagent reinforcement learning and the iterated Prisoner's Dilemma
-
T. W. Sandholm and R. H. Crites. "Multiagent reinforcement learning and the iterated Prisoner's Dilemma." Biosystems Journal, vol. 37 pp. 147-166, 1995.
-
(1995)
Biosystems Journal
, vol.37
, pp. 147-166
-
-
Sandholm, T.W.1
Crites, R.H.2
-
36
-
-
0024079557
-
Learning control of finite Markov chains with an explicit trade-off between estimation and control
-
September
-
M. Sato, K. Abe, and H. Takeda. "Learning control of finite Markov chains with an explicit trade-off between estimation and control," in IEEE Transactions on Systems, Man and Cybernetics, vol. 18(5), September 1991.
-
(1991)
IEEE Transactions on Systems, Man and Cybernetics
, vol.18
, Issue.5
-
-
Sato, M.1
Abe, K.2
Takeda, H.3
-
37
-
-
0028555752
-
Learning to coordinate without sharing information
-
Seattle, Washington
-
S. Sen, M. Sekaran, and J. Hale. "Learning to coordinate without sharing information," in Proceeding of the Twelfth National Conference on Artificial Intelligence (AAAI-94), Seattle, Washington, pp. 426-431, 1994.
-
(1994)
Proceeding of the Twelfth National Conference on Artificial Intelligence (AAAI-94)
, pp. 426-431
-
-
Sen, S.1
Sekaran, M.2
Hale, J.3
-
39
-
-
85132026293
-
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
-
Morgan Kaufman: San Mateo, CA
-
R. S. Sutton. "Integrated architectures for learning, planning, and reacting based on approximating dynamic programming," in Proceedings of the Seventh International Conference on Machine Learning, Morgan Kaufman: San Mateo, CA, pp. 216-224, 1990.
-
(1990)
Proceedings of the Seventh International Conference on Machine Learning
, pp. 216-224
-
-
Sutton, R.S.1
-
40
-
-
0002210775
-
The role of exploration in learning control
-
David A. White and Donald Sopfge, (Eds.), Multiscience Press Inc.
-
S. B. Thrun. "The role of exploration in learning control," in David A. White and Donald Sopfge, (Eds.), Handbook for Intelligent Control. Multiscience Press Inc.: 1992.
-
(1992)
Handbook for Intelligent Control
-
-
Thrun, S.B.1
-
41
-
-
34249833101
-
Technical notes: Q-learning
-
C. J. C. H. Watkins and P. Dayan. "Technical notes: Q-learning." Machine Learning, vol. 8 pp. 279-292, 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 279-292
-
-
Watkins, C.J.C.H.1
Dayan, P.2
|