-
1
-
-
33749864692
-
Optimal tuning of continual online exploration in reinforcement learning
-
Springer, Berlin / Heidelberg, S. Kollias, A. Stafylopatis, W. Duch, E. Oja (Eds.) Artificial Neural Networks-ICANN 2006
-
Achbany Y., Fouss F., Yen L., Pirotte A., Saerens M. Optimal tuning of continual online exploration in reinforcement learning. Lecture Notes in Computer Science 2006, vol. 4131:790-800. Springer, Berlin / Heidelberg. S. Kollias, A. Stafylopatis, W. Duch, E. Oja (Eds.).
-
(2006)
Lecture Notes in Computer Science
, vol.4131
, pp. 790-800
-
-
Achbany, Y.1
Fouss, F.2
Yen, L.3
Pirotte, A.4
Saerens, M.5
-
2
-
-
0003519462
-
-
The MIT Press, Cambridge, MA
-
Anderson S.P., de Palma A., Thisse J.-F. Discrete Choice Theory of Product Differentiation 1992, The MIT Press, Cambridge, MA.
-
(1992)
Discrete Choice Theory of Product Differentiation
-
-
Anderson, S.P.1
de Palma, A.2
Thisse, J.-F.3
-
3
-
-
76249088580
-
The evolution of reciprocity: social types or social incentives?
-
André J.-B. The evolution of reciprocity: social types or social incentives?. The American Naturalist 2010, 175:197-210.
-
(2010)
The American Naturalist
, vol.175
, pp. 197-210
-
-
André, J.-B.1
-
4
-
-
77957272663
-
Co-evolution of learning complexity and social foraging strategies
-
Arbilly M., Motro U., Feldman M.W., Lotem A. Co-evolution of learning complexity and social foraging strategies. Journal of Theoretical Biology 2010, 267:573-581.
-
(2010)
Journal of Theoretical Biology
, vol.267
, pp. 573-581
-
-
Arbilly, M.1
Motro, U.2
Feldman, M.W.3
Lotem, A.4
-
5
-
-
80054751337
-
Evolution of social learning when high expected payoffs are associated with high risk of failure
-
Arbilly M., Motro U., Feldman M.W., Lotem A. Evolution of social learning when high expected payoffs are associated with high risk of failure. Journal of The Royal Society Interface 2011, 8:1604-1615.
-
(2011)
Journal of The Royal Society Interface
, vol.8
, pp. 1604-1615
-
-
Arbilly, M.1
Motro, U.2
Feldman, M.W.3
Lotem, A.4
-
6
-
-
81155139598
-
Recombination and the evolution of coordinated phenotypic expression in a frequency-dependent game
-
Arbilly M., Motro U., Feldman M.W., Lotem A. Recombination and the evolution of coordinated phenotypic expression in a frequency-dependent game. Theoretical Population Biology 2011, 80:244-255.
-
(2011)
Theoretical Population Biology
, vol.80
, pp. 244-255
-
-
Arbilly, M.1
Motro, U.2
Feldman, M.W.3
Lotem, A.4
-
7
-
-
84949231290
-
Effective choice in the prisoner's dilemma
-
Axelrod R. Effective choice in the prisoner's dilemma. The Journal of Conflict Resolution 1980, 24:3-25.
-
(1980)
The Journal of Conflict Resolution
, vol.24
, pp. 3-25
-
-
Axelrod, R.1
-
8
-
-
0019480612
-
The evolution of cooperation
-
Axelrod R., Hamilton W.D. The evolution of cooperation. Science 1981, 211:1390-1396.
-
(1981)
Science
, vol.211
, pp. 1390-1396
-
-
Axelrod, R.1
Hamilton, W.D.2
-
9
-
-
0001793657
-
Dynamics of stochastic approximation algorithms
-
Springer, Berlin, J. Azéma (Ed.)
-
Benaïm M. Dynamics of stochastic approximation algorithms. Séminaire de Probabilités XXXIII, vol. 1709 1999, 1-68. Springer, Berlin. J. Azéma (Ed.).
-
(1999)
Séminaire de Probabilités XXXIII, vol. 1709
, pp. 1-68
-
-
Benaïm, M.1
-
10
-
-
84891163591
-
-
Promenade Aléatoire: Chaînes de Markov et Simulations; Martingales et Stratégies. Editions de l'Ecole Polytechnique, Palaiseau, France
-
Benaïm, M., El Karoui, N., 2005. Promenade Aléatoire: Chaînes de Markov et Simulations; Martingales et Stratégies. Editions de l'Ecole Polytechnique, Palaiseau, France.
-
(2005)
-
-
Benaïm, M.1
El Karoui, N.2
-
11
-
-
0002277539
-
Mixed equilibria and dynamical systems arising from fictitious play in perturbed games
-
Benaïm M., Hirsch M.W. Mixed equilibria and dynamical systems arising from fictitious play in perturbed games. Games and Economic Behavior 1999, 29:36-72.
-
(1999)
Games and Economic Behavior
, vol.29
, pp. 36-72
-
-
Benaïm, M.1
Hirsch, M.W.2
-
12
-
-
0039521654
-
Stochastic approximation algorithms with constant step size whose average is cooperative
-
Benaïm M., Hirsch M.W. Stochastic approximation algorithms with constant step size whose average is cooperative. The Annals of Applied Probability 1999, 9:216-241.
-
(1999)
The Annals of Applied Probability
, vol.9
, pp. 216-241
-
-
Benaïm, M.1
Hirsch, M.W.2
-
14
-
-
0024234018
-
Individual decisions and the distribution of predators in a patchy environment
-
Bernstein C., Kacelnik A., Krebs J.R. Individual decisions and the distribution of predators in a patchy environment. Journal of Animal Ecology 1988, 57:1007-1026.
-
(1988)
Journal of Animal Ecology
, vol.57
, pp. 1007-1026
-
-
Bernstein, C.1
Kacelnik, A.2
Krebs, J.R.3
-
15
-
-
44049110303
-
Evolutionary stability in repeated games played by finite automata
-
Binmore K.G., Samuelson L. Evolutionary stability in repeated games played by finite automata. Journal of Economic Theory 1992, 57:278-305.
-
(1992)
Journal of Economic Theory
, vol.57
, pp. 278-305
-
-
Binmore, K.G.1
Samuelson, L.2
-
16
-
-
0031281590
-
Learning through reinforcement and replicator dynamics
-
Börgers T., Sarin R. Learning through reinforcement and replicator dynamics. Journal of Economic Theory 1997, 77:1-14.
-
(1997)
Journal of Economic Theory
, vol.77
, pp. 1-14
-
-
Börgers, T.1
Sarin, R.2
-
19
-
-
0002672918
-
Iterative solution of games by fictitious play
-
Wiley, New York
-
Brown G.W. Iterative solution of games by fictitious play. Activity Analysis of Production and Allocation 1951, 374-376. Wiley, New York.
-
(1951)
Activity Analysis of Production and Allocation
, pp. 374-376
-
-
Brown, G.W.1
-
20
-
-
0001491619
-
A mathematical model for simple learning
-
Bush R.R., Mostelller F. A mathematical model for simple learning. Psychological Review 1951, 58:313-323.
-
(1951)
Psychological Review
, vol.58
, pp. 313-323
-
-
Bush, R.R.1
Mostelller, F.2
-
22
-
-
18644365144
-
Experienced-weighted attraction learning in normal form games
-
Camerer C., Ho T.H. Experienced-weighted attraction learning in normal form games. Econometrica 1999, 67:827-874.
-
(1999)
Econometrica
, vol.67
, pp. 827-874
-
-
Camerer, C.1
Ho, T.H.2
-
24
-
-
0000076619
-
Do chimpanzees cooperate in a learning task?
-
Chalmeau R. Do chimpanzees cooperate in a learning task?. Primates 1994, 35:385-392.
-
(1994)
Primates
, vol.35
, pp. 385-392
-
-
Chalmeau, R.1
-
25
-
-
79953131782
-
Aspiration learning in coordination games
-
Chasparis, G., Shamma, J., Arapostathis, A., 2010. Aspiration learning in coordination games. In 49th IEEE Conference on Decision and Control (CDC), pages 5756-5761.
-
(2010)
In 49th IEEE Conference on Decision and Control (CDC)
, pp. 5756-5761
-
-
Chasparis, G.1
Shamma, J.2
Arapostathis, A.3
-
28
-
-
79955607687
-
Physical constraints on the evolution of cooperation
-
Dijker A. Physical constraints on the evolution of cooperation. Evolutionary Biology 2011, 38:124-143.
-
(2011)
Evolutionary Biology
, vol.38
, pp. 124-143
-
-
Dijker, A.1
-
31
-
-
10344252314
-
The mentality of crows: Convergent evolution of intelligence in corvids and apes
-
Emery N.J., Clayton N.S. The mentality of crows: Convergent evolution of intelligence in corvids and apes. Science 2004, 306:1903-1907.
-
(2004)
Science
, vol.306
, pp. 1903-1907
-
-
Emery, N.J.1
Clayton, N.S.2
-
33
-
-
0038829878
-
Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria
-
Erev I., Roth A.E. Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria. American Economic Review 1998, 88:848-881.
-
(1998)
American Economic Review
, vol.88
, pp. 848-881
-
-
Erev, I.1
Roth, A.E.2
-
34
-
-
84871188711
-
Exposing the behavioral gambit: the evolution of learning and decision rules
-
Fawcett T.W., Hamblin S., Giraldeau L.-A. Exposing the behavioral gambit: the evolution of learning and decision rules. Behavioral Ecology 2013, 24:2-11.
-
(2013)
Behavioral Ecology
, vol.24
, pp. 2-11
-
-
Fawcett, T.W.1
Hamblin, S.2
Giraldeau, L.-A.3
-
35
-
-
0030186388
-
Individual versus social learning: evolutionary analysis in a fluctuating environment
-
Feldman M., Aoki K., Kumm J. Individual versus social learning: evolutionary analysis in a fluctuating environment. Anthropological Science 1996, 104:209-232.
-
(1996)
Anthropological Science
, vol.104
, pp. 209-232
-
-
Feldman, M.1
Aoki, K.2
Kumm, J.3
-
36
-
-
0141838158
-
Learning, hypothesis testing, and nash equilibrium
-
Foster D.P., Young H. Learning, hypothesis testing, and nash equilibrium. Games and Economic Behavior 2003, 45:73-96.
-
(2003)
Games and Economic Behavior
, vol.45
, pp. 73-96
-
-
Foster, D.P.1
Young, H.2
-
38
-
-
78650868877
-
Heterogeneous beliefs and local information in stochastic fictitious play
-
Fudenberg D., Takahashi S. Heterogeneous beliefs and local information in stochastic fictitious play. Games and Economic Behavior 2011, 71:100-120.
-
(2011)
Games and Economic Behavior
, vol.71
, pp. 100-120
-
-
Fudenberg, D.1
Takahashi, S.2
-
43
-
-
49949101671
-
Simple learning rules to cope with changing environments
-
Groß R., Houston A.I., Collins E.J., McNamara J.M., Dechaume-Moncharmont F.-X., Franks N.R. Simple learning rules to cope with changing environments. Journal of The Royal Society Interface 2008, 5:1193-1202.
-
(2008)
Journal of The Royal Society Interface
, vol.5
, pp. 1193-1202
-
-
Groß, R.1
Houston, A.I.2
Collins, E.J.3
McNamara, J.M.4
Dechaume-Moncharmont, F.-X.5
Franks, N.R.6
-
44
-
-
70649083163
-
Finding the evolutionarily stable learning rule for frequency-dependent foraging
-
Hamblin S., Giraldeau L.-A. Finding the evolutionarily stable learning rule for frequency-dependent foraging. Animal Behaviour 2009, 78:1343-1350.
-
(2009)
Animal Behaviour
, vol.78
, pp. 1343-1350
-
-
Hamblin, S.1
Giraldeau, L.-A.2
-
45
-
-
84878444206
-
-
MIT Press, Cambridge, MA, P. Hammerstein, J.R. Stevens (Eds.)
-
Evolution and the Mechanisms of Decision Making 2012, MIT Press, Cambridge, MA. P. Hammerstein, J.R. Stevens (Eds.).
-
(2012)
Evolution and the Mechanisms of Decision Making
-
-
-
46
-
-
0019885790
-
Learning the evolutionarily stable strategy
-
Harley C.B. Learning the evolutionarily stable strategy. Journal of Theoretical Biology 1981, 89:611-633.
-
(1981)
Journal of Theoretical Biology
, vol.89
, pp. 611-633
-
-
Harley, C.B.1
-
48
-
-
22144491187
-
-
Academic Press, San Diego, CA
-
Hirsch M.W., Smale S., Devaney R.L. Differential Equations, Dynamical Systems, and an Introduction to Chaos 2004, Academic Press, San Diego, CA.
-
(2004)
Differential Equations, Dynamical Systems, and an Introduction to Chaos
-
-
Hirsch, M.W.1
Smale, S.2
Devaney, R.L.3
-
49
-
-
33847042809
-
Self-tuning experience weighted attraction learning in games
-
Ho T.H., Camerer C.F., Chong J.-K. Self-tuning experience weighted attraction learning in games. Journal of Economic Theory 2007, 133:177-198.
-
(2007)
Journal of Economic Theory
, vol.133
, pp. 177-198
-
-
Ho, T.H.1
Camerer, C.F.2
Chong, J.-K.3
-
50
-
-
0036436650
-
On the global convergence of stochastic fictitious play
-
Hofbauer J., Sandholm W.H. On the global convergence of stochastic fictitious play. Econometrica 2002, 70:2265-2294.
-
(2002)
Econometrica
, vol.70
, pp. 2265-2294
-
-
Hofbauer, J.1
Sandholm, W.H.2
-
53
-
-
0000359838
-
Pavlovian conditioning of aggressive behavior in blue gourami fish (Trichogaster trichopterus): winners become winners and losers stay losers
-
Hollis K.L., Dumas M.J., Singh P., Fackelman P. Pavlovian conditioning of aggressive behavior in blue gourami fish (Trichogaster trichopterus): winners become winners and losers stay losers. Journal of Comparative Psychology 1995, 109:123-133.
-
(1995)
Journal of Comparative Psychology
, vol.109
, pp. 123-133
-
-
Hollis, K.L.1
Dumas, M.J.2
Singh, P.3
Fackelman, P.4
-
54
-
-
0036434064
-
Two competing models of how people learn in games
-
Hopkins E. Two competing models of how people learn in games. Econometrica 2002, 70:2141-2166.
-
(2002)
Econometrica
, vol.70
, pp. 2141-2166
-
-
Hopkins, E.1
-
55
-
-
0021097184
-
Comments on "Learning the evolutionarily stable strategy"
-
Houston A.I. Comments on "Learning the evolutionarily stable strategy". Journal of Theoretical Biology 1983, 105:175-178.
-
(1983)
Journal of Theoretical Biology
, vol.105
, pp. 175-178
-
-
Houston, A.I.1
-
57
-
-
35048853187
-
Transient and asymptotic dynamics of reinforcement learning in games
-
Izquierdo L.R., Izquierdo S.S., Gotts N.M., Polhill J.G. Transient and asymptotic dynamics of reinforcement learning in games. Games and Economic Behavior 2007, 61:259-276.
-
(2007)
Games and Economic Behavior
, vol.61
, pp. 259-276
-
-
Izquierdo, L.R.1
Izquierdo, S.S.2
Gotts, N.M.3
Polhill, J.G.4
-
58
-
-
0002298153
-
Bayesian learning in normal form games
-
Jordan J.S. Bayesian learning in normal form games. Games and Economic Behavior 1991, 3:60-81.
-
(1991)
Games and Economic Behavior
, vol.3
, pp. 60-81
-
-
Jordan, J.S.1
-
59
-
-
41949122499
-
A numerical analysis of the evolutionary stability of learning rules
-
Josephson J. A numerical analysis of the evolutionary stability of learning rules. Journal of Economic Dynamics and Control 2008, 32:1569-1599.
-
(2008)
Journal of Economic Dynamics and Control
, vol.32
, pp. 1569-1599
-
-
Josephson, J.1
-
65
-
-
0017526570
-
Analysis of recursive stochastic algorithms
-
Ljung L. Analysis of recursive stochastic algorithms. IEEE Transactions on Automatic Control 1977, 22:551-575.
-
(1977)
IEEE Transactions on Automatic Control
, vol.22
, pp. 551-575
-
-
Ljung, L.1
-
66
-
-
84871250172
-
Learning to avoid the behavioral gambit
-
13-13
-
Lotem A. Learning to avoid the behavioral gambit. Behavioral Ecology 2013, 24. 13-13.
-
(2013)
Behavioral Ecology
, vol.24
-
-
Lotem, A.1
-
70
-
-
34548719708
-
The logic of animal conflict
-
Maynard Smith J., Price G.R. The logic of animal conflict. Nature 1973, 246:15-18.
-
(1973)
Nature
, vol.246
, pp. 15-18
-
-
Maynard Smith, J.1
Price, G.R.2
-
71
-
-
84891148861
-
-
Baryplot 1.0
-
McElreath, R., 2010. Baryplot 1.0.
-
(2010)
-
-
McElreath, R.1
-
76
-
-
67349283062
-
Reinforcement learning in the brain
-
Niv Y. Reinforcement learning in the brain. Journal of Mathematical Psychology 2009, 53:139-154.
-
(2009)
Journal of Mathematical Psychology
, vol.53
, pp. 139-154
-
-
Niv, Y.1
-
77
-
-
0002710392
-
Some convergence theorems for stochastic learning models with distance diminishing operators
-
Norman M.F. Some convergence theorems for stochastic learning models with distance diminishing operators. Journal of Mathematical Psychology 1968, 5:61-101.
-
(1968)
Journal of Mathematical Psychology
, vol.5
, pp. 61-101
-
-
Norman, M.F.1
-
78
-
-
79953218067
-
Elephants know when they need a helping trunk in a cooperative task
-
Plotnik J.M., Lair R., Suphachoksahakun W., De Waal F.B.M. Elephants know when they need a helping trunk in a cooperative task. Proceedings of the National Academy of Sciences 2011, 108:5116-5121.
-
(2011)
Proceedings of the National Academy of Sciences
, vol.108
, pp. 5116-5121
-
-
Plotnik, J.M.1
Lair, R.2
Suphachoksahakun, W.3
De Waal, F.B.M.4
-
79
-
-
84884285374
-
R Development Core Team
-
R Foundation for Statistical Computing, Vienna, Austria
-
R Development Core Team 2011. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria.
-
(2011)
R: A Language and Environment for Statistical Computing
-
-
-
81
-
-
0002109138
-
A theory of pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
-
Appleton-Century-Crofts, New York (NY), A.H. Black, W.F. Prokasy (Eds.)
-
Rescorla R.A., Wagner A.R. A theory of pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. Classical Conditioning II: Current Research and Theory 1972, 64-99. Appleton-Century-Crofts, New York (NY). A.H. Black, W.F. Prokasy (Eds.).
-
(1972)
Classical Conditioning II: Current Research and Theory
, pp. 64-99
-
-
Rescorla, R.A.1
Wagner, A.R.2
-
82
-
-
0343329755
-
Density-dependent patch exploitation and acquisition of environmental information
-
Rodriguez-Gironés M.A., Vásquez R.A. Density-dependent patch exploitation and acquisition of environmental information. Theoretical Population Biology 1997, 52:32-42.
-
(1997)
Theoretical Population Biology
, vol.52
, pp. 32-42
-
-
Rodriguez-Gironés, M.A.1
Vásquez, R.A.2
-
83
-
-
84982364966
-
Does biology constrain culture
-
Rogers A.R. Does biology constrain culture. American Anthropologist 1988, 90:819-831.
-
(1988)
American Anthropologist
, vol.90
, pp. 819-831
-
-
Rogers, A.R.1
-
85
-
-
0000861816
-
Why imitate, and if so, how?
-
Schlag K.H. Why imitate, and if so, how?. Journal of Economic Theory 1998, 78:130-156.
-
(1998)
Journal of Economic Theory
, vol.78
, pp. 130-156
-
-
Schlag, K.H.1
-
87
-
-
0026300818
-
Change, regularity, and value in the evolution of animal learning
-
Stephens D.W. Change, regularity, and value in the evolution of animal learning. Behavioral Ecology 1991, 2:77-89.
-
(1991)
Behavioral Ecology
, vol.2
, pp. 77-89
-
-
Stephens, D.W.1
-
91
-
-
0029584479
-
Properties of evolutionarily stable learning rules
-
Tracy N.D., Seaman J.W. Properties of evolutionarily stable learning rules. Journal of Theoretical Biology 1995, 177:193-198.
-
(1995)
Journal of Theoretical Biology
, vol.177
, pp. 193-198
-
-
Tracy, N.D.1
Seaman, J.W.2
-
92
-
-
1142268235
-
A selection-mutation model for Q-learning in multi-agent systems
-
ACM, Melbourne, Australia
-
Tuyls K., Verbeeck K., Lenaerts T. A selection-mutation model for Q-learning in multi-agent systems. Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems 2003, 693-700. ACM, Melbourne, Australia.
-
(2003)
Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems
, pp. 693-700
-
-
Tuyls, K.1
Verbeeck, K.2
Lenaerts, T.3
-
93
-
-
77949873664
-
Analyzing behavior implied by EWA learning: an emphasis on distinguishing reinforcement from belief learning
-
van der Horst W., van Assen M., Snijders C. Analyzing behavior implied by EWA learning: an emphasis on distinguishing reinforcement from belief learning. Journal of Mathematical Psychology 2010, 54:222-229.
-
(2010)
Journal of Mathematical Psychology
, vol.54
, pp. 222-229
-
-
van der Horst, W.1
van Assen, M.2
Snijders, C.3
-
94
-
-
0032014621
-
Pavlovian conditioning of social affirmative behavior in the mongolian gerbil (Meriones unguiculatus)
-
Villarreal R., Domjan M. Pavlovian conditioning of social affirmative behavior in the mongolian gerbil (Meriones unguiculatus). Journal of Comparative Psychology 1998, 112:26-35.
-
(1998)
Journal of Comparative Psychology
, vol.112
, pp. 26-35
-
-
Villarreal, R.1
Domjan, M.2
-
96
-
-
0034951517
-
A simple learning strategy that realizes robust cooperation better than pavlov in iterated prisoners' dilemma
-
Wakano J.Y., Yamamura N. A simple learning strategy that realizes robust cooperation better than pavlov in iterated prisoners' dilemma. Journal of Ethology 2001, 19:1-8.
-
(2001)
Journal of Ethology
, vol.19
, pp. 1-8
-
-
Wakano, J.Y.1
Yamamura, N.2
-
97
-
-
80051590423
-
Individuality in nest building: do southern masked weaver (Ploceus velatus) males vary in their nest-building behaviour?
-
Walsh P.T., Hansell M., Borello W.D., Healy S.D. Individuality in nest building: do southern masked weaver (Ploceus velatus) males vary in their nest-building behaviour?. Behavioural Processes 2011, 88:1-6.
-
(2011)
Behavioural Processes
, vol.88
, pp. 1-6
-
-
Walsh, P.T.1
Hansell, M.2
Borello, W.D.3
Healy, S.D.4
-
99
-
-
84891160830
-
-
Wolfram Research, Inc. Mathematica, Version 8.0.4. Champaign, Illinois
-
Wolfram Research, Inc. 2011. Mathematica, Version 8.0.4. Champaign, Illinois.
-
(2011)
-
-
|