SCOPUS 정보 검색 플랫폼

IEEE Transactions on Neural Networks

Volumn 20, Issue 8, 2009, Pages 1368-1371

Simple artificial neural networks that match probability and exploit and explore when confronting a multiarmed bandit

(4) Dawson, Michael R W a Dupuis, Brian a Spetch, Marcia L a Kelly, Debbie M b

a UNIVERSITY OF ALBERTA (Canada)

b UNIVERSITY OF SASKATCHEWAN (Canada)

Author keywords

Instrumental learning; Multiarmed bandit; Operant conditioning; Perceptron; Probability matching

Indexed keywords

INSTRUMENTAL LEARNING; MULTIARMED BANDIT; OPERANT CONDITIONING; PERCEPTRON; PROBABILITY MATCHING;

BACKPROPAGATION; EDUCATION; NEURAL NETWORKS; PATTERN RECOGNITION SYSTEMS; PROBABILITY; REINFORCEMENT;

LEARNING ALGORITHMS;

ANIMAL; ARTICLE; ARTIFICIAL INTELLIGENCE; ARTIFICIAL NEURAL NETWORK; COMPUTER SIMULATION; DECISION MAKING; HUMAN; INSTRUMENTAL CONDITIONING; PROBABILITY; REINFORCEMENT;

ANIMALS; ARTIFICIAL INTELLIGENCE; CHOICE BEHAVIOR; COMPUTER SIMULATION; CONDITIONING, OPERANT; HUMANS; NEURAL NETWORKS (COMPUTER); PROBABILITY; REINFORCEMENT (PSYCHOLOGY); REINFORCEMENT SCHEDULE;

EID: 68949216971 PISSN: 10459227 EISSN: None Source Type: Journal
DOI: 10.1109/TNN.2009.2025588 Document Type: Article

Times cited : (20)

References (39)

1
- 27844539379
- Relative and absolute strength of response as a function of frequency of reinforcement
- R. J. Herrnstein, "Relative and absolute strength of response as a function of frequency of reinforcement," J. Exp. Anal. Behav., vol. 4, pp. 267-272, 1961.
- (1961) J. Exp. Anal. Behav , vol.4 , pp. 267-272
- Herrnstein, R.J.¹

2
- 0034014181
- An economist's perspective on probability matching
- Feb
- N. Vulkan, "An economist's perspective on probability matching," J. Econom. Surv., vol. 14, pp. 101-118, Feb. 2000.
- (2000) J. Econom. Surv , vol.14 , pp. 101-118
- Vulkan, N.¹

3
- 84891584370
- Chichester, U.K, Wiley
- J. C. Gittins, Multi-Armed Bandit Allocation Indices. Chichester, U.K.: Wiley, 1989.
- (1989) Multi-Armed Bandit Allocation Indices
- Gittins, J.C.¹

4
- 23444461158
- On the classic and modern theories of matching
- Jul
- J. J. McDowell, "On the classic and modern theories of matching," J. Exp. Anal. Behav., vol. 84, pp. 111-127, Jul. 2005.
- (2005) J. Exp. Anal. Behav , vol.84 , pp. 111-127
- McDowell, J.J.¹

5
- 0001730110
- Toward a law of response strength
- P. de Villiers and R. J. Herrnstein, "Toward a law of response strength," Psychol. Bull., vol. 83, pp. 1131-1153, 1976.
- (1976) Psychol. Bull , vol.83 , pp. 1131-1153
- de Villiers, P.¹ Herrnstein, R.J.²

6
- 0004129897
- Hillsdale, N.J, L. Erlbaum
- M. Davison and D. McCarthy, The Matching Law : A Research Review. Hillsdale, N.J.: L. Erlbaum, 1988.
- (1988) The Matching Law : A Research Review
- Davison, M.¹ McCarthy, D.²

7
- 85136499808
- Choice in concurrent schedules and a quantitative formulation of the law of effect
- W.K. Honig and J. E. R. Staddon, Eds. Englewood Cliffs, NJ: Prentice-Hall
- P. de Villiers, "Choice in concurrent schedules and a quantitative formulation of the law of effect," in Handbook of Operant Behavior W.K. Honig and J. E. R. Staddon, Eds. Englewood Cliffs, NJ: Prentice-Hall, 1977, pp. 233-287.
- (1977) Handbook of Operant Behavior , pp. 233-287
- de Villiers, P.¹

8
- 0003818321
- New York: Harvard Univ. Press
- R. J. Herrnstein, The Matching Law : Papers in Psychology and Economics. New York: Harvard Univ. Press, 1997.
- (1997) The Matching Law : Papers in Psychology and Economics
- Herrnstein, R.J.¹

9
- 84987278656
- Maximizing and matching on concurrent ratio schedules
- R. J. Herrnstein and D. H. Loveland, "Maximizing and matching on concurrent ratio schedules," J. Exp. Anal. Behav., vol. 24, pp. 107-116, 1975.
- (1975) J. Exp. Anal. Behav , vol.24 , pp. 107-116
- Herrnstein, R.J.¹ Loveland, D.H.²

10
- 11144273669
- The perceptron: A probabilistic model for information storage and organization in the brain
- F. Rosenblatt, "The perceptron: A probabilistic model for information storage and organization in the brain," Psychol. Rev., vol. 65, pp. 386-408, 1958.
- (1958) Psychol. Rev , vol.65 , pp. 386-408
- Rosenblatt, F.¹

11
- 0003952786
- Washington, DC: Spartan Books
- F. Rosenblatt, Principles of Neurodynamics. Washington, DC: Spartan Books, 1962.
- (1962) Principles of Neurodynamics
- Rosenblatt, F.¹

12
- 84889314058
- Malden, MA: Blackwell
- M. R. W. Dawson, Minds and Machines : Connectionism and Psychological Modeling. Malden, MA: Blackwell, 2004.
- (2004) Minds and Machines : Connectionism and Psychological Modeling
- Dawson, M.R.W.¹

13
- 68949215727
- M. R. W. Dawson, Connectionism and classical conditioning, Comparat. Cogn. Behav. Rev., 3, Monograph, pp. 1-115, 2008.
- M. R. W. Dawson, "Connectionism and classical conditioning," Comparat. Cogn. Behav. Rev., vol. 3, Monograph, pp. 1-115, 2008.

14
- 84889822618
- Connectionism
- 1st ed. Malden, MA: Blackwell
- M. R. W. Dawson, Connectionism : A Hands-on Approach, 1st ed. Malden, MA: Blackwell, 2005.
- (2005) A Hands-on Approach
- Dawson, M.R.W.¹

15
- 0027325820
- Choice in honeybees as a function of the probability of reward
- Aug
- M. E. Fischer, P. A. Couvillon, and M. E. Bitterman, "Choice in honeybees as a function of the probability of reward," Animal Learn. Behav., vol. 21, pp. 187-195, Aug. 1993.
- (1993) Animal Learn. Behav , vol.21 , pp. 187-195
- Fischer, M.E.¹ Couvillon, P.A.² Bitterman, M.E.³

16
- 0036862934
- Bees in two-armed bandit situations: Foraging choices and possible decision mechanisms
- Nov.-Dec
- T. Keasar, E. Rashkovich, D. Cohen, and A. Shmida, "Bees in two-armed bandit situations: Foraging choices and possible decision mechanisms," Behav. Ecol., vol. 13, pp. 757-765, Nov.-Dec. 2002.
- (2002) Behav. Ecol , vol.13 , pp. 757-765
- Keasar, T.¹ Rashkovich, E.² Cohen, D.³ Shmida, A.⁴

17
- 2642665866
- Probability-learning and habit-reversal in the cockroach
- N. Longo, "Probability-learning and habit-reversal in the cockroach," Amer. J. Psychol., vol. 77, pp. 29-41, 1964.
- (1964) Amer. J. Psychol , vol.77 , pp. 29-41
- Longo, N.¹

18
- 0036972336
- Evolution of reinforcement learning in uncertain environments: A simple explanation for complex foraging behaviors
- Y. Niv, D. Joel, I. Meilijson, and E. Ruppin, "Evolution of reinforcement learning in uncertain environments: A simple explanation for complex foraging behaviors," Adapt. Behav., vol. 10, pp. 5-24, 2002.
- (2002) Adapt. Behav , vol.10 , pp. 5-24
- Niv, Y.¹ Joel, D.² Meilijson, I.³ Ruppin, E.⁴

19
- 0003301122
- Probability-matching in the fish
- E. R. Behrend and M. E. Bitterman, "Probability-matching in the fish," Amer. J. Psychol., vol. 74, pp. 542-551, 1961.
- (1961) Amer. J. Psychol , vol.74 , pp. 542-551
- Behrend, E.R.¹ Bitterman, M.E.²

20
- 0003368011
- Probability-learning by the turtle
- K. L. Kirk and M. E. Bitterman, "Probability-learning by the turtle," Science, vol. 148, pp. 1484-1485, 1965.
- (1965) Science , vol.148 , pp. 1484-1485
- Kirk, K.L.¹ Bitterman, M.E.²

21
- 7344222429
- Further experiments on probability-matching in the pigeon
- V. Graf, D. H. Bullock, and M. E. Bitterman, "Further experiments on probability-matching in the pigeon," J. Exp. Anal. Behav., vol. 7, pp. 151-157, 1964.
- (1964) J. Exp. Anal. Behav , vol.7 , pp. 151-157
- Graf, V.¹ Bullock, D.H.² Bitterman, M.E.³

22
- 0001493642
- Analysis of a verbal conditioning situation in terms of statistical learning theory
- W. K. Estes and J. H. Straughan, "Analysis of a verbal conditioning situation in terms of statistical learning theory," J. Exp. Psychol. vol. 47, pp. 225-234, 1954.
- (1954) J. Exp. Psychol , vol.47 , pp. 225-234
- Estes, W.K.¹ Straughan, J.H.²

23
- 0021834552
- A new approach to the design of reinforcement schemes for learning automata
- Feb
- M. A. L. Thathachar and P. S. Sastry, "A new approach to the design of reinforcement schemes for learning automata," IEEE Trans. Syst. Man Cybern., vol. SMC-15, no. 1, pp. 168-175, Feb. 1985.
- (1985) IEEE Trans. Syst. Man Cybern , vol.SMC-15 , Issue.1 , pp. 168-175
- Thathachar, M.A.L.¹ Sastry, P.S.²

24
- 0029679044
- Reinforcement learning: A survey
- L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," J. Artif. Intell. Res., vol. 4, pp. 237-285, 1996.
- (1996) J. Artif. Intell. Res , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

25
- 0010790615
- Autonomous processing in PDP networks
- M. R. W. Dawson and D. P. Schopflocher, "Autonomous processing in PDP networks," Philosoph. Psychol., vol. 5, pp. 199-219, 1992.
- (1992) Philosoph. Psychol , vol.5 , pp. 199-219
- Dawson, M.R.W.¹ Schopflocher, D.P.²

26
- 4444227991
- Computational model of selection by consequences
- May
- J. J. McDowell, "Computational model of selection by consequences," J. Exp. Anal. Behav., vol. 81, pp. 297-317, May 2004.
- (2004) J. Exp. Anal. Behav , vol.81 , pp. 297-317
- McDowell, J.J.¹

27
- 33646035835
- The quantitative law of effect is a robust emergent property of an evolutionary algorithm for reinforcement learning
- Cambridge, MA:MIT Press
- J. J. McDowell and Z. Ansari, "The quantitative law of effect is a robust emergent property of an evolutionary algorithm for reinforcement learning," in Advances in Artificial Life. Cambridge, MA:MIT Press, 2005, vol. 3630, pp. 413-422.
- (2005) Advances in Artificial Life , vol.3630 , pp. 413-422
- McDowell, J.J.¹ Ansari, Z.²

28
- 34247149281
- Undermatching is an emergent property of selection by consequences
- Jun
- J. J. McDowell and M. L. Caron, "Undermatching is an emergent property of selection by consequences," Behav. Processes, vol. 75, pp. 97-106, Jun. 2007.
- (2007) Behav. Processes , vol.75 , pp. 97-106
- McDowell, J.J.¹ Caron, M.L.²

29
- 33750228991
- A computational theory of adaptive behavior based on an evolutionary reinforcement mechanism
- J. J. McDowell, P. L. Soto, J. Dallery, and S. Kulubekova, M. Keijzer, Ed, New York
- J. J. McDowell, P. L. Soto, J. Dallery, and S. Kulubekova, M. Keijzer, Ed., "A computational theory of adaptive behavior based on an evolutionary reinforcement mechanism," in Proc. Conf. Genetic Evol. Comput., New York, 2006, pp. 175-182.
- (2006) Proc. Conf. Genetic Evol. Comput , pp. 175-182

30
- 0024614579
- Evolution, selection and cognition: From "learning" to parameter setting in biology and in the study of language
- M. Piattelli-Palmarini, "Evolution, selection and cognition: From "learning" to parameter setting in biology and in the study of language," Cognition, vol. 31, pp. 1-44, 1989.
- (1989) Cognition , vol.31 , pp. 1-44
- Piattelli-Palmarini, M.¹

31
- 0003397519
- Boston, MA: Allyn and Bacon
- J. W. Donahoe and D. C. Palmer, Learning and Complex Behavior. Boston, MA: Allyn and Bacon, 1994.
- (1994) Learning and Complex Behavior
- Donahoe, J.W.¹ Palmer, D.C.²

32
- 68949203640
- Connectionist selectionism: A case study of parity
- R. B. T. Lowry and M. R. W. Dawson, "Connectionist selectionism: A case study of parity," Neural Inf. Process. - Lett. Rev., vol. 9, pp. 59-67, 2005.
- (2005) Neural Inf. Process. - Lett. Rev , vol.9 , pp. 59-67
- Lowry, R.B.T.¹ Dawson, M.R.W.²

33
- 0001594484
- Derivatives of matching
- R. J. Herrnstein, "Derivatives of matching," Psychol. Rev., vol. 86, pp. 486-495, 1979.
- (1979) Psychol. Rev , vol.86 , pp. 486-495
- Herrnstein, R.J.¹

34
- 0019537951
- Toward a modern theory of adaptive networks: Expectation and prediction
- R. S. Sutton and A. G. Barto, "Toward a modern theory of adaptive networks: Expectation and prediction," Psychol. Rev., vol. 88, pp. 135-170, 1981.
- (1981) Psychol. Rev , vol.88 , pp. 135-170
- Sutton, R.S.¹ Barto, A.G.²

35
- 0003497471
- Cambridge, U.K, Cambridge Univ. Press
- D. R. Shanks, The Psychology of Associative Learning. Cambridge, U.K.: Cambridge Univ. Press, 1995.
- (1995) The Psychology of Associative Learning
- Shanks, D.R.¹

36
- 0038172310
- Equilibria of the Rescorla-Wagner model
- Apr
- D. Danks, "Equilibria of the Rescorla-Wagner model," J. Math. Psychol., vol. 47, pp. 109-121, Apr. 2003.
- (2003) J. Math. Psychol , vol.47 , pp. 109-121
- Danks, D.¹

37
- 0004007508
- Cambridge, MA: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

38
- 0001304384
- On two types of deviation from the matching law: Bias and undermatching
- W. M. Baum, "On two types of deviation from the matching law: Bias and undermatching," J. Exp. Anal. Behav., vol. 22, pp. 231-242, 1974.
- (1974) J. Exp. Anal. Behav , vol.22 , pp. 231-242
- Baum, W.M.¹

39
- 21244442335
- Is there a geometric module for spatial orientation? Squaring theory and evidence
- Feb
- K. Cheng and N. S. Newcombe, "Is there a geometric module for spatial orientation? Squaring theory and evidence," Psychonom. Bull. Rev. vol. 12, pp. 1-23, Feb. 2005.
- (2005) Psychonom. Bull. Rev , vol.12 , pp. 1-23
- Cheng, K.¹ Newcombe, N.S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.