SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Neural Networks

Volumn 3, Issue 6, 1990, Pages 671-692

A stochastic reinforcement learning algorithm for learning real-valued functions

(1) Gullapalli, Vijaykumar a

a UNIVERSITY OF MASSACHUSETTS (United States)

Author keywords

Associative reinforcement learning; Learning algorithm; Neural networks; Neurocontrol; Real valued functions; Robotics; Shaping; Stochastic automata

Indexed keywords

COMPUTER PROGRAMMING--ALGORITHMS; LEARNING SYSTEMS; ROBOTS--MANIPULATORS;

LEARNING ALGORITHMS; REAL VALUED FUNCTIONS; REINFORCEMENT LEARNING; STOCHASTIC REINFORCEMENT LEARNING;

NEURAL NETWORKS;

EID: 0025600638 PISSN: 08936080 EISSN: None Source Type: Journal
DOI: 10.1016/0893-6080(90)90056-Q Document Type: Article

Times cited : (207)

References (31)

1
- 0001578518
- A learning algorithm for Boltzmann machines
- (1985) Cognitive Science , vol.9 , pp. 147-169
- Ackley¹ Hinton² Sejnowski³

2
- 0003942195
- BYTE Books, Peterborough, NH
- (1975) Brains, behavior, and robotics
- Albus¹

3
- 0023596261
- Stochastic learning networks and their electronic implementation
- Denver, Colorado
- (1987) Proceedings of the Conference on Neural Information Processing Systems
- Alspector¹ Allen² Hu³ Satyanarayana⁴

4
- 0003630733
- Learning and problem solving with multilayer connectionist systems
- University of Massachusetts, Amherst
- (1986) Ph.D. dissertation
- Anderson¹

5
- 0000960934
- Perceptual structures and distributed motor control
- V.B Brooks, American Psychological Society, Bethesda, MD
- (1981) Handbook of physiology—The nervous system II, motor control
- Arbib¹

6
- 0004014191
- Wiley, New York
- (1965) An introduction to mathematical learning theory
- Atkinson¹ Bower² Crothers³

7
- 0022213383
- Learning by statistical cooperation of self-interested neuron-like computing elements
- (1985) Human Neurobiology , vol.4 , pp. 229-256
- Barto¹

8
- 84927461265
- Pattern recognizing stochastic learning automata
- (1985) IEEE Transactions on Systems, Man, and Cybernetics , vol.15 , pp. 360-374
- Barto¹ Anandan²

9
- 0000683869
- Gradient following without back-propagation in layered networks
- San Diego, California
- (1987) Proceedings of the IEEE First Annual Conference on Neural Networks
- Barto¹ Jordan²

10
- 0019519039
- Associative search network: A reinforcement learning associative memory
- (1981) Biological Cybernetics , vol.40 , pp. 201-211
- Barto¹ Sutton² Brouwer³

11
- 84910776456
- Barto, A. G., Sutton, R. S., & Watkins, C. (to appear). Prediction, control, and learning. In M. Gabriel and J. W. Moore (Eds.), Learning and computational neuroscience. Cambridge, MA: The MIT Press.

12
- 0040300322
- Stanford University Press, Stanford, California
- (1959) Studies in mathematical learning theory
- Bush¹ Estes²

13
- 0003781528
- Wiley, New York
- (1955) Stochastic models for learning
- Bush¹ Mosteller²

14
- 84918304918
- Simulation of self-organizing systems by digital computer
- (1954) I.R.E. Transactions on Information Theory , vol.4 PGIT , pp. 76-84
- Farley¹ Clark²

15
- 0000562031
- A heuristic approach to reinforcement-learning control systems
- (1965) IEEE Transactions on Information Theory , vol.9 , pp. 390-398
- Fu¹ Waltz²

16
- 0012083286
- A stochastic algorithm for learning real-valued functions via reinforcement feedback
- Dept. of Computer and Info. Sciences, University of Massachusetts, Amherst
- (1988) COINS Technical Report 88–91
- Gullapalli¹

17
- 0016303884
- Alopex: A stochastic method for determining visual receptive fields
- (1974) Vision Research , vol.14 , pp. 1475-1482
- Harth¹ Tzanakou²

18
- 0004141541
- Connectionist learning procedures
- Department of Computer Science, Carnegie-Mellon University, Pittsburgh, PA 15213
- (1987) Technical Report CMU-CS-87-115
- Hinton¹

19
- 0003961888
- Yale University Press, New Haven
- (1952) A behavior system: An introduction to behavior theory concerning the individual organism
- Hull¹

20
- 0001887517
- Attractor dynamics and parallelism in a connectionist sequential machine
- Erlbaum, Hillsdale, NJ
- (1986) Proceedings of the Eighth Annual Conference of the Cognitive Science Society
- Jordan¹

21
- 84932306852
- Adaptive optimization procedures
- J.M Mendel, K.S Fu, Academic Press, New York and London
- (1970) Adaptive, learning and pattern recognition systems: Theory and applications
- McMurtry¹

22
- 0003799456
- Reinforcement-learning control and pattern recognition systems
- J.M Mendel, K.S Fu, Academic Press, New York and London
- (1970) Adaptive, learning and pattern recognition systems: Theory and applications
- Mendel¹ McLaren²

23
- 0017463231
- Learning automata—A critique
- (1977) Journal of Cybernetics, and Information Science , vol.1 , pp. 53-65
- Narendra¹ Lakshmivarahan²

24
- 0003891507
- Prentice Hall, Englewood Cliffs, NJ
- (1989) Learning automata: An introduction
- Narendra¹ Thathachar²

25
- 0000646059
- Learning internal representations by error propagation
- D.E Rumelhart, J.L McClelland, MIT Press/Bradford Books, Cambridge
- (1986) Parallel distributed processing: Explorations in the microstructure of cognition. Vol. 1: Foundations
- Rumelhart¹ Hinton² Williams³

26
- 0003880401
- D. Appleton Century, New York
- (1983) The behavior of organisms: An experimental analysis
- Skinner¹

27
- 84910783121
- Unpublished working paper
- (1982) Ballistic bug
- Sutton¹

28
- 0003617454
- Temporal aspects of credit assignment in reinforcement learning
- University of Massachusetts, Amherst
- (1984) Ph.D. dissertation
- Sutton¹

29
- 0004162272
- Academic Press, New York
- (1973) Automata theory and modeling of biological systems
- Tsetlin¹

30
- 0002278965
- Adaptive switching circuits
- (1960) 1960 WESCON Convention Record Part IV , pp. 96-104
- Widrow¹ Hoff²

31
- 0012076854
- Reinforcement learning in connectionist networks: A mathematical analysis
- University of California, La Jolla, San Diego, Institute for Cognitive Science
- (1986) Technical Report 8605
- Williams¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.