SCOPUS 정보 검색 플랫폼

Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences

Volumn 361, Issue 1811, 2003, Pages 2225-2244

Isotropic-sequence-order learning in a closed-loop behavioural system

(2) Porr, Bernd a Wörgötter, Florentin a

a UNIVERSITY OF STIRLING (United Kingdom)

Author keywords

Autonomous behaviour; Control theory; Inverse controller; Reinforcement learning; Temporal sequence learning

Indexed keywords

EID: 0742301619 PISSN: 1364503X EISSN: None Source Type: Journal
DOI: 10.1098/rsta.2003.1273 Document Type: Article

Times cited : (20)

References (30)

1
- 0003685190
- Wiley
- Blinchikoff, H. J. 1976 Filtering in the time and frequency domain. Wiley.
- (1976) Filtering in the Time and Frequency Domain
- Blinchikoff, H.J.¹

2
- 0003147032
- How to build complete creatures rather than isolated cognitive simulators
- ed. K. VanLehn. Hillsdale, NJ: Erlbaum
- Brooks, R. A. 1989 How to build complete creatures rather than isolated cognitive simulators. In Architectures for intelligence (ed. K. VanLehn), pp. 225-239. Hillsdale, NJ: Erlbaum.
- (1989) Architectures for Intelligence , pp. 225-239
- Brooks, R.A.¹

3
- 33745200552
- True autonomy from self-organised adaptivity
- HP Bristol Labs, Bristol, UK, 14-16 August 2002 (ed. R. Damper & D. Cliff)
- Der, R. & Liebscher, R. 2002 True autonomy from self-organised adaptivity. In WGW'02, Proc. EPSRC/BBSRC Int. Biologically Inspired Robotics: the legacy of W. Grey Walter, HP Bristol Labs, Bristol, UK, 14-16 August 2002 (ed. R. Damper & D. Cliff).
- (2002) WGW'02, Proc. EPSRC/BBSRC Int. Biologically Inspired Robotics: The Legacy of W. Grey Walter
- Der, R.¹ Liebscher, R.²

4
- 0003405693
- London: Van Nostrand Reinhold
- Doetsch, G. 1961 Guide to the applications of the Laplace and z-transforms. London: Van Nostrand Reinhold.
- (1961) Guide to the Applications of the Laplace and z-Transforms
- Doetsch, G.¹

5
- 0033629916
- Reinforcement learning in continuous time and space
- Doya, K. 2000 Reinforcement learning in continuous time and space. Neural Netw. 12, 219-245.
- (2000) Neural Netw. , vol.12 , pp. 219-245
- Doya, K.¹

6
- 0142055195
- Agents as anticipatory systems
- Skokie, IL: IIIS
- Ekdahl, B. 2000 Agents as anticipatory systems. In Proc. 4th World Multiconference on Systemic, Cybernetics and Informatics (SCI 2000) and 6th Int. Conf. on Information Systems Analysis and Synthesis (ISAS 2000), Orlando, FL 23-26 July 2000, pp. 133-137. Skokie, IL: IIIS.
- (2000) Proc. 4th World Multiconference on Systemic, Cybernetics and Informatics (SCI 2000) and 6th Int. Conf. on Information Systems Analysis and Synthesis (ISAS 2000), Orlando, FL 23-26 July 2000 , pp. 133-137
- Ekdahl, B.¹

7
- 0142055194
- Darmstadt: Wissenschaftliche Buchgesellschaft
- Haken, H. 1995 Entstehung von Biologischer Information und Ordnung. Darmstadt: Wissenschaftliche Buchgesellschaft.
- (1995) Entstehung von Biologischer Information und Ordnung
- Haken, H.¹

8
- 0035487297
- Mosaic model for sensorimotor learning and control
- Haruno, M., Wolpert, D. M. & Kawato, M. 2001 Mosaic model for sensorimotor learning and control. Neural Comput. 13, 2201-2220.
- (2001) Neural Comput. , vol.13 , pp. 2201-2220
- Haruno, M.¹ Wolpert, D.M.² Kawato, M.³

9
- 0042276164
- A drive-reinforcement model of single neuron function
- (ed. J. S. Denker) New York: American Institute of Physics
- Klopf, A. H. 1986 A drive-reinforcement model of single neuron function. In Neural networks for computing: AIP Conf. Proc. (ed. J. S. Denker), vol. 151, pp. 265-270. New York: American Institute of Physics.
- (1986) Neural Networks for Computing: AIP Conf. Proc. , vol.151 , pp. 265-270
- Klopf, A.H.¹

10
- 0003821776
- Academic
- McFarland, D. J. 1971 Feedback mechanisms in animal behaviour. Academic.
- (1971) Feedback Mechanisms in Animal Behaviour
- McFarland, D.J.¹

11
- 0004231438
- Harlow: Longman
- McFarland, D. J. 1989 Problems of animal behaviour. Harlow: Longman.
- (1989) Problems of Animal Behaviour
- McFarland, D.J.¹

12
- 0036832960
- Continuous-action Q-learning
- Millán, J. D. R., Posentano, D. & Dedieu, E. 2002 Continuous-action Q-learning. Mach. Learn. 49, 247-265.
- (2002) Mach. Learn. , vol.49 , pp. 247-265
- Millán, J.D.R.¹ Posentano, D.² Dedieu, E.³

13
- 0003752758
- Wiley
- Palm, W. J. 2000 Modelling, analysis and control of dynamic systems. Wiley.
- (2000) Modelling, Analysis and Control of Dynamic Systems
- Palm, W.J.¹

14
- 0032124565
- Representation in natural and artificial agents: A perspective from embodied cognitive science
- Pfeifer, R. & Scheier, C. 1998 Representation in natural and artificial agents: a perspective from embodied cognitive science. Z. Naturforsch. C 53, 480-503.
- (1998) Z. Naturforsch. C , vol.53 , pp. 480-503
- Pfeifer, R.¹ Scheier, C.²

15
- 0004108039
- Cambridge, MA: MIT Press
- Pfeifer, R. & Scheier, C. 1999 Understanding intelligence. Cambridge, MA: MIT Press.
- (1999) Understanding Intelligence
- Pfeifer, R.¹ Scheier, C.²

16
- 0036813595
- Isotropic sequence order learning using a novel linear algorithm in a closed loop behavioural system
- Porr, B. & Wörgötter, F. 2002 Isotropic sequence order learning using a novel linear algorithm in a closed loop behavioural system. Biosystems 67, 195-202.
- (2002) Biosystems , vol.67 , pp. 195-202
- Porr, B.¹ Wörgötter, F.²

17
- 0002109138
- A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and non-reinforcement
- (ed. A. Black & W. Prokasy) New York: Appleton-Century-Crofts
- Rescorla, R. & Wagner, A. 1972 A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and non-reinforcement. In Classical conditioning 2. Current theory and research (ed. A. Black & W. Prokasy), pp. 64-99. New York: Appleton-Century-Crofts.
- (1972) Classical Conditioning 2. Current Theory and Research , pp. 64-99
- Rescorla, R.¹ Wagner, A.²

18
- 0032696609
- Temporally asymmetric learning rules. I. Differential Hebbian learning
- Roberts, P. D. 1999 Temporally asymmetric learning rules. I. Differential Hebbian learning. J. Comput. Neurosci. 7, 235-246.
- (1999) J. Comput. Neurosci. , vol.7 , pp. 235-246
- Roberts, P.D.¹

19
- 0032865893
- Exploration-tuned reinforcement function
- Santos, J. M. & Touzet, C. 1999 Exploration-tuned reinforcement function. Neurocomputing 28, 93-105.
- (1999) Neurocomputing , vol.28 , pp. 93-105
- Santos, J.M.¹ Touzet, C.²

20
- 0010121618
- Categorization in a real-world agent using haptic exploration and active perception
- Cape Cod, MA. Cambridge, MA: MIT Press
- Scheier, C. & Lambrosios, D. 1996 Categorization in a real-world agent using haptic exploration and active perception. In From animals to animals, 4th Int. Conf. on Simulation of Adaptive Behaviour, Cape Cod, MA. Cambridge, MA: MIT Press.
- (1996) From Animals to Animals, 4th Int. Conf. on Simulation of Adaptive Behaviour
- Scheier, C.¹ Lambrosios, D.²

21
- 0003663134
- Oxford University Press
- Shepherd, G. M. (ed.) 1990 The synaptic organisation of the brain. Oxford University Press.
- (1990) The Synaptic Organisation of the Brain
- Shepherd, G.M.¹

22
- 0042777297
- New York: McGraw-Hill
- Stewart, J. L. 1960 Fundamentals of signal theory. New York: McGraw-Hill.
- (1960) Fundamentals of Signal Theory
- Stewart, J.L.¹

23
- 33847202724
- Learning to predict by method of temporal differences
- Sutton, R. 1988 Learning to predict by method of temporal differences. Mach. Learn. 3, 9-44.
- (1988) Mach. Learn. , vol.3 , pp. 9-44
- Sutton, R.¹

24
- 0019537951
- Towards a modern theory of adaptive networks: Expectation and prediction
- Sutton, R. & Barto, A. 1981 Towards a modern theory of adaptive networks: expectation and prediction. Psychol. Rev. 88, 135-170.
- (1981) Psychol. Rev. , vol.88 , pp. 135-170
- Sutton, R.¹ Barto, A.²

25
- 0020076731
- Simulation of anticipatory responses in classical conditioning by a neuron-like adaptive element
- Sutton, R. S. & Barto, A. G. 1982 Simulation of anticipatory responses in classical conditioning by a neuron-like adaptive element. Behav. Brain Sci. 4, 221-235.
- (1982) Behav. Brain Sci. , vol.4 , pp. 221-235
- Sutton, R.S.¹ Barto, A.G.²

26
- 0000580224
- A temporal-difference model of classical conditioning
- Mahwah, NJ: Erlbaum
- Sutton, R. S. & Barto, A. G. 1987 A temporal-difference model of classical conditioning. In Proc. 9th Ann. Conf. Cognitive Sci. Soc., pp. 355-378. Mahwah, NJ: Erlbaum.
- (1987) Proc. 9th Ann. Conf. Cognitive Sci. Soc. , pp. 355-378
- Sutton, R.S.¹ Barto, A.G.²

27
- 0003066891
- Time-derivative models of Pavlovian reinforcement
- (ed. M. Gabriel & J. Moore) Cambridge, MA: MIT Press
- Sutton, R. S. & Barto, A. G. 1990 Time-derivative models of Pavlovian reinforcement. In Learning and computational neuroscience. Foundations of adaptive networks (ed. M. Gabriel & J. Moore), pp. 497-537. Cambridge, MA: MIT Press.
- (1990) Learning and Computational Neuroscience. Foundations of Adaptive Networks , pp. 497-537
- Sutton, R.S.¹ Barto, A.G.²

28
- 0032191778
- A bottom-up approach towards the acquisition, retention, and expression of sequential representations: Distributed adaptive control. III
- Verschure, P. & Voegtlin, T. 1998 A bottom-up approach towards the acquisition, retention, and expression of sequential representations: distributed adaptive control. III. Neural Netw. 11, 1531-1549.
- (1998) Neural Netw. , vol.11 , pp. 1531-1549
- Verschure, P.¹ Voegtlin, T.²

29
- 0042276153
- Learning and adaptation in constructivism
- (ed. L. Smith) London: Routledge
- von Glasersfeld, E. 1996 Learning and adaptation in constructivism. In Critical readings on Piaget (ed. L. Smith), pp. 22-27. London: Routledge.
- (1996) Critical Readings on Piaget , pp. 22-27
- Von Glasersfeld, E.¹

30
- 0033667260
- Computational principles of movement neuroscience
- Wolpert, D. M. & Ghahramani, Z. 2000 Computational principles of movement neuroscience. Nature Neurosci. Suppl. 3, 1212-1217.
- (2000) Nature Neurosci. Suppl. , vol.3 , pp. 1212-1217
- Wolpert, D.M.¹ Ghahramani, Z.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.