메뉴 건너뛰기




Volumn 361, Issue 1811, 2003, Pages 2225-2244

Isotropic-sequence-order learning in a closed-loop behavioural system

Author keywords

Autonomous behaviour; Control theory; Inverse controller; Reinforcement learning; Temporal sequence learning

Indexed keywords


EID: 0742301619     PISSN: 1364503X     EISSN: None     Source Type: Journal    
DOI: 10.1098/rsta.2003.1273     Document Type: Article
Times cited : (20)

References (30)
  • 2
    • 0003147032 scopus 로고
    • How to build complete creatures rather than isolated cognitive simulators
    • ed. K. VanLehn. Hillsdale, NJ: Erlbaum
    • Brooks, R. A. 1989 How to build complete creatures rather than isolated cognitive simulators. In Architectures for intelligence (ed. K. VanLehn), pp. 225-239. Hillsdale, NJ: Erlbaum.
    • (1989) Architectures for Intelligence , pp. 225-239
    • Brooks, R.A.1
  • 5
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • Doya, K. 2000 Reinforcement learning in continuous time and space. Neural Netw. 12, 219-245.
    • (2000) Neural Netw. , vol.12 , pp. 219-245
    • Doya, K.1
  • 8
    • 0035487297 scopus 로고    scopus 로고
    • Mosaic model for sensorimotor learning and control
    • Haruno, M., Wolpert, D. M. & Kawato, M. 2001 Mosaic model for sensorimotor learning and control. Neural Comput. 13, 2201-2220.
    • (2001) Neural Comput. , vol.13 , pp. 2201-2220
    • Haruno, M.1    Wolpert, D.M.2    Kawato, M.3
  • 9
    • 0042276164 scopus 로고
    • A drive-reinforcement model of single neuron function
    • (ed. J. S. Denker) New York: American Institute of Physics
    • Klopf, A. H. 1986 A drive-reinforcement model of single neuron function. In Neural networks for computing: AIP Conf. Proc. (ed. J. S. Denker), vol. 151, pp. 265-270. New York: American Institute of Physics.
    • (1986) Neural Networks for Computing: AIP Conf. Proc. , vol.151 , pp. 265-270
    • Klopf, A.H.1
  • 14
    • 0032124565 scopus 로고    scopus 로고
    • Representation in natural and artificial agents: A perspective from embodied cognitive science
    • Pfeifer, R. & Scheier, C. 1998 Representation in natural and artificial agents: a perspective from embodied cognitive science. Z. Naturforsch. C 53, 480-503.
    • (1998) Z. Naturforsch. C , vol.53 , pp. 480-503
    • Pfeifer, R.1    Scheier, C.2
  • 16
    • 0036813595 scopus 로고    scopus 로고
    • Isotropic sequence order learning using a novel linear algorithm in a closed loop behavioural system
    • Porr, B. & Wörgötter, F. 2002 Isotropic sequence order learning using a novel linear algorithm in a closed loop behavioural system. Biosystems 67, 195-202.
    • (2002) Biosystems , vol.67 , pp. 195-202
    • Porr, B.1    Wörgötter, F.2
  • 17
    • 0002109138 scopus 로고
    • A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and non-reinforcement
    • (ed. A. Black & W. Prokasy) New York: Appleton-Century-Crofts
    • Rescorla, R. & Wagner, A. 1972 A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and non-reinforcement. In Classical conditioning 2. Current theory and research (ed. A. Black & W. Prokasy), pp. 64-99. New York: Appleton-Century-Crofts.
    • (1972) Classical Conditioning 2. Current Theory and Research , pp. 64-99
    • Rescorla, R.1    Wagner, A.2
  • 18
    • 0032696609 scopus 로고    scopus 로고
    • Temporally asymmetric learning rules. I. Differential Hebbian learning
    • Roberts, P. D. 1999 Temporally asymmetric learning rules. I. Differential Hebbian learning. J. Comput. Neurosci. 7, 235-246.
    • (1999) J. Comput. Neurosci. , vol.7 , pp. 235-246
    • Roberts, P.D.1
  • 19
    • 0032865893 scopus 로고    scopus 로고
    • Exploration-tuned reinforcement function
    • Santos, J. M. & Touzet, C. 1999 Exploration-tuned reinforcement function. Neurocomputing 28, 93-105.
    • (1999) Neurocomputing , vol.28 , pp. 93-105
    • Santos, J.M.1    Touzet, C.2
  • 23
    • 33847202724 scopus 로고
    • Learning to predict by method of temporal differences
    • Sutton, R. 1988 Learning to predict by method of temporal differences. Mach. Learn. 3, 9-44.
    • (1988) Mach. Learn. , vol.3 , pp. 9-44
    • Sutton, R.1
  • 24
    • 0019537951 scopus 로고
    • Towards a modern theory of adaptive networks: Expectation and prediction
    • Sutton, R. & Barto, A. 1981 Towards a modern theory of adaptive networks: expectation and prediction. Psychol. Rev. 88, 135-170.
    • (1981) Psychol. Rev. , vol.88 , pp. 135-170
    • Sutton, R.1    Barto, A.2
  • 25
    • 0020076731 scopus 로고
    • Simulation of anticipatory responses in classical conditioning by a neuron-like adaptive element
    • Sutton, R. S. & Barto, A. G. 1982 Simulation of anticipatory responses in classical conditioning by a neuron-like adaptive element. Behav. Brain Sci. 4, 221-235.
    • (1982) Behav. Brain Sci. , vol.4 , pp. 221-235
    • Sutton, R.S.1    Barto, A.G.2
  • 26
    • 0000580224 scopus 로고
    • A temporal-difference model of classical conditioning
    • Mahwah, NJ: Erlbaum
    • Sutton, R. S. & Barto, A. G. 1987 A temporal-difference model of classical conditioning. In Proc. 9th Ann. Conf. Cognitive Sci. Soc., pp. 355-378. Mahwah, NJ: Erlbaum.
    • (1987) Proc. 9th Ann. Conf. Cognitive Sci. Soc. , pp. 355-378
    • Sutton, R.S.1    Barto, A.G.2
  • 28
    • 0032191778 scopus 로고    scopus 로고
    • A bottom-up approach towards the acquisition, retention, and expression of sequential representations: Distributed adaptive control. III
    • Verschure, P. & Voegtlin, T. 1998 A bottom-up approach towards the acquisition, retention, and expression of sequential representations: distributed adaptive control. III. Neural Netw. 11, 1531-1549.
    • (1998) Neural Netw. , vol.11 , pp. 1531-1549
    • Verschure, P.1    Voegtlin, T.2
  • 29
    • 0042276153 scopus 로고    scopus 로고
    • Learning and adaptation in constructivism
    • (ed. L. Smith) London: Routledge
    • von Glasersfeld, E. 1996 Learning and adaptation in constructivism. In Critical readings on Piaget (ed. L. Smith), pp. 22-27. London: Routledge.
    • (1996) Critical Readings on Piaget , pp. 22-27
    • Von Glasersfeld, E.1
  • 30
    • 0033667260 scopus 로고    scopus 로고
    • Computational principles of movement neuroscience
    • Wolpert, D. M. & Ghahramani, Z. 2000 Computational principles of movement neuroscience. Nature Neurosci. Suppl. 3, 1212-1217.
    • (2000) Nature Neurosci. Suppl. , vol.3 , pp. 1212-1217
    • Wolpert, D.M.1    Ghahramani, Z.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.