메뉴 건너뛰기




Volumn 97, Issue 5-6, 2007, Pages 363-378

Chained learning architectures in a simple closed-loop behavioural context

Author keywords

Driving robot; Line following; Sparse inputs; Temporal sequence learning; Weak correlations

Indexed keywords

CORRELATION METHODS; INTELLIGENT ROBOTS; PROBLEM SOLVING; SENSORS;

EID: 40749152758     PISSN: 03401200     EISSN: 14320770     Source Type: Journal    
DOI: 10.1007/s00422-007-0176-y     Document Type: Article
Times cited : (10)

References (49)
  • 3
    • 0034305816 scopus 로고    scopus 로고
    • Is heterosynaptic modulation essential for stabilizing Hebbian plasticity and memory
    • 1
    • Bailey CH, Giustetto M, Huang YY, Hawkins RD and Kandel ER (2000). Is heterosynaptic modulation essential for stabilizing Hebbian plasticity and memory. Nat Rev Neurosci 1(1): 11-20
    • (2000) Nat Rev Neurosci , vol.1 , pp. 11-20
    • Bailey, C.H.1    Giustetto, M.2    Huang, Y.Y.3    Hawkins, R.D.4    Kandel, E.R.5
  • 5
    • 0020970738 scopus 로고
    • Neuronlike elements that can solve difficult learning control problems
    • Barto AG, Sutton RS and Anderson CW (1983). Neuronlike elements that can solve difficult learning control problems. IEEE Trans Syst Man Cybern 13: 835-846
    • (1983) IEEE Trans Syst Man Cybern , vol.13 , pp. 835-846
    • Barto, A.G.1    Sutton, R.S.2    Anderson, C.W.3
  • 7
    • 0033792710 scopus 로고    scopus 로고
    • Using pavlovian higher-order conditioning paradigms to investigate the neural substrates of emotional learning and memory
    • 5
    • Gewirtz JC and Davis M (2000). Using pavlovian higher-order conditioning paradigms to investigate the neural substrates of emotional learning and memory. Learn Mem 7(5): 257-266
    • (2000) Learn Mem , vol.7 , pp. 257-266
    • Gewirtz, J.C.1    Davis, M.2
  • 8
    • 0027831281 scopus 로고
    • Neural network control for a closed-loop system using feedback-error-learning
    • 7
    • Gomi H and Kawato M (1993). Neural network control for a closed-loop system using feedback-error-learning. Neural Netw 6(7): 933-946
    • (1993) Neural Netw , vol.6 , pp. 933-946
    • Gomi, H.1    Kawato, M.2
  • 9
    • 0346096511 scopus 로고    scopus 로고
    • Presynaptic induction of heterosynaptic associative plasticity in the mammalian brain
    • 6968
    • Humeau Y, Shaban H, Bissiere S and Luthi A (2003). Presynaptic induction of heterosynaptic associative plasticity in the mammalian brain. Nature 426(6968): 841-845
    • (2003) Nature , vol.426 , pp. 841-845
    • Humeau, Y.1    Shaban, H.2    Bissiere, S.3    Luthi, A.4
  • 10
    • 0037246963 scopus 로고    scopus 로고
    • Role of AMPA and NMDA receptors in the nucleus accumbens shell in turning behaviour of rats: Interaction with dopamine and receptors
    • Ikeda H, Akiyama G, Fujii Y, Minowa R, Koshikawa N and Cools A (2003). Role of AMPA and NMDA receptors in the nucleus accumbens shell in turning behaviour of rats: interaction with dopamine and receptors. Neuropharmacology 44: 81-87
    • (2003) Neuropharmacology , vol.44 , pp. 81-87
    • Ikeda, H.1    Akiyama, G.2    Fujii, Y.3    Minowa, R.4    Koshikawa, N.5    Cools, A.6
  • 11
    • 33646819011 scopus 로고    scopus 로고
    • Second-order conditioning of human causal learning
    • Jara E, Vila J and Maldonado A (2006). Second-order conditioning of human causal learning. Learn Motiv 37: 230-246
    • (2006) Learn Motiv , vol.37 , pp. 230-246
    • Jara, E.1    Vila, J.2    Maldonado, A.3
  • 12
    • 0038445711 scopus 로고    scopus 로고
    • Dopamine: A potential substrate for synaptic plasticity and memory mechanisms
    • 6
    • Jay T (2003). Dopamine: a potential substrate for synaptic plasticity and memory mechanisms. Prog Neurobiol 69(6): 375-390
    • (2003) Prog Neurobiol , vol.69 , pp. 375-390
    • Jay, T.1
  • 14
    • 0032779673 scopus 로고    scopus 로고
    • Functional specificity of ventral striatal compartments in appetitive behaviors
    • Kelley AE (1999). Functional specificity of ventral striatal compartments in appetitive behaviors. Ann NY Acad Sci 877: 71-90
    • (1999) Ann NY Acad Sci , vol.877 , pp. 71-90
    • Kelley, A.E.1
  • 15
    • 0023878618 scopus 로고
    • A neuronal model of classical conditioning
    • 2
    • Klopf AH (1988). A neuronal model of classical conditioning. Psychobiology 16(2): 85-123
    • (1988) Psychobiology , vol.16 , pp. 85-123
    • Klopf, A.H.1
  • 16
    • 40749121160 scopus 로고    scopus 로고
    • Mathematical properties of neuronal TD-rules and differential Hebbian learning: A comparison
    • submitted
    • Kolodziejski C, Wörgötter F, Porr B (2007) Mathematical properties of neuronal TD-rules and differential Hebbian learning: A comparison. Biol Cybern (submitted)
    • (2007) Biol Cybern
    • Kolodziejski, C.1    Wörgötter, F.2    Porr, B.3
  • 17
    • 0042276165 scopus 로고
    • Differential Hebbian learning
    • Denker JS (ed) American Institute of Physics, New York
    • Kosco B (1986) Differential Hebbian learning. In: Denker JS (ed) Neural networks for computing: AIP Conference Proceedings, vol. 151. American Institute of Physics, New York
    • (1986) Neural Networks for Computing: AIP Conference Proceedings , vol.151
    • Kosco, B.1
  • 19
    • 34547609559 scopus 로고    scopus 로고
    • Adaptive, fast walking in a biped robot under neuronal control and learning
    • doi: 10.1371/journal.pcbi.0030,134
    • Manoonpong P, Geng T, Kulvicius T, Porr B, Wörgötter F (2007) Adaptive, fast walking in a biped robot under neuronal control and learning. PLoS Comput Biol 3(7):e134 doi: 10.1371/journal.pcbi.0030,134
    • (2007) PLoS Comput Biol , vol.3 , Issue.7
    • Manoonpong, P.1    Geng, T.2    Kulvicius, T.3    Porr, B.4    Wörgötter, F.5
  • 22
    • 33644772461 scopus 로고    scopus 로고
    • A cerebellar model for predictive motor control tested in a brain-based device
    • 9
    • McKinstry JL, Edelman GM and Krichmar JL (2006). A cerebellar model for predictive motor control tested in a brain-based device. Proc Natl Acad Sci USA 103(9): 3387-3392
    • (2006) Proc Natl Acad Sci USA , vol.103 , pp. 3387-3392
    • Edelman, G.M.1    Krichmar, J.L.2
  • 23
    • 0028972278 scopus 로고
    • Bee foraging in uncertain environments using predictive Hebbian learning
    • Montague PR, Dayan P, Person C and Sejnowski TJ (1995). Bee foraging in uncertain environments using predictive Hebbian learning. Nature 377: 725-728
    • (1995) Nature , vol.377 , pp. 725-728
    • Montague, P.R.1    Dayan, P.2    Person, C.3    Sejnowski, T.J.4
  • 24
    • 8444251345 scopus 로고    scopus 로고
    • Feedback error learning and nonlinear adaptive control
    • Nakanishi J and Schaal S (2004). Feedback error learning and nonlinear adaptive control. Neural Netw 17: 1453-1465
    • (2004) Neural Netw , vol.17 , pp. 1453-1465
    • Nakanishi, J.1    Schaal, S.2
  • 25
    • 0036972336 scopus 로고    scopus 로고
    • Evolution of reinforcement learning in uncertain environments: A simple explanation for complex foraging behaviors
    • 1
    • Niv Y, Joel D, Meilijson I and Ruppin E (2002). Evolution of reinforcement learning in uncertain environments: a simple explanation for complex foraging behaviors. Adapt Behav 10(1): 5-24
    • (2002) Adapt Behav , vol.10 , pp. 5-24
    • Niv, Y.1    Joel, D.2    Meilijson, I.3    Ruppin, E.4
  • 26
    • 0008570374 scopus 로고    scopus 로고
    • Neural network vision for robot driving
    • Oxford University Press New York
    • Pomerleau D (1996). Neural network vision for robot driving. In: Nayar, S and Poggio, T (eds) Early visual learning., pp 161-181. Oxford University Press, New York
    • (1996) Early Visual Learning. , pp. 161-181
    • Pomerleau, D.1    Nayar, S.2    Poggio, T.3
  • 27
    • 0037686661 scopus 로고    scopus 로고
    • Isotropic sequence order learning
    • Porr B and Wörgötter F (2003a). Isotropic sequence order learning. Neural Comp 15: 831-864
    • (2003) Neural Comp , vol.15 , pp. 831-864
    • Porr, B.1    Wörgötter, F.2
  • 28
    • 0742301619 scopus 로고    scopus 로고
    • Isotropic sequence order learning in a closed loop behavioural system
    • 1811
    • Porr B and Wörgötter F (2003b). Isotropic sequence order learning in a closed loop behavioural system. R Soc Phil Trans Math Phys Eng Sci 361(1811): 2225-2244
    • (2003) R Soc Phil Trans Math Phys Eng Sci , vol.361 , pp. 2225-2244
    • Porr, B.1    Wörgötter, F.2
  • 29
    • 33646781302 scopus 로고    scopus 로고
    • Strongly improved stability and faster convergence of temporal sequence learning by utilising input correlations only
    • 6
    • Porr B and Wörgötter F (2006). Strongly improved stability and faster convergence of temporal sequence learning by utilising input correlations only. Neural Comp 18(6): 1380-1412
    • (2006) Neural Comp , vol.18 , pp. 1380-1412
    • Porr, B.1    Wörgötter, F.2
  • 30
    • 0038362695 scopus 로고    scopus 로고
    • Iso-learning approximates a solution to the inverse controller problem in an unsupervised behavioural paradigm
    • Porr B, Ferber C and Worgotter F (2003). Iso-learning approximates a solution to the inverse controller problem in an unsupervised behavioural paradigm. Neural Comp 15: 865-884
    • (2003) Neural Comp , vol.15 , pp. 865-884
    • Porr, B.1    Ferber, C.2    Worgotter, F.3
  • 31
    • 0038362695 scopus 로고    scopus 로고
    • ISO-learning approximates a solution to the inverse-controller problem in an unsupervised behavioral paradigm
    • Porr B, Wörgötter F and Ferber C (2003). ISO-learning approximates a solution to the inverse-controller problem in an unsupervised behavioral paradigm. Neural Comp 15: 865-884
    • (2003) Neural Comp , vol.15 , pp. 865-884
    • Porr, B.1    Von Ferber, C.2    Wörgötter, F.3
  • 33
    • 0035315989 scopus 로고    scopus 로고
    • Temporal difference model reproduces anticipatory neural activity
    • 4
    • Schultz W and Suri RE (2001). Temporal difference model reproduces anticipatory neural activity. Neural Comp 13(4): 841-862
    • (2001) Neural Comp , vol.13 , pp. 841-862
    • Schultz, W.1    Suri, R.E.2
  • 34
    • 0031854385 scopus 로고    scopus 로고
    • Learning of sequential movements by neural network model with dopamine-like reinforcement signal
    • Suri RE and Schultz W (1998). Learning of sequential movements by neural network model with dopamine-like reinforcement signal. Exp Brain Res 121: 350-354
    • (1998) Exp Brain Res , vol.121 , pp. 350-354
    • Suri, R.E.1    Schultz, W.2
  • 35
    • 0019537951 scopus 로고
    • Towards a modern theory of adaptive networks: Expectation and prediction
    • Sutton R and Barto A (1981). Towards a modern theory of adaptive networks: expectation and prediction. Psychol Rev 88: 135-170
    • (1981) Psychol Rev , vol.88 , pp. 135-170
    • Sutton, R.1    Barto, A.2
  • 36
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton RS (1988). Learning to predict by the methods of temporal differences. Mach Learn 3: 9-44
    • (1988) Mach Learn , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 39
    • 0037321236 scopus 로고    scopus 로고
    • Mossy fibre synaptic NMDA receptors trigger non-Hebbian long-term potentiation at entorhino-CA3 synapses in the rat
    • 3
    • Tsukamoto M, Yasui T, Yamada MK, Nishiyama N, Matsuki N and Ikegaya Y (2003). Mossy fibre synaptic NMDA receptors trigger non-Hebbian long-term potentiation at entorhino-CA3 synapses in the rat. J Physiol 546(3): 665-675
    • (2003) J Physiol , vol.546 , pp. 665-675
    • Tsukamoto, M.1    Yasui, T.2    Yamada, M.K.3    Nishiyama, N.4    Matsuki, N.5    Ikegaya, Y.6
  • 40
    • 0037972164 scopus 로고    scopus 로고
    • A real-world rational agent: Unifying old and new AI
    • Verschure P and Althaus P (2003). A real-world rational agent: unifying old and new AI. Cogn Sci 27: 561-590
    • (2003) Cogn Sci , vol.27 , pp. 561-590
    • Verschure, P.1    Althaus, P.2
  • 41
    • 0006138956 scopus 로고
    • Adaptive fields: Distributed representations of classically conditioned associations
    • Verschure P and Coolen A (1991). Adaptive fields: distributed representations of classically conditioned associations. Network 2: 189-206
    • (1991) Network , vol.2 , pp. 189-206
    • Verschure, P.1    Coolen, A.2
  • 42
    • 0003117086 scopus 로고
    • An imitation of life
    • Walter WG (1950). An imitation of life. Sci Am 182: 42-45
    • (1950) Sci Am , vol.182 , pp. 42-45
    • Walter, W.G.1
  • 44
    • 34249833101 scopus 로고
    • Technical note: Q-Learning
    • Watkins CJCH and Dayan P (1992). Technical note: Q-Learning. Mach Learn 8: 279-292
    • (1992) Mach Learn , vol.8 , pp. 279-292
    • Cjch, W.1    Dayan, P.2
  • 45
    • 0037118075 scopus 로고    scopus 로고
    • Robots in invertebrate neuroscience
    • Webb B (2002). Robots in invertebrate neuroscience. Nature 417: 359-363
    • (2002) Nature , vol.417 , pp. 359-363
    • Webb, B.1
  • 47
    • 0017524329 scopus 로고
    • An adaptive optimal controller for discrete-time Markov environments
    • Witten IH (1977). An adaptive optimal controller for discrete-time Markov environments. Inf Control 34: 86-295
    • (1977) Inf Control , vol.34 , pp. 86-295
    • Witten, I.H.1
  • 48
    • 13244267004 scopus 로고    scopus 로고
    • Temporal sequence learning for prediction and control - A review of different models and their relation to biological mechanisms
    • Wörgötter F and Porr B (2005). Temporal sequence learning for prediction and control - a review of different models and their relation to biological mechanisms. Neural Comp 17: 245-319
    • (2005) Neural Comp , vol.17 , pp. 245-319
    • Wörgötter, F.1    Porr, B.2
  • 49
    • 10744226779 scopus 로고    scopus 로고
    • Involving the motor system in decision making
    • Suppl 3
    • Wyss R, König P and Verschure PFMJ (2004). Involving the motor system in decision making. Proc Biol Sci 271(Suppl 3): 50-52
    • (2004) Proc Biol Sci , vol.271 , pp. 50-52
    • Wyss, R.1    König, P.2    Pfmj, V.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.