메뉴 건너뛰기




Volumn 9, Issue 6, 1998, Pages 1502-1508

Reinforcement learning to train a cooperative network with both discrete and continuous output neurons

Author keywords

Aplysia; Cooperative networks; Discrete and continuous output neurons; Fast learning; Fine control; Inverted pendulum; Reinforcement learning

Indexed keywords

COMPUTER SIMULATION; CONTROL; LEARNING SYSTEMS; NEURAL NETWORKS;

EID: 0032208721     PISSN: 10459227     EISSN: None     Source Type: Journal    
DOI: 10.1109/72.728399     Document Type: Article
Times cited : (9)

References (21)
  • 1
    • 0020970738 scopus 로고
    • Neuron like adaptive elements that can solve difficult learning control problems
    • A. G. Barto, R. S. Sutton, and C. W. Anderson, "Neuron like adaptive elements that can solve difficult learning control problems," IEEE Trans. Sys. Man Cybern., vol. SMC-13, pp. 834-846, 1983.
    • (1983) IEEE Trans. Sys. Man Cybern. , vol.SMC-13 , pp. 834-846
    • Barto, A.G.1    Sutton, R.S.2    Anderson, C.W.3
  • 2
    • 0024646143 scopus 로고
    • Learning to control an inverted pendulum using neural networks
    • C. W. Anderson, "Learning to control an inverted pendulum using neural networks," IEEE Contr. Syst. Mag., vol. 9, pp. 31-37, 1989.
    • (1989) IEEE Contr. Syst. Mag. , vol.9 , pp. 31-37
    • Anderson, C.W.1
  • 4
    • 0000985504 scopus 로고
    • TD-gammon, a self-teaching backgammon program, achieves master-level play
    • G. Tesauro, "TD-gammon, a self-teaching backgammon program, achieves master-level play," Neural Comput., vol. 6, pp. 215-219, 1994.
    • (1994) Neural Comput. , vol.6 , pp. 215-219
    • Tesauro, G.1
  • 5
    • 0028733168 scopus 로고
    • Continuous valued reinforcement learning
    • C. H. Dagli, B. R. Fernández, J. Ghosh, and R. T. Soundar Kumara, Eds. Providence, RI: ASME Press
    • W. C. Jouse, "Continuous valued reinforcement learning," in Intell. Eng. Syst. Artificial Neural Networks, C. H. Dagli, B. R. Fernández, J. Ghosh, and R. T. Soundar Kumara, Eds. Providence, RI: ASME Press, vol. 4, 1994, pp. 141-146.
    • (1994) Intell. Eng. Syst. Artificial Neural Networks , vol.4 , pp. 141-146
    • Jouse, W.C.1
  • 6
    • 0027814748 scopus 로고
    • Neural-network model to simulate neuronal responses of Aplysia gill-withdrawal reflex
    • S. Yamada, M. Nakashima, and S. Shiono, "Neural-network model to simulate neuronal responses of Aplysia gill-withdrawal reflex," in Proc. 1993 Int. Joint Conf. Neural Networks, 1993, pp. 37-40.
    • (1993) Proc. 1993 Int. Joint Conf. Neural Networks , pp. 37-40
    • Yamada, S.1    Nakashima, M.2    Shiono, S.3
  • 7
    • 0028731983 scopus 로고
    • Reinforcement learning to train cooperative networks with both digital and analog motor neurons
    • C. H. Dagli, B. R. Frenández, J. Ghosh, and R. T. Soundar Kumara, Eds. Providence, RI: ASME Press
    • S. Yamada, A. Watanabe, M. Nakashima, and S. Shiono, "Reinforcement learning to train cooperative networks with both digital and analog motor neurons," in Intell. Eng. Syst. Artificial Neural Networks, C. H. Dagli, B. R. Frenández, J. Ghosh, and R. T. Soundar Kumara, Eds. Providence, RI: ASME Press, vol. 4, 1994, pp. 133-140.
    • (1994) Intell. Eng. Syst. Artificial Neural Networks , vol.4 , pp. 133-140
    • Yamada, S.1    Watanabe, A.2    Nakashima, M.3    Shiono, S.4
  • 8
    • 0142129427 scopus 로고    scopus 로고
    • Aplysia optical recording: Neural response in abdominal ganglion neurons to patterned stimuli and its analysis
    • H. Koike, Y. Kidokoro, K. Takahashi, and T. Kanaseki, Eds. Tokyo, Japan: Japan Sei. Soc. Press
    • S. Shiono, S. Yamada, and M. Nakashima, "Aplysia optical recording: Neural response in abdominal ganglion neurons to patterned stimuli and its analysis," in Basic Neuroscience in Invertebrate, H. Koike, Y. Kidokoro, K. Takahashi, and T. Kanaseki, Eds. Tokyo, Japan: Japan Sei. Soc. Press, 1996, pp. 315-330.
    • (1996) Basic Neuroscience in Invertebrate , pp. 315-330
    • Shiono, S.1    Yamada, S.2    Nakashima, M.3
  • 9
    • 0026573451 scopus 로고
    • 448-detector optical recording system: Development and application to Aplysia gill-withdrawal reflex
    • M. Nakashima, S. Yamada, S. Shiono, M. Maeda, and F. Satoh, "448-detector optical recording system: Development and application to Aplysia gill-withdrawal reflex," IEEE Trans. Biomed. Eng., vol. 39, pp. 26-36, 1992.
    • (1992) IEEE Trans. Biomed. Eng. , vol.39 , pp. 26-36
    • Nakashima, M.1    Yamada, S.2    Shiono, S.3    Maeda, M.4    Satoh, F.5
  • 10
    • 33747603496 scopus 로고
    • Optical recording and analysis of Aplysia abdominal ganglia
    • M. Nakashima, S. Yamada, and S. Shiono, "Optical recording and analysis of Aplysia abdominal ganglia," Soc. Neurosci. Abst., vol. 18, p. 712, 1992.
    • (1992) Soc. Neurosci. Abst. , vol.18 , pp. 712
    • Nakashima, M.1    Yamada, S.2    Shiono, S.3
  • 11
    • 0027769134 scopus 로고
    • Optical recording and information theoretic analysis of Aplysia reflex
    • S. Shiono, M. Nakashima, S. Yamada, and K. Matsumoto, "Optical recording and information theoretic analysis of Aplysia reflex," Jap. J. Physiol., vol. 43, pp. s31-s36, 1993.
    • (1993) Jap. J. Physiol. , vol.43
    • Shiono, S.1    Nakashima, M.2    Yamada, S.3    Matsumoto, K.4
  • 12
    • 0027354703 scopus 로고
    • Information theoretic analysis of action potential trains I. Analysis of correlation between two neurons
    • S. Yamada, M. Nakashima, K. Matsumoto, and S. Shiono, "Information theoretic analysis of action potential trains I. Analysis of correlation between two neurons," Biol. Cybern., vol. 68, pp. 215-220, 1993.
    • (1993) Biol. Cybern. , vol.68 , pp. 215-220
    • Yamada, S.1    Nakashima, M.2    Matsumoto, K.3    Shiono, S.4
  • 13
    • 0030008773 scopus 로고    scopus 로고
    • Information theoretic analysis of action potential trains II. Analysis of correlation among n neurons to deduce connection structure
    • S. Yamada, K. Matsumoto, M. Nakashima, and S. Shiono, "Information theoretic analysis of action potential trains II. Analysis of correlation among n neurons to deduce connection structure," J. Neurosci. Methods, vol. 66, pp. 35-45, 1996.
    • (1996) J. Neurosci. Methods , vol.66 , pp. 35-45
    • Yamada, S.1    Matsumoto, K.2    Nakashima, M.3    Shiono, S.4
  • 15
    • 0017891335 scopus 로고
    • Respiratory pumping: Neuronal control of a centrally commanded behavior in Aplysia
    • J. H. Byrne and J. Koester, "Respiratory pumping: Neuronal control of a centrally commanded behavior in Aplysia," Brain Res., vol. 143, pp. 87-105, 1978.
    • (1978) Brain Res. , vol.143 , pp. 87-105
    • Byrne, J.H.1    Koester, J.2
  • 16
    • 0019448549 scopus 로고
    • Interneurons involved in mediation and modulation of gill-withdrawal reflex in Aplysia. I. identification and characterization
    • R. D. Hawkins, V. F. Castellucci, and E. R. Kandel, "Interneurons involved in mediation and modulation of gill-withdrawal reflex in Aplysia. I. identification and characterization," J. Neurophysiol., vol. 45, pp. 304-314, 1981.
    • (1981) J. Neurophysiol. , vol.45 , pp. 304-314
    • Hawkins, R.D.1    Castellucci, V.F.2    Kandel, E.R.3
  • 17
    • 0020654196 scopus 로고
    • Identification of a cluster of command and pattern-generating neurons underlying respiratory pumping in Aplysia Californica
    • J. H. Byrne, "Identification of a cluster of command and pattern-generating neurons underlying respiratory pumping in Aplysia Californica," J. Neurophysiol., vol. 49, pp. 491-508, 1983.
    • (1983) J. Neurophysiol. , vol.49 , pp. 491-508
    • Byrne, J.H.1
  • 18
    • 0024804445 scopus 로고
    • Identified facilitator neurons L29 and L28 are excited by cutaneous stimuli used in dishabituation, sensitization, and classical conditioning of Aplysia
    • R. D. Hawkins and S. Schacher, "Identified facilitator neurons L29 and L28 are excited by cutaneous stimuli used in dishabituation, sensitization, and classical conditioning of Aplysia," J. Neurosci., vol. 9, pp. 4236-4245, 1989.
    • (1989) J. Neurosci. , vol.9 , pp. 4236-4245
    • Hawkins, R.D.1    Schacher, S.2
  • 19
    • 0642354825 scopus 로고
    • A neural network model of inhibitory information processing in Aplysia
    • D. E. J. Blazis, T. M. Fisher, and T. J. Carew, "A neural network model of inhibitory information processing in Aplysia," Neural Comput., vol. 5, pp. 213-227, 1993.
    • (1993) Neural Comput. , vol.5 , pp. 213-227
    • Blazis, D.E.J.1    Fisher, T.M.2    Carew, T.J.3
  • 20
    • 0027536884 scopus 로고
    • Activity-dependent potentiation of recurrent inhibition: A mechanism for dynamic gain control in the siphon withdrawal reflex of Aplysia
    • T. M. Fisher and T. J. Carew, "Activity-dependent potentiation of recurrent inhibition: A mechanism for dynamic gain control in the siphon withdrawal reflex of Aplysia," J. Neurosci, vol. 13, pp. 1302-1314, 1993.
    • (1993) J. Neurosci , vol.13 , pp. 1302-1314
    • Fisher, T.M.1    Carew, T.J.2
  • 21
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • R. S. Sutton, "Learning to predict by the methods of temporal differences," Machine Learning, vol. 3, pp. 9-44, 1988.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.