메뉴 건너뛰기




Volumn , Issue , 1997, Pages 1012-1018

Efficient nonlinear control with actor-tutor architecture

Author keywords

[No Author keywords available]

Indexed keywords

REAL TIME CONTROL; REINFORCEMENT LEARNING;

EID: 0000406101     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (16)

References (10)
  • 3
    • 84899022238 scopus 로고    scopus 로고
    • An intepated model of basal ganglia and cerebellum in sequential control tasks
    • Doya, K. (1996a). An intepated model of basal ganglia and cerebellum in sequential control tasks. Society for Neuroscience Abstracts, 22:2029.
    • (1996) Society for Neuroscience Abstracts , vol.22 , pp. 2029
    • Doya, K.1
  • 4
    • 85156231814 scopus 로고    scopus 로고
    • Temporal difference learning in continuous time and space
    • Touretzky, D. S., Mozer, M. C, and Hasselmo, M. E., editors, MIT Press, Cambridge, MA
    • Doya, K. (1996b). Temporal difference learning in continuous time and space. In Touretzky, D. S., Mozer, M. C, and Hasselmo, M. E., editors, Advances in Neural Information Processing Systems 8, pages 1073-1079. MIT Press, Cambridge, MA.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1073-1079
    • Doya, K.1
  • 5
    • 0742266286 scopus 로고    scopus 로고
    • Procedural learning in monkeys-Possible roles of the basal ganglia
    • Ono, T., McNaughton, B. L., Molotchnikoff, S., Rolls, E. T., and Nishijo, H., editors, Pergamon, Oxford
    • Hikosaka, O., Miyachi, S., Miyashita, K., and Rand, M. K. (1996). Procedural learning in monkeys-Possible roles of the basal ganglia. In Ono, T., McNaughton, B. L., Molotchnikoff, S., Rolls, E. T., and Nishijo, H., editors, Perception, Memory and Emotion: Frontiers in Neuroscience, pages 403-420. Pergamon, Oxford.
    • (1996) Perception, Memory and Emotion: Frontiers in Neuroscience , pp. 403-420
    • Hikosaka, O.1    Miyachi, S.2    Miyashita, K.3    Rand, M.K.4
  • 6
    • 0002861883 scopus 로고
    • A model of how the basal ganglia generate and use neural signals that predict reinforcement
    • Houk, J. C., Davis, J. L., and Beiser, D. G., editors, MIT Press, Cambrigde, MA
    • Houk, J. C, Adams, J. L., and Barto, A. G. (1994). A model of how the basal ganglia generate and use neural signals that predict reinforcement. In Houk, J. C., Davis, J. L., and Beiser, D. G., editors, Models of Information Processing in the Basal Ganglia, pages 249-270. MIT Press, Cambrigde, MA.
    • (1994) Models of Information Processing in the Basal Ganglia , pp. 249-270
    • Houk, J.C.1    Adams, J.L.2    Barto, A.G.3
  • 7
  • 8
    • 0023084681 scopus 로고
    • A hierarchical neural network model for control and learning of voluntary movement
    • Kawato, M., Furukawa, K., and Suzuki, R. (1987). A hierarchical neural network model for control and learning of voluntary movement. Biological Cybernetics, 57:169-185.
    • (1987) Biological Cybernetics , vol.57 , pp. 169-185
    • Kawato, M.1    Furukawa, K.2    Suzuki, R.3
  • 9
    • 84964009081 scopus 로고    scopus 로고
    • From isolation to cooperation: An alternative view of a system of experts
    • Touretzky, D. S., Mozer, M. C, and Hasselmo, M. E., editors, MIT Press, Cambridge, MA, USA
    • Schaal, S. and Atkeson, C. C. (1996). From isolation to cooperation: An alternative view of a system of experts. In Touretzky, D. S., Mozer, M. C, and Hasselmo, M. E., editors, Advances in Neural Information Processing Systems 8, pages 605-611. MIT Press, Cambridge, MA, USA.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 605-611
    • Schaal, S.1    Atkeson, C.C.2
  • 10
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal difference
    • Sutton, R. S. (1988). Learning to predict by the methods of temporal difference. Machine Learning, 3 iQ 44.
    • (1988) Machine Learning , vol.3
    • Sutton, R.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.