메뉴 건너뛰기




Volumn 2, Issue , 1999, Pages 777-782

Combination of actor/critic algorithm with the goal-directed reasoning

Author keywords

actor critic algorithm; basal ganglia; pal directed remoning; rainforcament learning

Indexed keywords

BASAL GANGLIA; GOAL DIRECTED; PAL-DIRECTED REMONING; RAINFORCAMENT LEARNING;

EID: 77957601595     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICONIP.1999.845694     Document Type: Conference Paper
Times cited : (1)

References (12)
  • 2
    • 0000541213 scopus 로고
    • Adaptive critics and the basal ganglia
    • J. C. Houk, J. l. Davis, and D. G. Beiser, Eds., MIT Press, Cambridge, MA, USA
    • A. G. Barto, "Adaptive critics and the basal ganglia," in Models of Information. Processing in the Basal Ganglia, j. C. Houk, J. l. Davis, and D. G. Beiser, Eds., pp, 215-232. MIT Press, Cambridge, MA, USA, 1995
    • (1995) Models of Information. Processing in the Basal Ganglia , pp. 215-232
    • Barto, A.G.1
  • 3
    • 0002861883 scopus 로고
    • A model of how the basal ganglia generate and use neural signals that predict reinforcement
    • L. C. Houk, J. L. Davis, and D. G. Beiser, Eds., MIT Press, Cambridge, MA, USA
    • J. C. Houk, J. L. Adams, and A. G. Barto, "A model of how the basal ganglia generate and use neural signals that predict reinforcement," in Models of Information Processing in the Baeal Ganglia, J. C. Houk, J. L. Davis, and D. G. Beiser, Eds., pp. 249-270. MIT Press, Cambridge, MA, USA, 1995.
    • (1995) Models of Information Processing in the Baeal Ganglia , pp. 249-270
    • Houk, J.C.1    Adams, J.L.2    Barto, A.G.3
  • 4
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive hebbian learning
    • P Read Montague, Peter Dayan, and Terrence J. Se-jnowski, "A framework for mesencephalic dopamine systems based on predictive hebbian learning," Journal of Neuroscience, vol. 16, no. 5, pp. 1936-1947, 1996.
    • (1996) Journal of Neuroscience , vol.16 , Issue.5 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Se-Jnowski, T.J.3
  • 5
    • 0031854385 scopus 로고    scopus 로고
    • Learning of sequential movements by neural network model with dopamine-jjke reinforcement signal
    • Roland E. Suri and Wolfram Schultz, "Learning of sequential movements by neural network model with dopamine-jjke reinforcement signal," Experimental Brain. Research, vol. 121, pp. 350-354, 1998.
    • (1998) Experimental Brain. Research , vol.121 , pp. 350-354
    • Suri, R.E.1    Schultz, W.2
  • 6
    • 0025321039 scopus 로고
    • Functional architecture of basal ganglia circuits: Neural substrates of parallel processing
    • G. E. Alexander and M. D. Crutcher, "Functional architecture of basal ganglia circuits: neural substrates of parallel processing," Trends in Neurosciences, vol, 13, no. 7, pp. 266-271, 1990
    • (1990) Trends in Neurosciences , vol.13 , Issue.7 , pp. 266-271
    • Alexander, G.E.1    Crutcher, M.D.2
  • 7
    • 0028110209 scopus 로고
    • Anatomical evidence for cerebellar and basal ganglia involvement in higher cognitive function
    • F. A. Micidleton and P. L. Strick, "Anatomical evidence for cerebellar and basal ganglia involvement in higher cognitive function," Science, vol, 266, pp. 458-461, 1994.
    • (1994) Science , vol.266 , pp. 458-461
    • Micidleton, F.A.1    Strick, P.L.2
  • 8
    • 0007907759 scopus 로고    scopus 로고
    • Emergent hierarchical control structures: Learning reactive/hierarchical relationships in reinforcement environments
    • P, Maes, M. Mataric, ,1,-A., Meyer, J. Pollack, and S. W. Wilson, Eds, MIT Press/Bradford Books
    • B. Digney, "Emergent hierarchical control structures: Learning reactive/hierarchical relationships in reinforcement environments," in From animals to ani-mofs 4: Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior, P, Maes, M. Mataric, ,1,-A., Meyer, J. Pollack, and S. W. Wilson, Eds, 1996, pp. 363-372, MIT Press/Bradford Books.
    • (1996) From Animals to Ani-mofs 4: Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior , pp. 363-372
    • Digney, B.1
  • 12
    • 0017524329 scopus 로고
    • An adaptive optimal controller for discrete-time markov environments
    • I. H. Witten, "An adaptive optimal controller for discrete-time markov environments," Information and Control, vol. 34, pp. 336-295, 1977.
    • (1977) Information and Control , vol.34 , pp. 295-336
    • Witten, I.H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.