SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 3141, Issue , 2004, Pages 80-94

Biologically inspired reinforcement learning: Reward-Based decomposition for multi-goal environments

(2) Zhou, Weidong a Coggins, Richard a

a UNIVERSITY OF SYDNEY (Australia)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; DECISION MAKING; REINFORCEMENT LEARNING;

ARTIFICIAL EMOTION INDICATIONS; BIOLOGICALLY INSPIRED; BIOLOGICALLY INSPIRED REINFORCEMENT LEARNING; FRONTAL CORTEX; HIERARCHICAL REINFORCEMENT LEARNING; LEARNING PROBLEM; LEARNING PROCESS; MULTIPLE SOURCE;

LEARNING ALGORITHMS;

EID: 35048843384 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-540-27835-1_7 Document Type: Article

Times cited : (5)

References (12)

1
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- T. G. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:227-303, 2000.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

2
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- R. S. Sutton, D. Precup, and S. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1):181-211, 1999.
- (1999) Artificial Intelligence , vol.112 , Issue.1 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

3
- 0013465036
- Discovering hierarchy in reinforcement learning with HEXQ. in Claude Sammut Hoffmann and Achim, editors
- Sydney Australia
- B. Hengst. Discovering hierarchy in reinforcement learning with HEXQ. In Claude Sammut Hoffmann and Achim, editors, the Nineteenth International Conference on Machine Learning, pages 243-250, Sydney Australia, 2002.
- (2002) The Nineteenth International Conference on Machine Learning , pp. 243-250
- Hengst, B.¹

4
- 84899028619
- Balancing multiple sources of reward in reinforcement learning
- MIT Press
- C. R. Shelton. Balancing multiple sources of reward in reinforcement learning. In Advances in Neural Information Processing Systems, volume 13, pages 1082-1088. MIT Press, 2001.
- (2001) Advances in Neural Information Processing Systems , vol.13 , pp. 1082-1088
- Shelton, C.R.¹

5
- 0004248679
- Oxford University Press
- E. T. Rolls. The brain and emotion. Oxford University Press, 1999.
- (1999) The Brain and Emotion
- Rolls, E.T.¹

6
- 0034061495
- Reward processing in primate orbitofrontal cortex and basla ganglia
- W. L. Schultz, L. Tremblay, and J. R. Hollerman. Reward processing in primate orbitofrontal cortex and basla ganglia. Cerebral Cortex, 10:272-283, 2000.
- (2000) Cerebral Cortex , vol.10 , pp. 272-283
- Schultz, W.L.¹ Tremblay, L.² Hollerman, J.R.³

7
- 84958795157
- A biologically inspired hierarchical reinforcement learning system
- to appear
- W. Zhou and R. Coggins. A biologically inspired hierarchical reinforcement learning system. Cybernetics and Systems, to appear, 2004.
- (2004) Cybernetics and Systems
- Zhou, W.¹ Coggins, R.²

8
- 0026847155
- Brain mechanisms of emotion and emotional learning.
- J. E. LeDoux. Brain mechanisms of emotion and emotional learning. Current Opinion in Neurobiology, 2:191-197, 1992.
- (1992) Current Opinion in Neurobiology , vol.2 , pp. 191-197
- LeDoux, J.E.¹

9
- 0000541213
- Adaptive critics and the basal ganglia
- J.L. Davis J.C. Houk Beiser and D.G., editors, MIT Press
- A. G. Barto. Adaptive critics and the basal ganglia. In J.L. Davis J.C. Houk Beiser and D.G., editors, Models of information processing in the basal ganglia, pages 215-232. MIT Press, 1995.
- (1995) Models of Information Processing in the Basal Ganglia , pp. 215-232
- Barto, A.G.¹

10
- 33847202724
- Learning to predict by the methods of temporal differences
- R. S. Sutton. Learning to predict by the methods of temporal differences. Machine Learning, 3:9-44, 1988.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

11
- 85150714688
- Reinforcement learning methods for continuous-time markov decision problems
- MIT Press
- S. J. Bradtke and M. O. Duff. Reinforcement learning methods for continuous-time markov decision problems. In Advances in Neural Information Processing Systems, volume 7, pages 393-500. MIT Press, 1995.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 393-500
- Bradtke, S.J.¹ Duff, M.O.²

12
- 85047698537
- Emotion-triggered learning in autonomous robot control
- J. Gadanho and SC. Hallam. Emotion-triggered learning in autonomous robot control. Cybernetics and Systems, 32(5):531-59, 2001.
- (2001) Cybernetics and Systems , vol.32 , Issue.5 , pp. 531-559
- Gadanho, J.¹ Hallam, S.C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.