SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

1st International Symposium on Systems and Control in Aerospace and Astronautics

Volumn 2006, Issue , 2006, Pages 497-500

Automatic option generation in hierarchical reinforcement learning via immune clustering

(3) Jing, Shen a Guochang, Gu a Haibo, Liu a

a HARBIN ENGINEERING UNIVERSITY (China)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; IMMUNOLOGY; INTELLIGENT AGENTS; ONLINE SYSTEMS; PROBLEM SOLVING;

AUTOMATIC CONSTRUCTION; IMMUNE CLUSTERING; LEARNING AGENTS; STATE TRANSITIONS;

LEARNING SYSTEMS;

EID: 33750954561 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (6)

References (12)

1
- 0141988716
- Recent advances in hierarchical reinforcement learning
- Apr.
- A. G. Barto, S. Mahadevan, "Recent advances in hierarchical reinforcement learning," Discrete Event Dynamic Systems: Theory and Applications, vol.13, pp.41-77, Apr. 2003.
- (2003) Discrete Event Dynamic Systems: Theory and Applications , vol.13 , pp. 41-77
- Barto, A.G.¹ Mahadevan, S.²

2
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Jan.
- R.S. Sutton, D. Precup, S.P. Singh, "Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning," Artificial Intelligence, vol.112, pp.181-211, Jan. 1999.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.P.³

3
- 0003989214
- Ph.D. Thesis, University of California, Berkeley
- R. Parr, "Hierarchical control and learning for Markov decision processes," Ph.D. Thesis, University of California, Berkeley, 1998.
- (1998) Hierarchical Control and Learning for Markov Decision Processes
- Parr, R.¹

4
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- T.G. Dietterich, "Hierarchical reinforcement learning with the MAXQ value function decomposition," Journal of Artificial Intelligence Research, vol.13, pp.227-303, 2000.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

5
- 0004782095
- Learning hierarchical control structures for multiple tasks and changing environments
- Zurich, Switzerland
- B.L. Digney, "Learning hierarchical control structures for multiple tasks and changing environments," in Proc. of the 15th International Conference on Simulation of Adaptive Behavior, Zurich, Switzerland, 1998. pp.321-330.
- (1998) Proc. of the 15th International Conference on Simulation of Adaptive Behavior , pp. 321-330
- Digney, B.L.¹

6
- 0013465187
- Autonomous discovery of subgoals in reinforcement learning using deverse density
- San Fransisco: Morgan Kaufmann
- A. McGovern, A. Barto, "Autonomous discovery of subgoals in reinforcement learning using deverse density," in Proc. of the 8th International Conference on Machine Learning, San Fransisco: Morgan Kaufmann, 2001. pp.361-368.
- (2001) Proc. of the 8th International Conference on Machine Learning , pp. 361-368
- McGovern, A.¹ Barto, A.²

7
- 84945250000
- Q-cut: Dynamic discovery ofsub-goals in reinforcement learning
- Springer
- I. Menache, S. Mannor, N. Shimkin, "Q-cut: dynamic discovery ofsub-goals in reinforcement learning," in Volume 2430 of Lecture Notes in Computer Science, Springer, 2002. pp.295-306.
- (2002) Lecture Notes in Computer Science , vol.2430 , pp. 295-306
- Menache, I.¹ Mannor, S.² Shimkin, N.³

8
- 14344250635
- Dynamic abstraction in reinforcement learning via clustering
- Banff, Canada
- S. Mannor, et al, "Dynamic abstraction in reinforcement learning via clustering," in Proc. of the 21st International Conference on Machine Learning, Banff, Canada, 2004. pp.560-567.
- (2004) Proc. of the 21st International Conference on Machine Learning , pp. 560-567
- Mannor, S.¹

9
- 0015956495
- Towards a network theory of the immune system
- Jan.
- N. K. Jerne, "Towards a network theory of the immune system," Annual Immunology, vol. 125C, pp.373-389, Jan. 1974.
- (1974) Annual Immunology , vol.125 C , pp. 373-389
- Jerne, N.K.¹

10
- 84950235798
- An evolutionary immune network for data clustering
- Rio de Janeiro
- L.N. de Castro, F. N. Von Zuben, "An evolutionary immune network for data clustering," in Proc. of the IEEE Brazilian Symposium on Artificial Neural Networks, vol.1, Rio de Janeiro, 2000. pp.84-89.
- (2000) Proc. of the IEEE Brazilian Symposium on Artificial Neural Networks , vol.1 , pp. 84-89
- De Castro, L.N.¹ Von Zuben, F.N.²

11
- 34249833101
- Q-learning
- Mar.
- C. Watkins, P. Dayan, "Q-learning," Machine Learning, vol. 8, pp.279-292, Mar. 1992.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

12
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- Apr.
- L. G. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Machine Learning, vol. 8, pp.293-321, Apr. 1992.
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.