SCOPUS 정보 검색 플랫폼

Volumn 3809 LNAI, Issue , 2005, Pages 164-175

Structural abstraction experiments in reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SCIENCE; HIERARCHICAL SYSTEMS; MULTI AGENT SYSTEMS; PROBLEM SOLVING; SOFTWARE AGENTS; STORAGE ALLOCATION (COMPUTER);

ABSTRACTION EXPERIMENTS; HIERARCHICAL DECOMPOSITION; REINFORCEMENT LEARNING; TIME COMPLEXITY;

LEARNING SYSTEMS;

EID: 33745586802 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/11589990_19 Document Type: Conference Paper

Times cited : (18)

References (16)

1
- 0004102479
- MIT Press, Cambridge, Massachusetts
- Button, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge, Massachusetts (1998)
- (1998) Reinforcement Learning: An Introduction
- Button, R.S.¹ Barto, A.G.²

2
- 0030854548
- Trading spaces: Computation, representation, and the limits of uninformed learning
- Clark, A., Thornton, C.: Trading spaces: Computation, representation, and the limits of uninformed learning. Behavioral and Brain Sciences 20 (1997) 57-66
- (1997) Behavioral and Brain Sciences , vol.20 , pp. 57-66
- Clark, A.¹ Thornton, C.²

4
- 0004208636
- Chapman &: Hall, London
- Ashby, R.: Introduction to Cybernetics. Chapman &: Hall, London (1956)
- (1956) Introduction to Cybernetics
- Ashby, R.¹

5
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Dietterich, T.G.: Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research 13 (2000) 227-303
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

6
- 0004049893
- PhD thesis, King's College
- Watkins, C.J.C.H.: Learning from Delayed Rewards. PhD thesis, King's College (1989)
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

7
- 84880771557
- SMDP homomorphisms: An algebraic approach to abstraction in semi markov decision processes
- Ravindran, B., Barto, A.G.: SMDP homomorphisms: An algebraic approach to abstraction in semi markov decision processes. In: Proc. of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI 03). (2003) 1011-1018
- (2003) Proc. of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI 03) , pp. 1011-1018
- Ravindran, B.¹ Barto, A.G.²

9
- 0031370386
- Model minimization in markov decision processes
- Dean, T., Givan, R.: Model minimization in markov decision processes. In: AAAI/IAAI. (1997) 106-111
- (1997) AAAI/IAAI , pp. 106-111
- Dean, T.¹ Givan, R.²

10
- 0034272032
- Bounded-parameter markov decision processes
- Givan, R., Leach, S.M., Dean, T.: Bounded-parameter markov decision processes. Artificial Intelligence 122 (2000) 71-109
- (2000) Artificial Intelligence , vol.122 , pp. 71-109
- Givan, R.¹ Leach, S.M.² Dean, T.³

11
- 0032208335
- Elevator group control using multiple reinforcement learning agents
- Crites, R.H., Barto, A.G.: Elevator group control using multiple reinforcement learning agents. Machine Learning 33 (1998) 235-262
- (1998) Machine Learning , vol.33 , pp. 235-262
- Crites, R.H.¹ Barto, A.G.²

13
- 34250513249
- Über ein Paradoxon der Verkehrsplanung
- Braess, D.: Über ein Paradoxon der Verkehrsplanung. Unternehmensforschung 12 (1968) 258-268
- (1968) Unternehmensforschung , vol.12 , pp. 258-268
- Braess, D.¹

14
- 85156265058
- Learning to take concurrent actions
- Rohanimanesh, K., Mahadevan, S.: Learning to take concurrent actions. In: NIPS. (2002) 1619-1626
- (2002) NIPS , pp. 1619-1626
- Rohanimanesh, K.¹ Mahadevan, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.