-
2
-
-
0030854548
-
Trading spaces: Computation, representation, and the limits of uninformed learning
-
Clark, A., Thornton, C.: Trading spaces: Computation, representation, and the limits of uninformed learning. Behavioral and Brain Sciences 20 (1997) 57-66
-
(1997)
Behavioral and Brain Sciences
, vol.20
, pp. 57-66
-
-
Clark, A.1
Thornton, C.2
-
5
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
Dietterich, T.G.: Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research 13 (2000) 227-303
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
8
-
-
84956854078
-
Model minimization in hierarchical reinforcement learning
-
Fifth Symposium on Abstraction, Reformulation and Approximation (SARA 2002) Springer Verlag
-
Ravindran, B., Barto, A.G.: Model minimization in hierarchical reinforcement learning. In: Fifth Symposium on Abstraction, Reformulation and Approximation (SARA 2002). LNCS, Springer Verlag (2002) 196-211
-
(2002)
LNCS
, pp. 196-211
-
-
Ravindran, B.1
Barto, A.G.2
-
9
-
-
0031370386
-
Model minimization in markov decision processes
-
Dean, T., Givan, R.: Model minimization in markov decision processes. In: AAAI/IAAI. (1997) 106-111
-
(1997)
AAAI/IAAI
, pp. 106-111
-
-
Dean, T.1
Givan, R.2
-
10
-
-
0034272032
-
Bounded-parameter markov decision processes
-
Givan, R., Leach, S.M., Dean, T.: Bounded-parameter markov decision processes. Artificial Intelligence 122 (2000) 71-109
-
(2000)
Artificial Intelligence
, vol.122
, pp. 71-109
-
-
Givan, R.1
Leach, S.M.2
Dean, T.3
-
11
-
-
0032208335
-
Elevator group control using multiple reinforcement learning agents
-
Crites, R.H., Barto, A.G.: Elevator group control using multiple reinforcement learning agents. Machine Learning 33 (1998) 235-262
-
(1998)
Machine Learning
, vol.33
, pp. 235-262
-
-
Crites, R.H.1
Barto, A.G.2
-
12
-
-
0004320981
-
An introduction to collective intelligence
-
NASA Ames Research Center, CA
-
Wolpert, D., Turner, K.: An introduction to collective intelligence. Technical Report NASA-ARC-IC-99-63, NASA Ames Research Center, CA (1999)
-
(1999)
Technical Report
, vol.NASA-ARC-IC-99-63
-
-
Wolpert, D.1
Turner, K.2
-
13
-
-
34250513249
-
Über ein Paradoxon der Verkehrsplanung
-
Braess, D.: Über ein Paradoxon der Verkehrsplanung. Unternehmensforschung 12 (1968) 258-268
-
(1968)
Unternehmensforschung
, vol.12
, pp. 258-268
-
-
Braess, D.1
-
14
-
-
85156265058
-
Learning to take concurrent actions
-
Rohanimanesh, K., Mahadevan, S.: Learning to take concurrent actions. In: NIPS. (2002) 1619-1626
-
(2002)
NIPS
, pp. 1619-1626
-
-
Rohanimanesh, K.1
Mahadevan, S.2
-
15
-
-
0013465036
-
Discovering hierarchy in reinforcement learning with HEXQ
-
Sammut, C., Hoffmann, A., eds., Morgan-Kaufman
-
Hengst, B.: Discovering hierarchy in reinforcement learning with HEXQ. In Sammut, C., Hoffmann, A., eds.: Proceedings of the Nineteenth International Conference on Machine Learning, Morgan-Kaufman (2002) 243-250
-
(2002)
Proceedings of the Nineteenth International Conference on Machine Learning
, pp. 243-250
-
-
Hengst, B.1
-
16
-
-
85143168613
-
Hierarchical learning in stochastic domains: Preliminary results
-
San Mateo, CA, Morgan Kaufmann
-
Kaelbling, L.P.: Hierarchical learning in stochastic domains: Preliminary results. In: Machine Learning Proceedings of the Tenth International Conference, San Mateo, CA, Morgan Kaufmann (1993) 167-173
-
(1993)
Machine Learning Proceedings of the Tenth International Conference
, pp. 167-173
-
-
Kaelbling, L.P.1
|