-
2
-
-
0029679044
-
Reinforcement learning: A survey
-
L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," Journal of Artificial Intelligence Research, vol. 4, pp. 237-285, 1996.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
3
-
-
0029276036
-
Temporal difference learning and TD-gammon
-
G.J. Tesauro, "Temporal difference learning and TD-Gammon," Communications of the ACM, vol. 38, pp. 58-68, 1995.
-
(1995)
Communications of the ACM
, vol.38
, pp. 58-68
-
-
Tesauro, G.J.1
-
4
-
-
85156187730
-
Improving elevator performance using reinforcement learning
-
David S. Touretzky, Michael C. Mozer, and Michael E. Hasselmo, Eds. , The MIT Press
-
Robert H. Crites and Andrew G. Barto, "Improving elevator performance using reinforcement learning," in Advances In Neural Information Processing Systems, David S. Touretzky, Michael C. Mozer, and Michael E. Hasselmo, Eds. 1996, vol. 8, pp. 1017-1023, The MIT Press.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1017-1023
-
-
Crites, R.H.1
Barto, A.G.2
-
5
-
-
0000719863
-
Packet routing in dynamically changing networks: A reinforcement learning approach
-
Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, Eds. , Morgan Kaufmann Publishers
-
Justin A. Boyan and Michael L. Littman, "Packet routing in dynamically changing networks: A reinforcement learning approach," in Advances in Neural Information Processing Systems, Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, Eds. 1994, vol. 6, pp. 671-678, Morgan Kaufmann Publishers.
-
(1994)
Advances in Neural Information Processing Systems
, vol.6
, pp. 671-678
-
-
Boyan, J.A.1
Littman, M.L.2
-
6
-
-
0001232636
-
A cellular automaton model for freeway traffic
-
K. Nagel and M. Schreckenberg, "A cellular automaton model for freeway traffic," J. Phys. I France, vol. 2, pp. 2221-2229, 1992.
-
(1992)
J. Phys. I France
, vol.2
, pp. 2221-2229
-
-
Nagel, K.1
Schreckenberg, M.2
-
7
-
-
84930123273
-
A distributed approach to optimized control of street traffic signals
-
N. Findler and J. Stapp, "A distributed approach to optimized control of street traffic signals," Journal of Transportation Engineering, vol. 118-1, pp. 99-110, 1992.
-
(1992)
Journal of Transportation Engineering
, vol.118
, Issue.1
, pp. 99-110
-
-
Findler, N.1
Stapp, J.2
-
10
-
-
34250768296
-
Intelligent traffic lights control by fuzzy logic
-
K. K. Tan, M. Khalid, and R. Yusof, "Intelligent traffic lights control by fuzzy logic," Malaysian Journal of Computer Science, vol. 9-2, 1995.
-
(1995)
Malaysian Journal of Computer Science
, vol.9
, Issue.2
-
-
Tan, K.K.1
Khalid, M.2
Yusof, R.3
-
11
-
-
0003084088
-
Traffic control of intersection group based on fuzzy logic
-
J.H. Lee, K.M. Lee, K.A. Seong, C.B. Kim, and H. Lee-Kwang, "Traffic control of intersection group based on fuzzy logic," in Proceedings of the 6th International Fuzzy Systems Association World Congress, 1995, pp. 465-468.
-
(1995)
Proceedings of the 6th International Fuzzy Systems Association World Congress
, pp. 465-468
-
-
Lee, J.H.1
Lee, K.M.2
Seong, K.A.3
Kim, C.B.4
Lee-Kwang, H.5
-
12
-
-
0000427726
-
Evolution strategy: Nature's way of optimization
-
Bergmann, Ed., , Lecture notes in Engineering
-
I. Rechenberg, "Evolution strategy: Nature's way of optimization," in Methods and Applications, Possibilities and Limitations, Bergmann, Ed., 1989, pp. 106-126, Lecture notes in Engineering.
-
(1989)
Methods and Applications, Possibilities and Limitations
, pp. 106-126
-
-
Rechenberg, I.1
-
13
-
-
84863637003
-
Optimizing traffic light controllers by means of evolutionary algorithms
-
H. Taale, Th. Bäck, M. Preuß, A. E. Eiben, J. M. de Graaf, and C. A. Schippers, "Optimizing traffic light controllers by means of evolutionary algorithms," in EUFIT'98, 1998.
-
(1998)
EUFIT'98
-
-
Taale, H.1
Bäck, Th.2
Preuß, M.3
Eiben, A.E.4
De Graaf, J.M.5
Schippers, C.A.6
-
14
-
-
4544324293
-
Traffic light control using sarsa with three state representations
-
IBM corporation
-
T. L. Thorpe and C. Andersson, "Traffic light control using sarsa with three state representations," Tech. Rep., IBM corporation, 1996.
-
(1996)
Tech. Rep.
-
-
Thorpe, T.L.1
Andersson, C.2
-
15
-
-
1842538384
-
-
M.S. thesis, Department of Computer Science, Colorado State University
-
Thomas Thorpe, "Vehicle traffic light control using sarsa," M.S. thesis, Department of Computer Science, Colorado State University, 1997.
-
(1997)
Vehicle Traffic Light Control Using Sarsa
-
-
Thorpe, T.1
-
16
-
-
0000723997
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
Richard S. Sutton, "Generalization in reinforcement learning: Successful examples using sparse coarse coding," Advances in Neural Information Processing Systems, vol. 8, 1996.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
-
-
Sutton, R.S.1
|