-
1
-
-
1842486388
-
-
[online]. Available from [accesssed on December]
-
Artificial intelligence depot [online]. Available from http://reinforcementlearning.ai-depot.com/ [accesssed on December 2001].
-
(2001)
Artificial Intelligence Depot
-
-
-
2
-
-
1842538395
-
Reinforcement learning for ITS: Introduction and a case study on adaptive traffic signal control transportation research board
-
Washington, D.C., 7-11 January 2001
-
Abdulhai, B., Pringle, R., and Karakoulas, G.J. 2001. Reinforcement learning for ITS: Introduction and a case study on adaptive traffic signal control transportation research board, Transportation Research Board 80th Annual Meeting, Washington, D.C., 7-11 January 2001.
-
(2001)
Transportation Research Board 80th Annual Meeting
-
-
Abdulhai, B.1
Pringle, R.2
Karakoulas, G.J.3
-
3
-
-
0037616356
-
Reinforcement learning for true adaptive traffic signal control
-
Abdulhai, B., Pringle, R., and Karakoulas, G.J. 2003. Reinforcement learning for true adaptive traffic signal control. ASCE Journal of Transportation Engineering. 129(3): 278-285.
-
(2003)
ASCE Journal of Transportation Engineering
, vol.129
, Issue.3
, pp. 278-285
-
-
Abdulhai, B.1
Pringle, R.2
Karakoulas, G.J.3
-
5
-
-
85151728371
-
Residual algorithms reinforcement learning with function approximation
-
San Francisco, Calif., 9-12 July 1995. Morgan Kaufman Publishers. San Francisco, Calif
-
Baird, L. 1995. Residual algorithms reinforcement learning with function approximation. In Proceedings of the 12th International Conference on Machine Learning, San Francisco, Calif., 9-12 July 1995. Morgan Kaufman Publishers. San Francisco, Calif. pp. 30-37.
-
(1995)
Proceedings of the 12th International Conference on Machine Learning
, pp. 30-37
-
-
Baird, L.1
-
7
-
-
0035372090
-
Reinforcement learning in neurofuzzy traffic signal control
-
Bingham, E. 2001. Reinforcement learning in neurofuzzy traffic signal control. European Journal of Operation Research, 131: 232-241.
-
(2001)
European Journal of Operation Research
, vol.131
, pp. 232-241
-
-
Bingham, E.1
-
8
-
-
85156187730
-
Improving elevator performance using reinforcement learning
-
MIT Press, Cambridge, Mass
-
Crites, R.H., and Barto, A.G. 1996. 9. Improving elevator performance using reinforcement learning. In Advances in neural information processing systems. MIT Press, Cambridge, Mass. pp. 1017-1023.
-
(1996)
Advances in Neural Information Processing Systems
, pp. 1017-1023
-
-
Crites, R.H.1
Barto, A.G.2
-
11
-
-
1842434066
-
-
[online]. Available from [accessed on December 2001]
-
Langley, P. [online]. Available from http://newatlantis.isle.org/~langley/ [accessed on December 2001].
-
-
-
Langley, P.1
-
13
-
-
84925080315
-
Fuzzy model-based reinforcement learning
-
Aachen, Germany, 14-15 September 2000
-
Martin, A., and Brauer, W. 2000. Fuzzy model-based reinforcement learning, European Symposium on Intelligent Techniques (ESIT), Aachen, Germany, 14-15 September 2000, pp. 14-15.
-
(2000)
European Symposium on Intelligent Techniques (ESIT)
, pp. 14-15
-
-
Martin, A.1
Brauer, W.2
-
14
-
-
0031632547
-
Learning cooperative lane selection strategies for highways
-
Menlo Park, Calif., AAAI Press
-
Moriarty, D., and Langley, P. 1998a. Learning cooperative lane selection strategies for highways. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, Menlo Park, Calif., AAAI Press, pp. 684-691.
-
(1998)
Proceedings of the Fifteenth National Conference on Artificial Intelligence
, pp. 684-691
-
-
Moriarty, D.1
Langley, P.2
-
16
-
-
1842486380
-
Learning distributed strategies for traffic control
-
Switzerland
-
Moriarty, D.E., Handley, S., and Langley, P. 1998. Learning distributed strategies for traffic control. In Proceedings of the Fifth International Conference of the Society for Adaptive Behavior Zurich, Switzerland. pp. 437-446.
-
(1998)
Proceedings of the Fifth International Conference of the Society for Adaptive Behavior Zurich
, pp. 437-446
-
-
Moriarty, D.E.1
Handley, S.2
Langley, P.3
-
18
-
-
0000230403
-
Foundations of dynamic traffic assignment: The past, the present and the future
-
Peeta
-
Peeta. 2001. Foundations of dynamic traffic assignment: the past, the present and the future. Networks and Spatial Economics, 1: 223-265.
-
(2001)
Networks and Spatial Economics
, vol.1
, pp. 223-265
-
-
-
19
-
-
0033714691
-
Distributed reinforcement learning for a traffic engineering application
-
Barcelona, Spain. ACM, New York, N.Y
-
Pendrith, M.D. 2000. Distributed reinforcement learning for a traffic engineering application In Proceedings of the 4th International Conference on Autonomous Agents, Barcelona, Spain. ACM, New York, N.Y., pp. 404-411.
-
(2000)
Proceedingsof the 4th International Conference on Autonomous Agents
, pp. 404-411
-
-
Pendrith, M.D.1
-
20
-
-
0003420416
-
-
[online]. Available from [accessed on December 2001]
-
Perez, A. 1998. Introduction to reinforcement learning [online]. Available from http://lslwww.epfl.ch/~aperez/RL/RL.html [accessed on December 2001].
-
(1998)
Introduction to Reinforcement Learning
-
-
Perez, A.1
-
22
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton, R. 1988. Learning to predict by the methods of temporal differences. Machine Learning, 3: 9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.1
-
23
-
-
0004007508
-
-
[online]. Available from [accessed on December 2001]
-
Sutton, R. 1999. Reinforcement learning: past, present and future? [online]. Available from http://www-anw.cs.umass.edu/~rich/ Talks/SEAL98/SEAL98.html [accessed on December 2001].
-
(1999)
Reinforcement Learning: Past, Present and Future?
-
-
Sutton, R.1
-
25
-
-
0029276036
-
Temporal difference learning and TD-Gammon
-
Tesauro, G.J. 1995. Temporal difference learning and TD-Gammon. Communications of the ACM, 38: 58-68.
-
(1995)
Communications of the ACM
, vol.38
, pp. 58-68
-
-
Tesauro, G.J.1
-
26
-
-
1842538384
-
-
Master's Project Report. Computer Science Department, Colorado State University, Colo
-
Thorpe, T.L. 1997. Vehicle traffic light control using SARSA. Master's Project Report. Computer Science Department, Colorado State University, Colo.
-
(1997)
Vehicle Traffic Light Control Using SARSA
-
-
Thorpe, T.L.1
-
27
-
-
1842486383
-
Learning to control traffic lights with multi-agent reinforcement learning
-
Utrecht, Netherlands, Basque Country University and Foundation B.B.V. Bilbao, Spain
-
Wiering, M.A. 2000. Learning to control traffic lights with multi-agent reinforcement learning, First World Congress of the Game Theory Society Games 2000, Utrecht, Netherlands, Basque Country University and Foundation B.B.V. Bilbao, Spain.
-
(2000)
First World Congress of the Game Theory Society Games 2000
-
-
Wiering, M.A.1
|