SCOPUS 정보 검색 플랫폼

Canadian Journal of Civil Engineering

Volumn 30, Issue 6, 2003, Pages 981-991

Reinforcement learning: Introduction to theory and potential for transport applications

(2) Abdulhai, Baher a Kattan, Lina a

a UNIVERSITY OF TORONTO (Canada)

Author keywords

Artificial intelligence; Intelligent transportation systems; Machine learning; Reinforcement learning; Traffic control

Indexed keywords

CIVIL ENGINEERING; INTELLIGENT NETWORKS; LEARNING ALGORITHMS; REINFORCEMENT; TRAFFIC CONTROL;

INTELLIGENT TRANSPORT SYSTEMS (ITS); REINFORCEMENT LEARNING (RL);

TRANSPORTATION;

ARTIFICIAL NEURAL NETWORK; INTELLIGENT TRANSPORTATION SYSTEM; TRANSPORTATION PLANNING;

EID: 1842427901 PISSN: 03151468 EISSN: None Source Type: Journal
DOI: 10.1139/l03-014 Document Type: Article

Times cited : (99)

References (28)

1
- 1842486388
- [online]. Available from [accesssed on December]
- Artificial intelligence depot [online]. Available from http://reinforcementlearning.ai-depot.com/ [accesssed on December 2001].
- (2001) Artificial Intelligence Depot

2
- 1842538395
- Reinforcement learning for ITS: Introduction and a case study on adaptive traffic signal control transportation research board
- Washington, D.C., 7-11 January 2001
- Abdulhai, B., Pringle, R., and Karakoulas, G.J. 2001. Reinforcement learning for ITS: Introduction and a case study on adaptive traffic signal control transportation research board, Transportation Research Board 80th Annual Meeting, Washington, D.C., 7-11 January 2001.
- (2001) Transportation Research Board 80th Annual Meeting
- Abdulhai, B.¹ Pringle, R.² Karakoulas, G.J.³

3
- 0037616356
- Reinforcement learning for true adaptive traffic signal control
- Abdulhai, B., Pringle, R., and Karakoulas, G.J. 2003. Reinforcement learning for true adaptive traffic signal control. ASCE Journal of Transportation Engineering. 129(3): 278-285.
- (2003) ASCE Journal of Transportation Engineering , vol.129 , Issue.3 , pp. 278-285
- Abdulhai, B.¹ Pringle, R.² Karakoulas, G.J.³

4
- 0004118921
- The MIT Press, Cambridge, Mass
- Ballard, D. 1997. An introduction to natural computation. The MIT Press, Cambridge, Mass.
- (1997) an Introduction to Natural Computation
- Ballard, D.¹

5
- 85151728371
- Residual algorithms reinforcement learning with function approximation
- San Francisco, Calif., 9-12 July 1995. Morgan Kaufman Publishers. San Francisco, Calif
- Baird, L. 1995. Residual algorithms reinforcement learning with function approximation. In Proceedings of the 12th International Conference on Machine Learning, San Francisco, Calif., 9-12 July 1995. Morgan Kaufman Publishers. San Francisco, Calif. pp. 30-37.
- (1995) Proceedings of the 12th International Conference on Machine Learning , pp. 30-37
- Baird, L.¹

6
- 0003487482
- Athena Scientific, Belmont, Mass
- Bertsekas, D.P., and Tsitsiklis, J.N. 1996. Neuro-dynamic programming. Athena Scientific, Belmont, Mass.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

7
- 0035372090
- Reinforcement learning in neurofuzzy traffic signal control
- Bingham, E. 2001. Reinforcement learning in neurofuzzy traffic signal control. European Journal of Operation Research, 131: 232-241.
- (2001) European Journal of Operation Research , vol.131 , pp. 232-241
- Bingham, E.¹

8
- 85156187730
- Improving elevator performance using reinforcement learning
- MIT Press, Cambridge, Mass
- Crites, R.H., and Barto, A.G. 1996. 9. Improving elevator performance using reinforcement learning. In Advances in neural information processing systems. MIT Press, Cambridge, Mass. pp. 1017-1023.
- (1996) Advances in Neural Information Processing Systems , pp. 1017-1023
- Crites, R.H.¹ Barto, A.G.²

9
- 1842434071
- Reinforcement learning models for transportation infrastructure management
- Durango, P., and Madanat, M. 2001. Reinforcement learning models for transportation infrastructure management, Proceedings of the Second Berkeley-Tottori Joint Seminar on Evolution Processes of Transportation Systems: Analysis and Policy Implementation.
- (2001) Proceedings of the Second Berkeley-Tottori Joint Seminar on Evolution Processes of Transportation Systems: Analysis and Policy Implementation
- Durango, P.¹ Madanat, M.²

10
- 0003753097
- Prentice Hall
- Jang, J., Sun, C., and Mizutani, E. 1997. Neuro-fuzzy and soft computing. Prentice Hall.
- (1997) Neuro-Fuzzy and Soft Computing
- Jang, J.¹ Sun, C.² Mizutani, E.³

11
- 1842434066
- [online]. Available from [accessed on December 2001]
- Langley, P. [online]. Available from http://newatlantis.isle.org/~langley/ [accessed on December 2001].
- Langley, P.¹

12
- 0004255908
- McGraw-Hill
- Mitchel, T. 1997. Machine learning, McGraw-Hill.
- (1997) Machine Learning
- Mitchel, T.¹

13
- 84925080315
- Fuzzy model-based reinforcement learning
- Aachen, Germany, 14-15 September 2000
- Martin, A., and Brauer, W. 2000. Fuzzy model-based reinforcement learning, European Symposium on Intelligent Techniques (ESIT), Aachen, Germany, 14-15 September 2000, pp. 14-15.
- (2000) European Symposium on Intelligent Techniques (ESIT) , pp. 14-15
- Martin, A.¹ Brauer, W.²

14
- 0031632547
- Learning cooperative lane selection strategies for highways
- Menlo Park, Calif., AAAI Press
- Moriarty, D., and Langley, P. 1998a. Learning cooperative lane selection strategies for highways. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, Menlo Park, Calif., AAAI Press, pp. 684-691.
- (1998) Proceedings of the Fifteenth National Conference on Artificial Intelligence , pp. 684-691
- Moriarty, D.¹ Langley, P.²

15
- 33746049305
- 98-2. Daimler-Benz Research and Technology Center, Palo Alto, Calif
- Moriarty, D., and Langley, P. 1998b. Distributed learning of lane-selection strategies for traffic management technical report. 98-2. Daimler-Benz Research and Technology Center, Palo Alto, Calif.
- (1998) Distributed Learning of Lane-Selection Strategies for Traffic Management Technical Report
- Moriarty, D.¹ Langley, P.²

16
- 1842486380
- Learning distributed strategies for traffic control
- Switzerland
- Moriarty, D.E., Handley, S., and Langley, P. 1998. Learning distributed strategies for traffic control. In Proceedings of the Fifth International Conference of the Society for Adaptive Behavior Zurich, Switzerland. pp. 437-446.
- (1998) Proceedings of the Fifth International Conference of the Society for Adaptive Behavior Zurich , pp. 437-446
- Moriarty, D.E.¹ Handley, S.² Langley, P.³

17
- 0003891507
- Prentice-Hall
- Narendra, K., and Thathachar, M. 1989. Learning Automata an introduction. Prentice-Hall.
- (1989) Learning Automata an Introduction
- Narendra, K.¹ Thathachar, M.²

18
- 0000230403
- Foundations of dynamic traffic assignment: The past, the present and the future
- Peeta
- Peeta. 2001. Foundations of dynamic traffic assignment: the past, the present and the future. Networks and Spatial Economics, 1: 223-265.
- (2001) Networks and Spatial Economics , vol.1 , pp. 223-265

19
- 0033714691
- Distributed reinforcement learning for a traffic engineering application
- Barcelona, Spain. ACM, New York, N.Y
- Pendrith, M.D. 2000. Distributed reinforcement learning for a traffic engineering application In Proceedings of the 4th International Conference on Autonomous Agents, Barcelona, Spain. ACM, New York, N.Y., pp. 404-411.
- (2000) Proceedingsof the 4th International Conference on Autonomous Agents , pp. 404-411
- Pendrith, M.D.¹

20
- 0003420416
- [online]. Available from [accessed on December 2001]
- Perez, A. 1998. Introduction to reinforcement learning [online]. Available from http://lslwww.epfl.ch/~aperez/RL/RL.html [accessed on December 2001].
- (1998) Introduction to Reinforcement Learning
- Perez, A.¹

21
- 0003584577
- Prentice Hall Series in Artificial Intelligence, Englewood Cliffs, N.J
- Russel, S., and Norvig, P. 1995. Artificial intelligence: a modern approach. Prentice Hall Series in Artificial Intelligence, Englewood Cliffs, N.J.
- (1995) Artificial Intelligence: A Modern Approach
- Russel, S.¹ Norvig, P.²

22
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton, R. 1988. Learning to predict by the methods of temporal differences. Machine Learning, 3: 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.¹

23
- 0004007508
- [online]. Available from [accessed on December 2001]
- Sutton, R. 1999. Reinforcement learning: past, present and future? [online]. Available from http://www-anw.cs.umass.edu/~rich/ Talks/SEAL98/SEAL98.html [accessed on December 2001].
- (1999) Reinforcement Learning: Past, Present and Future?
- Sutton, R.¹

24
- 0004102479
- MIT Press, Cambridge Mass
- Sutton, R., and Barto, A. 1998. Reinforcement learning: an introduction. MIT Press, Cambridge Mass.
- (1998) Reinforcement Learning: an Introduction
- Sutton, R.¹ Barto, A.²

25
- 0029276036
- Temporal difference learning and TD-Gammon
- Tesauro, G.J. 1995. Temporal difference learning and TD-Gammon. Communications of the ACM, 38: 58-68.
- (1995) Communications of the ACM , vol.38 , pp. 58-68
- Tesauro, G.J.¹

26
- 1842538384
- Master's Project Report. Computer Science Department, Colorado State University, Colo
- Thorpe, T.L. 1997. Vehicle traffic light control using SARSA. Master's Project Report. Computer Science Department, Colorado State University, Colo.
- (1997) Vehicle Traffic Light Control Using SARSA
- Thorpe, T.L.¹

27
- 1842486383
- Learning to control traffic lights with multi-agent reinforcement learning
- Utrecht, Netherlands, Basque Country University and Foundation B.B.V. Bilbao, Spain
- Wiering, M.A. 2000. Learning to control traffic lights with multi-agent reinforcement learning, First World Congress of the Game Theory Society Games 2000, Utrecht, Netherlands, Basque Country University and Foundation B.B.V. Bilbao, Spain.
- (2000) First World Congress of the Game Theory Society Games 2000
- Wiering, M.A.¹

28
- 85156225449
- High-performance job-shop scheduling with a time-delay TDλ network
- Zhang, W., and Dietterich, T. 1996. High-performance job-shop scheduling with a time-delay TDλ network. Advances in neural information processing systems, 8: 1024-1030.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1024-1030
- Zhang, W.¹ Dietterich, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.