메뉴 건너뛰기




Volumn 12, Issue 2, 2011, Pages 412-421

Reinforcement learning with function approximation for traffic signal control

Author keywords

Q learning with full state representation (QTLC FS); Q learning with function approximation (QTLC FA); reinforcement learning (RL); traffic signal control

Indexed keywords

CURSE OF DIMENSIONALITY; FEATURE-BASED; FUNCTION APPROXIMATION; HIGH-DIMENSIONAL; ORDERS OF MAGNITUDE; OTHER ALGORITHMS; PERFORMANCE COMPARISON; PRIORITIZATION; Q-LEARNING; QUEUE ALGORITHMS; QUEUE LENGTHS; REINFORCEMENT LEARNING WITH FUNCTION APPROXIMATIONS; ROAD NETWORK; STATE REPRESENTATION; TRAFFIC SIGNAL CONTROL;

EID: 79958173163     PISSN: 15249050     EISSN: None     Source Type: Journal    
DOI: 10.1109/TITS.2010.2091408     Document Type: Conference Paper
Times cited : (306)

References (29)
  • 1
    • 0037616356 scopus 로고    scopus 로고
    • Reinforcement learning for true adaptive traffic signal control
    • DOI 10.1061/(ASCE)0733-947X(2003)129:3(278)
    • B. Abdulhai, R. Pringle, and G. Karakoulas, "Reinforcement learning for true adaptive traffic signal control", J. Transp. Eng., vol. 129, no. 3, pp. 278-285, May/Jun. 2003. (Pubitemid 36594889)
    • (2003) Journal of Transportation Engineering , vol.129 , Issue.3 , pp. 278-285
    • Abdulhai, B.1    Pringle, R.2    Karakoulas, G.J.3
  • 3
    • 0345880144 scopus 로고    scopus 로고
    • Traffic-responsive signal timing for system-wide traffic control
    • PII S0968090X97000120
    • J. Spall and D. Chin, "Traffic-responsive signal timing for systemwide traffic control", Transp. Res. Part C: Emerging Technol., vol. 5, no. 3/4, pp. 153-163, Aug. 1997. (Pubitemid 127438592)
    • (1997) Transportation Research Part C: Emerging Technologies , vol.5 , Issue.3-4 , pp. 153-163
    • Spall, J.C.1    Chin, D.C.2
  • 4
    • 0029504767 scopus 로고
    • Evaluation of an adaptive traffic control technique with underlying system changes
    • R. Smith and D. Chin, "Evaluation of an adaptive traffic control technique with underlying system changes", in Proc. Winter Simul. Conf., 1995, pp. 1124-1130.
    • (1995) Proc. Winter Simul. Conf. , pp. 1124-1130
    • Smith, R.1    Chin, D.2
  • 7
    • 10644276401 scopus 로고    scopus 로고
    • An enhanced 0-1 mixed-integer LP formulation for traffic signal control
    • Dec.
    • W. Lin and C. Wang, "An enhanced 0-1 mixed-integer LP formulation for traffic signal control", IEEE Trans. Intell. Transp. Syst., vol. 5, no. 4, pp. 238-245, Dec. 2004.
    • (2004) IEEE Trans. Intell. Transp. Syst. , vol.5 , Issue.4 , pp. 238-245
    • Lin, W.1    Wang, C.2
  • 8
    • 77956344775 scopus 로고    scopus 로고
    • Distributed geometric fuzzy multiagent urban traffic signal control
    • Sep.
    • B. Gokulan and D. Srinivasan, "Distributed geometric fuzzy multiagent urban traffic signal control", IEEE Trans. Intell. Transp. Syst., vol. 11, no. 3, pp. 714-727, Sep. 2010.
    • (2010) IEEE Trans. Intell. Transp. Syst. , vol.11 , Issue.3 , pp. 714-727
    • Gokulan, B.1    Srinivasan, D.2
  • 10
    • 77649272483 scopus 로고    scopus 로고
    • Traffic signal optimization in "La Almozara" district in Saragossa under congestion conditions, using genetic algorithms, traffic microsimulation, and cluster computing
    • Mar.
    • J. Sanchez-Medina, M. Galan-Moreno, and E. Rubiyo-Royo, "Traffic signal optimization in "La Almozara" district in Saragossa under congestion conditions, using genetic algorithms, traffic microsimulation, and cluster computing", IEEE Trans. Intell. Transp. Syst., vol. 11, no. 1, pp. 132-141, Mar. 2010.
    • (2010) IEEE Trans. Intell. Transp. Syst. , vol.11 , Issue.1 , pp. 132-141
    • Sanchez-Medina, J.1    Galan-Moreno, M.2    Rubiyo-Royo, E.3
  • 11
    • 33749365892 scopus 로고    scopus 로고
    • Stochastic adaptive control model for traffic signal systems
    • DOI 10.1016/j.trc.2006.08.002, PII S0968090X06000556
    • X. Yu and W. Recker, "Stochastic adaptive control model for traffic signal systems", Transp. Res. Part C: Emerging Technol., vol. 14, no. 4, pp. 263-282, Aug. 2006. (Pubitemid 44490300)
    • (2006) Transportation Research Part C: Emerging Technologies , vol.14 , Issue.4 , pp. 263-282
    • Yu, X.-H.1    Recker, W.W.2
  • 13
    • 0026114207 scopus 로고
    • Optimizing networks of traffic signals in real time - The SCOOT method
    • Feb.
    • D. Robertson and R. Bretherton, "Optimizing networks of traffic signals in real time-the SCOOT method", IEEE Trans. Veh. Technol., vol. 40, pt. 2, no. 1, pp. 11-15, Feb. 1991.
    • (1991) IEEE Trans. Veh. Technol. , vol.40 , Issue.1-2 PART , pp. 11-15
    • Robertson, D.1    Bretherton, R.2
  • 15
    • 60749092453 scopus 로고    scopus 로고
    • Adaptive dynamic programming for multiintersections traffic signal intelligent control
    • T. Li, D. Zhao, and J. Yi, "Adaptive dynamic programming for multiintersections traffic signal intelligent control", in Proc. 11th Int. IEEE ITSC, 2008, pp. 286-291.
    • (2008) Proc. 11th Int. IEEE ITSC , pp. 286-291
    • Li, T.1    Zhao, D.2    Yi, J.3
  • 18
    • 34249833101 scopus 로고
    • Q-learning
    • May, Online. Available
    • C. J. Watkins and P. Dayan, "Q-learning", Mach. Learn., vol. 8, no. 3, pp. 279-292, May 1992. [Online]. Available: http://dx.doi.org/10.1007/ BF00992698
    • (1992) Mach. Learn. , vol.8 , Issue.3 , pp. 279-292
    • Watkins, C.J.1    Dayan, P.2
  • 19
    • 0037294787 scopus 로고    scopus 로고
    • Design and evaluation of dynamic traffic management strategies for congested conditions
    • DOI 10.1016/S0965-8564(02)00006-X, PII S096585640200006X
    • G. Abu-Lebdeh and R. Benekohal, "Design and evaluation of dynamic traffic management strategies for congested conditions", Transp. Res. Part A: Policy Pract., vol. 37, no. 2, pp. 109-127, Feb. 2003. (Pubitemid 36059578)
    • (2003) Transportation Research Part A: Policy and Practice , vol.37 , Issue.2 , pp. 109-127
    • Abu-Lebdeh, G.1    Benekohal, R.F.2
  • 20
    • 46149083042 scopus 로고    scopus 로고
    • Application of stochastic optimization method for an urban corridor
    • I. Yun and B. Park, "Application of stochastic optimization method for an urban corridor", in Proc. 38th Winter Simul. Conf., 2006, pp. 1493-1499.
    • (2006) Proc. 38th Winter Simul. Conf. , pp. 1493-1499
    • Yun, I.1    Park, B.2
  • 21
    • 38049011296 scopus 로고    scopus 로고
    • Q-learning with linear function approximation
    • F. Melo and M. Ribeiro, "Q-learning with linear function approximation", in Proc. Learn. Theory, 2007, pp. 308-322.
    • (2007) Proc. Learn. Theory , pp. 308-322
    • Melo, F.1    Ribeiro, M.2
  • 25
    • 0028497630 scopus 로고
    • Asynchronous stochastic approximation and Q-learning
    • Sep.
    • J. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning", Mach. Learn., vol. 16, no. 3, pp. 185-202, Sep. 1994.
    • (1994) Mach. Learn. , vol.16 , Issue.3 , pp. 185-202
    • Tsitsiklis, J.1
  • 26
    • 78649750853 scopus 로고    scopus 로고
    • Real-time measurement of link vehicle count and travel time in a road network
    • Dec.
    • K. Kwong, R. Kavler, R. Rajagopal, and P. Varaiya, "Real-time measurement of link vehicle count and travel time in a road network", IEEE Trans. Intell. Transp. Syst., vol. 11, no. 4, pp. 814-825, Dec. 2010.
    • (2010) IEEE Trans. Intell. Transp. Syst. , vol.11 , Issue.4 , pp. 814-825
    • Kwong, K.1    Kavler, R.2    Rajagopal, R.3    Varaiya, P.4
  • 27
    • 4043069840 scopus 로고    scopus 로고
    • On actor-critic algorithms
    • V. Konda and J. Tsitsiklis, "On actor-critic algorithms", SIAM J. Control Optim., vol. 42, no. 4, pp. 1143-1166, 2004.
    • (2004) SIAM J. Control Optim. , vol.42 , Issue.4 , pp. 1143-1166
    • Konda, V.1    Tsitsiklis, J.2
  • 28
    • 70349984547 scopus 로고    scopus 로고
    • Natural actorcritic algorithms
    • Nov.
    • S. Bhatnagar, R. Sutton, M. Ghavamzadeh, and M. Lee, "Natural actorcritic algorithms", Automatica, vol. 45, no. 11, pp. 2471-2482, Nov. 2009.
    • (2009) Automatica , vol.45 , Issue.11 , pp. 2471-2482
    • Bhatnagar, S.1    Sutton, R.2    Ghavamzadeh, M.3    Lee, M.4
  • 29
    • 77953120268 scopus 로고    scopus 로고
    • Driver behavior analysis during ACC activation and deactivation in a real traffic environment
    • Jun.
    • J. Pauwelussen and P. Feenstra, "Driver behavior analysis during ACC activation and deactivation in a real traffic environment", IEEE Trans. Intell. Transp. Syst., vol. 11, no. 2, pp. 329-338, Jun. 2010.
    • (2010) IEEE Trans. Intell. Transp. Syst. , vol.11 , Issue.2 , pp. 329-338
    • Pauwelussen, J.1    Feenstra, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.