SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems

Volumn , Issue , 2007, Pages 1169-1176

Natural actor-critic for road traffic optimisation

(3) Richter, Silvia a Aberdeen, Douglas b Yu, Jin b

a UNIVERSITY OF FREIBURG (Germany)

b CSIRO (Australia)

Author keywords

[No Author keywords available]

Indexed keywords

ACTOR CRITIC; ACTOR-CRITIC ALGORITHM; CONTROL SIGNAL; INFINITE HORIZONS; OPTIMISATIONS; REINFORCEMENT LEARNING APPROACH; ROAD TRAFFIC; TRAFFIC SYSTEMS;

GRADIENT METHODS; REINFORCEMENT LEARNING; SENSORS;

OPTIMIZATION;

EID: 84864064043 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (85)

References (12)

1
- 33646413135
- Natural actor-critic
- J. Peters, S. Vijayakumar, and S. Schaal. Natural actor-critic. In Proc. ECML., pages 280-291, 2005.
- (2005) Proc. ECML , pp. 280-291
- Peters, J.¹ Vijayakumar, S.² Schaal, S.³

2
- 84899022736
- Large scale online learning
- L. Bottou and Y. Le Cun. Large scale online learning. In Proc. NIPS'2003, volume 16, 2004.
- (2004) Proc. NIPS'2003 , vol.16
- Bottou, L.¹ Le Cun, Y.²

3
- 70449654751
- U.S. Department of Transportation, Transportation Research Board,Washington, D.C.
- N. H. Gartner, C. J. Messer, and E. Ajay K. Rathi. Traffic Flow Theory: A State of the Art Report - Revised Monograph on Traffic Flow Theory. U.S. Department of Transportation, Transportation Research Board,Washington, D.C., 1992.
- (1992) Traffic Flow Theory: A State of the Art Report - Revised Monograph on Traffic Flow Theory
- Gartner, N.H.¹ Messer, C.J.² Ajay K Rathi, E.³

4
- 67649886878
- R. W. Hall, Editor, Kluwer Academic Publishers, Boston
- M. Papageorgiou. Traffic Control. In Handbook of Transportation Science. R. W. Hall, Editor, Kluwer Academic Publishers, Boston, 1999.
- (1999) Traffic Control Handbook of Transportation Science
- Papageorgiou, M.¹

5
- 0019012912
- The Sydney coordinated adaptive traffic (SCAT) system philosophy and benefits
- A. G. Sims and K.W. Dobinson. The Sydney coordinated adaptive traffic (SCAT) system philosophy and benefits. IEEE Transactions on Vehicular Technology, VT-29(2):130-137, 1980.
- (1980) IEEE Transactions on Vehicular Technology , vol.VT-29 , Issue.2 , pp. 130-137
- Sims, A.G.¹ Dobinson, K.W.²

6
- 36249019659
- Multi-agent reinforcement learning for traffic light control
- M. Wiering. Multi-agent reinforcement learning for traffic light control. In Proc. ICML 2000, 2000.
- (2000) Proc. ICML 2000
- Wiering, M.¹

7
- 62949166865
- Diplomarbeit Albert-Ludwigs-Universität Freiburg
- S. Richter. Learning traffic control - towards practical traffic control using policy gradients. Diplomarbeit, Albert-Ludwigs-Universität Freiburg, 2006.
- (2006) Learning Traffic Control - Towards Practical Traffic Control Using Policy Gradients
- Richter, S.¹

8
- 84864058248
- On local rewards and scaling distributed reinforcement learning
- J. A. Bagnell and A. Y. Ng. On local rewards and scaling distributed reinforcement learning. In Proc. NIPS'2005, volume 18, 2006.
- (2006) Proc. NIPS'2005 , vol.18
- Bagnell, J.A.¹ Ng, A.Y.²

9
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- MIT Press
- R. S. Sutton, D. McAllester, S. Singh, and Y. Mansour. Policy gradient methods for reinforcement learning with function approximation. In Proc. NIPS, volume 12. MIT Press, 2000.
- (2000) Proc. NIPS , vol.12
- Sutton, R.S.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

10
- 84898930479
- A natural policy gradient
- S. Kakade. A natural policy gradient. In Proc. NIPS'2001, volume 14, 2002.
- (2002) Proc. NIPS'2001 , vol.14
- Kakade, S.¹

11
- 0038595396
- Least-squares temporal difference learning
- J. A. Boyan. Least-squares temporal difference learning. In Proc. ICML 16, pages 49-56, 1999.
- (1999) Proc. ICML , vol.16 , pp. 49-56
- Boyan, J.A.¹

12
- 0013495368
- Experiments with infinite-horizon, policy-gradient estimation
- J. Baxter, P. Bartlett, and L.Weaver. Experiments with infinite-horizon, policy-gradient estimation. JAIR, 15:351-381, 2001.
- (2001) JAIR , vol.15 , pp. 351-381
- Baxter, J.¹ Bartlett, P.² Weaver, L.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.