SCOPUS 정보 검색 플랫폼

SASO 2009 - 3rd IEEE International Conference on Self-Adaptive and Self-Organizing Systems

Volumn , Issue , 2009, Pages 20-29

Distributed W-learning: Multi-policy optimization in self-organizing systems

(2) Dusparic, Ivana a Cahill, Vinny a

a University of Limerick (Ireland)

Author keywords

[No Author keywords available]

Indexed keywords

AGENT-BASED SYSTEMS; COLLABORATIVE AGENTS; GLOBAL KNOWLEDGE; LOCAL INTERACTIONS; OPERATING ENVIRONMENT; POLICY OPTIMIZATION; PUBLIC TRANSPORT VEHICLES; ROUND ROBIN; SELF ORGANIZING; SELF-OPTIMIZATION; SELF-ORGANIZING SYSTEMS; TRAFFIC CONTROLLERS; URBAN TRAFFIC CONTROL; WAITING TIME;

CYBERNETICS; OPTIMIZATION; REINFORCEMENT; TRAFFIC CONTROL;

LEARNING ALGORITHMS;

EID: 73649130992 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/SASO.2009.23 Document Type: Conference Paper

Times cited : (43)

References (27)

1
- 33646590265
- Self-organization in multi-agent systems
- G. Di Marzo Serugendo, M.-P. Gleizes, and A. Karageorgos, "Self-organization in multi-agent systems," Knowledge Engineering Review, vol. 20, no. 2, pp. 165-189, 2005.
- (2005) Knowledge Engineering Review , vol.20 , Issue.2 , pp. 165-189
- Di Marzo Serugendo, G.¹ Gleizes, M.-P.² Karageorgos, A.³

2
- 23144465616
- Messor: Load-balancing through a swarm of autonomous agents
- AP2PC '02, G. Moro and M. Koubarakis, Eds, Bologna, Italy: Springer-Verlag, pp
- A. Montresor, H. Meling, and O. Baboglu, "Messor: Load-balancing through a swarm of autonomous agents," in AP2PC '02, ser. Lecture Notes in Artificial Intelligence, G. Moro and M. Koubarakis, Eds., no. 2530. Bologna, Italy: Springer-Verlag, pp. 125-137.
- ser. Lecture Notes in Artificial Intelligence , Issue.2530 , pp. 125-137
- Montresor, A.¹ Meling, H.² Baboglu, O.³

3
- 0036037269
- A particle swarm model for swarm-based networked sensor systems
- B. A. Kadrovach and G. B. Lamont, "A particle swarm model for swarm-based networked sensor systems." in SAC, 2002, pp. 918-924.
- (2002) SAC , pp. 918-924
- Kadrovach, B.A.¹ Lamont, G.B.²

4
- 51649084574
- Digital evolution of behavioral models for autonomic systems
- IEEE Computer Society, pp
- H. J. Goldsby, B. H. C. Cheng, P. K. McKinley, D. B. Knoester, and C. A. Ofria, "Digital evolution of behavioral models for autonomic systems," in ICAC '08. IEEE Computer Society, pp. 87-96.
- ICAC '08 , pp. 87-96
- Goldsby, H.J.¹ Cheng, B.H.C.² McKinley, P.K.³ Knoester, D.B.⁴ Ofria, C.A.⁵

5
- 33750252006
- The decentralised coordination of self-adaptive components for autonomic distributed systems,
- Ph.D. dissertation, Trinity College Dublin
- J. Dowling, "The decentralised coordination of self-adaptive components for autonomic distributed systems," Ph.D. dissertation, Trinity College Dublin, 2005.
- (2005)
- Dowling, J.¹

6
- 33847379922
- Reinforcement learning in autonomic computing: A manifesto and case studies
- G. Tesauro, "Reinforcement learning in autonomic computing: A manifesto and case studies," IEEE Internet Computing, vol. 11, no. 1, pp. 22-30, 2007.
- (2007) IEEE Internet Computing , vol.11 , Issue.1 , pp. 22-30
- Tesauro, G.¹

7
- 0008898273
- Action selection methods using reinforcement learning,
- Ph.D. dissertation, University of Cambridge
- M. Humphrys, "Action selection methods using reinforcement learning," Ph.D. dissertation, University of Cambridge, 1996.
- (1996)
- Humphrys, M.¹

8
- 70049098795
- Distributed W-learning: An algorithm for multi-policy optimization in decentralized autonomic systems (poster)
- I. Dusparic and V. Cahill, "Distributed W-learning: An algorithm for multi-policy optimization in decentralized autonomic systems (poster)," in Proceedings of the 6th International Conference on Autonomic Computing and Communications, 2009.
- (2009) Proceedings of the 6th International Conference on Autonomic Computing and Communications
- Dusparic, I.¹ Cahill, V.²

9
- 0032096675
- Multiagent systems
- K. Sycara, "Multiagent systems," AI Magazine, vol. 19, no. 2, 1998.
- (1998) AI Magazine , vol.19 , Issue.2
- Sycara, K.¹

10
- 70049098795
- Using reinforcement learning for multi-policy optimization in decentralized autonomic systems - an experimental evaluation
- I. Dusparic and V. Cahill, "Using reinforcement learning for multi-policy optimization in decentralized autonomic systems - an experimental evaluation," in The Proceedings of the 6th International Conference on Autonomic and Trusted Computing, 2009.
- (2009) The Proceedings of the 6th International Conference on Autonomic and Trusted Computing
- Dusparic, I.¹ Cahill, V.²

11
- 0004102479
- Cambridge, Massachusetts: A Bradford Book. The MIT Press
- R. S. Suton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, Massachusetts: A Bradford Book. The MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Suton, R.S.¹ Barto, A.G.²

12
- 34249833101
- Technical note: Q-learning
- May
- C. J. C. H. Watkins and P. Dayan, "Technical note: Q-learning," Machine Learning, vol. 8, no. 3, pp. 279-292, May 1992.
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

13
- 0037616356
- Reinforcement learning for the true adaptive traffic signal control
- May/June
- B. Abdulhai, R. Pringle, and G. Karakoulas, "Reinforcement learning for the true adaptive traffic signal control," Journal of Transportation Engineering, vol. 129, no. 3, pp. 278-285, May/June 2003.
- (2003) Journal of Transportation Engineering , vol.129 , Issue.3 , pp. 278-285
- Abdulhai, B.¹ Pringle, R.² Karakoulas, G.³

14
- 73649088207
- Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces
- H. Cuayáhuitl, S. Renals, O. Lemon, and H. Shimodaira, "Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces," in International Journal of Game Theory, 2006, pp. 547-565.
- (2006) International Journal of Game Theory , pp. 547-565
- Cuayáhuitl, H.¹ Renals, S.² Lemon, O.³ Shimodaira, H.⁴

15
- 26444601262
- Cooperative multi-agent learning: The state of the art
- L. Panait and S. Luke, "Cooperative multi-agent learning: The state of the art," AAMAS '05, vol. 11, no. 3, pp. 387-434.
- AAMAS '05 , vol.11 , Issue.3 , pp. 387-434
- Panait, L.¹ Luke, S.²

16
- 73649129112
- Adaptive traffic control with reinforcement learning
- May
- B. C. da Silva, D. de Oliveira, A. L. Bazzan, and E. W. Basso, "Adaptive traffic control with reinforcement learning," in AAMAS '06, May.
- AAMAS '06
- da Silva, B.C.¹ de Oliveira, D.² Bazzan, A.L.³ Basso, E.W.⁴

17
- 16244367141
- Cooperative multiagent systems for the optimization of urban traffic
- Washington, DC, USA: IEEE Computer Society, pp
- E. Bitting and A. A. Ghorbani, "Cooperative multiagent systems for the optimization of urban traffic," in IAT '04. Washington, DC, USA: IEEE Computer Society, pp. 176-182.
- IAT '04 , pp. 176-182
- Bitting, E.¹ Ghorbani, A.A.²

18
- 0033714691
- Distributed reinforcement learning for a traffic engineering application
- New York, NY, USA: ACM Press, pp
- M. D. Pendrith, "Distributed reinforcement learning for a traffic engineering application," in AGENTS '00. New York, NY, USA: ACM Press, pp. 404-411.
- AGENTS '00 , pp. 404-411
- Pendrith, M.D.¹

19
- 62949112174
- A collaborative reinforcement learning approach to urban traffic control optimization
- A. Salkham, R. Cunningham, A. Garg, and V. Cahill, "A collaborative reinforcement learning approach to urban traffic control optimization," in IAT '08.
- IAT '08
- Salkham, A.¹ Cunningham, R.² Garg, A.³ Cahill, V.⁴

20
- 36249019659
- Multi-agent reinforcement learning for traffic light control
- Morgan Kaufmann, San Francisco, CA, pp
- M. Wiering, "Multi-agent reinforcement learning for traffic light control," in ICML '00. Morgan Kaufmann, San Francisco, CA, pp. 1151-1158.
- ICML '00 , pp. 1151-1158
- Wiering, M.¹

21
- 34249316911
- Simulation and evaluation of urban bus-networks using a multiagent approach
- July
- D. Meignan, O. Simonin, and A. Koukam, "Simulation and evaluation of urban bus-networks using a multiagent approach," Simulation Modelling Practice and Theory, vol. 15, no. 6, pp. 659-671, July 2007.
- (2007) Simulation Modelling Practice and Theory , vol.15 , Issue.6 , pp. 659-671
- Meignan, D.¹ Simonin, O.² Koukam, A.³

22
- 70350644120
- Making way for emergency vehicles
- E. Oliveira and N. Duarte, "Making way for emergency vehicles," in Proceedings of the 2005 European Simulation and Modelling Conference, pp. 128-135.
- Proceedings of the 2005 European Simulation and Modelling Conference , pp. 128-135
- Oliveira, E.¹ Duarte, N.²

23
- 0002109085
- Multi-agent reinforcement learning: Independent vs. cooperative agents
- M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents," in ICML '93.
- ICML '93
- Tan, M.¹

24
- 62949152586
- Requirements for an ubiquitous computing simulation and emulation environment
- NY, USA: ACM
- V. Reynolds, V. Cahill, and A. Senart, "Requirements for an ubiquitous computing simulation and emulation environment," in InterSense '06. NY, USA: ACM.
- InterSense '06
- Reynolds, V.¹ Cahill, V.² Senart, A.³

25
- 0019934502
- SCATS: The sydney co-ordinated adaptive traffic system - principles, methodology, algorithms
- P. Lowrie, "SCATS: The sydney co-ordinated adaptive traffic system - principles, methodology, algorithms," in Proceedings of the IEE International Conference on Road Traffic Signalling, 1982.
- (1982) Proceedings of the IEE International Conference on Road Traffic Signalling
- Lowrie, P.¹

26
- 62949166865
- Learning traffic control - towards practical traffic control using policy gradients
- Albert-Ludwigs-Universitat Freiburg, Tech. Rep
- S. Richter, "Learning traffic control - towards practical traffic control using policy gradients," Albert-Ludwigs-Universitat Freiburg, Tech. Rep., 2006.
- (2006)
- Richter, S.¹

27
- 33644809850
- A distributed approach for coordination of traffic signal agents
- A. L. Bazzan, "A distributed approach for coordination of traffic signal agents," Autonomous Agents and Multi-Agent Systems, vol. 10, no. 1, pp. 131-164, 2005.
- (2005) Autonomous Agents and Multi-Agent Systems , vol.10 , Issue.1 , pp. 131-164
- Bazzan, A.L.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.