메뉴 건너뛰기




Volumn , Issue , 2009, Pages 20-29

Distributed W-learning: Multi-policy optimization in self-organizing systems

Author keywords

[No Author keywords available]

Indexed keywords

AGENT-BASED SYSTEMS; COLLABORATIVE AGENTS; GLOBAL KNOWLEDGE; LOCAL INTERACTIONS; OPERATING ENVIRONMENT; POLICY OPTIMIZATION; PUBLIC TRANSPORT VEHICLES; ROUND ROBIN; SELF ORGANIZING; SELF-OPTIMIZATION; SELF-ORGANIZING SYSTEMS; TRAFFIC CONTROLLERS; URBAN TRAFFIC CONTROL; WAITING TIME;

EID: 73649130992     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SASO.2009.23     Document Type: Conference Paper
Times cited : (43)

References (27)
  • 2
    • 23144465616 scopus 로고    scopus 로고
    • Messor: Load-balancing through a swarm of autonomous agents
    • AP2PC '02, G. Moro and M. Koubarakis, Eds, Bologna, Italy: Springer-Verlag, pp
    • A. Montresor, H. Meling, and O. Baboglu, "Messor: Load-balancing through a swarm of autonomous agents," in AP2PC '02, ser. Lecture Notes in Artificial Intelligence, G. Moro and M. Koubarakis, Eds., no. 2530. Bologna, Italy: Springer-Verlag, pp. 125-137.
    • ser. Lecture Notes in Artificial Intelligence , Issue.2530 , pp. 125-137
    • Montresor, A.1    Meling, H.2    Baboglu, O.3
  • 3
    • 0036037269 scopus 로고    scopus 로고
    • A particle swarm model for swarm-based networked sensor systems
    • B. A. Kadrovach and G. B. Lamont, "A particle swarm model for swarm-based networked sensor systems." in SAC, 2002, pp. 918-924.
    • (2002) SAC , pp. 918-924
    • Kadrovach, B.A.1    Lamont, G.B.2
  • 4
    • 51649084574 scopus 로고    scopus 로고
    • Digital evolution of behavioral models for autonomic systems
    • IEEE Computer Society, pp
    • H. J. Goldsby, B. H. C. Cheng, P. K. McKinley, D. B. Knoester, and C. A. Ofria, "Digital evolution of behavioral models for autonomic systems," in ICAC '08. IEEE Computer Society, pp. 87-96.
    • ICAC '08 , pp. 87-96
    • Goldsby, H.J.1    Cheng, B.H.C.2    McKinley, P.K.3    Knoester, D.B.4    Ofria, C.A.5
  • 5
    • 33750252006 scopus 로고    scopus 로고
    • The decentralised coordination of self-adaptive components for autonomic distributed systems,
    • Ph.D. dissertation, Trinity College Dublin
    • J. Dowling, "The decentralised coordination of self-adaptive components for autonomic distributed systems," Ph.D. dissertation, Trinity College Dublin, 2005.
    • (2005)
    • Dowling, J.1
  • 6
    • 33847379922 scopus 로고    scopus 로고
    • Reinforcement learning in autonomic computing: A manifesto and case studies
    • G. Tesauro, "Reinforcement learning in autonomic computing: A manifesto and case studies," IEEE Internet Computing, vol. 11, no. 1, pp. 22-30, 2007.
    • (2007) IEEE Internet Computing , vol.11 , Issue.1 , pp. 22-30
    • Tesauro, G.1
  • 7
    • 0008898273 scopus 로고    scopus 로고
    • Action selection methods using reinforcement learning,
    • Ph.D. dissertation, University of Cambridge
    • M. Humphrys, "Action selection methods using reinforcement learning," Ph.D. dissertation, University of Cambridge, 1996.
    • (1996)
    • Humphrys, M.1
  • 9
    • 0032096675 scopus 로고    scopus 로고
    • Multiagent systems
    • K. Sycara, "Multiagent systems," AI Magazine, vol. 19, no. 2, 1998.
    • (1998) AI Magazine , vol.19 , Issue.2
    • Sycara, K.1
  • 12
    • 34249833101 scopus 로고
    • Technical note: Q-learning
    • May
    • C. J. C. H. Watkins and P. Dayan, "Technical note: Q-learning," Machine Learning, vol. 8, no. 3, pp. 279-292, May 1992.
    • (1992) Machine Learning , vol.8 , Issue.3 , pp. 279-292
    • Watkins, C.J.C.H.1    Dayan, P.2
  • 13
    • 0037616356 scopus 로고    scopus 로고
    • Reinforcement learning for the true adaptive traffic signal control
    • May/June
    • B. Abdulhai, R. Pringle, and G. Karakoulas, "Reinforcement learning for the true adaptive traffic signal control," Journal of Transportation Engineering, vol. 129, no. 3, pp. 278-285, May/June 2003.
    • (2003) Journal of Transportation Engineering , vol.129 , Issue.3 , pp. 278-285
    • Abdulhai, B.1    Pringle, R.2    Karakoulas, G.3
  • 14
    • 73649088207 scopus 로고    scopus 로고
    • Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces
    • H. Cuayáhuitl, S. Renals, O. Lemon, and H. Shimodaira, "Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces," in International Journal of Game Theory, 2006, pp. 547-565.
    • (2006) International Journal of Game Theory , pp. 547-565
    • Cuayáhuitl, H.1    Renals, S.2    Lemon, O.3    Shimodaira, H.4
  • 15
    • 26444601262 scopus 로고    scopus 로고
    • Cooperative multi-agent learning: The state of the art
    • L. Panait and S. Luke, "Cooperative multi-agent learning: The state of the art," AAMAS '05, vol. 11, no. 3, pp. 387-434.
    • AAMAS '05 , vol.11 , Issue.3 , pp. 387-434
    • Panait, L.1    Luke, S.2
  • 17
    • 16244367141 scopus 로고    scopus 로고
    • Cooperative multiagent systems for the optimization of urban traffic
    • Washington, DC, USA: IEEE Computer Society, pp
    • E. Bitting and A. A. Ghorbani, "Cooperative multiagent systems for the optimization of urban traffic," in IAT '04. Washington, DC, USA: IEEE Computer Society, pp. 176-182.
    • IAT '04 , pp. 176-182
    • Bitting, E.1    Ghorbani, A.A.2
  • 18
    • 0033714691 scopus 로고    scopus 로고
    • Distributed reinforcement learning for a traffic engineering application
    • New York, NY, USA: ACM Press, pp
    • M. D. Pendrith, "Distributed reinforcement learning for a traffic engineering application," in AGENTS '00. New York, NY, USA: ACM Press, pp. 404-411.
    • AGENTS '00 , pp. 404-411
    • Pendrith, M.D.1
  • 19
    • 62949112174 scopus 로고    scopus 로고
    • A collaborative reinforcement learning approach to urban traffic control optimization
    • A. Salkham, R. Cunningham, A. Garg, and V. Cahill, "A collaborative reinforcement learning approach to urban traffic control optimization," in IAT '08.
    • IAT '08
    • Salkham, A.1    Cunningham, R.2    Garg, A.3    Cahill, V.4
  • 20
    • 36249019659 scopus 로고    scopus 로고
    • Multi-agent reinforcement learning for traffic light control
    • Morgan Kaufmann, San Francisco, CA, pp
    • M. Wiering, "Multi-agent reinforcement learning for traffic light control," in ICML '00. Morgan Kaufmann, San Francisco, CA, pp. 1151-1158.
    • ICML '00 , pp. 1151-1158
    • Wiering, M.1
  • 21
    • 34249316911 scopus 로고    scopus 로고
    • Simulation and evaluation of urban bus-networks using a multiagent approach
    • July
    • D. Meignan, O. Simonin, and A. Koukam, "Simulation and evaluation of urban bus-networks using a multiagent approach," Simulation Modelling Practice and Theory, vol. 15, no. 6, pp. 659-671, July 2007.
    • (2007) Simulation Modelling Practice and Theory , vol.15 , Issue.6 , pp. 659-671
    • Meignan, D.1    Simonin, O.2    Koukam, A.3
  • 23
    • 0002109085 scopus 로고    scopus 로고
    • Multi-agent reinforcement learning: Independent vs. cooperative agents
    • M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents," in ICML '93.
    • ICML '93
    • Tan, M.1
  • 24
    • 62949152586 scopus 로고    scopus 로고
    • Requirements for an ubiquitous computing simulation and emulation environment
    • NY, USA: ACM
    • V. Reynolds, V. Cahill, and A. Senart, "Requirements for an ubiquitous computing simulation and emulation environment," in InterSense '06. NY, USA: ACM.
    • InterSense '06
    • Reynolds, V.1    Cahill, V.2    Senart, A.3
  • 26
    • 62949166865 scopus 로고    scopus 로고
    • Learning traffic control - towards practical traffic control using policy gradients
    • Albert-Ludwigs-Universitat Freiburg, Tech. Rep
    • S. Richter, "Learning traffic control - towards practical traffic control using policy gradients," Albert-Ludwigs-Universitat Freiburg, Tech. Rep., 2006.
    • (2006)
    • Richter, S.1
  • 27
    • 33644809850 scopus 로고    scopus 로고
    • A distributed approach for coordination of traffic signal agents
    • A. L. Bazzan, "A distributed approach for coordination of traffic signal agents," Autonomous Agents and Multi-Agent Systems, vol. 10, no. 1, pp. 131-164, 2005.
    • (2005) Autonomous Agents and Multi-Agent Systems , vol.10 , Issue.1 , pp. 131-164
    • Bazzan, A.L.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.