메뉴 건너뛰기




Volumn 5586 LNCS, Issue , 2009, Pages 105-119

Using reinforcement learning for multi-policy optimization in decentralized autonomic systems - An experimental evaluation

Author keywords

[No Author keywords available]

Indexed keywords

AUTONOMIC SYSTEMS; DECENTRALIZED SYSTEM; EXPERIMENTAL EVALUATION; HETEROGENEOUS AGENTS; HIGH LEVEL POLICIES; LEARNING PROCESS; ON CURRENTS; POLICY OPTIMIZATION; REINFORCEMENT LEARNING TECHNIQUES; SELF-OPTIMIZATION; SINGLE STATE; URBAN TRAFFIC CONTROL; VEHICLE TYPES; WAITING TIME;

EID: 70350650156     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-02704-8_9     Document Type: Conference Paper
Times cited : (6)

References (22)
  • 2
    • 33644809850 scopus 로고    scopus 로고
    • A distributed approach for coordination of traffic signal agents
    • Bazzan, A.L.: A distributed approach for coordination of traffic signal agents. Autonomous Agents and Multi-Agent Systems 10(1), 131-164 (2005)
    • (2005) Autonomous Agents and Multi-Agent Systems , vol.10 , Issue.1 , pp. 131-164
    • Bazzan, A.L.1
  • 3
    • 70350652403 scopus 로고    scopus 로고
    • Cuayáhuitl, H., Renals, S., Lemon, O., Shimodaira, H.: Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces. Int. Journal of Game Theory, 547-565 (2006)
    • Cuayáhuitl, H., Renals, S., Lemon, O., Shimodaira, H.: Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces. Int. Journal of Game Theory, 547-565 (2006)
  • 5
    • 0034428959 scopus 로고    scopus 로고
    • Hybrid genetic algorithms for telecommunications network backup routeing
    • October
    • He, L., Nort, N.: Hybrid genetic algorithms for telecommunications network backup routeing. BT Technology Journal 18(4) (October 2000)
    • (2000) BT Technology Journal , vol.18 , Issue.4
    • He, L.1    Nort, N.2
  • 8
    • 0037253062 scopus 로고    scopus 로고
    • The vision of autonomic computing
    • Kephart, J.O., Chess, D.M.: The vision of autonomic computing. Computer 36(1), 41-50 (2003)
    • (2003) Computer , vol.36 , Issue.1 , pp. 41-50
    • Kephart, J.O.1    Chess, D.M.2
  • 9
    • 34249316911 scopus 로고    scopus 로고
    • Simulation and evaluation of urban bus-networks using a multiagent approach
    • Meignan, D., Simonin, O., Koukam, A.: Simulation and evaluation of urban bus-networks using a multiagent approach. Simulation Modelling Practice and Theory 15(6), 659-671 (2007)
    • (2007) Simulation Modelling Practice and Theory , vol.15 , Issue.6 , pp. 659-671
    • Meignan, D.1    Simonin, O.2    Koukam, A.3
  • 10
    • 23144465616 scopus 로고    scopus 로고
    • Montresor, A., Meling, H., Babaoǧlu, Ö.: Messor: Load-balancing through a swarm of autonomous agents. In: Moro, G., Koubarakis, M. (eds.) AP2PC 2002. LNCS (LNAI), 2530, pp. 125-137. Springer, Heidelberg (2003)
    • Montresor, A., Meling, H., Babaoǧlu, Ö.: Messor: Load-balancing through a swarm of autonomous agents. In: Moro, G., Koubarakis, M. (eds.) AP2PC 2002. LNCS (LNAI), vol. 2530, pp. 125-137. Springer, Heidelberg (2003)
  • 14
    • 0033714691 scopus 로고    scopus 로고
    • Pendrith, M.D.: Distributed reinforcement learning for a traffic engineering application. In: AGENTS 2000, pp. 404-411. ACM Press, New York (2000)
    • Pendrith, M.D.: Distributed reinforcement learning for a traffic engineering application. In: AGENTS 2000, pp. 404-411. ACM Press, New York (2000)
  • 15
    • 62949152586 scopus 로고    scopus 로고
    • Requirements for an ubiquitous computing simulation and emulation environment
    • ACM Press, New York
    • Reynolds, V., Cahill, V., Senart, A.: Requirements for an ubiquitous computing simulation and emulation environment. In: InterSense 2006. ACM Press, New York (2006)
    • (2006) InterSense 2006
    • Reynolds, V.1    Cahill, V.2    Senart, A.3
  • 16
    • 62949166865 scopus 로고    scopus 로고
    • Learning traffic control - towards practical traffic control using policy gradients
    • Technical report, Albert-Ludwigs-Universität Freiburg
    • Richter, S.: Learning traffic control - towards practical traffic control using policy gradients. Technical report, Albert-Ludwigs-Universität Freiburg (2006)
    • (2006)
    • Richter, S.1
  • 18
    • 62949112174 scopus 로고    scopus 로고
    • Salkham, A., Cunningham, R., Garg, A., Cahill, V.: A collaborative reinforcement learning approach to urban traffic control optimization. In: International Conference on Intelligent Agent Technology (December 2008)
    • Salkham, A., Cunningham, R., Garg, A., Cahill, V.: A collaborative reinforcement learning approach to urban traffic control optimization. In: International Conference on Intelligent Agent Technology (December 2008)
  • 19
    • 33748438713 scopus 로고    scopus 로고
    • Balancing multiple sources of reward in reinforcement learning
    • Shelton, C.R.: Balancing multiple sources of reward in reinforcement learning. In: Neural Information Processing Systems 2000, pp. 1082-1088 (2000)
    • (2000) Neural Information Processing Systems 2000 , pp. 1082-1088
    • Shelton, C.R.1
  • 21
    • 4544234137 scopus 로고    scopus 로고
    • Tesauro, G., Chess, D.M., Walsh, W.E., Das, R., Segal, A., Whalley, I., Kephart, J.O., White, S.R.: A multi-agent systems approach to autonomic computing. In: AAMAS 2004, pp. 464-471 (2004)
    • Tesauro, G., Chess, D.M., Walsh, W.E., Das, R., Segal, A., Whalley, I., Kephart, J.O., White, S.R.: A multi-agent systems approach to autonomic computing. In: AAMAS 2004, pp. 464-471 (2004)
  • 22
    • 36249019659 scopus 로고    scopus 로고
    • Multi-agent reinforcement learning for traffic light control
    • Morgan Kaufmann, San Francisco
    • Wiering, M.: Multi-agent reinforcement learning for traffic light control. In: Proc. of 17th Int. Conf. on Machine Learning, pp. 1151-1158. Morgan Kaufmann, San Francisco (2000)
    • (2000) Proc. of 17th Int. Conf. on Machine Learning , pp. 1151-1158
    • Wiering, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.