메뉴 건너뛰기




Volumn , Issue , 2009, Pages 361-366

A multi-agent learning approach to online distributed resource allocation

Author keywords

[No Author keywords available]

Indexed keywords

CLUSTER COMPUTING; E-LEARNING; RESOURCE ALLOCATION;

EID: 78751684474     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (50)

References (12)
  • 1
    • 0034441439 scopus 로고    scopus 로고
    • Cluster reserves: A mechanism for resource management in cluster-based network servers
    • Mohit Aron, Peter Druschel, and Willy Zwaenepoel. Cluster reserves: a mechanism for resource management in cluster-based network servers. In Measurement and Modeling of Computer Systems, pages 90-101, 2000.
    • (2000) Measurement and Modeling of Computer Systems , pp. 90-101
    • Aron, M.1    Druschel, P.2    Zwaenepoel, W.3
  • 3
    • 84899027977 scopus 로고    scopus 로고
    • Convergence and noregret in multiagent learning
    • Michael Bowling. Convergence and noregret in multiagent learning. In NIPS'05, pages 209-216, 2005.
    • (2005) NIPS'05 , pp. 209-216
    • Bowling, M.1
  • 4
    • 0000719863 scopus 로고
    • Packet routing in dynamically changing networks: A reinforcement learning approach
    • Justin A. Boyan and Michael L. Littman. Packet routing in dynamically changing networks: A reinforcement learning approach. In NIPS'94, volume 6, pages 671-678, 1994.
    • (1994) NIPS'94 , vol.6 , pp. 671-678
    • Boyan, J.A.1    Littman, M.L.2
  • 5
    • 0031630561 scopus 로고    scopus 로고
    • The dynamics of reinforcement learning in cooperative multiagent systems
    • AAAI Press
    • Caroline Claus and Craig Boutilier. The dynamics of reinforcement learning in cooperative multiagent systems. In AAAI'98, pages 746-752. AAAI Press, 1998.
    • (1998) AAAI'98 , pp. 746-752
    • Claus, C.1    Boutilier, C.2
  • 6
    • 18544386026 scopus 로고    scopus 로고
    • Confidence based dual reinforcement qrouting: An adaptive online network routing algorithm
    • Shailesh Kumar and Risto Miikkulainen. Confidence based dual reinforcement qrouting: An adaptive online network routing algorithm. In IJCAI '99, pages 758-763, 1999.
    • (1999) IJCAI '99 , pp. 758-763
    • Kumar, S.1    Miikkulainen, R.2
  • 7
    • 0024716426 scopus 로고
    • Distributed scheduling of tasks with deadlines and resource requirements
    • K. Ramamritham, J. A. Stankovic, and W. Zhao. Distributed scheduling of tasks with deadlines and resource requirements. IEEE Trans. Comput., 38(8):1110-1123, 1989.
    • (1989) IEEE Trans. Comput. , vol.38 , Issue.8 , pp. 1110-1123
    • Ramamritham, K.1    Stankovic, J.A.2    Zhao, W.3
  • 9
    • 0033901602 scopus 로고    scopus 로고
    • Convergence results for single-step on-policy reinforcement-learning algorithms
    • Satinder P. Singh, Tommi Jaakkola, Michael L. Littman, and Csaba Szepesvari. Convergence results for single-step on-policy reinforcement-learning algorithms. Machine Learning, 38(3):287-308, 2000.
    • (2000) Machine Learning , vol.38 , Issue.3 , pp. 287-308
    • Singh, S.P.1    Jaakkola, T.2    Littman, M.L.3    Szepesvari, C.4
  • 10
    • 29344462255 scopus 로고    scopus 로고
    • Online resource allocation using decompositional reinforcement learning
    • Manuela M. Veloso and Subbarao Kambhampati, editors, AAAI Press / The MIT Press
    • Gerald Tesauro. Online resource allocation using decompositional reinforcement learning. In Manuela M. Veloso and Subbarao Kambhampati, editors, AAAI, pages 886-891. AAAI Press / The MIT Press, 2005.
    • (2005) AAAI , pp. 886-891
    • Tesauro, G.1
  • 12
    • 1942484421 scopus 로고    scopus 로고
    • Online convex programming and generalized infinitesimal gradient ascent
    • Martin Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In ICML'03, pages 928-936, 2003.
    • (2003) ICML'03 , pp. 928-936
    • Zinkevich, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.