메뉴 건너뛰기




Volumn 1, Issue , 2011, Pages 764-770

Coordinated multi-agent reinforcement learning in networked distributed POMDPs

Author keywords

[No Author keywords available]

Indexed keywords

DISTRIBUTED CONSTRAINTS; DISTRIBUTED LEARNING; DISTRIBUTED SENSOR; GLOBAL LEARNING; LEARNING APPROACH; LOCAL INTERACTIONS; MODEL FREE; MULTI-AGENT APPLICATIONS; MULTI-AGENT DECISION MAKING; MULTI-AGENT REINFORCEMENT LEARNING; NETWORK OF AGENTS; OFFLINE; OPTIMAL POLICIES;

EID: 80055062322     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (83)

References (13)
  • 1
    • 0031630561 scopus 로고    scopus 로고
    • The dynamics of reinforcement learning in cooperative multiagent systems
    • AAAI Press
    • Claus, C., and Boutilier, C. 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In AAAI'98, 746-752. AAAI Press.
    • (1998) AAAI'98 , pp. 746-752
    • Claus, C.1    Boutilier, C.2
  • 2
    • 0012296128 scopus 로고    scopus 로고
    • Multiagent planning with factored mdps
    • Guestrin, C.; Koller, D.; and Parr, R. 2001. Multiagent planning with factored mdps. In NIPS-14, 1523-1530.
    • (2001) NIPS-14 , pp. 1523-1530
    • Guestrin, C.1    Koller, D.2    Parr, R.3
  • 4
    • 33748543203 scopus 로고    scopus 로고
    • Collaborative multiagent reinforcement learning by payoff propagation
    • Kok, J. R., and Vlassis, N. 2006. Collaborative multiagent reinforcement learning by payoff propagation. Journal of Machine Learning Research 7:1789-1828. (Pubitemid 44373693)
    • (2006) Journal of Machine Learning Research , vol.7 , pp. 1789-1828
    • Kok, J.R.1    Vlassis, N.2
  • 5
    • 84899828955 scopus 로고    scopus 로고
    • Constraint-based dynamic programming for decentralized pomdps with structured interactions
    • Kumar, A., and Zilberstein, S. 2009. Constraint-based dynamic programming for decentralized pomdps with structured interactions. In AAMAS.
    • (2009) AAMAS
    • Kumar, A.1    Zilberstein, S.2
  • 7
    • 84899969517 scopus 로고    scopus 로고
    • Not all agents are equal: Scaling up distributed pomdps for agent networks
    • Marecki, J.; Gupta, T.; Varakantham, P.; Tambe, M.; and Yokoo, M. 2008. Not all agents are equal: Scaling up distributed pomdps for agent networks. In AAMAS, 485-492.
    • (2008) AAMAS , pp. 485-492
    • Marecki, J.1    Gupta, T.2    Varakantham, P.3    Tambe, M.4    Yokoo, M.5
  • 9
    • 78751696710 scopus 로고    scopus 로고
    • Decentralised coordination of mobile sensors using the max-sum algorithm
    • Stranders, R.; Farinelli, A.; Rogers, A.; and Jennings, N. R. 2009. Decentralised coordination of mobile sensors using the max-sum algorithm. In IJCAI, 299-304.
    • (2009) IJCAI , pp. 299-304
    • Stranders, R.1    Farinelli, A.2    Rogers, A.3    Jennings, N.R.4
  • 11
    • 29344437834 scopus 로고    scopus 로고
    • Networked distributed pomdps: A synthesis of distributed constraint optimization and pomdps
    • Varakantham, P.; Tambe, M.; and Yokoo, M. 2005. Networked distributed pomdps: A synthesis of distributed constraint optimization and pomdps. In AAAI, 133-139.
    • (2005) AAAI , pp. 133-139
    • Varakantham, P.1    Tambe, M.2    Yokoo, M.3
  • 12
    • 84899884456 scopus 로고    scopus 로고
    • Integrating organizational control into multi-agent learning
    • Zhang, C.; Abdallah, S.; and Lesser, V. 2009. Integrating organizational control into multi-agent learning. In AAMAS'09.
    • (2009) AAMAS'09
    • Zhang, C.1    Abdallah, S.2    Lesser, V.3
  • 13
    • 84865781568 scopus 로고    scopus 로고
    • Self-organization for coordinating decentralized reinforcement learning
    • Zhang, C.; Lesser, V.; and Abdallah, S. 2010. Self-organization for coordinating decentralized reinforcement learning. In AAMAS'10.
    • (2010) AAMAS'10
    • Zhang, C.1    Lesser, V.2    Abdallah, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.