SCOPUS 정보 검색 플랫폼

Proceedings of the National Conference on Artificial Intelligence

Volumn 1, Issue , 2011, Pages 764-770

Coordinated multi-agent reinforcement learning in networked distributed POMDPs

a University of Massachusetts Amherst (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DISTRIBUTED CONSTRAINTS; DISTRIBUTED LEARNING; DISTRIBUTED SENSOR; GLOBAL LEARNING; LEARNING APPROACH; LOCAL INTERACTIONS; MODEL FREE; MULTI-AGENT APPLICATIONS; MULTI-AGENT DECISION MAKING; MULTI-AGENT REINFORCEMENT LEARNING; NETWORK OF AGENTS; OFFLINE; OPTIMAL POLICIES;

DECISION MAKING; OPTIMIZATION; REINFORCEMENT LEARNING;

ARTIFICIAL INTELLIGENCE;

EID: 80055062322 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (83)

References (13)

1
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- AAAI Press
- Claus, C., and Boutilier, C. 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In AAAI'98, 746-752. AAAI Press.
- (1998) AAAI'98 , pp. 746-752
- Claus, C.¹ Boutilier, C.²

2
- 0012296128
- Multiagent planning with factored mdps
- Guestrin, C.; Koller, D.; and Parr, R. 2001. Multiagent planning with factored mdps. In NIPS-14, 1523-1530.
- (2001) NIPS-14 , pp. 1523-1530
- Guestrin, C.¹ Koller, D.² Parr, R.³

3
- 4544236179
- Coordinated reinforcement learning
- San Francisco, CA, USA: Morgan Kaufmann Publishers Inc.
- Guestrin, C.; Lagoudakis, M. G.; and Parr, R. 2002. Coordinated reinforcement learning. In ICML '02: Proceedings of the Nineteenth International Conference on Machine Learning, 227-234. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc.
- (2002) ICML '02: Proceedings of the Nineteenth International Conference on Machine Learning , pp. 227-234
- Guestrin, C.¹ Lagoudakis, M.G.² Parr, R.³

4
- 33748543203
- Collaborative multiagent reinforcement learning by payoff propagation
- Kok, J. R., and Vlassis, N. 2006. Collaborative multiagent reinforcement learning by payoff propagation. Journal of Machine Learning Research 7:1789-1828. (Pubitemid 44373693)
- (2006) Journal of Machine Learning Research , vol.7 , pp. 1789-1828
- Kok, J.R.¹ Vlassis, N.²

5
- 84899828955
- Constraint-based dynamic programming for decentralized pomdps with structured interactions
- Kumar, A., and Zilberstein, S. 2009. Constraint-based dynamic programming for decentralized pomdps with structured interactions. In AAMAS.
- (2009) AAMAS
- Kumar, A.¹ Zilberstein, S.²

6
- 3042527480
- Kluwer Academic Publishers
- Lesser, V.; Ortiz, C.; and Tambe, M., eds. 2003. Distributed Sensor Networks: A Multiagent Perspective (Edited book), volume 9. Kluwer Academic Publishers.
- (2003) Distributed Sensor Networks: A Multiagent Perspective (Edited Book) , vol.9
- Lesser, V.¹ Ortiz, C.² Tambe, M.³

7
- 84899969517
- Not all agents are equal: Scaling up distributed pomdps for agent networks
- Marecki, J.; Gupta, T.; Varakantham, P.; Tambe, M.; and Yokoo, M. 2008. Not all agents are equal: Scaling up distributed pomdps for agent networks. In AAMAS, 485-492.
- (2008) AAMAS , pp. 485-492
- Marecki, J.¹ Gupta, T.² Varakantham, P.³ Tambe, M.⁴ Yokoo, M.⁵

8
- 2342482919
- Instance-based utile distinctions for reinforcement learning with hidden state
- Morgan Kaufmann
- Mccallum, R. A. 1995. Instance-based utile distinctions for reinforcement learning with hidden state. In In Proceedings of the Twelfth International Conference on Machine Learning, 387-395. Morgan Kaufmann.
- (1995) Proceedings of the Twelfth International Conference on Machine Learning , pp. 387-395
- Mccallum, R.A.¹

9
- 78751696710
- Decentralised coordination of mobile sensors using the max-sum algorithm
- Stranders, R.; Farinelli, A.; Rogers, A.; and Jennings, N. R. 2009. Decentralised coordination of mobile sensors using the max-sum algorithm. In IJCAI, 299-304.
- (2009) IJCAI , pp. 299-304
- Stranders, R.¹ Farinelli, A.² Rogers, A.³ Jennings, N.R.⁴

10
- 62949185084
- Introducing communication in dis-pomdps with locality of interaction
- Tasaki, M.; Yabu, Y.; Iwanari, Y.; Yokoo, M.; Tambe, M.; Marecki, J.; and Varakantham, P. 2008. Introducing communication in dis-pomdps with locality of interaction. In Proceedings of the 2008 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, volume 2, 169-175.
- (2008) Proceedings of the 2008 IEEE/WIC/ACM International Conference on Intelligent Agent Technology , vol.2 , pp. 169-175
- Tasaki, M.¹ Yabu, Y.² Iwanari, Y.³ Yokoo, M.⁴ Tambe, M.⁵ Marecki, J.⁶ Varakantham, P.⁷

11
- 29344437834
- Networked distributed pomdps: A synthesis of distributed constraint optimization and pomdps
- Varakantham, P.; Tambe, M.; and Yokoo, M. 2005. Networked distributed pomdps: A synthesis of distributed constraint optimization and pomdps. In AAAI, 133-139.
- (2005) AAAI , pp. 133-139
- Varakantham, P.¹ Tambe, M.² Yokoo, M.³

12
- 84899884456
- Integrating organizational control into multi-agent learning
- Zhang, C.; Abdallah, S.; and Lesser, V. 2009. Integrating organizational control into multi-agent learning. In AAMAS'09.
- (2009) AAMAS'09
- Zhang, C.¹ Abdallah, S.² Lesser, V.³

13
- 84865781568
- Self-organization for coordinating decentralized reinforcement learning
- Zhang, C.; Lesser, V.; and Abdallah, S. 2010. Self-organization for coordinating decentralized reinforcement learning. In AAMAS'10.
- (2010) AAMAS'10
- Zhang, C.¹ Lesser, V.² Abdallah, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.