SCOPUS 정보 검색 플랫폼

12th International Conference on Autonomous Agents and Multiagent Systems 2013, AAMAS 2013

Volumn 1, Issue , 2013, Pages 563-570

Approximate solutions for factored Dec-POMDPs with many agents

(3) Oliehoek, Frans A a Whiteson, Shimon b Spaan, Matthijs T J c

a MAASTRICHT UNIVERSITY (Netherlands)

b UNIVERSITY OF AMSTERDAM (Netherlands)

c DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

Author keywords

Factored decentralized partially observable Markov decision processes; Multi agent planning

Indexed keywords

AUTONOMOUS AGENTS; MULTI AGENT SYSTEMS; SCALABILITY;

APPROXIMATE INFERENCE; APPROXIMATE SOLUTION; COMPUTATION METHODS; COUPLED STRUCTURES; EMPIRICAL EVALUATIONS; INTERACTION STRUCTURES; MULTI-AGENT PLANNING; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS;

HEURISTIC METHODS;

EID: 84899420781 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (50)

References (34)

1
- 4544344444
- Transition-independent decentralized Markov decision processes
- R. Becker, S. Zilberstein, V. Lesser, and C. V. Goldman. Transition-independent decentralized Markov decision processes. In AAMAS, 2003.
- (2003) AAMAS
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.V.⁴

2
- 0036874366
- The complexity of decentralized control of Markov decision processes
- D. S. Bernstein, R. Givan, N. Immerman, and S. Zilberstein. The complexity of decentralized control of Markov decision processes. Math, of OR, 27(4): 819-840, 2002.
- (2002) Math, of OR , vol.27 , Issue.4 , pp. 819-840
- Bernstein, D.S.¹ Givan, R.² Immerman, N.³ Zilberstein, S.⁴

3
- 0346942368
- Decision-theoretic planning: Structural assumptions and computational leverage
- C. Boutilier, T. Dean, and S. Hanks. Decision-theoretic planning: Structural assumptions and computational leverage. J AIR, 11: 1-94, 1999.
- (1999) J AIR , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

4
- 0002436850
- Tractable inference for complex stochastic processes
- X. Boyen and D. Koller. Tractable inference for complex stochastic processes. In UAI, 1998.
- (1998) UAI
- Boyen, X.¹ Koller, D.²

5
- 4544325183
- Approximate solutions for partially observable stochastic games with common payoffs
- R. Emery-Montemerlo, G. Gordon, J. Schneider, and S. Thrun. Approximate solutions for partially observable stochastic games with common payoffs. In AAMAS, 2004.
- (2004) AAMAS
- Emery-Montemerlo, R.¹ Gordon, G.² Schneider, J.³ Thrun, S.⁴

6
- 33846159516
- Game theoretic control for robot teams
- R. Emery-Montemerlo, G. Gordon, J. Schneider, and S. Thrun. Game theoretic control for robot teams. In Proc. of the IEEE Int. Conf. on Robotics and Automation, 2005.
- (2005) Proc. of the IEEE Int. Conf. on Robotics and Automation
- Emery-Montemerlo, R.¹ Gordon, G.² Schneider, J.³ Thrun, S.⁴

7
- 84899946303
- Decentralised coordination of low-power embedded devices using the max-sum algorithm
- A. Farinelli, A. Rogers, A. Petcu, and N. R. Jennings. Decentralised coordination of low-power embedded devices using the max-sum algorithm. In AAMAS, 2008.
- (2008) AAMAS
- Farinelli, A.¹ Rogers, A.² Petcu, A.³ Jennings, N.R.⁴

8
- 4544318426
- Efficient solution algorithms for factored MDPs
- C. Guestrin, D. Koller, R. Parr, and S. Venkataraman. Efficient solution algorithms for factored MDPs. JAIR, 19: 399-468, 2003.
- (2003) JAIR , vol.19 , pp. 399-468
- Guestrin, C.¹ Koller, D.² Parr, R.³ Venkataraman, S.⁴

9
- 84899466233
- Application of max-sum algorithm to radar coordination and scheduling
- Y. Kim, M. Krainin, and V. Lesser. Application of max-sum algorithm to radar coordination and scheduling. In Workshop on Distributed Constraint Reasoning, 2010.
- (2010) Workshop on Distributed Constraint Reasoning
- Kim, Y.¹ Krainin, M.² Lesser, V.³

10
- 33748543203
- Collaborative multiagent reinforcement learning by payoff propagation
- J. R. Kok and N. Vlassis. Collaborative multiagent reinforcement learning by payoff propagation. JMLR, 7: 1789-1828, 2006.
- (2006) JMLR , vol.7 , pp. 1789-1828
- Kok, J.R.¹ Vlassis, N.²

11
- 84880688552
- Computing factored value functions for policies in structured MDPs
- D. Koller and R. Parr. Computing factored value functions for policies in structured MDPs. In IJCAI, 1999.
- (1999) IJCAI
- Koller, D.¹ Parr, R.²

12
- 84899828955
- Constraint-based dynamic programming for decentralized POMDPs with structured interactions
- A. Kumar and S. Zilberstein. Constraint-based dynamic programming for decentralized POMDPs with structured interactions. In AAMAS, 2009.
- (2009) AAMAS
- Kumar, A.¹ Zilberstein, S.²

13
- 84868288428
- Scalable multiagent planning using probabilistic inference
- A. Kumar, S. Zilberstein, and M. Toussaint. Scalable multiagent planning using probabilistic inference. In IJCAI, 2011.
- (2011) IJCAI
- Kumar, A.¹ Zilberstein, S.² Toussaint, M.³

14
- 84899969517
- Not all agents are equal: Scaling up distributed POMDPs for agent networks
- J. Marecki, T. Gupta, P. Varakantham, M. Tambe, and M. Yokoo. Not all agents are equal: scaling up distributed POMDPs for agent networks. In AAMAS, 2008.
- (2008) AAMAS
- Marecki, J.¹ Gupta, T.² Varakantham, P.³ Tambe, M.⁴ Yokoo, M.⁵

15
- 0007788905
- The factored frontier algorithm for approximate inference in DBNs
- K. P. Murphy and Y. Weiss. The factored frontier algorithm for approximate inference in DBNs. In UAI, 2001.
- (2001) UAI
- Murphy, K.P.¹ Weiss, Y.²

16
- 34247214638
- Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs
- R. Nair, P. Varakantham, M. Tambe, and M. Yokoo. Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In AAAI, 2005.
- (2005) AAAI
- Nair, R.¹ Varakantham, P.² Tambe, M.³ Yokoo, M.⁴

17
- 84868298634
- PhD thesis, Informatics Institute, Univ. of Amsterdam
- F. A. Oliehoek. Value-Based Planning for Teams of Agents in Stochastic Partially Observable Environments. PhD thesis, Informatics Institute, Univ. of Amsterdam, 2010.
- (2010) Value-Based Planning for Teams of Agents in Stochastic Partially Observable Environments
- Oliehoek, F.A.¹

18
- 84868288325
- Decentralized POMDPs
- M. Wiering and M. van Otterlo, editors Springer Berlin Heidelberg
- F. A. Oliehoek. Decentralized POMDPs. In M. Wiering and M. van Otterlo, editors, Reinforcement Learning: State of the Art. Springer Berlin Heidelberg, 2012.
- (2012) Reinforcement Learning: State of the Art
- Oliehoek, F.A.¹

19
- 57349184659
- The cross-entropy method for policy search in decentralized POMDPs
- F. A. Oliehoek, J. F. Kooi, and N. Vlassis. The cross-entropy method for policy search in decentralized POMDPs. Informatica, 32: 341-357, 2008.
- (2008) Informatica , vol.32 , pp. 341-357
- Oliehoek, F.A.¹ Kooi, J.F.² Vlassis, N.³

20
- 52249098423
- Optimal and approximate Q-value functions for decentralized POMDPs
- F. A. Oliehoek, M. T. J. Spaan, and N. Vlassis. Optimal and approximate Q-value functions for decentralized POMDPs. JAIR, 32: 289-353, 2008.
- (2008) JAIR , vol.32 , pp. 289-353
- Oliehoek, F.A.¹ Spaan, M.T.J.² Vlassis, N.³

21
- 84899909133
- Exploiting locality of interaction in factored dec-POMDPs
- F. A. Oliehoek, M. T. J. Spaan, S. Whiteson, and N. Vlassis. Exploiting locality of interaction in factored Dec-POMDPs. In AAMAS, 2008.
- (2008) AAMAS
- Oliehoek, F.A.¹ Spaan, M.T.J.² Whiteson, S.³ Vlassis, N.⁴

22
- 84885985853
- Exploiting structure in cooperative Bayesian games
- F. A. Oliehoek, S. Whiteson, and M. T. J. Spaan. Exploiting structure in cooperative Bayesian games. In UAI, 2012.
- (2012) UAI
- Oliehoek, F.A.¹ Whiteson, S.² Spaan, M.T.J.³

23
- 84860644195
- Efficient planning for factored infinite-horizon DEC-POMDPs
- J. Pajarinen and J. Peltonen. Efficient planning for factored infinite-horizon DEC-POMDPs. In IJCAI, 2011.
- (2011) IJCAI
- Pajarinen, J.¹ Peltonen, J.²

24
- 84881044687
- The complexity of multiagent systems: The price of silence
- Z. Rabinovich, C. V. Goldman, and J. S. Rosenschein. The complexity of multiagent systems: the price of silence. In AAMAS, 2003.
- (2003) AAMAS
- Rabinovich, Z.¹ Goldman, C.V.² Rosenschein, J.S.³

25
- 78650949545
- Bounded approximate decentralised coordination via the max-sum algorithm
- A. Rogers, A. Farinelli, R. Stranders, and N. Jennings. Bounded approximate decentralised coordination via the max-sum algorithm. Artif Intel., 175(2): 730-759, 2011.
- (2011) Artif Intel. , vol.175 , Issue.2 , pp. 730-759
- Rogers, A.¹ Farinelli, A.² Stranders, R.³ Jennings, N.⁴

26
- 51649127552
- Formal models and algorithms for decentralized decision making under uncertainty
- S. Seuken and S. Zilberstein. Formal models and algorithms for decentralized decision making under uncertainty. Autonomous Agents and Multi-Agent Systems, 17(2): 190-250, 2008.
- (2008) Autonomous Agents and Multi-Agent Systems , vol.17 , Issue.2 , pp. 190-250
- Seuken, S.¹ Zilberstein, S.²

27
- 84868299292
- Scaling up optimal heuristic search in dec-POMDPs via incremental expansion
- M. T. J. Spaan, F. A. Oliehoek, and C. Amato. Scaling up optimal heuristic search in Dec-POMDPs via incremental expansion. In IJCAI, 2011.
- (2011) IJCAI
- Spaan, M.T.J.¹ Oliehoek, F.A.² Amato, C.³

28
- 80053226937
- *: A heuristic search algorithm for solving decentralized POMDPs
- *: A heuristic search algorithm for solving decentralized POMDPs. In UAI, 2005.
- (2005) UAI
- Szer, D.¹ Charpillet, F.² Zilberstein, S.³

29
- 68949157375
- Transfer learning for reinforcement learning domains: A survey
- M. E. Taylor and P. Stone. Transfer learning for reinforcement learning domains: A survey. JMLR, 10: 1633-1685, 2009.
- (2009) JMLR , vol.10 , pp. 1633-1685
- Taylor, M.E.¹ Stone, P.²

30
- 78650588227
- Exploiting coordination locales in distributed POMDPs via social model shaping
- P. Varakantham, J. Kwak, M. E. Taylor, J. Marecki, P. Scerri, and M. Tambe. Exploiting coordination locales in distributed POMDPs via social model shaping. In ICAPS, 2009.
- (2009) ICAPS
- Varakantham, P.¹ Kwak, J.² Taylor, M.E.³ Marecki, J.⁴ Scerri, P.⁵ Tambe, M.⁶

31
- 78650622568
- Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies
- P. Varakantham, J. Marecki, Y. Yabu, M. Tambe, and M. Yokoo. Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies. In AAMAS, 2007.
- (2007) AAMAS
- Varakantham, P.¹ Marecki, J.² Yabu, Y.³ Tambe, M.⁴ Yokoo, M.⁵

32
- 84899454751
- Distributed model shaping for scaling to decentralized POMDPs with hundreds of agents
- P. Velagapudi, P. Varakantham, P. Scerri, and K. Sycara. Distributed model shaping for scaling to decentralized POMDPs with hundreds of agents. In AAMAS, 2011.
- (2011) AAMAS
- Velagapudi, P.¹ Varakantham, P.² Scerri, P.³ Sycara, K.⁴

33
- 78650593547
- Influence-based policy abstraction for weakly-coupled dec-POMDPs
- S. J. Witwicki and E. H. Durfee. Influence-based policy abstraction for weakly-coupled Dec-POMDPs. In ICAPS, 2010.
- (2010) ICAPS
- Witwicki, S.J.¹ Durfee, E.H.²

34
- 80053153738
- Rollout sampling policy iteration for decentralized POMDPs
- F. Wu, S. Zilberstein, and X. Chen. Rollout sampling policy iteration for decentralized POMDPs. In UAI, 2010.
- (2010) UAI
- Wu, F.¹ Zilberstein, S.² Chen, X.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.