SCOPUS 정보 검색 플랫폼

Informatica (Ljubljana)

Volumn 32, Issue 4, 2008, Pages 341-357

The cross-entropy method for policy search in decentralized POMDPs

(3) Oliehoek, Frans A a Kooij, Julian F P a Vlassis, Nikos b

a UNIVERSITY OF AMSTERDAM (Netherlands)

b TECHNICAL UNIVERSITY OF CRETE (Greece)

Author keywords

Combinatorial optimization; Decentralized POMDPs; Multiagent planning

Indexed keywords

COMBINATORIAL OPTIMIZATION; MULTI AGENT SYSTEMS; OPTIMIZATION;

APPROXIMATE SOLUTIONS; AS MODELS; CE METHODS; COMBINATORIAL OPTIMIZATION PROBLEMS; CROSS ENTROPIES; DECENTRALIZED POMDPS; MULTI AGENTS; MULTIAGENT PLANNING; PLANNING UNDER UNCERTAINTIES; POLICY SEARCHES; STOCHASTIC POLICIES;

COMBINATORIAL MATHEMATICS;

EID: 57349184659 PISSN: 03505596 EISSN: None Source Type: Journal
DOI: None Document Type: Conference Paper

Times cited : (34)

References (39)

1
- 17444384857
- Application of the cross-entropy method to the buffer allocation problem in a simulation-based environment
- G. Alon, D, Kroese, T, Raviv, and R. Rubinstein, Application of the cross-entropy method to the buffer allocation problem in a simulation-based environment. Annals of Operations Research, 134(1): 137- 151,2005.
- (2005) Annals of Operations Research , vol.134 , Issue.1 , pp. 137-151
- Alon, G.¹ Kroese, D.² Raviv, T.³ Rubinstein, R.⁴

2
- 52249122724
- Optimal fixed-size controllers for decentralized POMDPs
- May
- C. Amato, D. S. Bernstein, and S. Zilberstein. Optimal fixed-size controllers for decentralized POMDPs. In Proc. of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM), May 2006.
- (2006) Proc. of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM)
- Amato, C.¹ Bernstein, D.S.² Zilberstein, S.³

3
- 84899412493
- Bounded dynamic programming for decentralized POMDPs
- May
- C. Amato, A. Carlin, and S. Zilberstein. Bounded dynamic programming for decentralized POMDPs. In Proc. of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM), May 2007.
- (2007) Proc. of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM)
- Amato, C.¹ Carlin, A.² Zilberstein, S.³

4
- 58349107260
- Mixed integer linear programming for exact finite-horizon planning in decentralized POMDPs
- R. Aras, A. Dutech, and F. Charpillet. Mixed integer linear programming for exact finite-horizon planning in decentralized POMDPs. In The International Conference on Automated Planning and Scheduling, 2007.
- (2007) The International Conference on Automated Planning and Scheduling
- Aras, R.¹ Dutech, A.² Charpillet, F.³

5
- 27344432831
- Solving transition independent decentralized Markov decision processes
- December
- R. Becker, S. Zilberstein, V. Lesser, and C. V. Goldman. Solving transition independent decentralized Markov decision processes. Journal of Artificial Intelligence Research (JAIR), 22:423-455, December 2004.
- (2004) Journal of Artificial Intelligence Research (JAIR) , vol.22 , pp. 423-455
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.V.⁴

6
- 0141965747
- The complexity of decentralized control of Markov decision processes
- D. S. Bernstein, S. Zilberstein, and N. Immerman. The complexity of decentralized control of Markov decision processes. In Proc. of Uncertainty in Artificial Intelligence, pages 32-37,2000.
- (2000) Proc. of Uncertainty in Artificial Intelligence , pp. 32-37
- Bernstein, D.S.¹ Zilberstein, S.² Immerman, N.³

7
- 0036874366
- The complexity of decentralized control of Markov decision processes
- D. S. Bernstein, R. Givan, N. Immerman, and S. Zilberstein. The complexity of decentralized control of Markov decision processes. Math. Oper. Res., 27(4): 819-840,2002.
- (2002) Math. Oper. Res , vol.27 , Issue.4 , pp. 819-840
- Bernstein, D.S.¹ Givan, R.² Immerman, N.³ Zilberstein, S.⁴

8
- 84880740944
- Bounded policy iteration for decentralized POMDPs
- D. S. Bernstein, E. A. Hansen, and S. Zilberstein. Bounded policy iteration for decentralized POMDPs. In Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI), 2005.
- (2005) Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI)
- Bernstein, D.S.¹ Hansen, E.A.² Zilberstein, S.³

9
- 17744363105
- Global likelihood optimization via the cross-entropy method with an application to mixture models
- Z. Botev and D. P. Kroese. Global likelihood optimization via the cross-entropy method with an application to mixture models. In WSC '04: Proceedings of the 36th conference on Winter simulation, pages 529-535,2004.
- (2004) WSC '04: Proceedings of the 36th conference on Winter simulation , pp. 529-535
- Botev, Z.¹ Kroese, D.P.²

10
- 0002500351
- Planning, learning and coordination in multiagent decision processes
- San Francisco, CA, USA, Morgan Kaufmann Publishers Inc. ISBN 1-55860-417-9
- C. Boutilier. Planning, learning and coordination in multiagent decision processes. In TARK '96: Proceedings of the 6th conference on Theoretical aspects of rationality and knowledge, pages 195-210, San Francisco, CA, USA, 1996. Morgan Kaufmann Publishers Inc. ISBN 1-55860-417-9.
- (1996) TARK '96: Proceedings of the 6th conference on Theoretical aspects of rationality and knowledge , pp. 195-210
- Boutilier, C.¹

11
- 17444420771
- Managing stochastic finite capacity multi-project systems through the cross-entropy method
- I. Cohen, B. Golany, and A. Shtub. Managing stochastic finite capacity multi-project systems through the cross-entropy method. Annals of Operations Research, 134(1):183-199,2005.
- (2005) Annals of Operations Research , vol.134 , Issue.1 , pp. 183-199
- Cohen, I.¹ Golany, B.² Shtub, A.³

12
- 17444409624
- A tutorial on the cross-entropy method
- P.-T. de Boer, D. P. Kroese, S. Mannor, and R. Y. Rubinstein. A tutorial on the cross-entropy method. Annals of Operations Research, 134(1): 19-67,2005.
- (2005) Annals of Operations Research , vol.134 , Issue.1 , pp. 19-67
- de Boer, P.-T.¹ Kroese, D.P.² Mannor, S.³ Rubinstein, R.Y.⁴

13
- 4544325183
- Approximate solutions for partially observable stochastic games with common payoffs
- R. Emery-Montemerlo, G. Gordon, J. Schneider, and S. Thrun. Approximate solutions for partially observable stochastic games with common payoffs. In Proc. of Int. Joint Conference on Autonomous Agents and Multi Agent Systems, pages 136-143, 2004.
- (2004) Proc. of Int. Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 136-143
- Emery-Montemerlo, R.¹ Gordon, G.² Schneider, J.³ Thrun, S.⁴

14
- 33846159516
- Game theoretic control for robot teams
- R. Emery-Montemerlo, G. Gordon, J. Schneider, and S. Thrun. Game theoretic control for robot teams. In Proceedings of the IEEE International Conference on Robotics and Automation, pages 1175-1181,2005.
- (2005) Proceedings of the IEEE International Conference on Robotics and Automation , pp. 1175-1181
- Emery-Montemerlo, R.¹ Gordon, G.² Schneider, J.³ Thrun, S.⁴

15
- 27344449757
- Decentralized control of cooperative systems: Categorization and complexity analysis
- C. V. Goldman and S. Zilberstein. Decentralized control of cooperative systems: Categorization and complexity analysis. Journal of Artificial Intelligence Research (JAIR), 22:143-174, 2004.
- (2004) Journal of Artificial Intelligence Research (JAIR) , vol.22 , pp. 143-174
- Goldman, C.V.¹ Zilberstein, S.²

16
- 84899028010
- Multiagent planning with factored MDPs
- C. Guestrin, D. Koller, and R. Parr. Multiagent planning with factored MDPs. In Advances in Neural Information Processing Systems 14, pages 1523-1530, 2002.
- (2002) Advances in Neural Information Processing Systems , vol.14 , pp. 1523-1530
- Guestrin, C.¹ Koller, D.² Parr, R.³

17
- 9444233318
- Dynamic programming for partially observable stochastic games
- E. A. Hansen, D. S. Bernstein, and S. Zilberstein. Dynamic programming for partially observable stochastic games. In Proc. of the National Conference on Artificial Intelligence, pages 709-715, 2004.
- (2004) Proc. of the National Conference on Artificial Intelligence , pp. 709-715
- Hansen, E.A.¹ Bernstein, D.S.² Zilberstein, S.³

18
- 84947403595
- Probability inequalities for sums of bounded random variables
- Mar
- W. Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301): 13-30, Mar. 1963.
- (1963) Journal of the American Statistical Association , vol.58 , Issue.301 , pp. 13-30
- Hoeffding, W.¹

19
- 0032073263
- Planning and acting in partially observable stochastic domains
- L. P. Kaelbling, M. L. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101(1-2):99-134, 1998.
- (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

20
- 33747180593
- Exploiting locality of interaction in networked distributed POMDPs
- Y. Kim, R. Nair, P. Varakantham, M. Tambe, and M. Yokoo. Exploiting locality of interaction in networked distributed POMDPs. In Proceedings of the of the AAAI Spring Symposium on Distributed Plan and Schedule Management, 2006.
- (2006) Proceedings of the of the AAAI Spring Symposium on Distributed Plan and Schedule Management
- Kim, Y.¹ Nair, R.² Varakantham, P.³ Tambe, M.⁴ Yokoo, M.⁵

21
- 33748562008
- Using the max-plus algorithm for multiagent decision making in coordination graphs
- Osaka, Japan, July
- J. R. Kok and N, Vlassis. Using the max-plus algorithm for multiagent decision making in coordination graphs. In RoboCup-2005: Robot Soccer World Cup IX, Osaka, Japan, July 2005.
- (2005) RoboCup-2005: Robot Soccer World Cup IX
- Kok, J.R.¹ Vlassis, N.²

22
- 0031192989
- Representations and solutions for game-theoretic problems
- D. Koller and A. Pfeffer. Representations and solutions for game-theoretic problems. Artificial Intelligence, 94(1-2): 167-215,1997.
- (1997) Artificial Intelligence , vol.94 , Issue.1-2 , pp. 167-215
- Koller, D.¹ Pfeffer, A.²

23
- 0032596468
- On the un-decidability of probabilistic planning and infinite-horizon partially observable Markov decision problems
- O. Madani, S. Hanks, and A. Condon. On the un-decidability of probabilistic planning and infinite-horizon partially observable Markov decision problems. In Proc. of the National Conference on Artificial Intelligence, pages 541-548,1999.
- (1999) Proc. of the National Conference on Artificial Intelligence , pp. 541-548
- Madani, O.¹ Hanks, S.² Condon, A.³

24
- 1942516890
- The cross entropy method for fast policy search
- S. Mannor, R. Rubinstein, and Y. Gat. The cross entropy method for fast policy search. In Proc. of the International Conference on Machine Learning, pages 512-519,2003.
- (2003) Proc. of the International Conference on Machine Learning , pp. 512-519
- Mannor, S.¹ Rubinstein, R.² Gat, Y.³

25
- 84880823326
- Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings
- R. Nair, M. Tambe, M. Yokoo, D. V. Pynadath, and S. Marsella. Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In Proc. of the Int. Joint Conf. on Artificial Intelligence, pages 705-711,2003.
- (2003) Proc. of the Int. Joint Conf. on Artificial Intelligence , pp. 705-711
- Nair, R.¹ Tambe, M.² Yokoo, M.³ Pynadath, D.V.⁴ Marsella, S.⁵

26
- 29344437834
- Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs
- R. Nair, P. Varakantham, M. Tambe, and M. Yokoo. Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In Proc. of the National Conference on Artificial Intelligence, pages 133-139,2005.
- (2005) Proc. of the National Conference on Artificial Intelligence , pp. 133-139
- Nair, R.¹ Varakantham, P.² Tambe, M.³ Yokoo, M.⁴

27
- 77951732939
- A cross-entropy approach to solving Dec-POMDPs
- Oct
- F. A. Oliehoek, J. F. Kooij, and N. Vlassis. A cross-entropy approach to solving Dec-POMDPs. In International Symposium on Intelligent and Distributed Computing, pages 145-154, Oct. 2007.
- (2007) International Symposium on Intelligent and Distributed Computing , pp. 145-154
- Oliehoek, F.A.¹ Kooij, J.F.² Vlassis, N.³

28
- 52249098423
- Optimal and approximate Q-value functions for decentralized POMDPs
- F. A. Oliehoek, M. T. J. Spaan, and N. Vlassis. Optimal and approximate Q-value functions for decentralized POMDPs. Journal of Artificial Intelligence Research, 32:289-353, 2008.'
- (2008) Journal of Artificial Intelligence Research , vol.32 , pp. 289-353
- Oliehoek, F.A.¹ Spaan, M.T.J.² Vlassis, N.³

29
- 84899909133
- Exploiting locality of interaction in factored Dec-POMDPs
- F. A. Oliehoek, M. T. J. Spaan, S. Whiteson, and N. Vlassis. Exploiting locality of interaction in factored Dec-POMDPs. In Proc. of Int. Joint Conference on Autonomous Agents and Multi Agent Systems, pages 517-524,2008.
- (2008) Proc. of Int. Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 517-524
- Oliehoek, F.A.¹ Spaan, M.T.J.² Whiteson, S.³ Vlassis, N.⁴

30
- 0030396683
- Decentralized control of a multiple access broadcast channel: Performance bounds
- J. M. Ooi and G. W. Wornell. Decentralized control of a multiple access broadcast channel: Performance bounds. In Proc. 35th Conf. on Decision and Control, 1996.
- (1996) Proc. 35th Conf. on Decision and Control
- Ooi, J.M.¹ Wornell, G.W.²

31
- 0003427725
- The MIT Press, July
- M. J. Osborne and A. Rubinstein. A Course in Game Theory. The MIT Press, July 1994.
- (1994) A Course in Game Theory
- Osborne, M.J.¹ Rubinstein, A.²

32
- 1142292938
- The communicative multiagent team decision problem: Analyzing teamwork theories and models
- D. V. Pynadath and M. Tambe. The communicative multiagent team decision problem: Analyzing teamwork theories and models. Journal of AI research (JAIR), 16:389-423,2002.
- (2002) Journal of AI research (JAIR) , vol.16 , pp. 389-423
- Pynadath, D.V.¹ Tambe, M.²

33
- 60349107649
- Exploiting factored representations for decentralized execution in multi-agent teams
- May
- M, Roth, R. Simmons, and M. Veloso. Exploiting factored representations for decentralized execution in multi-agent teams. In Proc. of Int. Joint Conference on Autonomous Agents and Multi Agent Systems, pages 467-463, May 2007.
- (2007) Proc. of Int. Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 467-463
- Roth, M.¹ Simmons, R.² Veloso, M.³

34
- 51649085567
- Improved memory-bounded dynamic programming for decentralized POMDPs
- July
- S. Seuken and S. Zilberstein. Improved memory-bounded dynamic programming for decentralized POMDPs. In Proc. of Uncertainty in Artificial Intelligence, July 2007.
- (2007) Proc. of Uncertainty in Artificial Intelligence
- Seuken, S.¹ Zilberstein, S.²

35
- 84880856384
- Memory-bounded dynamic programming for DEC-POMDPs
- S. Seuken and S. Zilberstein. Memory-bounded dynamic programming for DEC-POMDPs. In Proc. of the Int. Joint Conf. on Artificial Intelligence, pages 2009-2015,2007.
- (2007) Proc. of the Int. Joint Conf. on Artificial Intelligence , pp. 2009-2015
- Seuken, S.¹ Zilberstein, S.²

36
- 84899992307
- Interaction-driven Markov games for decentralized multiagent planning under uncertainty
- M. T. J. Spaan and F. S. Melo. Interaction-driven Markov games for decentralized multiagent planning under uncertainty. In Proc. of Int. Joint Conference on Autonomous Agents and Multi Agent Systems, pages 525-532,2008.
- (2008) Proc. of Int. Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 525-532
- Spaan, M.T.J.¹ Melo, F.S.²

37
- 33750691009
- Point-based dynamic programming for DEC-POMDPs
- D. Szer and F. Charpillet. Point-based dynamic programming for DEC-POMDPs. In Proc. of the National Conference on Artificial Intelligence, 2006.
- (2006) Proc. of the National Conference on Artificial Intelligence
- Szer, D.¹ Charpillet, F.²

38
- 80053226937
- MA A: A heuristic search algorithm for solving decentralized POMDPs
- D. Szer, F. Charpillet, and S. Zilberstein. MA A: A heuristic search algorithm for solving decentralized POMDPs. In Proc. of Uncertainty in Artificial Intelligence, 2005.
- (2005) Proc. of Uncertainty in Artificial Intelligence
- Szer, D.¹ Charpillet, F.² Zilberstein, S.³

39
- 60349101997
- Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies
- P. Varakantham, J. Marecki, Y. Yabu, M. Tambe, and M. Yokoo. Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies. In Proc. of Int. Joint Conference on Autonomous Agents and Multi Agent Systems, 2007.
- (2007) Proc. of Int. Joint Conference on Autonomous Agents and Multi Agent Systems
- Varakantham, P.¹ Marecki, J.² Yabu, Y.³ Tambe, M.⁴ Yokoo, M.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.