SCOPUS 정보 검색 플랫폼

Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence, UAI 2010

Volumn , Issue , 2010, Pages 294-301

Anytime planning for decentralized POMDPs using Expectation Maximization

(2) Kumar, Akshat a Zilberstein, Shlomo a

a Biologically Inspired Neural and Dynamical Systems Laboratory (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DECISION MAKING; INFERENCE ENGINES; MULTI AGENT SYSTEMS; STOCHASTIC SYSTEMS;

BENCHMARK DOMAINS; EXPECTATION - MAXIMIZATIONS; EXPECTATION-MAXIMIZATION ALGORITHMS; INFERENCE TECHNIQUES; INHERENT COMPLEXITY; OPTIMIZATION PROBLEMS; SEQUENTIAL DECISION MAKING; STATE OF THE ART;

MAXIMUM PRINCIPLE;

EID: 80053161304 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (32)

References (20)

1
- 77958556757
- Optimizing -xed-size stochastic controllers for POMDPs and decentralized POMDPs
- C. Amato, D. S. Bernstein, and S. Zilberstein. Optimizing -xed-size stochastic controllers for POMDPs and decentralized POMDPs. JAAMAS, 2009.
- (2009) JAAMAS
- Amato, C.¹ Bernstein, D.S.² Zilberstein, S.³

2
- 33749242151
- Planning by probabilistic inference
- H. Attias. Planning by probabilistic inference. In Workshop on AISTATS, 2003.
- (2003) Workshop on AISTATS
- Attias, H.¹

3
- 27344432831
- Solving transition independent decentralized Markov decision processes
- R. Becker, S. Zilberstein, V. Lesser, and C. V. Goldman. Solving transition independent decentralized markov decision processes. JAIR, 22:423-455, 2004. (Pubitemid 41525892)
- (2004) Journal of Artificial Intelligence Research , vol.22 , pp. 423-455
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.V.⁴

4
- 65349083220
- Policy iteration for decentralized control of Markov decision processes
- D. S. Bernstein, C. Amato, E. A. Hansen, and S. Zilberstein. Policy iteration for decentralized control of Markov decision processes. JAIR, 34:89-132, 2009.
- (2009) JAIR , vol.34 , pp. 89-132
- Bernstein, D.S.¹ Amato, C.² Hansen, E.A.³ Zilberstein, S.⁴

5
- 0036874366
- The complexity of decentralized control of Markov decision processes
- D. S. Bernstein, R. Givan, N. Immerman, and S. Zilberstein. The complexity of decentralized control of Markov decision processes. J. MOR, 27:819-840, 2002.
- (2002) J. MOR , vol.27 , pp. 819-840
- Bernstein, D.S.¹ Givan, R.² Immerman, N.³ Zilberstein, S.⁴

6
- 73649114265
- MapReduce: A exible data processing tool
- J. Dean and S. Ghemawat. MapReduce: a exible data processing tool. CACM, 53(1):72-77, 2010.
- (2010) CACM , vol.53 , Issue.1 , pp. 72-77
- Dean, J.¹ Ghemawat, S.²

7
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical society, Series B, 39(1):1-38, 1977.
- (1977) Journal of the Royal Statistical Society, Series B , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

8
- 80053136160
- Point-based incremental pruning heuristic for solving -nite-horizon DEC-POMDPs
- J. S. Dibangoye, A.-I. Mouaddib, and B. Chaib-draa. Point-based incremental pruning heuristic for solving -nite-horizon DEC-POMDPs. In AAMAS, pages 569-576, 2009.
- (2009) AAMAS , pp. 569-576
- Dibangoye, J.S.¹ Mouaddib, A.-I.² Chaib-Draa, B.³

9
- 0036433588
- SNOPT: An SQP algorithm for large-scale constrained optimization
- P. E. Gill, W. Murray, and M. A. Saunders. SNOPT: An SQP algorithm for large-scale constrained optimization. SIOPT, 12(4):979-1006, 2002.
- (2002) SIOPT , vol.12 , Issue.4 , pp. 979-1006
- Gill, P.E.¹ Murray, W.² Saunders, M.A.³

10
- 78751705157
- New inference strategies for solving Markov decision processes using reversible jump MCMC
- M. Hoffman, H. Kueck, N. de Freitas, and A. Doucet. New inference strategies for solving Markov decision processes using reversible jump MCMC. In UAI, 2009.
- (2009) UAI
- Hoffman, M.¹ Kueck, H.² De Freitas, N.³ Doucet, A.⁴

11
- 84899456155
- Point based backup for decentralized POMDPs: Complexity and new algorithms
- A. Kumar and S. Zilberstein. Point based backup for decentralized POMDPs: Complexity and new algorithms. In AAMAS, pages 1315-1322, 2010.
- (2010) AAMAS , pp. 1315-1322
- Kumar, A.¹ Zilberstein, S.²

12
- 0001205548
- Complexity of finite-horizon Markov decision process problems
- M. Mundhenk, J. Goldsmith, C. Lusena, and E. Allender. Complexity of -nite-horizon Markov decision process problems. J. ACM, 47(4):681-720, 2000.
- (2000) J. ACM , vol.47 , Issue.4 , pp. 681-720
- Mundhenk, M.¹ Goldsmith, J.² Lusena, C.³ Allender, E.⁴

13
- 29344437834
- Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs
- Proceedings of the 20th National Conference on Artificial Intelligence and the 17th Innovative Applications of Artificial Intelligence Conference, AAAI-05/IAAI-05
- R. Nair, P. Varakantham, M. Tambe, and M. Yokoo. Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In AAAI, pages 133-139, 2005. (Pubitemid 43006767)
- (2005) Proceedings of the National Conference on Artificial Intelligence , vol.1 , pp. 133-139
- Nair, R.¹ Varakantham, P.² Tambe, M.³ Yokoo, M.⁴

14
- 52249098423
- Optimal and approximate Q-value functions for decentralized POMDPs
- F. A. Oliehoek, M. T. J. Spaan, and N. A. Vlassis. Optimal and approximate Q-value functions for decentralized POMDPs. JAIR, 32:289-353, 2008.
- (2008) JAIR , vol.32 , pp. 289-353
- Oliehoek, F.A.¹ Spaan, M.T.J.² Vlassis, N.A.³

15
- 52249090123
- Anytime point-based approximations for large POMDPs
- J. Pineau, G. Gordon, and S. Thrun. Anytime point- based approximations for large POMDPs. JAIR, 27:335-380, 2006.
- (2006) JAIR , vol.27 , pp. 335-380
- Pineau, J.¹ Gordon, G.² Thrun, S.³

16
- 84880856384
- Memory-bounded dynamic programming for DEC-POMDPs
- S. Seuken and S. Zilberstein. Memory-bounded dynamic programming for DEC-POMDPs. In IJCAI, pages 2009-2015, 2007.
- (2007) IJCAI , pp. 2009-2015
- Seuken, S.¹ Zilberstein, S.²

17
- 33750297371
- Heuristic search value iteration for POMDPs
- T. Smith and R. Simmons. Heuristic search value iteration for POMDPs. In UAI, pages 520-527, 2004.
- (2004) UAI , pp. 520-527
- Smith, T.¹ Simmons, R.²

18
- 67349102783
- Hierarchical POMDP controller optimization by likelihood maximization
- M. Toussaint, L. Charlin, and P. Poupart. Hierarchical POMDP controller optimization by likelihood maximization. In UAI, pages 562-570, 2008.
- (2008) UAI , pp. 562-570
- Toussaint, M.¹ Charlin, L.² Poupart, P.³

19
- 51349153274
- Technical Report EDIINF-RR-0934, University of Edinburgh, School of Informatics
- M. Toussaint, S. Harmeling, and A. Storkey. Probabilistic inference for solving (PO)MDPs. Technical Report EDIINF-RR-0934, University of Edinburgh, School of Informatics, 2006.
- (2006) Probabilistic Inference for Solving (PO)MDPs
- Toussaint, M.¹ Harmeling, S.² Storkey, A.³

20
- 33749234798
- Probabilistic inference for solving discrete and continuous state markov decision processes
- M. Toussaint and A. J. Storkey. Probabilistic inference for solving discrete and continuous state markov decision processes. In ICML, pages 945-952, 2006.
- (2006) ICML , pp. 945-952
- Toussaint, M.¹ Storkey, A.J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.