메뉴 건너뛰기




Volumn , Issue , 2010, Pages 294-301

Anytime planning for decentralized POMDPs using Expectation Maximization

Author keywords

[No Author keywords available]

Indexed keywords

DECISION MAKING; INFERENCE ENGINES; MULTI AGENT SYSTEMS; STOCHASTIC SYSTEMS;

EID: 80053161304     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (32)

References (20)
  • 1
    • 77958556757 scopus 로고    scopus 로고
    • Optimizing -xed-size stochastic controllers for POMDPs and decentralized POMDPs
    • C. Amato, D. S. Bernstein, and S. Zilberstein. Optimizing -xed-size stochastic controllers for POMDPs and decentralized POMDPs. JAAMAS, 2009.
    • (2009) JAAMAS
    • Amato, C.1    Bernstein, D.S.2    Zilberstein, S.3
  • 2
    • 33749242151 scopus 로고    scopus 로고
    • Planning by probabilistic inference
    • H. Attias. Planning by probabilistic inference. In Workshop on AISTATS, 2003.
    • (2003) Workshop on AISTATS
    • Attias, H.1
  • 4
    • 65349083220 scopus 로고    scopus 로고
    • Policy iteration for decentralized control of Markov decision processes
    • D. S. Bernstein, C. Amato, E. A. Hansen, and S. Zilberstein. Policy iteration for decentralized control of Markov decision processes. JAIR, 34:89-132, 2009.
    • (2009) JAIR , vol.34 , pp. 89-132
    • Bernstein, D.S.1    Amato, C.2    Hansen, E.A.3    Zilberstein, S.4
  • 5
    • 0036874366 scopus 로고    scopus 로고
    • The complexity of decentralized control of Markov decision processes
    • D. S. Bernstein, R. Givan, N. Immerman, and S. Zilberstein. The complexity of decentralized control of Markov decision processes. J. MOR, 27:819-840, 2002.
    • (2002) J. MOR , vol.27 , pp. 819-840
    • Bernstein, D.S.1    Givan, R.2    Immerman, N.3    Zilberstein, S.4
  • 6
    • 73649114265 scopus 로고    scopus 로고
    • MapReduce: A exible data processing tool
    • J. Dean and S. Ghemawat. MapReduce: a exible data processing tool. CACM, 53(1):72-77, 2010.
    • (2010) CACM , vol.53 , Issue.1 , pp. 72-77
    • Dean, J.1    Ghemawat, S.2
  • 8
    • 80053136160 scopus 로고    scopus 로고
    • Point-based incremental pruning heuristic for solving -nite-horizon DEC-POMDPs
    • J. S. Dibangoye, A.-I. Mouaddib, and B. Chaib-draa. Point-based incremental pruning heuristic for solving -nite-horizon DEC-POMDPs. In AAMAS, pages 569-576, 2009.
    • (2009) AAMAS , pp. 569-576
    • Dibangoye, J.S.1    Mouaddib, A.-I.2    Chaib-Draa, B.3
  • 9
    • 0036433588 scopus 로고    scopus 로고
    • SNOPT: An SQP algorithm for large-scale constrained optimization
    • P. E. Gill, W. Murray, and M. A. Saunders. SNOPT: An SQP algorithm for large-scale constrained optimization. SIOPT, 12(4):979-1006, 2002.
    • (2002) SIOPT , vol.12 , Issue.4 , pp. 979-1006
    • Gill, P.E.1    Murray, W.2    Saunders, M.A.3
  • 10
    • 78751705157 scopus 로고    scopus 로고
    • New inference strategies for solving Markov decision processes using reversible jump MCMC
    • M. Hoffman, H. Kueck, N. de Freitas, and A. Doucet. New inference strategies for solving Markov decision processes using reversible jump MCMC. In UAI, 2009.
    • (2009) UAI
    • Hoffman, M.1    Kueck, H.2    De Freitas, N.3    Doucet, A.4
  • 11
    • 84899456155 scopus 로고    scopus 로고
    • Point based backup for decentralized POMDPs: Complexity and new algorithms
    • A. Kumar and S. Zilberstein. Point based backup for decentralized POMDPs: Complexity and new algorithms. In AAMAS, pages 1315-1322, 2010.
    • (2010) AAMAS , pp. 1315-1322
    • Kumar, A.1    Zilberstein, S.2
  • 12
    • 0001205548 scopus 로고    scopus 로고
    • Complexity of finite-horizon Markov decision process problems
    • M. Mundhenk, J. Goldsmith, C. Lusena, and E. Allender. Complexity of -nite-horizon Markov decision process problems. J. ACM, 47(4):681-720, 2000.
    • (2000) J. ACM , vol.47 , Issue.4 , pp. 681-720
    • Mundhenk, M.1    Goldsmith, J.2    Lusena, C.3    Allender, E.4
  • 13
    • 29344437834 scopus 로고    scopus 로고
    • Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs
    • Proceedings of the 20th National Conference on Artificial Intelligence and the 17th Innovative Applications of Artificial Intelligence Conference, AAAI-05/IAAI-05
    • R. Nair, P. Varakantham, M. Tambe, and M. Yokoo. Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In AAAI, pages 133-139, 2005. (Pubitemid 43006767)
    • (2005) Proceedings of the National Conference on Artificial Intelligence , vol.1 , pp. 133-139
    • Nair, R.1    Varakantham, P.2    Tambe, M.3    Yokoo, M.4
  • 14
    • 52249098423 scopus 로고    scopus 로고
    • Optimal and approximate Q-value functions for decentralized POMDPs
    • F. A. Oliehoek, M. T. J. Spaan, and N. A. Vlassis. Optimal and approximate Q-value functions for decentralized POMDPs. JAIR, 32:289-353, 2008.
    • (2008) JAIR , vol.32 , pp. 289-353
    • Oliehoek, F.A.1    Spaan, M.T.J.2    Vlassis, N.A.3
  • 15
    • 52249090123 scopus 로고    scopus 로고
    • Anytime point-based approximations for large POMDPs
    • J. Pineau, G. Gordon, and S. Thrun. Anytime point- based approximations for large POMDPs. JAIR, 27:335-380, 2006.
    • (2006) JAIR , vol.27 , pp. 335-380
    • Pineau, J.1    Gordon, G.2    Thrun, S.3
  • 16
    • 84880856384 scopus 로고    scopus 로고
    • Memory-bounded dynamic programming for DEC-POMDPs
    • S. Seuken and S. Zilberstein. Memory-bounded dynamic programming for DEC-POMDPs. In IJCAI, pages 2009-2015, 2007.
    • (2007) IJCAI , pp. 2009-2015
    • Seuken, S.1    Zilberstein, S.2
  • 17
    • 33750297371 scopus 로고    scopus 로고
    • Heuristic search value iteration for POMDPs
    • T. Smith and R. Simmons. Heuristic search value iteration for POMDPs. In UAI, pages 520-527, 2004.
    • (2004) UAI , pp. 520-527
    • Smith, T.1    Simmons, R.2
  • 18
    • 67349102783 scopus 로고    scopus 로고
    • Hierarchical POMDP controller optimization by likelihood maximization
    • M. Toussaint, L. Charlin, and P. Poupart. Hierarchical POMDP controller optimization by likelihood maximization. In UAI, pages 562-570, 2008.
    • (2008) UAI , pp. 562-570
    • Toussaint, M.1    Charlin, L.2    Poupart, P.3
  • 20
    • 33749234798 scopus 로고    scopus 로고
    • Probabilistic inference for solving discrete and continuous state markov decision processes
    • M. Toussaint and A. J. Storkey. Probabilistic inference for solving discrete and continuous state markov decision processes. In ICML, pages 945-952, 2006.
    • (2006) ICML , pp. 945-952
    • Toussaint, M.1    Storkey, A.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.