메뉴 건너뛰기




Volumn 46, Issue , 2013, Pages 449-509

Incremental clustering and expansion for faster optimal planning in decentralized POMDPs

Author keywords

[No Author keywords available]

Indexed keywords

GENERAL MODEL; HYBRID HEURISTICS; INCREMENTAL CLUSTERING; MULTI-AGENT PLANNING; OPTIMAL PLANNING; OPTIMAL SOLUTIONS; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; THEORETICAL GUARANTEES;

EID: 84878301624     PISSN: None     EISSN: 10769757     Source Type: Journal    
DOI: 10.1613/jair.3804     Document Type: Review
Times cited : (47)

References (107)
  • 4
    • 77954951649 scopus 로고    scopus 로고
    • Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs
    • Amato, C., Bernstein, D. S., & Zilberstein, S. (2010). Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs. Autonomous Agents and Multi-Agent Systems, 21 (3), 293-320.
    • (2010) Autonomous Agents and Multi-Agent Systems , vol.21 , Issue.3 , pp. 293-320
    • Amato, C.1    Bernstein, D.S.2    Zilberstein, S.3
  • 8
    • 77952736651 scopus 로고    scopus 로고
    • An investigation into mathematical programming for finite horizon decentralized POMDPs
    • Aras, R., & Dutech, A. (2010). An investigation into mathematical programming for finite horizon decentralized POMDPs. Journal of Artificial Intelligence Research, 37, 329- 396.
    • (2010) Journal of Artificial Intelligence Research , vol.37 , pp. 329-396
    • Aras, R.1    Dutech, A.2
  • 15
    • 79961008708 scopus 로고    scopus 로고
    • Solving efficiently decentralized MDPs with temporal and resource constraints
    • Beynier, A., & Mouaddib, A.-I. (2011). Solving efficiently decentralized MDPs with temporal and resource constraints. Autonomous Agents and Multi-Agent Systems, 23 (3), 486- 539.
    • (2011) Autonomous Agents and Multi-Agent Systems , vol.23 , Issue.3 , pp. 486-539
    • Beynier, A.1    Mouaddib, A.-I.2
  • 19
    • 0000303438 scopus 로고
    • The multiple sequence alignment problem in biology
    • Carrillo, H., & Lipman, D. (1988). The multiple sequence alignment problem in biology. SIAM Journal on Applied Mathematics, 48 (5), 1073-1082.
    • (1988) SIAM Journal on Applied Mathematics , vol.48 , Issue.5 , pp. 1073-1082
    • Carrillo, H.1    Lipman, D.2
  • 27
    • 59849115174 scopus 로고    scopus 로고
    • Graphical models for interactive POMDPs: Representations and solutions
    • Doshi, P., Zeng, Y., & Chen, Q. (2008). Graphical models for interactive POMDPs: representations and solutions. Autonomous Agents and Multi-Agent Systems, 18 (3), 376-416.
    • (2008) Autonomous Agents and Multi-Agent Systems , vol.18 , Issue.3 , pp. 376-416
    • Doshi, P.1    Zeng, Y.2    Chen, Q.3
  • 30
    • 84873750402 scopus 로고    scopus 로고
    • Solving decentralized POMDP problems using genetic algorithms
    • Eker, B., & Akin, H. L. (2013). Solving decentralized POMDP problems using genetic algorithms. Autonomous Agents and Multi-Agent Systems, 27 (1), 161-196.
    • (2013) Autonomous Agents and Multi-Agent Systems , vol.27 , Issue.1 , pp. 161-196
    • Eker, B.1    Akin, H.L.2
  • 34
    • 0038517214 scopus 로고    scopus 로고
    • Equivalence notions and model minimization in Markov decision processes
    • Givan, R., Dean, T., & Greig, M. (2003). Equivalence notions and model minimization in Markov decision processes. Artificial Intelligence, 14 (1-2), 163-223.
    • (2003) Artificial Intelligence , vol.14 , Issue.1-2 , pp. 163-223
    • Givan, R.1    Dean, T.2    Greig, M.3
  • 38
    • 27344449757 scopus 로고    scopus 로고
    • Decentralized control of cooperative systems: Categorization and complexity analysis
    • Goldman, C. V., & Zilberstein, S. (2004). Decentralized control of cooperative systems: Categorization and complexity analysis. Journal of Artificial Intelligence Research, 22, 143-174.
    • (2004) Journal of Artificial Intelligence Research , vol.22 , pp. 143-174
    • Goldman, C.V.1    Zilberstein, S.2
  • 41
    • 0001770240 scopus 로고    scopus 로고
    • Value-function approximations for partially observable Markov decision processes
    • Hauskrecht, M. (2000). Value-function approximations for partially observable Markov decision processes. Journal of Artificial Intelligence Research, 13, 33-94.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 33-94
    • Hauskrecht, M.1
  • 42
    • 0020113091 scopus 로고
    • Decentralized control of finite state Markov processes
    • Hsu, K., & Marcus, S. (1982). Decentralized control of finite state Markov processes. IEEE Transactions on Automatic Control, 27 (2), 426-431.
    • (1982) IEEE Transactions on Automatic Control , vol.27 , Issue.2 , pp. 426-431
    • Hsu, K.1    Marcus, S.2
  • 44
    • 0001650390 scopus 로고    scopus 로고
    • Enhanced A&z.ast; Algorithms for multiple alignments: Optimal alignments for several sequences and k-opt approximate alignments for large cases
    • Ikeda, T., & Imai, H. (1999). Enhanced A&z.ast; algorithms for multiple alignments: optimal alignments for several sequences and k-opt approximate alignments for large cases. Theoretical Computer Science, 210 (2), 341-374.
    • (1999) Theoretical Computer Science , vol.210 , Issue.2 , pp. 341-374
    • Ikeda, T.1    Imai, H.2
  • 45
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Kaelbling, L. P., Littman, M. L., & Cassandra, A. R. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101 (1-2), 99-134.
    • (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 52
    • 79955976414 scopus 로고    scopus 로고
    • Decentralized MDPs with sparse interactions
    • Melo, F. S., & Veloso, M. (2011). Decentralized MDPs with sparse interactions. Artificial Intelligence, 175 (11), 1757-1789.
    • (2011) Artificial Intelligence , vol.175 , Issue.11 , pp. 1757-1789
    • Melo, F.S.1    Veloso, M.2
  • 58
    • 85042914320 scopus 로고    scopus 로고
    • Decentralized POMDPs
    • M. Wiering & M. van Otterlo (Eds.), Springer Berlin Heidelberg
    • Oliehoek, F. A. (2012). Decentralized POMDPs. In M. Wiering & M. van Otterlo (Eds.), Reinforcement learning: State of the art (Vol. 12). Springer Berlin Heidelberg.
    • (2012) Reinforcement Learning: State of the Art , vol.12
    • Oliehoek, F.A.1
  • 59
    • 57349184659 scopus 로고    scopus 로고
    • The cross-entropy method for policy search in decentralized POMDPs
    • Oliehoek, F. A., Kooi, J. F., & Vlassis, N. (2008). The cross-entropy method for policy search in decentralized POMDPs. Informatica, 32, 341-357.
    • (2008) Informatica , vol.32 , pp. 341-357
    • Oliehoek, F.A.1    Kooi, J.F.2    Vlassis, N.3
  • 72
    • 26444601262 scopus 로고    scopus 로고
    • Cooperative multi-agent learning: The state of the art
    • Panait, L., & Luke, S. (2005). Cooperative multi-agent learning: The state of the art. Autonomous Agents and Multi-Agent Systems, 11 (3), 387-434.
    • (2005) Autonomous Agents and Multi-Agent Systems , vol.11 , Issue.3 , pp. 387-434
    • Panait, L.1    Luke, S.2
  • 75
    • 1142292938 scopus 로고    scopus 로고
    • The communicative multiagent team decision problem: Analyzing teamwork theories and models
    • Pynadath, D. V., & Tambe, M. (2002). The communicative multiagent team decision problem: Analyzing teamwork theories and models. Journal of Artificial Intelligence Research, 16, 389-423.
    • (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 389-423
    • Pynadath, D.V.1    Tambe, M.2
  • 83
    • 51649127552 scopus 로고    scopus 로고
    • Formal models and algorithms for decentralized decision making under uncertainty
    • Seuken, S., & Zilberstein, S. (2008). Formal models and algorithms for decentralized decision making under uncertainty. Autonomous Agents and Multi-Agent Systems, 17 (2), 190- 250.
    • (2008) Autonomous Agents and Multi-Agent Systems , vol.17 , Issue.2 , pp. 190-250
    • Seuken, S.1    Zilberstein, S.2
  • 89
    • 0032096675 scopus 로고    scopus 로고
    • Multiagent systems
    • Sycara, K. P. (1998). Multiagent systems. AI Magazine, 19 (2), 79-92.
    • (1998) AI Magazine , vol.19 , Issue.2 , pp. 79-92
    • Sycara, K.P.1
  • 91
    • 0022062265 scopus 로고
    • On the complexity of decentralized decision making and detection problems
    • Tsitsiklis, J., & Athans, M. (1985). On the complexity of decentralized decision making and detection problems. IEEE Transactions on Automatic Control, 30 (5), 440-446.
    • (1985) IEEE Transactions on Automatic Control , vol.30 , Issue.5 , pp. 440-446
    • Tsitsiklis, J.1    Athans, M.2
  • 103
    • 78650942474 scopus 로고    scopus 로고
    • Online planning for multi-agent systems with bounded communication
    • Wu, F., Zilberstein, S., & Chen, X. (2011). Online planning for multi-agent systems with bounded communication. Artificial Intelligence, 175 (2), 487-511.
    • (2011) Artificial Intelligence , vol.175 , Issue.2 , pp. 487-511
    • Wu, F.1    Zilberstein, S.2    Chen, X.3
  • 106
    • 84862644620 scopus 로고    scopus 로고
    • Exploiting model equivalences for solving interactive dynamic influence diagrams
    • Zeng, Y., & Doshi, P. (2012). Exploiting model equivalences for solving interactive dynamic influence diagrams. Journal of Artificial Intelligence Research, 43, 211-255.
    • (2012) Journal of Artificial Intelligence Research , vol.43 , pp. 211-255
    • Zeng, Y.1    Doshi, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.