메뉴 건너뛰기




Volumn 32, Issue 4, 2008, Pages 341-357

The cross-entropy method for policy search in decentralized POMDPs

Author keywords

Combinatorial optimization; Decentralized POMDPs; Multiagent planning

Indexed keywords

COMBINATORIAL OPTIMIZATION; MULTI AGENT SYSTEMS; OPTIMIZATION;

EID: 57349184659     PISSN: 03505596     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Conference Paper
Times cited : (34)

References (39)
  • 1
    • 17444384857 scopus 로고    scopus 로고
    • Application of the cross-entropy method to the buffer allocation problem in a simulation-based environment
    • G. Alon, D, Kroese, T, Raviv, and R. Rubinstein, Application of the cross-entropy method to the buffer allocation problem in a simulation-based environment. Annals of Operations Research, 134(1): 137- 151,2005.
    • (2005) Annals of Operations Research , vol.134 , Issue.1 , pp. 137-151
    • Alon, G.1    Kroese, D.2    Raviv, T.3    Rubinstein, R.4
  • 7
    • 0036874366 scopus 로고    scopus 로고
    • The complexity of decentralized control of Markov decision processes
    • D. S. Bernstein, R. Givan, N. Immerman, and S. Zilberstein. The complexity of decentralized control of Markov decision processes. Math. Oper. Res., 27(4): 819-840,2002.
    • (2002) Math. Oper. Res , vol.27 , Issue.4 , pp. 819-840
    • Bernstein, D.S.1    Givan, R.2    Immerman, N.3    Zilberstein, S.4
  • 9
    • 17744363105 scopus 로고    scopus 로고
    • Global likelihood optimization via the cross-entropy method with an application to mixture models
    • Z. Botev and D. P. Kroese. Global likelihood optimization via the cross-entropy method with an application to mixture models. In WSC '04: Proceedings of the 36th conference on Winter simulation, pages 529-535,2004.
    • (2004) WSC '04: Proceedings of the 36th conference on Winter simulation , pp. 529-535
    • Botev, Z.1    Kroese, D.P.2
  • 10
    • 0002500351 scopus 로고    scopus 로고
    • Planning, learning and coordination in multiagent decision processes
    • San Francisco, CA, USA, Morgan Kaufmann Publishers Inc. ISBN 1-55860-417-9
    • C. Boutilier. Planning, learning and coordination in multiagent decision processes. In TARK '96: Proceedings of the 6th conference on Theoretical aspects of rationality and knowledge, pages 195-210, San Francisco, CA, USA, 1996. Morgan Kaufmann Publishers Inc. ISBN 1-55860-417-9.
    • (1996) TARK '96: Proceedings of the 6th conference on Theoretical aspects of rationality and knowledge , pp. 195-210
    • Boutilier, C.1
  • 11
    • 17444420771 scopus 로고    scopus 로고
    • Managing stochastic finite capacity multi-project systems through the cross-entropy method
    • I. Cohen, B. Golany, and A. Shtub. Managing stochastic finite capacity multi-project systems through the cross-entropy method. Annals of Operations Research, 134(1):183-199,2005.
    • (2005) Annals of Operations Research , vol.134 , Issue.1 , pp. 183-199
    • Cohen, I.1    Golany, B.2    Shtub, A.3
  • 15
    • 27344449757 scopus 로고    scopus 로고
    • Decentralized control of cooperative systems: Categorization and complexity analysis
    • C. V. Goldman and S. Zilberstein. Decentralized control of cooperative systems: Categorization and complexity analysis. Journal of Artificial Intelligence Research (JAIR), 22:143-174, 2004.
    • (2004) Journal of Artificial Intelligence Research (JAIR) , vol.22 , pp. 143-174
    • Goldman, C.V.1    Zilberstein, S.2
  • 18
    • 84947403595 scopus 로고
    • Probability inequalities for sums of bounded random variables
    • Mar
    • W. Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301): 13-30, Mar. 1963.
    • (1963) Journal of the American Statistical Association , vol.58 , Issue.301 , pp. 13-30
    • Hoeffding, W.1
  • 19
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • L. P. Kaelbling, M. L. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101(1-2):99-134, 1998.
    • (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 21
    • 33748562008 scopus 로고    scopus 로고
    • Using the max-plus algorithm for multiagent decision making in coordination graphs
    • Osaka, Japan, July
    • J. R. Kok and N, Vlassis. Using the max-plus algorithm for multiagent decision making in coordination graphs. In RoboCup-2005: Robot Soccer World Cup IX, Osaka, Japan, July 2005.
    • (2005) RoboCup-2005: Robot Soccer World Cup IX
    • Kok, J.R.1    Vlassis, N.2
  • 22
    • 0031192989 scopus 로고    scopus 로고
    • Representations and solutions for game-theoretic problems
    • D. Koller and A. Pfeffer. Representations and solutions for game-theoretic problems. Artificial Intelligence, 94(1-2): 167-215,1997.
    • (1997) Artificial Intelligence , vol.94 , Issue.1-2 , pp. 167-215
    • Koller, D.1    Pfeffer, A.2
  • 23
    • 0032596468 scopus 로고    scopus 로고
    • On the un-decidability of probabilistic planning and infinite-horizon partially observable Markov decision problems
    • O. Madani, S. Hanks, and A. Condon. On the un-decidability of probabilistic planning and infinite-horizon partially observable Markov decision problems. In Proc. of the National Conference on Artificial Intelligence, pages 541-548,1999.
    • (1999) Proc. of the National Conference on Artificial Intelligence , pp. 541-548
    • Madani, O.1    Hanks, S.2    Condon, A.3
  • 30
    • 0030396683 scopus 로고    scopus 로고
    • Decentralized control of a multiple access broadcast channel: Performance bounds
    • J. M. Ooi and G. W. Wornell. Decentralized control of a multiple access broadcast channel: Performance bounds. In Proc. 35th Conf. on Decision and Control, 1996.
    • (1996) Proc. 35th Conf. on Decision and Control
    • Ooi, J.M.1    Wornell, G.W.2
  • 32
    • 1142292938 scopus 로고    scopus 로고
    • The communicative multiagent team decision problem: Analyzing teamwork theories and models
    • D. V. Pynadath and M. Tambe. The communicative multiagent team decision problem: Analyzing teamwork theories and models. Journal of AI research (JAIR), 16:389-423,2002.
    • (2002) Journal of AI research (JAIR) , vol.16 , pp. 389-423
    • Pynadath, D.V.1    Tambe, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.