메뉴 건너뛰기




Volumn 32, Issue , 2008, Pages 289-353

Optimal and approximate Q-value functions for decentralized POMDPs

Author keywords

[No Author keywords available]

Indexed keywords

DECISION MAKING; DYNAMIC PROGRAMMING;

EID: 52249098423     PISSN: None     EISSN: 10769757     Source Type: Journal    
DOI: 10.1613/jair.2447     Document Type: Article
Times cited : (455)

References (86)
  • 1
    • 0023453847 scopus 로고
    • Decentralized optimal control of Markov chains with a common past information set
    • Aicardi, M., Davoli, F., & Minciardi, R. (1987). Decentralized optimal control of Markov chains with a common past information set. IEEE Transactions on Automatic Control, 32(11), 1028-1031.
    • (1987) IEEE Transactions on Automatic Control , vol.32 , Issue.11 , pp. 1028-1031
    • Aicardi, M.1    Davoli, F.2    Minciardi, R.3
  • 2
    • 2342463476 scopus 로고    scopus 로고
    • Applications of Markov decision processes in communication networks
    • Feinberg, E. A, & Shwartz, A, Eds, Kluwer Academic Publishers
    • Altman, E. (2002). Applications of Markov decision processes in communication networks. In Feinberg, E. A., & Shwartz, A. (Eds.), Handbook of Markov Decision Processes: Methods and Applications. Kluwer Academic Publishers.
    • (2002) Handbook of Markov Decision Processes: Methods and Applications
    • Altman, E.1
  • 17
  • 20
    • 0346942368 scopus 로고    scopus 로고
    • Decision-theoretic planning: Structural assumptions and computational leverage
    • Boutilier, C., Dean, T., & Hanks, S. (1999). Decision-theoretic planning: Structural assumptions and computational leverage. Journal of Artificial Intelligence Research, 11, 1-94.
    • (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
    • Boutilier, C.1    Dean, T.2    Hanks, S.3
  • 26
    • 33645684503 scopus 로고    scopus 로고
    • Heuristic anytime approaches to stochastic decision processes
    • Fernández, J. L., Sanz, R., Simmons, R. G., & Diéguez, A. R. (2006). Heuristic anytime approaches to stochastic decision processes. Journal of Heuristics, 12(3), 181-209.
    • (2006) Journal of Heuristics , vol.12 , Issue.3 , pp. 181-209
    • Fernández, J.L.1    Sanz, R.2    Simmons, R.G.3    Diéguez, A.R.4
  • 30
    • 27344449757 scopus 로고    scopus 로고
    • Decentralized control of cooperative systems: Categorization and complexity analysis
    • Goldman, C. V., & Zilberstein, S. (2004). Decentralized control of cooperative systems: Categorization and complexity analysis.. Journal of Artificial Intelligence Research, 22, 143-174.
    • (2004) Journal of Artificial Intelligence Research , vol.22 , pp. 143-174
    • Goldman, C.V.1    Zilberstein, S.2
  • 33
    • 0001770240 scopus 로고    scopus 로고
    • Value-function approximations for partially observable Markov decision processes
    • Hauskrecht, M. (2000). Value-function approximations for partially observable Markov decision processes.. Journal of Artificial Intelligence Research, 13, 33-94.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 33-94
    • Hauskrecht, M.1
  • 34
    • 0020113091 scopus 로고
    • Decentralized control of finite state Markov processes
    • Hsu, K., & Marcus, S. (1982). Decentralized control of finite state Markov processes. IEEE Transactions on Automatic Control, 27(2), 426-431.
    • (1982) IEEE Transactions on Automatic Control , vol.27 , Issue.2 , pp. 426-431
    • Hsu, K.1    Marcus, S.2
  • 35
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Kaelbling, L. P., Littman, M. L., & Cassandra, A. R. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101(1-2), 99-134.
    • (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 40
    • 0031192989 scopus 로고    scopus 로고
    • Representations and solutions for game-theoretic problems
    • Koller, D., & Pfeffer, A. (1997). Representations and solutions for game-theoretic problems. Artificial Intelligence, 94(1-2), 167-215.
    • (1997) Artificial Intelligence , vol.94 , Issue.1-2 , pp. 167-215
    • Koller, D.1    Pfeffer, A.2
  • 41
    • 0000619048 scopus 로고
    • Extensive games and the problem of information
    • Kuhn, H. (1953). Extensive games and the problem of information. Annals of Mathematics Studies, 28, 193-216.
    • (1953) Annals of Mathematics Studies , vol.28 , pp. 193-216
    • Kuhn, H.1
  • 42
    • 3042527480 scopus 로고    scopus 로고
    • Lesser, V, Ortiz Jr, C. L, & Tambe, M, Eds, Kluwer Academic Publishers
    • Lesser, V., Ortiz Jr., C. L., & Tambe, M. (Eds.). (2003). Distributed Sensor Networks: A Multiagent Perspective, Vol. 9. Kluwer Academic Publishers.
    • (2003) Distributed Sensor Networks: A Multiagent Perspective , vol.9
  • 52
    • 52249121083 scopus 로고    scopus 로고
    • Oliehoek, F., & Vlassis, N. (2006). Dec-POMDPs and extensive form games: equivalence of models and algorithms. Ias technical report IAS-UVA-06-02, University of Amsterdam, Intelligent Systems Lab, Amsterdam, The Netherlands.
    • Oliehoek, F., & Vlassis, N. (2006). Dec-POMDPs and extensive form games: equivalence of models and algorithms. Ias technical report IAS-UVA-06-02, University of Amsterdam, Intelligent Systems Lab, Amsterdam, The Netherlands.
  • 58
    • 0030396683 scopus 로고    scopus 로고
    • Decentralized control of a multiple access broadcast channel: Performance bounds
    • Ooi, J. M., & Wornell, G. W. (1996). Decentralized control of a multiple access broadcast channel: Performance bounds. In Proc. of the 35th Conference on Decision and Control, pp. 293-298.
    • (1996) Proc. of the 35th Conference on Decision and Control , pp. 293-298
    • Ooi, J.M.1    Wornell, G.W.2
  • 66
    • 1142292938 scopus 로고    scopus 로고
    • The communicative multiagent team decision problem: Analyzing teamwork theories and models
    • Pynadath, D. V., & Tambe, M. (2002). The communicative multiagent team decision problem: Analyzing teamwork theories and models. Journal of Artificial Intelligence Research, 16, 389-423.
    • (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 389-423
    • Pynadath, D.V.1    Tambe, M.2
  • 67
    • 0008787431 scopus 로고
    • Reduction of a game with complete memory to a. matrix game
    • Romanovskii, I. (1962). Reduction of a game with complete memory to a. matrix game. Soviet Mathematics, 3, 678-681.
    • (1962) Soviet Mathematics , vol.3 , pp. 678-681
    • Romanovskii, I.1
  • 71
    • 0017961760 scopus 로고
    • Symmetric team problems and multi access wire communication
    • Schoute, F. C. (1978). Symmetric team problems and multi access wire communication. Automatica, 14, 255-269.
    • (1978) Automatica , vol.14 , pp. 255-269
    • Schoute, F.C.1
  • 79


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.