SCOPUS 정보 검색 플랫폼

Journal of Artificial Intelligence Research

Volumn 32, Issue , 2008, Pages 289-353

Optimal and approximate Q-value functions for decentralized POMDPs

(3) Oliehoek, Frans A a Spaan, Matthijs T J b Vlassis, Nikos c

a UNIVERSITY OF AMSTERDAM (Netherlands)

b INSTITUTO SUPERIOR TÉCNICO (Portugal)

c TECHNICAL UNIVERSITY OF CRETE (Greece)

Author keywords

[No Author keywords available]

Indexed keywords

DECISION MAKING; DYNAMIC PROGRAMMING;

BENCH-MARK PROBLEMS; DECISION-THEORETIC PLANNING; EFFICIENT COMPUTATION; EXPERIMENTAL EVALUATION; OPTIMAL POLICIES; SEQUENTIAL DECISION MAKING; SINGLE-AGENT; VALUE FUNCTIONS;

RATIONAL FUNCTIONS;

EID: 52249098423 PISSN: None EISSN: 10769757 Source Type: Journal
DOI: 10.1613/jair.2447 Document Type: Article

Times cited : (455)

References (86)

1
- 0023453847
- Decentralized optimal control of Markov chains with a common past information set
- Aicardi, M., Davoli, F., & Minciardi, R. (1987). Decentralized optimal control of Markov chains with a common past information set. IEEE Transactions on Automatic Control, 32(11), 1028-1031.
- (1987) IEEE Transactions on Automatic Control , vol.32 , Issue.11 , pp. 1028-1031
- Aicardi, M.¹ Davoli, F.² Minciardi, R.³

2
- 2342463476
- Applications of Markov decision processes in communication networks
- Feinberg, E. A, & Shwartz, A, Eds, Kluwer Academic Publishers
- Altman, E. (2002). Applications of Markov decision processes in communication networks. In Feinberg, E. A., & Shwartz, A. (Eds.), Handbook of Markov Decision Processes: Methods and Applications. Kluwer Academic Publishers.
- (2002) Handbook of Markov Decision Processes: Methods and Applications
- Altman, E.¹

3
- 52249122724
- Optimal fixed-size controllers for decentralized POMDPs
- Amato, C., Bernstein, D. S., & Zilberstein, S. (2006). Optimal fixed-size controllers for decentralized POMDPs. In Proc. of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM).
- (2006) Proc. of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM)
- Amato, C.¹ Bernstein, D.S.² Zilberstein, S.³

4
- 80053179816
- Optimizing memory-bounded controllers for decentralized POMDPs
- Amato, C., Bernstein, D. S., & Zilberstein, S. (2007a). Optimizing memory-bounded controllers for decentralized POMDPs. In Proc. of Uncertainty in Artificial Intelligence.
- (2007) Proc. of Uncertainty in Artificial Intelligence
- Amato, C.¹ Bernstein, D.S.² Zilberstein, S.³

5
- 84899412493
- Bounded dynamic programming for decentralized POMDPs
- Amato, C., Carlin, A., & Zilberstein, S. (2007b). Bounded dynamic programming for decentralized POMDPs. In Proc. of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM).
- (2007) Proc. of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM)
- Amato, C.¹ Carlin, A.² Zilberstein, S.³

6
- 0036817725
- Editorial: Advances in multirobot systems
- Arai, T., Pagello, E., & Parker, L. (2002). Editorial: Advances in multirobot systems. IEEE Transactions on Robotics and Automation, 18(5), 655-661.
- (2002) IEEE Transactions on Robotics and Automation , vol.18 , Issue.5 , pp. 655-661
- Arai, T.¹ Pagello, E.² Parker, L.³

7
- 58349107260
- Mixed integer linear programming for exact finite-horizon planning in decentralized POMDPs
- Aras, R., Dutech, A., & Charpillet, F. (2007). Mixed integer linear programming for exact finite-horizon planning in decentralized POMDPs. In Proc. of the International Conference on Automated Planning and Scheduling.
- (2007) Proc. of the International Conference on Automated Planning and Scheduling
- Aras, R.¹ Dutech, A.² Charpillet, F.³

8
- 33846298515
- Analyzing myopic approaches for multiagent communication
- Becker, R., Lesser, V., & Zilberstein, S. (2005). Analyzing myopic approaches for multiagent communication. In Proc. of the International Conference on Intelligent Agent Technology, pp. 550-557.
- (2005) Proc. of the International Conference on Intelligent Agent Technology , pp. 550-557
- Becker, R.¹ Lesser, V.² Zilberstein, S.³

9
- 4544301377
- Decentralized Markov decision processes with event-driven interactions
- Becker, R., Zilberstein, S., & Lesser, V. (2004a). Decentralized Markov decision processes with event-driven interactions. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 302-309.
- (2004) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 302-309
- Becker, R.¹ Zilberstein, S.² Lesser, V.³

10
- 27344432831
- Solving transition independent decentralized Markov decision processes
- Becker, R., Zilberstein, S., Lesser, V., & Goldman, C. V. (2004b). Solving transition independent decentralized Markov decision processes. Journal of Artificial Intelligence Research, 22, 423-455.
- (2004) Journal of Artificial Intelligence Research , vol.22 , pp. 423-455
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.V.⁴

11
- 48049099468
- Ph.D. thesis, University of Massachusets Amherst
- Bernstein, D. S. (2005). Complexity Analysis and Optimal Algorithms for Decentralized Decision Making. Ph.D. thesis, University of Massachusets Amherst.
- (2005) Complexity Analysis and Optimal Algorithms for Decentralized Decision Making
- Bernstein, D.S.¹

12
- 0036874366
- The complexity of decentralized control of Markov decision processes
- Bernstein, D. S., Givan, R., Immerman, N., & Zilberstein, S. (2002). The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research, 27(4), 819-840.
- (2002) Mathematics of Operations Research , vol.27 , Issue.4 , pp. 819-840
- Bernstein, D.S.¹ Givan, R.² Immerman, N.³ Zilberstein, S.⁴

13
- 84880740944
- Bounded policy iteration for decentralized POMDPs
- Bernstein, D. S., Hansen, E. A., & Zilberstein, S. (2005). Bounded policy iteration for decentralized POMDPs. In Proc. of the International Joint Conference on Artificial Intelligence, pp. 1287-1292.
- (2005) Proc. of the International Joint Conference on Artificial Intelligence , pp. 1287-1292
- Bernstein, D.S.¹ Hansen, E.A.² Zilberstein, S.³

14
- 0003565783
- 3rd edition, Athena Scientific
- Bertsekas, D. P. (2005). Dynamic Programming and Optimal Control (3rd edition)., Vol. I. Athena Scientific.
- (2005) Dynamic Programming and Optimal Control , vol.1
- Bertsekas, D.P.¹

15
- 52249124415
- A polynomial algorithm for decentralized Markov decision processes with temporal constraints
- Beynier, A., & Mouaddib, A.-I. (2005). A polynomial algorithm for decentralized Markov decision processes with temporal constraints. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 963-969.
- (2005) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 963-969
- Beynier, A.¹ Mouaddib, A.-I.²

16
- 33750726391
- An iterative algorithm for solving constrained decentralized Markov decision processes
- Beynier, A., & Mouaddib, A.-I. (2006). An iterative algorithm for solving constrained decentralized Markov decision processes. In Proc. of the National Conference on Artificial Intelligence.
- (2006) Proc. of the National Conference on Artificial Intelligence
- Beynier, A.¹ Mouaddib, A.-I.²

17
- 0004003758
- D.C. Heath and Company
- Binmore, K. (1992). Fun and Games. D.C. Heath and Company.
- (1992) Fun and Games
- Binmore, K.¹

18
- 17444409624
- A tutorial on the cross-entropy method
- de Boer, P.-T., Kroese, D. P., Mannor, S., & Rubinstein, R. Y. (2005). A tutorial on the cross-entropy method. Annals of Operations Research, 134(1), 19-67.
- (2005) Annals of Operations Research , vol.134 , Issue.1 , pp. 19-67
- de Boer, P.-T.¹ Kroese, D.P.² Mannor, S.³ Rubinstein, R.Y.⁴

19
- 0002500351
- Planning, learning and coordination in multiagent decision processes
- Boutilier, C. (1996). Planning, learning and coordination in multiagent decision processes. In Proc. of the 6th Conference on Theoretical Aspects of Rationality and Knowledge, pp. 195-210.
- (1996) Proc. of the 6th Conference on Theoretical Aspects of Rationality and Knowledge , pp. 195-210
- Boutilier, C.¹

20
- 0346942368
- Decision-theoretic planning: Structural assumptions and computational leverage
- Boutilier, C., Dean, T., & Hanks, S. (1999). Decision-theoretic planning: Structural assumptions and computational leverage. Journal of Artificial Intelligence Research, 11, 1-94.
- (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

21
- 0036040313
- A heuristic approach for solving decentralized-POMDP: Assessment on the pursuit problem
- Chades, I., Scherrer, B., & Charpillet, F. (2002). A heuristic approach for solving decentralized-POMDP: assessment on the pursuit problem. In Proc. of the 2002 ACM Symposium on Applied Computing, pp. 57-62.
- (2002) Proc. of the 2002 ACM Symposium on Applied Computing , pp. 57-62
- Chades, I.¹ Scherrer, B.² Charpillet, F.³

22
- 33748808086
- An approximate dynamic programming approach to decentralized control of stochastic systems
- Cogill, R., Rotkowitz, M., Roy, B. V., & Lall, S. (2004). An approximate dynamic programming approach to decentralized control of stochastic systems. In Proc. of the 2004 Allerton Conference on Communication, Control, and Computing.
- (2004) Proc. of the 2004 Allerton Conference on Communication, Control, and Computing
- Cogill, R.¹ Rotkowitz, M.² Roy, B.V.³ Lall, S.⁴

23
- 33746040213
- Ph.D. thesis, Carnegie Mellon University
- Emery-Montemerlo, R. (2005). Game-Theoretic Control for Robot Teams. Ph.D. thesis, Carnegie Mellon University.
- (2005) Game-Theoretic Control for Robot Teams
- Emery-Montemerlo, R.¹

24
- 4544325183
- Approximate solutions for partially observable stochastic games with common payoffs
- Emery-Montemerlo, R., Gordon, G., Schneider, J., & Thrun, S. (2004). Approximate solutions for partially observable stochastic games with common payoffs. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 136-143.
- (2004) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 136-143
- Emery-Montemerlo, R.¹ Gordon, G.² Schneider, J.³ Thrun, S.⁴

25
- 33846159516
- Game theoretic control for robot teams
- Emery-Montemerlo, R., Gordon, G., Schneider, J., & Thrun, S. (2005). Game theoretic control for robot teams. In Proc. of the IEEE International Conference on Robotics and Automation, pp. 1175-1181.
- (2005) Proc. of the IEEE International Conference on Robotics and Automation , pp. 1175-1181
- Emery-Montemerlo, R.¹ Gordon, G.² Schneider, J.³ Thrun, S.⁴

26
- 33645684503
- Heuristic anytime approaches to stochastic decision processes
- Fernández, J. L., Sanz, R., Simmons, R. G., & Diéguez, A. R. (2006). Heuristic anytime approaches to stochastic decision processes. Journal of Heuristics, 12(3), 181-209.
- (2006) Journal of Heuristics , vol.12 , Issue.3 , pp. 181-209
- Fernández, J.L.¹ Sanz, R.² Simmons, R.G.³ Diéguez, A.R.⁴

27
- 29344465971
- A framework for sequential planning in multiagent settings
- Gmytrasiewicz, P. J., & Doshi, P. (2005). A framework for sequential planning in multiagent settings. Journal of Artificial Intelligence Research, 24, 49-79.
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 49-79
- Gmytrasiewicz, P.J.¹ Doshi, P.²

28
- 34249732041
- Learning to communicate in a decentralized environment
- Goldman, C. V., Allen, M., & Zilberstein, S. (2007). Learning to communicate in a decentralized environment. Autonomous Agents and Multi-Agent Systems, 15(1), 47-90.
- (2007) Autonomous Agents and Multi-Agent Systems , vol.15 , Issue.1 , pp. 47-90
- Goldman, C.V.¹ Allen, M.² Zilberstein, S.³

29
- 1142293050
- Optimizing information exchange in cooperative multi-agent systems
- Goldman, C. V., & Zilberstein, S. (2003). Optimizing information exchange in cooperative multi-agent systems. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 137-144.
- (2003) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 137-144
- Goldman, C.V.¹ Zilberstein, S.²

30
- 27344449757
- Decentralized control of cooperative systems: Categorization and complexity analysis
- Goldman, C. V., & Zilberstein, S. (2004). Decentralized control of cooperative systems: Categorization and complexity analysis.. Journal of Artificial Intelligence Research, 22, 143-174.
- (2004) Journal of Artificial Intelligence Research , vol.22 , pp. 143-174
- Goldman, C.V.¹ Zilberstein, S.²

31
- 4544318426
- Efficient solution algorithms for factored MDPs
- Guestrin, C., Koller, D., Parr, R., & Venkataraman, S. (2003). Efficient solution algorithms for factored MDPs. Journal of Artificial Intelligence Research, 19, 399-468.
- (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 399-468
- Guestrin, C.¹ Koller, D.² Parr, R.³ Venkataraman, S.⁴

32
- 9444233318
- Dynamic programming for partially observable stochastic games
- Hansen, E. A., Bernstein, D. S., & Zilberstein, S. (2004). Dynamic programming for partially observable stochastic games. In Proc. of the National Conference on Artificial Intelligence, pp. 709-715.
- (2004) Proc. of the National Conference on Artificial Intelligence , pp. 709-715
- Hansen, E.A.¹ Bernstein, D.S.² Zilberstein, S.³

33
- 0001770240
- Value-function approximations for partially observable Markov decision processes
- Hauskrecht, M. (2000). Value-function approximations for partially observable Markov decision processes.. Journal of Artificial Intelligence Research, 13, 33-94.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 33-94
- Hauskrecht, M.¹

34
- 0020113091
- Decentralized control of finite state Markov processes
- Hsu, K., & Marcus, S. (1982). Decentralized control of finite state Markov processes. IEEE Transactions on Automatic Control, 27(2), 426-431.
- (1982) IEEE Transactions on Automatic Control , vol.27 , Issue.2 , pp. 426-431
- Hsu, K.¹ Marcus, S.²

35
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L. P., Littman, M. L., & Cassandra, A. R. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101(1-2), 99-134.
- (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

36
- 33747180593
- Exploiting locality of interaction in networked distributed POMDPs
- Kim, Y., Nair, R., Varakantham, P., Tambe, M., & Yokoo, M. (2006). Exploiting locality of interaction in networked distributed POMDPs. In Proc. of the AAAI Spring Symposium on Distributed Plan and Schedule Management.
- (2006) Proc. of the AAAI Spring Symposium on Distributed Plan and Schedule Management
- Kim, Y.¹ Nair, R.² Varakantham, P.³ Tambe, M.⁴ Yokoo, M.⁵

37
- 0030655611
- RoboCup: The robot world cup initiative
- Kitano, H., Asada, M., Kuniyoshi, Y., Noda, I., & Osawa, E. (1997). RoboCup: The robot world cup initiative. In Proc. of the International Conference on Autonomous Agents.
- (1997) Proc. of the International Conference on Autonomous Agents
- Kitano, H.¹ Asada, M.² Kuniyoshi, Y.³ Noda, I.⁴ Osawa, E.⁵

38
- 0033333228
- Robocup rescue: Search and rescue in large-scale disasters as a domain for autonomous agents research
- Man and Cybernetics, pp
- Kitano, H., Tadokoro, S., Noda, I., Matsubara, H., Takahashi, T., Shinjoh, A., & Shimada, S. (1999). Robocup rescue: Search and rescue in large-scale disasters as a domain for autonomous agents research. In Proc. of the International Conference on Systems, Man and Cybernetics, pp. 739-743.
- (1999) Proc. of the International Conference on Systems , pp. 739-743
- Kitano, H.¹ Tadokoro, S.² Noda, I.³ Matsubara, H.⁴ Takahashi, T.⁵ Shinjoh, A.⁶ Shimada, S.⁷

39
- 0027964134
- Fast algorithms for finding randomized strategies in game trees
- Koller, D., Megiddo, N., & von Stengel, B. (1994). Fast algorithms for finding randomized strategies in game trees. In Proc. of the 26th ACM Symposium on Theory of Computing, pp. 750-759.
- (1994) Proc. of the 26th ACM Symposium on Theory of Computing , pp. 750-759
- Koller, D.¹ Megiddo, N.² von Stengel, B.³

40
- 0031192989
- Representations and solutions for game-theoretic problems
- Koller, D., & Pfeffer, A. (1997). Representations and solutions for game-theoretic problems. Artificial Intelligence, 94(1-2), 167-215.
- (1997) Artificial Intelligence , vol.94 , Issue.1-2 , pp. 167-215
- Koller, D.¹ Pfeffer, A.²

41
- 0000619048
- Extensive games and the problem of information
- Kuhn, H. (1953). Extensive games and the problem of information. Annals of Mathematics Studies, 28, 193-216.
- (1953) Annals of Mathematics Studies , vol.28 , pp. 193-216
- Kuhn, H.¹

42
- 3042527480
- Lesser, V, Ortiz Jr, C. L, & Tambe, M, Eds, Kluwer Academic Publishers
- Lesser, V., Ortiz Jr., C. L., & Tambe, M. (Eds.). (2003). Distributed Sensor Networks: A Multiagent Perspective, Vol. 9. Kluwer Academic Publishers.
- (2003) Distributed Sensor Networks: A Multiagent Perspective , vol.9

43
- 85138579181
- Learning policies for partially observable environments: Scaling up
- Littman, M., Cassandra, A., & Kaelbling, L. (1995). Learning policies for partially observable environments: Scaling up. In Proc. of the International Conference on Machine Learning, pp. 362-370.
- (1995) Proc. of the International Conference on Machine Learning , pp. 362-370
- Littman, M.¹ Cassandra, A.² Kaelbling, L.³

44
- 84881081732
- On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints
- Marecki, J., & Tambe, M. (2007). On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 1-8.
- (2007) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 1-8
- Marecki, J.¹ Tambe, M.²

45
- 9444269433
- Team formation for reformation in multiagent domains like RoboCupRescue
- Nair, R., Tambe, M., & Marsella, S. (2003). Team formation for reformation in multiagent domains like RoboCupRescue. In Proc. of Robo Cup-2002 International Symposium.
- (2003) Proc. of Robo Cup-2002 International Symposium
- Nair, R.¹ Tambe, M.² Marsella, S.³

46
- 4544315369
- Communication for improving policy computation in distributed POMDPs
- Nair, R., Roth, M., & Yohoo, M. (2004). Communication for improving policy computation in distributed POMDPs. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 1098-1105.
- (2004) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 1098-1105
- Nair, R.¹ Roth, M.² Yohoo, M.³

47
- 32844455934
- Team formation for reformation
- Nair, R., Tambe, M., & Marsella, S. (2002). Team formation for reformation. In Proc. of the AAAI Spring Symposium on Intelligent Distributed and Embedded Systems.
- (2002) Proc. of the AAAI Spring Symposium on Intelligent Distributed and Embedded Systems
- Nair, R.¹ Tambe, M.² Marsella, S.³

48
- 1142280934
- Role allocation and reallocation in multiagent teams: Towards a practical analysis
- Nair, R., Tambe, M., & Marsella, S. (2003a). Role allocation and reallocation in multiagent teams: towards a practical analysis. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 552-559.
- (2003) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 552-559
- Nair, R.¹ Tambe, M.² Marsella, S.³

49
- 84880823326
- Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings
- Nair, R., Tambe, M., Yokoo, M., Pynadath, D. V., & Marsella, S. (2003b). Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In Proc. of the International Joint Conference on Artificial Intelligence, pp. 705-711.
- (2003) Proc. of the International Joint Conference on Artificial Intelligence , pp. 705-711
- Nair, R.¹ Tambe, M.² Yokoo, M.³ Pynadath, D.V.⁴ Marsella, S.⁵

50
- 29344437834
- Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs
- Nair, R., Varakantham, P., Tambe, M., & Yokoo, M. (2005). Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In Proc. of the National Conference on Artificial Intelligence, pp. 133-139.
- (2005) Proc. of the National Conference on Artificial Intelligence , pp. 133-139
- Nair, R.¹ Varakantham, P.² Tambe, M.³ Yokoo, M.⁴

51
- 0002021736
- Equilibrium points in N-person games
- Nash, J. F. (1950). Equilibrium points in N-person games. Proc. of the National Academy of Sciences of the United States of America, 36, 48-49.
- (1950) Proc. of the National Academy of Sciences of the United States of America , pp. 48-49
- Nash, J.F.¹

52
- 52249121083
- Oliehoek, F., & Vlassis, N. (2006). Dec-POMDPs and extensive form games: equivalence of models and algorithms. Ias technical report IAS-UVA-06-02, University of Amsterdam, Intelligent Systems Lab, Amsterdam, The Netherlands.
- Oliehoek, F., & Vlassis, N. (2006). Dec-POMDPs and extensive form games: equivalence of models and algorithms. Ias technical report IAS-UVA-06-02, University of Amsterdam, Intelligent Systems Lab, Amsterdam, The Netherlands.

53
- 77951732939
- A cross-entropy approach to solving Dec-POMDPs
- Oliehoek, F. A., Kooij, J. F., & Vlassis, N. (2007a). A cross-entropy approach to solving Dec-POMDPs. In Proc. of the International Symposium on Intelligent and Distributed Computing, pp. 145-154.
- (2007) Proc. of the International Symposium on Intelligent and Distributed Computing , pp. 145-154
- Oliehoek, F.A.¹ Kooij, J.F.² Vlassis, N.³

54
- 78650936066
- Dec-POMDPs with delayed communication
- Oliehoek, F. A., Spaan, M. T. J., & Vlassis, N. (2007b). Dec-POMDPs with delayed communication. In Proc. of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM).
- (2007) Proc. of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM)
- Oliehoek, F.A.¹ Spaan, M.T.J.² Vlassis, N.³

55
- 84899909133
- Exploiting locality of interaction in factored Dec-POMDPs
- Oliehoek, F. A., Spaan, M. T. J., Whiteson, S., & Vlassis, N. (2008). Exploiting locality of interaction in factored Dec-POMDPs. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems.
- (2008) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems
- Oliehoek, F.A.¹ Spaan, M.T.J.² Whiteson, S.³ Vlassis, N.⁴

56
- 52249092515
- A hierarchical model for decentralized fighting of large scale urban fires
- Oliehoek, F. A., & Visser, A. (2006). A hierarchical model for decentralized fighting of large scale urban fires. In Proc. of the AAMAS'06 Workshop on Hierarchical Autonomous Agents and Multi-Agent Systems (H-AAMAS), pp. 14-21.
- (2006) Proc. of the AAMAS'06 Workshop on Hierarchical Autonomous Agents and Multi-Agent Systems (H-AAMAS) , pp. 14-21
- Oliehoek, F.A.¹ Visser, A.²

57
- 60349096407
- Q-value functions for decentralized POMDPs
- Oliehoek, F. A., & Vlassis, N. (2007). Q-value functions for decentralized POMDPs. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 833-840.
- (2007) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 833-840
- Oliehoek, F.A.¹ Vlassis, N.²

58
- 0030396683
- Decentralized control of a multiple access broadcast channel: Performance bounds
- Ooi, J. M., & Wornell, G. W. (1996). Decentralized control of a multiple access broadcast channel: Performance bounds. In Proc. of the 35th Conference on Decision and Control, pp. 293-298.
- (1996) Proc. of the 35th Conference on Decision and Control , pp. 293-298
- Ooi, J.M.¹ Wornell, G.W.²

59
- 0003427725
- The MIT Press
- Osborne, M. J., & Rubinstein, A. (1994). A Course in Game Theory. The MIT Press.
- (1994) A Course in Game Theory
- Osborne, M.J.¹ Rubinstein, A.²

60
- 0000977910
- The complexity of Markov decision processes
- Papadimitriou, C. H., & Tsitsiklis, J. N. (1987). The complexity of Markov decision processes. Mathematics of Operations Research, 12(3), 441-451.
- (1987) Mathematics of Operations Research , vol.12 , Issue.3 , pp. 441-451
- Papadimitriou, C.H.¹ Tsitsiklis, J.N.²

61
- 33644792131
- An online POMDP algorithm for complex multiagent environments
- Paquet, S., Tobin, L., & Chaib-draa, B. (2005). An online POMDP algorithm for complex multiagent environments. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems.
- (2005) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems
- Paquet, S.¹ Tobin, L.² Chaib-draa, B.³

62
- 0005943267
- Ph.D. thesis, Brown University
- Peshkin, L. (2001). Reinforcement Learning by Policy Search. Ph.D. thesis, Brown University.
- (2001) Reinforcement Learning by Policy Search
- Peshkin, L.¹

63
- 0012646255
- Learning to cooperate via policy search
- Peshkin, L., Kim, K.-E., Meuleau, N., & Kaelbling, L. P. (2000). Learning to cooperate via policy search. In Proc. of Uncertainty in Artificial Intelligence, pp. 307-314.
- (2000) Proc. of Uncertainty in Artificial Intelligence , pp. 307-314
- Peshkin, L.¹ Kim, K.-E.² Meuleau, N.³ Kaelbling, L.P.⁴

64
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- Pineau, J., Gordon, G., & Thrun, S. (2003). Point-based value iteration: An anytime algorithm for POMDPs. In Proc. of the International Joint Conference on Artificial Intelligence, pp. 1025-1032.
- (2003) Proc. of the International Joint Conference on Artificial Intelligence , pp. 1025-1032
- Pineau, J.¹ Gordon, G.² Thrun, S.³

65
- 0003998452
- John Wiley & Sons, Inc
- Puterman, M. L. (1994). Markov Decision Processes-Discrete Stochastic Dynamic Programming. John Wiley & Sons, Inc.
- (1994) Markov Decision Processes-Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

66
- 1142292938
- The communicative multiagent team decision problem: Analyzing teamwork theories and models
- Pynadath, D. V., & Tambe, M. (2002). The communicative multiagent team decision problem: Analyzing teamwork theories and models. Journal of Artificial Intelligence Research, 16, 389-423.
- (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 389-423
- Pynadath, D.V.¹ Tambe, M.²

67
- 0008787431
- Reduction of a game with complete memory to a. matrix game
- Romanovskii, I. (1962). Reduction of a game with complete memory to a. matrix game. Soviet Mathematics, 3, 678-681.
- (1962) Soviet Mathematics , vol.3 , pp. 678-681
- Romanovskii, I.¹

68
- 33644803977
- Reasoning about joint beliefs for executiontime communication decisions
- Roth, M., Simmons, R., & Veloso, M. (2005). Reasoning about joint beliefs for executiontime communication decisions. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 786-793.
- (2005) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 786-793
- Roth, M.¹ Simmons, R.² Veloso, M.³

69
- 60349107649
- Exploiting factored representations for decentralized execution in multi-agent teams
- Roth, M., Simmons, R., & Veloso, M. (2007). Exploiting factored representations for decentralized execution in multi-agent teams. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 467-463.
- (2007) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 467-463
- Roth, M.¹ Simmons, R.² Veloso, M.³

70
- 0003584577
- 2nd edition, Pearson Education
- Russell, S., & Norvig, P. (2003). Artificial Intelligence: A Modern Approach (2nd edition). Pearson Education.
- (2003) Artificial Intelligence: A Modern Approach
- Russell, S.¹ Norvig, P.²

71
- 0017961760
- Symmetric team problems and multi access wire communication
- Schoute, F. C. (1978). Symmetric team problems and multi access wire communication. Automatica, 14, 255-269.
- (1978) Automatica , vol.14 , pp. 255-269
- Schoute, F.C.¹

72
- 51649085567
- Improved memory-bounded dynamic programming for decentralized POMDPs
- Seuken, S., & Zilberstein, S. (2007a). Improved memory-bounded dynamic programming for decentralized POMDPs. In Proc. of Uncertainty in Artificial Intelligence.
- (2007) Proc. of Uncertainty in Artificial Intelligence
- Seuken, S.¹ Zilberstein, S.²

73
- 84880856384
- Memory-bounded dynamic programming for DECPOMDPs
- Seuken, S., & Zilberstein, S. (2007b). Memory-bounded dynamic programming for DECPOMDPs.. In Proc. of the International Joint Conference on Artificial Intelligence, pp. 2009-2015.
- (2007) Proc. of the International Joint Conference on Artificial Intelligence , pp. 2009-2015
- Seuken, S.¹ Zilberstein, S.²

74
- 0003871607
- Ph.D. thesis, Stanford University
- Sondik, E. J. (1971). The optimal control of partially observable Markov decision processes. Ph.D. thesis, Stanford University.
- (1971) The optimal control of partially observable Markov decision processes
- Sondik, E.J.¹

75
- 34247245809
- Decentralized planning under uncertainty for teams of communicating agents
- Spaan, M. T. J., Gordon, G. J., & Vlassis, N. (2006). Decentralized planning under uncertainty for teams of communicating agents. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 249-256.
- (2006) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 249-256
- Spaan, M.T.J.¹ Gordon, G.J.² Vlassis, N.³

76
- 84899992307
- Interaction-driven Markov games for decentralized multiagent planning under uncertainty
- Spaan, M. T. J., & Melo, F. S. (2008). Interaction-driven Markov games for decentralized multiagent planning under uncertainty. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems.
- (2008) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems
- Spaan, M.T.J.¹ Melo, F.S.²

77
- 31144472319
- Perseus: Randomized point-based value iteration for POMDPs
- Spaan, M. T. J., & Vlassis, N. (2005). Perseus: Randomized point-based value iteration for POMDPs. Journal of Artificial Intelligence Research, 24, 195-220.
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 195-220
- Spaan, M.T.J.¹ Vlassis, N.²

78
- 0004102479
- The MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement Learning: An Introduction. The MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

79
- 33646423007
- An optimal best-first search algorithm for solving infinite horizon DEC-POMDPs
- Szer, D., & Charpillet, F. (2005). An optimal best-first search algorithm for solving infinite horizon DEC-POMDPs. In Proc. of the European Conference on Machine Learning, pp. 389-399.
- (2005) Proc. of the European Conference on Machine Learning , pp. 389-399
- Szer, D.¹ Charpillet, F.²

80
- 33750691009
- Point-based dynamic programming for DEC-POMDPs
- Szer, D., & Charpillet, F. (2006). Point-based dynamic programming for DEC-POMDPs.. In Proc. of the National Conference on Artificial Intelligence.
- (2006) Proc. of the National Conference on Artificial Intelligence
- Szer, D.¹ Charpillet, F.²

81
- 80053226937
- MAA*: A heuristic search algorithm for solving decentralized POMDPs
- Szer, D., Charpillet, F., & Zilberstein, S. (2005). MAA*: A heuristic search algorithm for solving decentralized POMDPs. In Proc. of Uncertainty in Artificial Intelligence, pp. 576-583.
- (2005) Proc. of Uncertainty in Artificial Intelligence , pp. 576-583
- Szer, D.¹ Charpillet, F.² Zilberstein, S.³

82
- 13444294406
- A multi-agent policy-gradient approach to network routing
- Tao, N., Baxter, J., & Weaver, L. (2001). A multi-agent policy-gradient approach to network routing. In Proc. of the International Conference on Machine Learning, pp. 553-560.
- (2001) Proc. of the International Conference on Machine Learning , pp. 553-560
- Tao, N.¹ Baxter, J.² Weaver, L.³

83
- 60349101997
- Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies
- Varakantham, P., Marecki, J., Yabu, Y., Tambe, M., & Yokoo, M. (2007). Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems.
- (2007) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems
- Varakantham, P.¹ Marecki, J.² Yabu, Y.³ Tambe, M.⁴ Yokoo, M.⁵

84
- 34247187490
- Winning back the cup for distributed POMDPs: Planning over continuous belief spaces
- Varakantham, P., Nair, R., Tambe, M., & Yokoo, M. (2006). Winning back the cup for distributed POMDPs: planning over continuous belief spaces. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 289-296.
- (2006) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 289-296
- Varakantham, P.¹ Nair, R.² Tambe, M.³ Yokoo, M.⁴

85
- 34247270255
- Mixed-integer linear programming for transition-independent decentralized MDPs
- Wu, J., & Durfee, E. H. (2006). Mixed-integer linear programming for transition-independent decentralized MDPs. In Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 1058-1060.
- (2006) Proc. of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 1058-1060
- Wu, J.¹ Durfee, E.H.²

86
- 0034827257
- Communication decisions in multi-agent cooperation: Model and experiments
- Xuan, P., Lesser, V., & Zilberstein, S. (2001). Communication decisions in multi-agent cooperation: Model and experiments. In Proc. of the International Conference on Autonomous Agents.
- (2001) Proc. of the International Conference on Autonomous Agents
- Xuan, P.¹ Lesser, V.² Zilberstein, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.