SCOPUS 정보 검색 플랫폼

Journal of Artificial Intelligence Research

Volumn 37, Issue , 2010, Pages 329-396

An investigation into mathematical programming for finite horizon decentralized POMDPs

(2) Aras, Raghav a Dutech, Alain b

a SUPELEC (France)

b LORIA (France)

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATE SOLUTION; CLASSICAL TESTS; COMPLEX TASK; DECENTRALIZED PLANNING; DECISION-THEORETIC; EXPERIMENTAL VALIDATIONS; FINITE HORIZONS; MIXED-INTEGER LINEAR PROGRAMMING; MODELING TOOL; NOVEL ALGORITHM; OPTIMAL SOLUTIONS; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; UNCERTAIN ENVIRONMENTS;

ALGORITHMS; GAME THEORY; INTEGER PROGRAMMING; LINEARIZATION; MARKOV PROCESSES; OPTIMAL SYSTEMS; OPTIMIZATION;

DYNAMIC PROGRAMMING;

EID: 77952736651 PISSN: None EISSN: 10769757 Source Type: Journal
DOI: 10.1613/jair.2915 Document Type: Article

Times cited : (21)

References (57)

1
- 80053179816
- Optimizing memory-bounded controllers for decentralized POMDPs
- Amato, C., Bernstein, D. S., & Zilberstein, S. (2007a). Optimizing memory-bounded controllers for decentralized POMDPs. In Proc. of the Twenty-Third Conf. on Uncertainty in Artificial Intelligence (UAI-07).
- (2007) Proc. of the Twenty-Third Conf. on Uncertainty in Artificial Intelligence (UAI-07)
- Amato, C.¹ Bernstein, D.S.² Zilberstein, S.³

2
- 34247254959
- Solving POMDPs using quadratically constrained linear programs
- Amato, C., Bernstein, D. S., & Zilberstein, S. (2007b). Solving POMDPs using quadratically constrained linear programs. In Proc. of the Twentieth Int. Joint Conf. on Artificial Intelligence (IJCAI'07).
- (2007) Proc. of the Twentieth Int. Joint Conf. on Artificial Intelligence (IJCAI'07)
- Amato, C.¹ Bernstein, D.S.² Zilberstein, S.³

3
- 84899412493
- Bounded dynamic programming for decentralized POMDPs
- Amato, C., Carlin, A., & Zilberstein, S. (2007c). Bounded dynamic programming for decentralized POMDPs. In Proc. of the Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM) in AAMAS'07.
- (2007) Proc. of the Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM) in AAMAS'07
- Amato, C.¹ Carlin, A.² Zilberstein, S.³

4
- 77958561050
- Incremental policy generation for finitehorizon DEC-POMDPs
- Amato, C., Dibangoye, J., & Zilberstein, S. (2009). Incremental policy generation for finitehorizon DEC-POMDPs. In Proc. of the Nineteenth Int. Conf. on Automated Planning and Scheduling (ICAPS-09).
- (2009) Proc. of the Nineteenth Int. Conf. on Automated Planning and Scheduling (ICAPS-09)
- Amato, C.¹ Dibangoye, J.² Zilberstein, S.³

5
- 0019263725
- Time-varying feedback laws for decentralized control
- Anderson, B., & Moore, J. (1980). Time-varying feedback laws for decentralized control. Nineteenth IEEE Conference on Decision and Control including the Symposium on Adaptive Processes, 19(1), 519-524.
- (1980) Nineteenth IEEE Conference on Decision and Control including the Symposium on Adaptive Processes , vol.19 , Issue.1 , pp. 519-524
- Anderson, B.¹ Moore, J.²

6
- 27344432831
- Solving transition independent decentralized Markov decision processes
- Becker, R., Zilberstein, S., Lesser, V., & Goldman, C. (2004). Solving transition independent decentralized Markov decision processes. Journal of Artificial Intelligence Research, 22, 423-455.
- (2004) Journal of Artificial Intelligence Research , vol.22 , pp. 423-455
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.⁴

7
- 85012688561
- Princeton University Press, Princeton, New- Jersey
- Bellman, R. (1957). Dynamic programming. Princeton University Press, Princeton, New- Jersey.
- (1957) Dynamic Programming
- Bellman, R.¹

8
- 0036874366
- The complexity of decentralized control of Markov decision processes
- Bernstein, D., Givan, R., Immerman, N., & Zilberstein, S. (2002). The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research, 27(4), 819-840.
- (2002) Mathematics of Operations Research , vol.27 , Issue.4 , pp. 819-840
- Bernstein, D.¹ Givan, R.² Immerman, N.³ Zilberstein, S.⁴

9
- 84880740944
- Bounded policy iteration for decentralized POMDPs
- Bernstein, D. S., Hansen, E. A., & Zilberstein, S. (2005). Bounded policy iteration for decentralized POMDPs. In Proc. of the Nineteenth Int. Joint Conf. on Artificial Intelligence (IJCAI), pp. 1287-1292.
- (2005) Proc. of the Nineteenth Int. Joint Conf. on Artificial Intelligence (IJCAI) , pp. 1287-1292
- Bernstein, D.S.¹ Hansen, E.A.² Zilberstein, S.³

10
- 58849095461
- Exact dynamic programming for decentralized pomdps with lossless policy compression
- Boularias, A., & Chaib-draa, B. (2008). Exact dynamic programming for decentralized pomdps with lossless policy compression. In Proc. of the Int. Conf. on Automated Planning and Scheduling (ICAPS'08).
- (2008) Proc. of the Int. Conf. on Automated Planning and Scheduling (ICAPS'08)
- Boularias, A.¹ Chaib-draa, B.²

11
- 0002500351
- Planning, learning and coordination in multiagent decision processes
- Boutilier, C. (1996). Planning, learning and coordination in multiagent decision processes. In Proceedings of the 6th Conference on Theoretical Aspects of Rationality and Knowledge (TARK '96), De Zeeuwse Stromen, Nederlands.
- (1996) Proceedings of the 6th Conference on Theoretical Aspects of Rationality and Knowledge (TARK '96), De Zeeuwse Stromen, Nederlands
- Boutilier, C.¹

12
- 34548099216
- Shaping multi-agent systems with gradient reinforcement learning
- Buffet, O., Dutech, A., & Charpillet, F. (2007). Shaping multi-agent systems with gradient reinforcement learning. Autonomous Agent and Multi-Agent System Journal (AAMASJ), 15(2), 197-220.
- (2007) Autonomous Agent and Multi-Agent System Journal (AAMASJ) , vol.15 , Issue.2 , pp. 197-220
- Buffet, O.¹ Dutech, A.² Charpillet, F.³

13
- 0028564629
- Acting optimally in partially observable stochastic domains
- Cassandra, A., Kaelbling, L., & Littman, M. (1994). Acting optimally in partially observable stochastic domains. In Proc. of the 12th Nat. Conf. on Artificial Intelligence (AAAI).
- (1994) Proc. of the 12th Nat. Conf. on Artificial Intelligence (AAAI)
- Cassandra, A.¹ Kaelbling, L.² Littman, M.³

14
- 0036040313
- A heuristic approach for solving decentralized-POMDP: Assessment on the pursuit problem
- Chadès, I., Scherrer, B., & Charpillet, F. (2002). A heuristic approach for solving decentralized-POMDP: assessment on the pursuit problem. In Proc. of the 2002 ACM Symposium on Applied Computing, pp. 57-62.
- (2002) Proc. of the 2002 ACM Symposium on Applied Computing , pp. 57-62
- Chadès, I.¹ Scherrer, B.² Charpillet, F.³

15
- 34547469909
- Valid inequalities for mixed integer linear programs
- Cornuéjols, G. (2008). Valid inequalities for mixed integer linear programs. Mathematical Programming B, 112, 3-44.
- (2008) Mathematical Programming B , vol.112 , pp. 3-44
- Cornuéjols, G.¹

16
- 0008644215
- On the significance of solving linear programming problems with some integer variables
- Dantzig, G. B. (1960). On the significance of solving linear programming problems with some integer variables. Econometrica, 28(1), 30-44.
- (1960) Econometrica , vol.28 , Issue.1 , pp. 30-44
- Dantzig, G.B.¹

17
- 0006464452
- A probabilistic production and inventory problem
- d'Epenoux, F. (1963). A probabilistic production and inventory problem. Management Science, 10(1), 98-108.
- (1963) Management Science , vol.10 , Issue.1 , pp. 98-108
- D'Epenoux, F.¹

18
- 0004244511
- (2 edition). Springer
- Diwekar, U. (2008). Introduction to Applied Optimization (2 edition). Springer.
- (2008) Introduction to Applied Optimization
- Diwekar, U.¹

19
- 0026838072
- Multilinear programming: Duality theories
- Drenick, R. (1992). Multilinear programming: Duality theories. Journal of Optimization Theory and Applications, 72(3), 459-486.
- (1992) Journal of Optimization Theory and Applications , vol.72 , Issue.3 , pp. 459-486
- Drenick, R.¹

20
- 0003768769
- John Wiley & Sons, New York
- Fletcher, R. (1987). Practical Methods of Optimization. John Wiley & Sons, New York.
- (1987) Practical Methods of Optimization
- Fletcher, R.¹

21
- 77952710739
- Learning to communicate and act in cooperative multiagent systems using hierarchical reinforcement learning
- Ghavamzadeh, M., & Mahadevan, S. (2004). Learning to communicate and act in cooperative multiagent systems using hierarchical reinforcement learning. In Proc. of the 3rd Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems (AAMAS'04).
- (2004) Proc. of the 3rd Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems (AAMAS'04)
- Ghavamzadeh, M.¹ Mahadevan, S.²

22
- 0038009937
- A global newton method to compute Nash equilibria
- Govindan, S., & Wilson, R. (2001). A global newton method to compute Nash equilibria. Journal of Economic Theory, 110, 65-86.
- (2001) Journal of Economic Theory , vol.110 , pp. 65-86
- Govindan, S.¹ Wilson, R.²

23
- 9444233318
- Dynamic programming for partially observable stochastic games
- Hansen, E., Bernstein, D., & Zilberstein, S. (2004). Dynamic programming for partially observable stochastic games. In Proc. of the Nineteenth National Conference on Artificial Intelligence (AAAI-04).
- (2004) Proc. of the Nineteenth National Conference on Artificial Intelligence (AAAI-04)
- Hansen, E.¹ Bernstein, D.² Zilberstein, S.³

24
- 0004070444
- (3rd edition). Springer
- Horst, R., & Tuy, H. (2003). Global Optimization: Deterministic Approaches (3rd edition). Springer.
- (2003) Global Optimization: Deterministic Approaches
- Horst, R.¹ Tuy, H.²

25
- 14344251007
- Learning and discovery of predictive state representations in dynamical systems with reset
- James, M., & Singh, S. (2004). Learning and discovery of predictive state representations in dynamical systems with reset. In Proc. of the Twenty-first Int. Conf. of Machine Learning (ICML'04).
- (2004) Proc. of the Twenty-first Int. Conf. of Machine Learning (ICML'04)
- James, M.¹ Singh, S.²

26
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L., Littman, M., & Cassandra, A. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 99-134.
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.¹ Littman, M.² Cassandra, A.³

27
- 0027964134
- Fast algorithms for finding randomized strategies in game trees
- Koller, D., Megiddo, N., & von Stengel, B. (1994). Fast algorithms for finding randomized strategies in game trees. In Proceedings of the 26th ACM Symposium on Theory of Computing (STOC '94), pp. 750-759.
- (1994) Proceedings of the 26th ACM Symposium on Theory of Computing (STOC '94) , pp. 750-759
- Koller, D.¹ Megiddo, N.² Von Stengel, B.³

28
- 0030535025
- Finding mixed strategies with small supports in extensive form games
- Koller, D., & Megiddo, N. (1996). Finding mixed strategies with small supports in extensive form games. International Journal of Game Theory, 25(1), 73-92.
- (1996) International Journal of Game Theory , vol.25 , Issue.1 , pp. 73-92
- Koller, D.¹ Megiddo, N.²

29
- 0001644591
- Bimatrix equilibrium points and mathematical programming
- Lemke, C. (1965). Bimatrix Equilibrium Points and Mathematical Programming. Management Science, 11(7), 681-689.
- (1965) Management Science , vol.11 , Issue.7 , pp. 681-689
- Lemke, C.¹

30
- 0003488911
- Addison-Wesley Publishing Company, Reading, Massachussetts
- Luenberger, D. (1984). Linear and Nonlinear Programming. Addison-Wesley Publishing Company, Reading, Massachussetts.
- (1984) Linear and Nonlinear Programming
- Luenberger, D.¹

31
- 84864070408
- Online discovery and learning of predictive state representations
- McCracken, P., & Bowling, M. H. (2005). Online discovery and learning of predictive state representations. In Advances in Neural Information Processing Systems 18 (NIPS'05).
- (2005) Advances in Neural Information Processing Systems 18 (NIPS'05)
- McCracken, P.¹ Bowling, M.H.²

32
- 84880823326
- Taming decentralized POMDPs: towards efficient policy computation for multiagent setting
- Nair, R., Tambe, M., Yokoo, M., Pynadath, D., & Marsella, S. (2003). Taming decentralized POMDPs: towards efficient policy computation for multiagent setting. In Proc. of Int. Joint Conference on Artificial Intelligence, IJCAI'03.
- (2003) Proc. of Int. Joint Conference on Artificial Intelligence, IJCAI'03
- Nair, R.¹ Tambe, M.² Yokoo, M.³ Pynadath, D.⁴ Marsella, S.⁵

33
- 52249098423
- Optimal and approximate Q-value functions for decentralized POMDPs
- Oliehoek, F., Spaan, M., & Vlassis, N. (2008). Optimal and approximate Q-value functions for decentralized POMDPs. Journal of Artificial Intelligence Research (JAIR), 32, 289-353.
- (2008) Journal of Artificial Intelligence Research (JAIR) , vol.32 , pp. 289-353
- Oliehoek, F.¹ Spaan, M.² Vlassis, N.³

34
- 84899811776
- Lossless clustering of histories in decentralized POMDPs
- Oliehoek, F., Whiteson, S., & Spaan, M. (2009). Lossless clustering of histories in decentralized POMDPs. In Proc. of The International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 577-584.
- (2009) Proc. of The International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 577-584
- Oliehoek, F.¹ Whiteson, S.² Spaan, M.³

35
- 0003427725
- The MIT Press, Cambridge, Mass
- Osborne, M. J., & Rubinstein, A. (1994). A Course in Game Theory. The MIT Press, Cambridge, Mass.
- (1994) A Course in Game Theory
- Osborne, M.J.¹ Rubinstein, A.²

36
- 0003725604
- Dover Publications
- Papadimitriou, C. H., & Steiglitz, K. (1982). Combinatorial Optimization: Algorithms and Complexity. Dover Publications.
- (1982) Combinatorial Optimization: Algorithms and Complexity
- Papadimitriou, C.H.¹ Steiglitz, K.²

37
- 0000977910
- The Complexity Of Markov Decision Processes
- Papadimitriou, C. H., & Tsitsiklis, J. (1987). The Complexity Of Markov Decision Processes. Mathematics of Operations Research, 12 (3), 441-450.
- (1987) Mathematics of Operations Research , vol.12 , Issue.3 , pp. 441-450
- Papadimitriou, C.H.¹ Tsitsiklis, J.²

38
- 0036267866
- Game theory and decision theory in multi-agent systems
- Parsons, S., & Wooldridge, M. (2002). Game theory and decision theory in multi-agent systems. Autonomous Agents and Multi-Agent Systems (JAAMAS), 5(3), 243-254.
- (2002) Autonomous Agents and Multi-Agent Systems (JAAMAS) , vol.5 , Issue.3 , pp. 243-254
- Parsons, S.¹ Wooldridge, M.²

39
- 36348956362
- Average-reward decentralized Markov decision processes
- Petrik, M., & Zilberstein, S. (2007). Average-reward decentralized Markov decision processes. In Proc. of the Twentieth Int. Joint Conf. on Artificial Intelligence (IJCAI 2007).
- (2007) Proc. of the Twentieth Int. Joint Conf. on Artificial Intelligence (IJCAI 2007)
- Petrik, M.¹ Zilberstein, S.²

40
- 68349086890
- A bilinear programming approach for multiagent planning
- Petrik, M., & Zilberstein, S. (2009). A bilinear programming approach for multiagent planning. Journal of Artificial Intelligence Research (JAIR), 35, 235-274.
- (2009) Journal of Artificial Intelligence Research (JAIR) , vol.35 , pp. 235-274
- Petrik, M.¹ Zilberstein, S.²

41
- 0003998452
- John Wiley & Sons, Inc. New York, NY
- Puterman, M. (1994). Markov Decision Processes: discrete stochastic dynamic programming. John Wiley & Sons, Inc. New York, NY.
- (1994) Markov Decision Processes: Discrete stochastic dynamic programming
- Puterman, M.¹

42
- 1142292938
- The communicative multiagent team decision problem: Analyzing teamwork theories and models
- Pynadath, D., & Tambe, M. (2002). The Communicative Multiagent Team Decision Problem: Analyzing Teamwork Theories And Models. Journal of Artificial Intelligence Research, 16, 389-423. (Pubitemid 43057178)
- (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 389-423
- Pynadath, D.V.¹ Tambe, M.²

43
- 0038246079
- The application of linear programming to team decision problems
- Radner, R. (1959). The application of linear programming to team decision problems. Management Science, 5, 143-150.
- (1959) Management Science , vol.5 , pp. 143-150
- Radner, R.¹

44
- 0003584577
- Prentice Hall
- Russell, S., & Norvig, P. (1995). Artificial Intelligence: A modern approach. Prentice Hall.
- (1995) Artificial Intelligence: A Modern Approach
- Russell, S.¹ Norvig, P.²

45
- 0000685151
- chap. Distributed rational decision making, The MIT Press. Ed. by G. Weiss
- Sandholm, T. (1999). Multiagent systems, chap. Distributed rational decision making, pp. 201-258. The MIT Press. Ed. by G. Weiss.
- (1999) Multiagent Systems , pp. 201-258
- Sandholm, T.¹

46
- 29344453416
- Mixed-integer programming methods for finding nash equilibria
- Sandholm, T., Gilpin, A., & Conitzer, V. (2005). Mixed-integer programming methods for finding nash equilibria. In Proc. of the National Conference on Artificial Intelligence (AAAI).
- (2005) Proc. of the National Conference on Artificial Intelligence (AAAI)
- Sandholm, T.¹ Gilpin, A.² Conitzer, V.³

47
- 0036927789
- Cooperative co-learning: A model based approach for solving multi agent reinforcement problems
- Scherrer, B., & Charpillet, F. (2002). Cooperative co-learning: A model based approach for solving multi agent reinforcement problems. In Proc. of the IEEE Int. Conf. on Tools with Artificial Intelligence (ICTAI'02).
- (2002) Proc. of the IEEE Int. Conf. on Tools with Artificial Intelligence (ICTAI'02)
- Scherrer, B.¹ Charpillet, F.²

48
- 84880856384
- Memory-bounded dynamic programming for DECPOMDPs
- Seuken, S., & Zilberstein, S. (2007). Memory-bounded dynamic programming for DECPOMDPs. In Proc. of the Twentieth Int. Joint Conf. on Artificial Intelligence (IJCAI' 07).
- (2007) Proc. of the Twentieth Int. Joint Conf. on Artificial Intelligence (IJCAI'07)
- Seuken, S.¹ Zilberstein, S.²

49
- 2142812536
- Learning without state estimation in partially observable markovian decision processes
- Singh, S., Jaakkola, T., & Jordan, M. (1994). Learning without state estimation in partially observable markovian decision processes.. In Proceedings of the Eleventh International Conference on Machine Learning.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning
- Singh, S.¹ Jaakkola, T.² Jordan, M.³

50
- 1942452236
- Learning predictive state representations
- Singh, S., Littman, M., Jong, N., Pardoe, D., & Stone, P. (2003). Learning predictive state representations. In Proc. of the Twentieth Int. Conf. of Machine Learning (ICML'03).
- (2003) Proc. of the Twentieth Int. Conf. of Machine Learning (ICML'03)
- Singh, S.¹ Littman, M.² Jong, N.³ Pardoe, D.⁴ Stone, P.⁵

51
- 33750691009
- Point-based Dynamic Programming for DEC-POMDPs
- Szer, D., & Charpillet, F. (2006). Point-based Dynamic Programming for DEC-POMDPs. In Proc. of the Twenty-First National Conf. on Artificial Intelligence (AAAI 2006).
- (2006) Proc. of the Twenty-First National Conf. on Artificial Intelligence (AAAI 2006)
- Szer, D.¹ Charpillet, F.²

52
- 80053226937
- MAA*: A heuristic search algorithm for solving decentralized POMDPs
- Szer, D., Charpillet, F., & Zilberstein, S. (2005). MAA*: A heuristic search algorithm for solving decentralized POMDPs. In Proc. of the Twenty-First Conf. on Uncertainty in Artificial Intelligence (UAI'05), pp. 576- 583.
- (2005) Proc. of the Twenty-First Conf. on Uncertainty in Artificial Intelligence (UAI'05) , pp. 576-583
- Szer, D.¹ Charpillet, F.² Zilberstein, S.³

53
- 4544322319
- Interac-DEC-MDP : Towards the use of interactions in DEC-MDP
- Thomas, V., Bourjot, C., & Chevrier, V. (2004). Interac-DEC-MDP : Towards the use of interactions in DEC-MDP. In Proc. of the Third Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems (AAMAS'04), New York, USA, pp. 1450-1451.
- (2004) Proc. of the Third Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems (AAMAS'04), New York, USA , pp. 1450-1451
- Thomas, V.¹ Bourjot, C.² Chevrier, V.³

54
- 0003782186
- (3rd edition). Springer
- Vanderbei, R. J. (2008). Linear Programming: Foundations and Extensions (3rd edition). Springer.
- (2008) Linear Programming: Foundations and Extensions
- Vanderbei, R.J.¹

55
- 67649370955
- chap. 45-"Computing equilibria for two-person games", North-Holland, Amsterdam
- von Stengel, B. (2002). Handbook of Game Theory, Vol.3, chap. 45-"Computing equilibria for two-person games", pp. 1723-1759. North-Holland, Amsterdam.
- (2002) Handbook of Game Theory , vol.3 , pp. 1723-1759
- Von Stengel, B.¹

56
- 34247270255
- Mixed-integer linear programming for transitionindependent decentralized MDPs, New York, NY, USA. ACM
- Wu, J., & Durfee, E. H. (2006). Mixed-integer linear programming for transitionindependent decentralized MDPs. In Proc. of the fifth Int. Joint Conf. on Autonomous Agents and Multiagent Systems (AAMAS'06), pp. 1058-1060 New York, NY, USA. ACM.
- (2006) Proc. of the fifth Int. Joint Conf. on Autonomous Agents and Multiagent Systems (AAMAS'06) , pp. 1058-1060
- Wu, J.¹ Durfee, E.H.²

57
- 84962090726
- Communication in multi-agent Markov decision processes
- Boston, MA
- Xuan, P., Lesser, V., & Zilberstein, S. (2000). Communication in multi-agent Markov decision processes. In Proc. of ICMAS Workshop on Game Theoretic and Decision Theoretics Agents Boston, MA.
- (2000) Proc. of ICMAS Workshop on Game Theoretic and Decision Theoretics Agents
- Xuan, P.¹ Lesser, V.² Zilberstein, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.