SCOPUS 정보 검색 플랫폼

Autonomous Agents and Multi-Agent Systems

Volumn 21, Issue 3, 2010, Pages 293-320

Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs

(3) Amato, Christopher a Bernstein, Daniel S a Zilberstein, Shlomo a

a Biologically Inspired Neural and Dynamical Systems Laboratory (United States)

Author keywords

DEC POMDPs; Decision theory; Multiagent systems; Planning under uncertainty; POMDPs

Indexed keywords

EID: 77954951649 PISSN: 13872532 EISSN: 15737454 Source Type: Journal
DOI: 10.1007/s10458-009-9103-z Document Type: Article

Times cited : (131)

References (35)

1
- 84880889008
- Amato, C., Bernstein, D. S., & Zilberstein, S. (2007). Solving POMDPs using quadratically constrained linear programs. In: Proceedings of the twentieth international joint conference on artificial intelligence. (pp. 2418-2424.) Hyderabad, India.

2
- 27344432831
- Solving transition-independent decentralized Markov decision processes
- Becker R., Zilberstein S., Lesser V., Goldman C. V. (2004) Solving transition-independent decentralized Markov decision processes. Journal of AI Research 22: 423-455.
- (2004) Journal of AI Research , vol.22 , pp. 423-455
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.V.⁴

3
- 84880740944
- Bernstein, D. S., Hansen, E., & Zilberstein, S. (2005). Bounded policy iteration for decentralized POMDPs. In: Proceedings of the nineteenth international joint conference on artificial intelligence. (pp. 1287-1292). Edinburgh, Scotland.

4
- 65349083220
- Policy iteration for decentralized control of Markov decision processes
- Bernstein D. S., Amato C., Hansen E. A., Zilberstein S. (2009) Policy iteration for decentralized control of Markov decision processes. Journal of AI Research 34: 89-132.
- (2009) Journal of AI Research , vol.34 , pp. 89-132
- Bernstein, D.S.¹ Amato, C.² Hansen, E.A.³ Zilberstein, S.⁴

5
- 0003713964
- Athena Scientific: Belmont, MA
- Bertsekas D. P. (2004) Nonlinear programming. Belmont, MA, Athena Scientific.
- (2004) Nonlinear Programming
- Bertsekas, D.P.¹

6
- 77954953364
- Cassandra, A. R. (1998a). A survey of POMDP applications. In: AAAI fall symposium: Planning with POMDPs. Orlando, FL.

7
- 77954955348
- Cassandra, A. R. (1998b). Exact and approximate algorithms for partially observable Markov decision processes. PhD thesis. Brown University Providence, RI.

8
- 0011718924
- Optimum maintenance with incomplete information
- Eckles J. E. (1968) Optimum maintenance with incomplete information. Operations Research 16: 1058-1067.
- (1968) Operations Research , vol.16 , pp. 1058-1067
- Eckles, J.E.¹

9
- 4544325183
- Emery-Montemerlo, R., Gordon, G., Schneider, J., & Thrun, S. (2004). Approximate solutions for partially observable stochastic games with common payoffs. In: Proceedings of the third international joint conference on autonomous agents and multiagent systems (pp. 136-143). New York, NY.

10
- 17444372724
- Snopt: An SQP algorithm for large-scale constrained optimization
- Gill P. E., Murray W., Saunders M. (2005) Snopt: An SQP algorithm for large-scale constrained optimization. SIAM Review 47: 99-131.
- (2005) SIAM Review , vol.47 , pp. 99-131
- Gill, P.E.¹ Murray, W.² Saunders, M.³

11
- 77954958062
- Hansen, E. A. (1998). Solving POMDPs by searching in policy space. In: Proceedings of the fourteenth conference on uncertainty in artificial intelligence. (pp. 211-219). Madison, WI.

12
- 9444233318
- Hansen, E. A., Bernstein, D. S., & Zilberstein, S. (2004). Dynamic programming for partially observable stochastic games. In: Proceedings of the nineteenth national conference on artificial intelligence. (pp. 709-715). San Jose, CA.

13
- 0032246990
- Hauskrecht, M., & Fraser, H. (1998). Modeling treatment of ischemic heart disease with partially observable Markov decision processes. In: Proceedings of American medical informatics association annual symposium on computer applications in health care. (pp. 538-542). Orlando, Florida.

14
- 0004070444
- New York: Springer
- Horst R., Tuy H. (1996) Global optimization: Deterministic approaches. Springer, New York.
- (1996) Global Optimization: Deterministic Approaches
- Horst, R.¹ Tuy, H.²

15
- 36348942884
- Ji, S., Parr, R., Li, H., Liao, X., & Carin, L. (2007). Point-based policy iteration. In: Proceedings of the twenty-second national conference on artificial intelligence. (pp. 1243-1249). Vancouver, Canada.

16
- 33646427325
- Providence, RI: Brown University, Department of Computer Science
- Littman M. L., Cassandra A. R., Kaelbling L. P. (1995) Learning policies for partially observable environments: Scaling up. Technical report CS-95-11. Brown University, Department of Computer Science, Providence, RI.
- (1995) Learning Policies for Partially Observable Environments: Scaling up. Technical Report CS-95-11
- Littman, M.L.¹ Cassandra, A.R.² Kaelbling, L.P.³

17
- 84899969517
- Marecki, J., Gupta, T., Varakantham, P., Tambe, M., & Yokoo, M. (2008). Not all agents are equal: Scaling up distributed POMDPs for agent networks. In: Proceedings of the seventh international joint conference on autonomous agents and multiagent systems. (pp. 485-492). Estoril, Portugal.

18
- 77954951286
- Meuleau, N., Kim, K. E., Kaelbling, L. P., & Cassandra, A. R. (1999). Solving POMDPs by searching the space of finite policies. In: Proceedings of the fifteenth conference on uncertainty in artificial intelligence. (pp. 417-426). Stockholm, Sweden.

19
- 84880823326
- Nair, R., Pynadath, D., Yokoo, M., Tambe, M., & Marsella, S. (2003). Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In: Proceedings of the nineteenth international joint conference on artificial intelligence. (pp. 705-711). Acapulco, Mexico.

20
- 36348956362
- Petrik, M., & Zilberstein, S. (2007). Average-reward decentralized Markov decision processes. In Proceedings of the twentieth international joint conference on artificial intelligence (pp. 1997-2002). Hyderabad, India.

21
- 84880772945
- Pineau, J., Gordon, G., & Thrun, S. (2003). Point-based value iteration: An anytime algorithm for POMDPs. In: Proceedings of the eighteenth international joint conference on artificial intelligence. (pp. 1025-1032). Acapulco, Mexico.

22
- 77954951770
- Poupart, P. (2005). Exploiting structure to efficiently solve large scale partially observable Markov decision processes. PhD thesis. University of Toronto.

23
- 84898959164
- Bounded finite state controllers
- S. Thrun, L. Saul, B. Schölkopf (Eds.), Cambridge, MA: MIT Press
- Poupart P., Boutilier C. (2004) Bounded finite state controllers. In: Thrun S., Saul L., Schölkopf B. (eds) Advances in neural information processing systems 16. MIT Press, Cambridge, MA.
- (2004) Advances in Neural Information Processing Systems 16
- Poupart, P.¹ Boutilier, C.²

24
- 84880856384
- Seuken, S., & Zilberstein, S. (2007a). Memory-bounded dynamic programming for DEC-POMDPs. In: Proceedings of the twentieth international joint conference on artificial intelligence. (pp. 2009-2015). Hyderabad, India.

25
- 51649085567
- Seuken, S., & Zilberstein, S. (2007b). Improved memory-bounded dynamic programming for decentralized POMDPs. In: Proceedings of the twenty-third conference on uncertainty in artificial intelligence. Vancouver, Canada.

26
- 77954959889
- Simmons, R., & Koenig, S. (1995). Probabilistic navigation in partially observable environments. In: Proceedings of the fourteenth international joint conference on artificial intelligence. (pp. 1080-1087). Montral, Canada.

27
- 77954953147
- Singh, S., Jaakkola, T., & Jordan, M. (1994). Learning without state-estimation in partially observable Markovian decision processes. In: Proceedings of the eleventh international conference on machine learning. (pp. 284-292). New Brunswick, NJ.

28
- 77954957472
- Smith, T., & Simmons, R. (2004). Heuristic search value iteration for POMDPs. In: Proceedings of the twentieth conference on uncertainty in artificial intelligence. (pp. 520-527). Banff, Canada.

29
- 85131708448
- Smith, T., & Simmons, R. (2005). Point-based POMDP algorithms: Improved analysis and implementation. In: Proceedings of the twenty-first conference on uncertainty in artificial intelligence. Edinburgh, Scotland.

30
- 77954956779
- Sondik, E. J. (1971). The optimal control of partially observable Markov processes. PhD thesis. Stanford University.

31
- 31144472319
- Perseus: Randomized point-based value iteration for POMDPs
- Spaan M. T. J., Vlassis N. (2005) Perseus: Randomized point-based value iteration for POMDPs. Journal of AI Research 24: 195-220.
- (2005) Journal of AI Research , vol.24 , pp. 195-220
- Spaan, M.T.J.¹ Vlassis, N.²

32
- 0004102479
- Cambridge, MA: MIT Press
- Sutton R. S., Barto A. G. (1995) Reinforcement learning: An introduction. MIT Press, Cambridge, MA.
- (1995) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

33
- 33646423007
- Szer, D., & Charpillet, F. (2005). An optimal best-first search algorithm for solving infinite horizon DEC-POMDPs. In: Proceedings of the sixteenth European conference on machine learning. (pp. 389-399). Porto, Portugal.

34
- 85131709787
- Szer, D., Charpillet, F., & Zilberstein, S. (2005). MAA*: A heuristic search algorithm for solving decentralized POMDPs. In: Proceedings of the twenty-first conference on uncertainty in artificial intelligence. (pp. 576-583). Edinburgh, Scotland.

35
- 33646171746
- Wah, B. W., & Chen, Y. (2005). Solving large-scale nonlinear programming problems by constraint partitioning. In: Proceedings of the eleventh international conference on principles and practice of constraint programming. (pp. 697-711).

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.