SCOPUS 정보 검색 플랫폼

Journal of Artificial Intelligence Research

Volumn 34, Issue , 2009, Pages 89-132

Policy iteration for decentralized control of markov decision processes

(4) Bernstein, Daniel S a Amato, Christopher a Hansen, Eric A b Zilberstein, Shlomo a

a Biologically Inspired Neural and Dynamical Systems Laboratory (United States)

b MISSISSIPPI STATE UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BEHAVIORAL RESEARCH; CONTROLLERS; DYNAMIC PROGRAMMING; ITERATIVE METHODS; LEARNING ALGORITHMS; MARKOV PROCESSES; MULTIPURPOSE ROBOTS; PROBABILITY DISTRIBUTIONS; SOFTWARE AGENTS; STOCHASTIC SYSTEMS;

DISTRIBUTED AGENTS; FINITE-STATE CONTROLLERS; MARKOV DECISION PROCESSES; MULTI-ROBOT SYSTEMS; OPTIMAL ALGORITHM; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; POLICY ITERATION; POLICY ITERATION ALGORITHMS;

PROCESS CONTROL;

EID: 65349083220 PISSN: None EISSN: 10769757 Source Type: Journal
DOI: 10.1613/jair.2667 Document Type: Article

Times cited : (87)

References (41)

1
- 80053179816
- Optimizing memory-bounded controllers for decentralized POMDPs
- Amato, C, Bernstein, D. S., & Zilberstein, S. (2007). Optimizing memory-bounded controllers for decentralized POMDPs. In Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence.
- (2007) Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence
- Amato, C.¹ Bernstein, D.S.² Zilberstein, S.³

2
- 50549213583
- Optimal control of Markov decision processes with incomplete state estimation
- Astrom, K. J. (1965). Optimal control of Markov decision processes with incomplete state estimation. Journal of Mathematical Analysis and Applications, 10, 174-205.
- (1965) Journal of Mathematical Analysis and Applications , vol.10 , pp. 174-205
- Astrom, K.J.¹

3
- 0002430114
- Subjectivity and correlation in randomized strategies
- Aumann, R. J. (1974). Subjectivity and correlation in randomized strategies. Journal of Mathematical Economics, 1, 67-96.
- (1974) Journal of Mathematical Economics , vol.1 , pp. 67-96
- Aumann, R.J.¹

4
- 0036874366
- The complexity of decentralized control of Markov decision processes
- Bernstein, D. S., Givan, R., Immerman, N., & Zilberstein, S. (2002). The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research, 27(A), 819-840.
- (2002) Mathematics of Operations Research , vol.27 , Issue.A , pp. 819-840
- Bernstein, D.S.¹ Givan, R.² Immerman, N.³ Zilberstein, S.⁴

5
- 84880740944
- Bounded policy iteration for decentralized POMDPs
- Bernstein, D. S., Hansen, E. A., & Zilberstein, S. (2005). Bounded policy iteration for decentralized POMDPs. In Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, pp. 1287-1292.
- (2005) Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence , pp. 1287-1292
- Bernstein, D.S.¹ Hansen, E.A.² Zilberstein, S.³

6
- 85166261608
- Planning with incomplete information as heuristic search in belief space
- Bonet, B., & Geffner, H. (2000). Planning with incomplete information as heuristic search in belief space. In Proceedings of the Fifth International Conference on AI Planning and Scheduling, pp. 52-61.
- (2000) Proceedings of the Fifth International Conference on AI Planning and Scheduling , pp. 52-61
- Bonet, B.¹ Geffner, H.²

7
- 0001909869
- Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes
- Cassandra, A., Littman, M. L., & Zhang, N. L. (1997). Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes. In Proceedings of the Thirteenth Annual Conference on Uncertainty in Artificial Intelligence, pp. 54-61.
- (1997) Proceedings of the Thirteenth Annual Conference on Uncertainty in Artificial Intelligence , pp. 54-61
- Cassandra, A.¹ Littman, M.L.² Zhang, N.L.³

8
- 0003818801
- Ph.D. thesis, University of British Columbia
- Cheng, H.-T. (1988). Algorithms for Partially Observable Markov Decision Processes. Ph.D. thesis, University of British Columbia.
- (1988) Algorithms for Partially Observable Markov Decision Processes
- Cheng, H.-T.¹

9
- 4544325183
- Approximate solutions for partially observable stochastic games with common payoffs
- Emery-Montemerlo, R., Gordon, G., Schnieder, J., & Thrun, S. (2004). Approximate solutions for partially observable stochastic games with common payoffs. In Proceedings of the Third International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 136-143.
- (2004) Proceedings of the Third International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 136-143
- Emery-Montemerlo, R.¹ Gordon, G.² Schnieder, J.³ Thrun, S.⁴

10
- 29344440790
- Region-based incremental pruning for POMDPs
- Feng, Z., & Zilberstein, S. (2004). Region-based incremental pruning for POMDPs. In Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence, pp. 146-153.
- (2004) Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence , pp. 146-153
- Feng, Z.¹ Zilberstein, S.²

11
- 29344456775
- Efficient maximization in solving POMDPs
- Feng, Z., & Zilberstein, S. (2005). Efficient maximization in solving POMDPs. In Proceedings of the Twentieth National Conference on Artificial Intelligence, pp. 975-980.
- (2005) Proceedings of the Twentieth National Conference on Artificial Intelligence , pp. 975-980
- Feng, Z.¹ Zilberstein, S.²

12
- 0003125478
- Solving POMDPs by searching in policy space
- Hansen, E. (1998a). Solving POMDPs by searching in policy space. In Proceedings of the Fourteenth Annual Conference on Uncertainty in Artificial Intelligence, pp. 211-219.
- (1998) Proceedings of the Fourteenth Annual Conference on Uncertainty in Artificial Intelligence , pp. 211-219
- Hansen, E.¹

13
- 0003659747
- Ph.D. thesis, University of Massachusetts Amherst, Amherst, Massachusetts
- Hansen, E. A. (1998b). Finite-Memory Control of Partially Observable Systems. Ph.D. thesis, University of Massachusetts Amherst, Amherst, Massachusetts.
- (1998) Finite-Memory Control of Partially Observable Systems
- Hansen, E.A.¹

14
- 9444233318
- Dynamic programming for partially observable stochastic games
- Hansen, E. A., Bernstein, D. S., & Zilberstein, S. (2004). Dynamic programming for partially observable stochastic games. In Proceedings of the Nineteenth National Conference on Artificial Intelligence, pp. 709-715.
- (2004) Proceedings of the Nineteenth National Conference on Artificial Intelligence , pp. 709-715
- Hansen, E.A.¹ Bernstein, D.S.² Zilberstein, S.³

15
- 36348942884
- Point-based policy iteration
- Ji, S., Parr, R., Li, H., Liao, X., & Carin, L. (2007). Point-based policy iteration. In Proceedings of the Twenty-Second National Conference on Artificial Intelligence, pp. 1243-1249.
- (2007) Proceedings of the Twenty-Second National Conference on Artificial Intelligence , pp. 1243-1249
- Ji, S.¹ Parr, R.² Li, H.³ Liao, X.⁴ Carin, L.⁵

16
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L. P., Littman, M. L., & Cassandra, A. R. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101(1-2), 99-134.
- (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

17
- 0141503453
- Multi-agent influence diagrams for representing and solving games
- Koller, D., & Milch, B. (2003). Multi-agent influence diagrams for representing and solving games. Games and Economic Behavior, 45(l), 181-221.
- (2003) Games and Economic Behavior , vol.45 , Issue.L , pp. 181-221
- Koller, D.¹ Milch, B.²

18
- 0242719540
- Game networks
- La Mura, P. (2000). Game networks. In Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence, pp. 335-342.
- (2000) Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence , pp. 335-342
- La Mura, P.¹

19
- 65349189307
- Ph.D. thesis, University of Virginia
- Lark III, J. W. (1990). Applications of Best-First Heuristic Search to Finite-Horizon Partially Observed Markov Decision Processes. Ph.D. thesis, University of Virginia.
- (1990) Applications of Best-First Heuristic Search to Finite-Horizon Partially Observed Markov Decision Processes
- Lark III, J.W.¹

20
- 85138579181
- Learning policies for partially observable environments: Scaling up
- Littman, M. L., Cassandra, A. R., & Kaelbling, L. P. (1995). Learning policies for partially observable environments: Scaling up. In Proceedings of the Twelfth International Conference on Machine Learning, pp. 362-370.
- (1995) Proceedings of the Twelfth International Conference on Machine Learning , pp. 362-370
- Littman, M.L.¹ Cassandra, A.R.² Kaelbling, L.P.³

21
- 84880823326
- Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings
- Nair, R., Pynadath, D., Yokoo, M., Tambe, M., & Marsella, S. (2003). Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, pp. 705- 711.
- (2003) Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence , pp. 705-711
- Nair, R.¹ Pynadath, D.² Yokoo, M.³ Tambe, M.⁴ Marsella, S.⁵

22
- 29344437834
- Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs
- Nair, R., Varakantham, P., Tambe, M., & Yokoo, M. (2005). Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In Proceedings of the Twentieth National Conference on Artificial Intelligence, pp. 133-139.
- (2005) Proceedings of the Twentieth National Conference on Artificial Intelligence , pp. 133-139
- Nair, R.¹ Varakantham, P.² Tambe, M.³ Yokoo, M.⁴

23
- 0037903748
- Ph.D. thesis, University of Birmingham, Birmingham, England
- Parker, D. A. (2002). Implementation of Symbolic Model Checking for Probabilistic Systems. Ph.D. thesis, University of Birmingham, Birmingham, England.
- (2002) Implementation of Symbolic Model Checking for Probabilistic Systems
- Parker, D.A.¹

24
- 0012646255
- Learning to cooperate via policy search
- Peshkin, L., Kim, K.-E., Meuleau, N., & Kaelbling, L. P. (2000). Learning to cooperate via policy search. In Proceedings of the Sixteenth International Conference on Uncertainty in Artificial Intelligence, pp. 489-496.
- (2000) Proceedings of the Sixteenth International Conference on Uncertainty in Artificial Intelligence , pp. 489-496
- Peshkin, L.¹ Kim, K.-E.² Meuleau, N.³ Kaelbling, L.P.⁴

25
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- Pineau, J., Gordon, G., & Thrun, S. (2003). Point-based value iteration: An anytime algorithm for POMDPs. In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, pp. 1025-1031.
- (2003) Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence , pp. 1025-1031
- Pineau, J.¹ Gordon, G.² Thrun, S.³

26
- 0042658750
- A feasible computational approach to infinite-horizon partially- observed Markov decision processes
- Tech. rep, Georgia Institute of Technology. Reprinted in Working Notes of the 1998 AAA I Fall Symposium on Planning Using Partially Observable Markov Decision Processes
- Platzman, L. K. (1980). A feasible computational approach to infinite-horizon partially- observed Markov decision processes. Tech. rep., Georgia Institute of Technology. Reprinted in Working Notes of the 1998 AAA I Fall Symposium on Planning Using Partially Observable Markov Decision Processes.
- (1980)
- Platzman, L.K.¹

27
- 34247196722
- Bounded finite state controllers
- Poupart, P., & Boutilier, C. (2003). Bounded finite state controllers. In Proceedings of Advances in Neural Information Processing Systems 16.
- (2003) Proceedings of Advances in Neural Information Processing Systems , vol.16
- Poupart, P.¹ Boutilier, C.²

28
- 51649085567
- Improved memory-bounded dynamic programming for decentralized POMDPs
- Seuken, S., & Zilberstein, S. (2007). Improved memory-bounded dynamic programming for decentralized POMDPs. In Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence.
- (2007) Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence
- Seuken, S.¹ Zilberstein, S.²

29
- 0005610003
- Probabilistic navigation in partially observable environments
- Simmons, R., & Koenig, S. (1995). Probabilistic navigation in partially observable environments. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1080-1087.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence , pp. 1080-1087
- Simmons, R.¹ Koenig, S.²

30
- 0003824303
- Ph.D. thesis, University of Massachusetts, Amherst, Massachusetts
- Singh, S. (1994). Learning to Solve Markovian Decision Processes. Ph.D. thesis, University of Massachusetts, Amherst, Massachusetts.
- (1994) Learning to Solve Markovian Decision Processes
- Singh, S.¹

31
- 2142812536
- Learning without state-estimation in partially observable markovian decision processes
- Singh, S. P., Jaakkola, T., & Jordan, M. I. (1994). Learning without state-estimation in partially observable markovian decision processes. In Proceedings of the Eleventh International Conference on Machine Learning, pp. 284-292.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 284-292
- Singh, S.P.¹ Jaakkola, T.² Jordan, M.I.³

32
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- Small wood, R. D., & Sondik, E. J. (1973). The optimal control of partially observable Markov processes over a finite horizon. Operations Research, 21(h), 1071-1088.
- (1973) Operations Research, 21(h) , pp. 1071-1088
- Small wood, R.D.¹ Sondik, E.J.²

33
- 80053262864
- Point-based POMDP algorithms: Improved analysis and implementation
- Smith, T., & Simmons, R. (2005). Point-based POMDP algorithms: Improved analysis and implementation. In Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence, pp. 542-547.
- (2005) Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence , pp. 542-547
- Smith, T.¹ Simmons, R.²

34
- 58349115194
- Generating exponentially smaller POMDP models using conditionally irrelevant variable abstraction
- Smith, T., Thompson, D. R., & Wettergreen, D. S. (2007). Generating exponentially smaller POMDP models using conditionally irrelevant variable abstraction. In Proceedings of the Seventeenth International Conference on Applied Planning and Scheduling.
- (2007) Proceedings of the Seventeenth International Conference on Applied Planning and Scheduling
- Smith, T.¹ Thompson, D.R.² Wettergreen, D.S.³

35
- 0017943242
- The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs
- Sondik, E. J. (1978). The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Operations Research, 26, 282-304.
- (1978) Operations Research , vol.26 , pp. 282-304
- Sondik, E.J.¹

36
- 33646423007
- An optimal best-first search algorithm for solving infinite horizon DEC-POMDPs
- Szer, D., & Charpillet, F. (2005). An optimal best-first search algorithm for solving infinite horizon DEC-POMDPs. In Proceedings of the Sixteenth European Conference on Machine Learning, pp. 389-399.
- (2005) Proceedings of the Sixteenth European Conference on Machine Learning , pp. 389-399
- Szer, D.¹ Charpillet, F.²

37
- 33750691009
- Point-based dynamic programming for DEC-POMDPs
- Szer, D., & Charpillet, F. (2006). Point-based dynamic programming for DEC-POMDPs. In Proceedings of the Twenty-First National Conference on Artificial Intelligence, pp. 1233-1238.
- (2006) Proceedings of the Twenty-First National Conference on Artificial Intelligence , pp. 1233-1238
- Szer, D.¹ Charpillet, F.²

38
- 80053226937
- MAA*: A heuristic search algorithm for solving decentralized POMDPs
- Szer, D., Charpillet, F., & Zilberstein, S. (2005). MAA*: A heuristic search algorithm for solving decentralized POMDPs. In Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence, pp. 576-590.
- (2005) Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence , pp. 576-590
- Szer, D.¹ Charpillet, F.² Zilberstein, S.³

39
- 0015158656
- Separation of estimation and control for discrete time systems
- Witsenhausen, H. S. (1971). Separation of estimation and control for discrete time systems. Proceedings of the IEEE, 55(11), 1557-1566.
- (1971) Proceedings of the IEEE , vol.55 , Issue.11 , pp. 1557-1566
- Witsenhausen, H.S.¹

40
- 0346087531
- Planning with partially observable Markov decision processes: Advances in exact solution methods
- Zhang, N. L., & Lee, S. S. (1998). Planning with partially observable Markov decision processes: Advances in exact solution methods. In Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 523-530.
- (1998) Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence , pp. 523-530
- Zhang, N.L.¹ Lee, S.S.²

41
- 0036374229
- Speeding up the convergence of value iteration in partially observable Markov decision processes
- Zhang, N. L., & Zhang, W. (2001). Speeding up the convergence of value iteration in partially observable Markov decision processes. Journal of Artificial Intelligence Research, 14, 29-51.
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 29-51
- Zhang, N.L.¹ Zhang, W.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.