SCOPUS 정보 검색 플랫폼

Volumn WS-04-08, Issue , 2004, Pages 25-30

Dynamic programming for partially observable stochastic games

Author keywords

[No Author keywords available]

Indexed keywords

FINITE-HORIZON POSGS; PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES (POMDPS); PARTIALLY OBSERVABLE STOCHASTIC GAMES; PAYOFFS;

ALGORITHMS; DECISION THEORY; ITERATIVE METHODS; MARKOV PROCESSES; SOFTWARE AGENTS; STOCHASTIC CONTROL SYSTEMS;

DYNAMIC PROGRAMMING;

EID: 32144461572 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (60)

References (18)

1
- 1142293055
- Transition-independent decentralized Markov decision processes
- Becker, R.; Zilberstein, S.; Lesser, V.; and Goldman, C. V. 2003. Transition-independent decentralized Markov decision processes. In Proceedings of the 2nd International Conference on Autonomous Agents and Multi-agent Systems, 41-48.
- (2003) Proceedings of the 2nd International Conference on Autonomous Agents and Multi-agent Systems , pp. 41-48
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.V.⁴

2
- 0036874366
- The complexity of decentralized control of Markov decision processes
- Bernstein, D.; Givan, R.; Immerman, N.; and Zilberstein, S. 2002. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research 27 (4):819-840.
- (2002) Mathematics of Operations Research , vol.27 , Issue.4 , pp. 819-840
- Bernstein, D.¹ Givan, R.² Immerman, N.³ Zilberstein, S.⁴

3
- 84880690163
- Sequential optimality and coordination in multiagent systems
- Boutilier, C. 1999. Sequential optimality and coordination in multiagent systems. In Proceedings of the 16th International Joint Conference on Artificial Intelligence, 478-485.
- (1999) Proceedings of the 16th International Joint Conference on Artificial Intelligence , pp. 478-485
- Boutilier, C.¹

4
- 0041965975
- R-MAX-a general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R., and Tennenholtz, M. 2002. R-MAX-a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research 3:213-231.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.¹ Tennenholtz, M.²

5
- 0003989209
- Springer-Verlag.
- Filar, J., and Vrieze, K. 1997. Competitive Markov Decision Processes. Springer-Verlag.
- (1997) Competitive Markov Decision Processes
- Filar, J.¹ Vrieze, K.²

6
- 0020113091
- Decentralized control of finite state Markov processes
- Hsu, K., and Marcus, S. I. 1982. Decentralized control of finite state Markov processes. IEEE Transactions on Automatic Control AC-27(2):426-431.
- (1982) IEEE Transactions on Automatic Control , vol.AC-27 , Issue.2 , pp. 426-431
- Hsu, K.¹ Marcus, S.I.²

7
- 4644369748
- Nash Q-learaing for general-sum stochastic games
- Hu, J., and Wellman, M. 2003. Nash Q-learaing for general-sum stochastic games. Journal of Machine Learning Research 4:1039-1069.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
- Hu, J.¹ Wellman, M.²

8
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L.; Littman, M.; and Cassandra, A. 1998. Planning and acting in partially observable stochastic domains. Artificial Intelligence 101:99-134.
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.¹ Littman, M.² Cassandra, A.³

9
- 9444295723
- Fast planning in stochastic games
- Kearns, M.; Mansour, Y.; and Singh, S. 2000. Fast planning in stochastic games. In Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence (UAI-00), 309-316.
- (2000) Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence (UAI-00) , pp. 309-316
- Kearns, M.¹ Mansour, Y.² Singh, S.³

10
- 0027964134
- Fast algorithms for finding randomized strategies in game trees
- Koller, D.; Megiddo, N.; and von Stengel, B. 1994. Fast algorithms for finding randomized strategies in game trees. In Proceedings of the 26th ACM Symposium on Theory of Computing, 750-759.
- (1994) Proceedings of the 26th ACM Symposium on Theory of Computing , pp. 750-759
- Koller, D.¹ Megiddo, N.² Von Stengel, B.³

12
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Littman, M. 1994. Markov games as a framework for multi-agent reinforcement learning. In Proceedings of the 11th International Conference on Machine Learning, 157-163.
- (1994) Proceedings of the 11th International Conference on Machine Learning , pp. 157-163
- Littman, M.¹

13
- 84880823326
- Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings
- Nair, R.; Pynadath, D.; Yokoo, M.; Tambe, M.; and Marsella, S. 2003. Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In Proceedings of the 18th International Joint Conference on Artificial Intelligence, 705-711.
- (2003) Proceedings of the 18th International Joint Conference on Artificial Intelligence , pp. 705-711
- Nair, R.¹ Pynadath, D.² Yokoo, M.³ Tambe, M.⁴ Marsella, S.⁵

14
- 0030396683
- Decentralized control of a multiple access broadcast channel: Performance bounds
- Ooi, J. M., and Wornell, G. W. 1996. Decentralized control of a multiple access broadcast channel: Performance bounds. In Proceedings of the 35th Conference on Decision and Control, 293-298.
- (1996) Proceedings of the 35th Conference on Decision and Control , pp. 293-298
- Ooi, J.M.¹ Wornell, G.W.²

15
- 0012646255
- Learning to cooperate via policy search
- Peshkin, L.; Kim, K.-E.; Meuleau, N.; and Kaelbling, L. P. 2000. Learning to cooperate via policy search. In Proceedings of the 16th International Conference on Uncertainty in Artificial Intelligence, 489-496.
- (2000) Proceedings of the 16th International Conference on Uncertainty in Artificial Intelligence , pp. 489-496
- Peshkin, L.¹ Kim, K.-E.² Meuleau, N.³ Kaelbling, L.P.⁴

17
- 0000392613
- Stochastic games
- Shapley, L. 1953. Stochastic games. Proceedings of the National Academy of Sciences of the United States of America 39:1095-1100.
- (1953) Proceedings of the National Academy of Sciences of the United States of America , vol.39 , pp. 1095-1100
- Shapley, L.¹

18
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- Smallwood, R., and Sondik, E. 1973. The optimal control of partially observable Markov processes over a finite horizon. Operations Research 21:1071-1088.
- (1973) Operations Research , vol.21 , pp. 1071-1088
- Smallwood, R.¹ Sondik, E.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.