SCOPUS 정보 검색 플랫폼

Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence, UAI 2010

Volumn , Issue , 2010, Pages 666-673

Rollout sampling policy iteration for decentralized POMDPs

(3) Wu, Feng a Zilberstein, Shlomo b Chen, Xiaoping a

a UNIVERSITY OF SCIENCE AND TECHNOLOGY OF CHINA (China)

b University of Massachusetts Amherst (United States)

Author keywords

[No Author keywords available]

Indexed keywords

MONTE CARLO METHODS; MULTI AGENT SYSTEMS; SCALABILITY; SOFTWARE AGENTS;

BOUNDED MEMORY; DECISION PROBLEMS; EXPLICIT MODELING; LINEAR TIME COMPLEXITY; MULTI AGENT; PLANNING ALGORITHMS; POLICY ITERATION; SOLUTION QUALITY;

ITERATIVE METHODS;

EID: 80053153738 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (22)

References (20)

1
- 77958561050
- Incremental policy generation for finite-horizon DECPOMDPs
- Chistopher Amato, Jilles S. Dibangoye, and Shlomo Zilberstein. Incremental policy generation for finite-horizon DECPOMDPs. In Proc. of the 19th Int'l Conf. on Automated Planning and Scheduling, pages 2-9, 2009.
- (2009) Proc. of the 19th Int'l Conf. on Automated Planning and Scheduling , pp. 2-9
- Amato, C.¹ Dibangoye, J.S.² Zilberstein, S.³

2
- 0141965747
- The complexity of decentralized control of Markov decision processes
- Daniel S. Bernstein, Shlomo Zilberstein, and Neil Immerman. The complexity of decentralized control of Markov decision processes. In Proc. of the 16th Conf. on Uncertainty in Artificial Intelligence, pages 32-37, 2000.
- (2000) Proc. of the 16th Conf. on Uncertainty in Artificial Intelligence , pp. 32-37
- Bernstein, D.S.¹ Zilberstein, S.² Immerman, N.³

3
- 84880740944
- Bounded policy iteration for decentralized POMDPs
- Daniel S. Bernstein, Eric A. Hansen, and Shlomo Zilberstein. Bounded policy iteration for decentralized POMDPs. In Proc. of the 19th Int'l Joint Conf. on Artificial Intelligence, pages 1287-1292, 2005.
- (2005) Proc. of the 19th Int'l Joint Conf. on Artificial Intelligence , pp. 1287-1292
- Bernstein, D.S.¹ Hansen, E.A.² Zilberstein, S.³

4
- 0031272681
- Rollout algorithms for combinatorial optimization
- Dimitri P. Bertsekas, John N. Tsitsiklis, and Cynara Wu. Rollout algorithms for combinatorial optimization. Journal of Heuristics, 3(3):245-262, 1997. (Pubitemid 127509041)
- (1997) Journal of Heuristics , vol.3 , Issue.3 , pp. 245-262
- Bertsekas, D.P.¹ Tsitsiklis, J.N.² Wu, C.³

5
- 55849151629
- Parallel rollout for online solution of Dec-POMDPs
- Camille Besse and Brahim Chaib-draa. Parallel rollout for online solution of Dec-POMDPs. In Proc. of the 21st Int'l FLAIRS Conf., pages 619-624, 2008.
- (2008) Proc. of the 21st Int'l FLAIRS Conf. , pp. 619-624
- Besse, C.¹ Chaib-Draa, B.²

6
- 40949147745
- A comprehensive survey of multiagent reinforcement learning
- DOI 10.1109/TSMCC.2007.913919
- Lucian Busoniu, Robert Babuska, and Bart D. Schutter. A comprehensive survey of multiagent reinforcement learning. IEEE Trans. on SMC, Part C, 38(2):156-172, 2008. (Pubitemid 351404112)
- (2008) IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews , vol.38 , Issue.2 , pp. 156-172
- Busoniu, L.¹ Babuska, R.² De Schutter, B.³

7
- 3543128853
- Parallel rollout for online solution of partially observable Markov decision processes
- Hyeong Soo Chang, Robert Givan, and Edwin K. P. Chong. Parallel rollout for online solution of partially observable Markov decision processes. Discrete Event Dynamic Systems, 14(3):309-341, 2004.
- (2004) Discrete Event Dynamic Systems , vol.14 , Issue.3 , pp. 309-341
- Chang, H.S.¹ Givan, R.² Chong, E.K.P.³

8
- 84899853392
- Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs
- Jilles S. Dibangoye, Abdel-Illah Mouaddib, and Brahim Chaib-draa. Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs. In Proc. of the 8th Int'l Joint Conf. on Autonomous Agents and Multi-Agent Systems, pages 569-576, 2009.
- (2009) Proc. of the 8th Int'l Joint Conf. on Autonomous Agents and Multi-Agent Systems , pp. 569-576
- Dibangoye, J.S.¹ Mouaddib, A.-I.² Chaib-Draa, B.³

9
- 48349140736
- Rollout sampling approximate policy iteration
- Christos Dimitrakakis and Michail G. Lagoudakis. Rollout sampling approximate policy iteration. Machine Learning, 72(3):157-171, 2008.
- (2008) Machine Learning , vol.72 , Issue.3 , pp. 157-171
- Dimitrakakis, C.¹ Lagoudakis, M.G.²

10
- 22944468731
- Approximate policy iteration with a policy language bias
- Alan Fern, Sung Wook Yoon, and Robert Givan. Approximate policy iteration with a policy language bias. In Proc. of the 17th Conf. on Neural Info. Processing Systems, pages 847-854, 2003.
- (2003) Proc. of the 17th Conf. on Neural Info. Processing Systems , pp. 847-854
- Fern, A.¹ Yoon, S.W.² Givan, R.³

11
- 9444233318
- Dynamic programming for partially observable stochastic games
- Eric A. Hansen, Daniel S. Bernstein, and Shlomo Zilberstein. Dynamic programming for partially observable stochastic games. In Proc. of the 19th National Conf. on Artificial Intelligence, pages 709-715, 2004.
- (2004) Proc. of the 19th National Conf. on Artificial Intelligence , pp. 709-715
- Hansen, E.A.¹ Bernstein, D.S.² Zilberstein, S.³

12
- 1942420814
- Reinforcement learning as classification: Leveraging modern classifiers
- Michail G. Lagoudakis and Ronald Parr. Reinforcement learning as classification: Leveraging modern classifiers. In Proc. of the 20th Int'l Conf. on Machine Learning, pages 424-431, 2003.
- (2003) Proc. of the 20th Int'l Conf. on Machine Learning , pp. 424-431
- Lagoudakis, M.G.¹ Parr, R.²

13
- 80053169654
- Exploiting locality of interactions using a policy-gradient approach in multiagent learning
- Francisco S. Melo. Exploiting locality of interactions using a policy-gradient approach in multiagent learning. In Proc. of the 18th European Conf. on Artificial Intelligence, volume 178, pages 157-161, 2008.
- (2008) Proc. of the 18th European Conf. on Artificial Intelligence , vol.178 , pp. 157-161
- Melo, F.S.¹

14
- 84880823326
- Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings
- Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. Pynadath, and Stacy Marsella. Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In Proc. of the 18th Int'l Joint Conf. on Artificial Intelligence, pages 705-711, 2003.
- (2003) Proc. of the 18th Int'l Joint Conf. on Artificial Intelligence , pp. 705-711
- Nair, R.¹ Tambe, M.² Yokoo, M.³ Pynadath, D.V.⁴ Marsella, S.⁵

15
- 0012646255
- Learning to cooperate via policy search
- Leonid Peshkin, Kee-Eung Kim, Nicolas Meuleau, and Leslie Pack Kaelbling. Learning to cooperate via policy search. In Proc. of the 16th Conf. on Uncertainty in Artificial Intelligence, pages 489-496, 2000.
- (2000) Proc. of the 16th Conf. on Uncertainty in Artificial Intelligence , pp. 489-496
- Peshkin, L.¹ Kim, K.-E.² Meuleau, N.³ Kaelbling, L.P.⁴

16
- 84898959164
- Bounded finite state controllers
- Pascal Poupart and Craig Boutilier. Bounded finite state controllers. In Proc. of the 17th Annual Conference on Neural Information Processing Systems, pages 823-830, 2003.
- (2003) Proc. of the 17th Annual Conference on Neural Information Processing Systems , pp. 823-830
- Poupart, P.¹ Boutilier, C.²

17
- 51649085567
- Improved memorybounded dynamic programming for decentralized POMDPs
- Sven Seuken and Shlomo Zilberstein. Improved memorybounded dynamic programming for decentralized POMDPs. In Proc. of the 23rd Conf. on Uncertainty in Artificial Intelligence, pages 344-351, 2007.
- (2007) Proc. of the 23rd Conf. on Uncertainty in Artificial Intelligence , pp. 344-351
- Seuken, S.¹ Zilberstein, S.²

18
- 84880856384
- Memory-bounded dynamic programming for DEC-POMDPs
- Sven Seuken and Shlomo Zilberstein. Memory-bounded dynamic programming for DEC-POMDPs. In Proc. of the 20th Int'll Joint Conf. on Artificial Intelligence, pages 2009- 2015, 2007.
- (2007) Proc. of the 20th Int'll Joint Conf. on Artificial Intelligence , pp. 2009-2015
- Seuken, S.¹ Zilberstein, S.²

19
- 0004102479
- MIT Press, Cambridge, MA
- Richard Sutton and Andrew Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

20
- 34547980892
- Conditional random fields for multi-agent reinforcement learning
- Xinhua Zhang, Douglas Aberdeen, and S. V. N. Vishwanathan. Conditional random fields for multi-agent reinforcement learning. In Proc. of the 24th Int'l Conf. on Machine Learning, volume 227, pages 1143-1150, 2007.
- (2007) Proc. of the 24th Int'l Conf. on Machine Learning , vol.227 , pp. 1143-1150
- Zhang, X.¹ Aberdeen, D.² Vishwanathan, S.V.N.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.