SCOPUS 정보 검색 플랫폼

Proceedings of the National Conference on Artificial Intelligence

Volumn 2, Issue , 2012, Pages 1256-1262

Sample bounded distributed reinforcement learning for decentralized POMDPs

(4) Banerjee, Bikramjit a Lyle, Jeremy b Kraemer, Landon a Yellamraju, Rajesh a

a UNIVERSITY OF SOUTHERN MISSISSIPPI (United States)

b University of Southern Mississippi (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BEST RESPONSE; COMPUTATION PROBLEMS; DISTRIBUTED REINFORCEMENT LEARNING; ERROR TOLERANCE; LEARNING APPROACH; MODELING TECHNIQUE; MULTI-AGENT COORDINATIONS; OPTIMAL POLICIES; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; PRIOR KNOWLEDGE; PROBLEM PARAMETERS; SAMPLE COMPLEXITY; SOLUTION TECHNIQUES;

UNCERTAINTY ANALYSIS;

ARTIFICIAL INTELLIGENCE;

EID: 84868275593 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (18)

References (19)

1
- 80053179816
- Optimizing memory-bounded controllers for decentralized POMDPs
- Amato, C.; Bernstein, D.; and Zilberstein, S. 2007. Optimizing memory-bounded controllers for decentralized POMDPs. In Proc. UAI.
- (2007) Proc. UAI
- Amato, C.¹ Bernstein, D.² Zilberstein, S.³

2
- 0036874366
- The complexity of decentralized control of markov decision processes
- Bernstein, D. S.; Givan, R.; Immerman, N.; and Zilberstein, S. 2002. The complexity of decentralized control of markov decision processes. Mathematics of Operations Research 27:819-840.
- (2002) Mathematics of Operations Research , vol.27 , pp. 819-840
- Bernstein, D.S.¹ Givan, R.² Immerman, N.³ Zilberstein, S.⁴

3
- 0041965975
- R-max - A general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R. I., and Tennenholtz, M. 2002. R-max - A general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research 3:213-231.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

4
- 0026998041
- Reinforcement learning with perceptual aliasing: The perceptual distinctions approach
- San Jose, CA: AAAI Press
- Chrisman, L. 1992. Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In Proceedings of the Tenth National Conference on Articial Intelligence, 183-188. San Jose, CA: AAAI Press.
- (1992) Proceedings of the Tenth National Conference on Articial Intelligence , pp. 183-188
- Chrisman, L.¹

5
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Menlo Park, CA: AAAI Press/MIT Press
- Claus, C., and Boutilier, C. 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the 15th National Conference on Artificial Intelligence, 746-752. Menlo Park, CA: AAAI Press/MIT Press.
- (1998) Proceedings of the 15th National Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

6
- 4544325183
- Approximate solutions for partially observable stochastic games with common payoffs
- Emery-Montemerlo, R.; Gordon, G.; Schneider, J.; and Thrun, S. 2004. Approximate solutions for partially observable stochastic games with common payoffs. Autonomous Agents and Multiagent Systems, International Joint Conference on 1:136-143.
- (2004) Autonomous Agents and Multiagent Systems, International Joint Conference on , vol.1 , pp. 136-143
- Emery-Montemerlo, R.¹ Gordon, G.² Schneider, J.³ Thrun, S.⁴

7
- 84868297932
- Informed initial policies for learning in dec-pomdps
- To appear
- Kraemer, L., and Banerjee, B. 2012. Informed initial policies for learning in dec-pomdps. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence Student Abstract and Poster Program. To appear.
- (2012) Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence Student Abstract and Poster Program
- Kraemer, L.¹ Banerjee, B.²

8
- 0003932121
- Ph.D. Dissertation, Department of Computer Science, University of Rochester
- McCallum, A. K. 1995. Reinforcement Learning with Selective Perception and Hidden State. Ph.D. Dissertation, Department of Computer Science, University of Rochester.
- (1995) Reinforcement Learning with Selective Perception and Hidden State
- McCallum, A.K.¹

9
- 0002103968
- Learning finite-state controllers for partially observable environments
- Meuleau, N.; Peshkin, L.; Kim, K.; and Kaelbling, L. 1999. Learning finite-state controllers for partially observable environments. In Proc. UAI, 427-436.
- (1999) Proc. UAI , pp. 427-436
- Meuleau, N.¹ Peshkin, L.² Kim, K.³ Kaelbling, L.⁴

10
- 84880823326
- Taming decentralized pomdps: Towards efficient policy computation for multiagent settings
- Nair, R.; Tambe, M.; Yokoo, M.; Pynadath, D.; and Marsella, S. 2003. Taming decentralized pomdps: Towards efficient policy computation for multiagent settings. In Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI-03), 705-711.
- (2003) Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI-03) , pp. 705-711
- Nair, R.¹ Tambe, M.² Yokoo, M.³ Pynadath, D.⁴ Marsella, S.⁵

11
- 84868289680
- Heuristic search for identical payoff bayesian games
- Oliehoek, F. A.; Spaan, M. T. J.; Dibangoye, J. S.; and Amato, C. 2010. Heuristic search for identical payoff bayesian games. In Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS-10), 1115-1122.
- (2010) Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS-10) , pp. 1115-1122
- Oliehoek, F.A.¹ Spaan, M.T.J.² Dibangoye, J.S.³ Amato, C.⁴

12
- 0012646255
- Learning to cooperate via policy search
- Peshkin, L.; Kim, K.; Meuleau, N.; and Kaelbling, L. 2000. Learning to cooperate via policy search. In Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence (UAI '00).
- (2000) Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence (UAI '00)
- Peshkin, L.¹ Kim, K.² Meuleau, N.³ Kaelbling, L.⁴

13
- 84880856384
- Memory-bounded dynamic programming for dec-pomdps
- Seuken, S. 2007. Memory-bounded dynamic programming for dec-pomdps. In Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI-07), 2009-2015.
- (2007) Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI-07) , pp. 2009-2015
- Seuken, S.¹

14
- 33646435268
- Model-based online learning of POMDPs
- Proceedings of the European Conference on Machine Learning (ECML), volume Springer
- Shani, G.; Brafman, R.; and Shimony, S. 2005. Model-based online learning of POMDPs. In Proceedings of the European Conference on Machine Learning (ECML), volume Lecture Notes in Computer Science 3720, 353-364. Springer.
- (2005) Lecture Notes in Computer Science , vol.3720 , pp. 353-364
- Shani, G.¹ Brafman, R.² Shimony, S.³

15
- 84868299292
- Scaling up optimal heuristic search in Dec-POMDPs via incremental expansion
- Spaan, M. T. J.; Oliehoek, F. A.; and Amato, C. 2011. Scaling up optimal heuristic search in Dec-POMDPs via incremental expansion. In Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence (IJCAI-11), 2027-2032.
- (2011) Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence (IJCAI-11) , pp. 2027-2032
- Spaan, M.T.J.¹ Oliehoek, F.A.² Amato, C.³

16
- 0004102479
- MIT Press
- Sutton, R., and Barto, A. G. 1998. Reinforcement Learning: An Introduction. MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.G.²

17
- 33750691009
- Point-based dynamic programming for dec-pomdps
- Szer, D., and Charpillet, F. 2006. Point-based dynamic programming for dec-pomdps. In Proceedings of the 21st National Conference on Artificial Intelligence, 1233-1238.
- (2006) Proceedings of the 21st National Conference on Artificial Intelligence , pp. 1233-1238
- Szer, D.¹ Charpillet, F.²

18
- 80053153738
- Rollout sampling policy iteration for decentralized POMDPs
- Wu, F.; Zilberstein, S.; and Chen, X. 2010. Rollout sampling policy iteration for decentralized POMDPs. In Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence (UAI-10), 666-673.
- (2010) Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence (UAI-10) , pp. 666-673
- Wu, F.¹ Zilberstein, S.² Chen, X.³

19
- 85140781301
- Coordinated multi-agent reinforcement learning in networked distributed POMDPs
- Zhang, C., and Lesser, V. 2011. Coordinated multi-agent reinforcement learning in networked distributed POMDPs. In Proc. AAAl-11.
- (2011) Proc. AAAl-11
- Zhang, C.¹ Lesser, V.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.