메뉴 건너뛰기




Volumn 2, Issue , 2012, Pages 1256-1262

Sample bounded distributed reinforcement learning for decentralized POMDPs

Author keywords

[No Author keywords available]

Indexed keywords

BEST RESPONSE; COMPUTATION PROBLEMS; DISTRIBUTED REINFORCEMENT LEARNING; ERROR TOLERANCE; LEARNING APPROACH; MODELING TECHNIQUE; MULTI-AGENT COORDINATIONS; OPTIMAL POLICIES; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; PRIOR KNOWLEDGE; PROBLEM PARAMETERS; SAMPLE COMPLEXITY; SOLUTION TECHNIQUES;

EID: 84868275593     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (18)

References (19)
  • 1
    • 80053179816 scopus 로고    scopus 로고
    • Optimizing memory-bounded controllers for decentralized POMDPs
    • Amato, C.; Bernstein, D.; and Zilberstein, S. 2007. Optimizing memory-bounded controllers for decentralized POMDPs. In Proc. UAI.
    • (2007) Proc. UAI
    • Amato, C.1    Bernstein, D.2    Zilberstein, S.3
  • 3
    • 0041965975 scopus 로고    scopus 로고
    • R-max - A general polynomial time algorithm for near-optimal reinforcement learning
    • Brafman, R. I., and Tennenholtz, M. 2002. R-max - A general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research 3:213-231.
    • (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
    • Brafman, R.I.1    Tennenholtz, M.2
  • 4
    • 0026998041 scopus 로고
    • Reinforcement learning with perceptual aliasing: The perceptual distinctions approach
    • San Jose, CA: AAAI Press
    • Chrisman, L. 1992. Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In Proceedings of the Tenth National Conference on Articial Intelligence, 183-188. San Jose, CA: AAAI Press.
    • (1992) Proceedings of the Tenth National Conference on Articial Intelligence , pp. 183-188
    • Chrisman, L.1
  • 5
    • 0031630561 scopus 로고    scopus 로고
    • The dynamics of reinforcement learning in cooperative multiagent systems
    • Menlo Park, CA: AAAI Press/MIT Press
    • Claus, C., and Boutilier, C. 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the 15th National Conference on Artificial Intelligence, 746-752. Menlo Park, CA: AAAI Press/MIT Press.
    • (1998) Proceedings of the 15th National Conference on Artificial Intelligence , pp. 746-752
    • Claus, C.1    Boutilier, C.2
  • 9
    • 0002103968 scopus 로고    scopus 로고
    • Learning finite-state controllers for partially observable environments
    • Meuleau, N.; Peshkin, L.; Kim, K.; and Kaelbling, L. 1999. Learning finite-state controllers for partially observable environments. In Proc. UAI, 427-436.
    • (1999) Proc. UAI , pp. 427-436
    • Meuleau, N.1    Peshkin, L.2    Kim, K.3    Kaelbling, L.4
  • 14
    • 33646435268 scopus 로고    scopus 로고
    • Model-based online learning of POMDPs
    • Proceedings of the European Conference on Machine Learning (ECML), volume Springer
    • Shani, G.; Brafman, R.; and Shimony, S. 2005. Model-based online learning of POMDPs. In Proceedings of the European Conference on Machine Learning (ECML), volume Lecture Notes in Computer Science 3720, 353-364. Springer.
    • (2005) Lecture Notes in Computer Science , vol.3720 , pp. 353-364
    • Shani, G.1    Brafman, R.2    Shimony, S.3
  • 19
    • 85140781301 scopus 로고    scopus 로고
    • Coordinated multi-agent reinforcement learning in networked distributed POMDPs
    • Zhang, C., and Lesser, V. 2011. Coordinated multi-agent reinforcement learning in networked distributed POMDPs. In Proc. AAAl-11.
    • (2011) Proc. AAAl-11
    • Zhang, C.1    Lesser, V.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.