메뉴 건너뛰기




Volumn , Issue , 2010, Pages 666-673

Rollout sampling policy iteration for decentralized POMDPs

Author keywords

[No Author keywords available]

Indexed keywords

MONTE CARLO METHODS; MULTI AGENT SYSTEMS; SCALABILITY; SOFTWARE AGENTS;

EID: 80053153738     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (22)

References (20)
  • 4
    • 0031272681 scopus 로고    scopus 로고
    • Rollout algorithms for combinatorial optimization
    • Dimitri P. Bertsekas, John N. Tsitsiklis, and Cynara Wu. Rollout algorithms for combinatorial optimization. Journal of Heuristics, 3(3):245-262, 1997. (Pubitemid 127509041)
    • (1997) Journal of Heuristics , vol.3 , Issue.3 , pp. 245-262
    • Bertsekas, D.P.1    Tsitsiklis, J.N.2    Wu, C.3
  • 7
    • 3543128853 scopus 로고    scopus 로고
    • Parallel rollout for online solution of partially observable Markov decision processes
    • Hyeong Soo Chang, Robert Givan, and Edwin K. P. Chong. Parallel rollout for online solution of partially observable Markov decision processes. Discrete Event Dynamic Systems, 14(3):309-341, 2004.
    • (2004) Discrete Event Dynamic Systems , vol.14 , Issue.3 , pp. 309-341
    • Chang, H.S.1    Givan, R.2    Chong, E.K.P.3
  • 9
    • 48349140736 scopus 로고    scopus 로고
    • Rollout sampling approximate policy iteration
    • Christos Dimitrakakis and Michail G. Lagoudakis. Rollout sampling approximate policy iteration. Machine Learning, 72(3):157-171, 2008.
    • (2008) Machine Learning , vol.72 , Issue.3 , pp. 157-171
    • Dimitrakakis, C.1    Lagoudakis, M.G.2
  • 13
    • 80053169654 scopus 로고    scopus 로고
    • Exploiting locality of interactions using a policy-gradient approach in multiagent learning
    • Francisco S. Melo. Exploiting locality of interactions using a policy-gradient approach in multiagent learning. In Proc. of the 18th European Conf. on Artificial Intelligence, volume 178, pages 157-161, 2008.
    • (2008) Proc. of the 18th European Conf. on Artificial Intelligence , vol.178 , pp. 157-161
    • Melo, F.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.