메뉴 건너뛰기




Volumn , Issue , 2008, Pages

Bayes-adaptive POMDPs

Author keywords

[No Author keywords available]

Indexed keywords

ECONOMIC AND SOCIAL EFFECTS; ESTIMATION; REINFORCEMENT LEARNING;

EID: 85162018872     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (108)

References (12)
  • 3
    • 33749251297 scopus 로고    scopus 로고
    • An analytic solution to discrete bayesian reinforcement learning
    • P. Poupart, N. Vlassis, J. Hoey, and K. Regan. An analytic solution to discrete bayesian reinforcement learning. In Proc. ICML, 2006.
    • (2006) Proc. ICML
    • Poupart, P.1    Vlassis, N.2    Hoey, J.3    Regan, K.4
  • 4
    • 39649090194 scopus 로고    scopus 로고
    • Active learning in partially observable markov decision processes
    • R. Jaulmes, J. Pineau, and D. Precup. Active learning in partially observable markov decision processes. In ECML, 2005.
    • (2005) ECML
    • Jaulmes, R.1    Pineau, J.2    Precup, D.3
  • 5
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • PII S000437029800023X
    • L. P. Kaelbling,M. L. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99-134, 1998. (Pubitemid 128387390)
    • (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 6
    • 84880772945 scopus 로고    scopus 로고
    • Point-based value iteration: An anytime algorithm for POMDPs
    • Acapulco, Mexico
    • J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: an anytime algorithm for POMDPs. In IJCAI, pages 1025-1032, Acapulco, Mexico, 2003.
    • (2003) IJCAI , pp. 1025-1032
    • Pineau, J.1    Gordon, G.2    Thrun, S.3
  • 8
    • 31144465830 scopus 로고    scopus 로고
    • Heuristic search value iteration for POMDPs
    • Banff, Canada
    • T. Smith and R. Simmons. Heuristic search value iteration for POMDPs. In UAI, Banff, Canada, 2004.
    • (2004) UAI
    • Smith, T.1    Simmons, R.2
  • 9
    • 44649095768 scopus 로고    scopus 로고
    • An online POMDP algorithm for complex multiagent environments
    • S. Paquet, L. Tobin, and B. Chaib-draa. An online POMDP algorithm for complex multiagent environments. In AAMAS, 2005.
    • (2005) AAMAS
    • Paquet, S.1    Tobin, L.2    Chaib-Draa, B.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.