메뉴 건너뛰기




Volumn , Issue , 2009, Pages 1641-1646

Solving POMDPs: RTDP-Bel vs. point-based algorithms

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE;

EID: 78751689703     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (102)

References (19)
  • 1
    • 50549213583 scopus 로고
    • Optimal control of Markov Decision Processes with incomplete state estimation
    • K. Astrom. Optimal control of Markov Decision Processes with incomplete state estimation. J. Math. Anal. Appl., 10:174-205, 1965.
    • (1965) J. Math. Anal. Appl. , vol.10 , pp. 174-205
    • Astrom, K.1
  • 2
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • A. Barto, S. Bradtke, and S. Singh. Learning to act using real-time dynamic programming. Art. Int., 72:81-138, 1995.
    • (1995) Art. Int. , vol.72 , pp. 81-138
    • Barto, A.1    Bradtke, S.2    Singh, S.3
  • 5
    • 0002098456 scopus 로고    scopus 로고
    • Learning sorting and decision trees with POMDPs
    • B. Bonet and H. Geffner. Learning sorting and decision trees with POMDPs. In Proc. ICML, pages 73-81, 1998.
    • (1998) Proc. ICML , pp. 73-81
    • Bonet, B.1    Geffner, H.2
  • 6
    • 0012252088 scopus 로고    scopus 로고
    • Solving large POMDPs using real time dynamic programming
    • AAAI Press
    • B. Bonet and H. Geffner. Solving large POMDPs using real time dynamic programming. In Proc. AAAI Fall Symp. on POMDPs. AAAI Press, 1998.
    • (1998) Proc. AAAI Fall Symp. on POMDPs
    • Bonet, B.1    Geffner, H.2
  • 7
    • 85166261608 scopus 로고    scopus 로고
    • Planning with incomplete information as heuristic search in belief space
    • B. Bonet and H. Geffner. Planning with incomplete information as heuristic search in belief space. In Proc. ICAPS, pages 52-61, 2000.
    • (2000) Proc. ICAPS , pp. 52-61
    • Bonet, B.1    Geffner, H.2
  • 8
    • 31144460375 scopus 로고    scopus 로고
    • An ε-optimal grid-based algorithm for Partially Observable Markov Decision Processes
    • B. Bonet. An ε-optimal grid-based algorithm for Partially Observable Markov Decision Processes. In Proc. ICML, pages 51-58, 2002.
    • (2002) Proc. ICML , pp. 51-58
    • Bonet, B.1
  • 9
    • 0001770240 scopus 로고    scopus 로고
    • Value-Function Approximations for Partially Observable Markov Decision Processes
    • M. Hauskrecht. Value-function approximations for partially observable Markov decision processes. JAIR, 13:33-94, 2000. (Pubitemid 33682083)
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 33-94
    • Hauskrecht, M.1
  • 10
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • L. P. Kaelbling, M. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Art. Int., 101:99-134, 1999.
    • (1999) Art. Int. , vol.101 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.2    Cassandra, A.R.3
  • 11
    • 0025400088 scopus 로고
    • Real-time heuristic search
    • R. Korf. Real-time heuristic search. Art. Int., 42(2-3):189-211, 1990.
    • (1990) Art. Int. , vol.42 , Issue.2-3 , pp. 189-211
    • Korf, R.1
  • 12
    • 0019909899 scopus 로고
    • A survey of partially observable Markov decision processes: Theory, models and algorithms
    • G. Monahan. A survey of partially observable Markov decision processes: Theory, models and algorithms. Management Science, 28(1):1-16, 1983.
    • (1983) Management Science , vol.28 , Issue.1 , pp. 1-16
    • Monahan, G.1
  • 13
    • 52249090123 scopus 로고    scopus 로고
    • Anytime point-based approximations for large POMDPs
    • J. Pineau, G. J. Gordon, and S. Thrun. Anytime point-based approximations for large POMDPs. JAIR, 27:335-380, 2006.
    • (2006) JAIR , vol.27 , pp. 335-380
    • Pineau, J.1    Gordon, G.J.2    Thrun, S.3
  • 14
    • 84880906197 scopus 로고    scopus 로고
    • Forward search value iteration for POMDPs
    • G. Shani, R. I. Brafman, and S. E. Shimony. Forward search value iteration for POMDPs. In Proc. IJCAI, pages 2619-2624, 2007.
    • (2007) Proc. IJCAI , pp. 2619-2624
    • Shani, G.1    Brafman, R.I.2    Shimony, S.E.3
  • 15
    • 80053262864 scopus 로고    scopus 로고
    • Point-based POMDP algorithms: Improved analysis and implementation
    • T. Smith and R. Simmons. Point-based POMDP algorithms: Improved analysis and implementation. In Proc. UAI, pages 542-547, 2005.
    • (2005) Proc. UAI , pp. 542-547
    • Smith, T.1    Simmons, R.2
  • 17
    • 0017943242 scopus 로고
    • The optimal control of partially observable Markov decision processes over the infinite horizon: Discounted costs
    • E. Sondik. The optimal control of partially observable Markov decision processes over the infinite horizon: discounted costs. Oper. Res., 26(2), 1978.
    • (1978) Oper. Res. , vol.26 , Issue.2
    • Sondik, E.1
  • 18
    • 31144472319 scopus 로고    scopus 로고
    • Perseus: Randomized point-based value iteration for POMDPs
    • M. T. J. Spaan and N. A. Vlassis. Perseus: Randomized point-based value iteration for POMDPs. JAIR, 24:195-220, 2005.
    • (2005) JAIR , vol.24 , pp. 195-220
    • Spaan, M.T.J.1    Vlassis, N.A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.