메뉴 건너뛰기




Volumn 2, Issue , 2007, Pages 1243-1249

Point-based policy iteration

Author keywords

[No Author keywords available]

Indexed keywords

HANSEN'S POLICY; POINT-BASED VALUE ITERATION (PBVI); VALUE FUNCTIONS;

EID: 36348942884     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (30)

References (13)
  • 1
    • 0001909869 scopus 로고    scopus 로고
    • Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes
    • Cassandra, A. R.; Littman, M.; and Zhang, N. 1997. Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes. In UAI 13, 54-61.
    • (1997) UAI 13 , pp. 54-61
    • Cassandra, A.R.1    Littman, M.2    Zhang, N.3
  • 2
    • 0003125478 scopus 로고    scopus 로고
    • Solving POMDPs by searching in policy space
    • Hansen, E. A. 1998. Solving POMDPs by searching in policy space. In UAI 14, 211-219.
    • (1998) UAI 14 , pp. 211-219
    • Hansen, E.A.1
  • 3
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Kaelbling, L. P.; Littman, M. L.; and Cassandra, A. R. 1998. Planning and acting in partially observable stochastic domains. Artificial Intelligence 101:99-134.
    • (1998) Artificial Intelligence , vol.101 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 4
    • 0002500946 scopus 로고    scopus 로고
    • Solving POMDPs by searching the space of finite policies
    • Meuleau, N.; Kim, K. E.; Kaelbling, L. P.; and Cassandra, A. R. 1999. Solving POMDPs by searching the space of finite policies. In UAI 15, 417-426.
    • (1999) UAI 15 , pp. 417-426
    • Meuleau, N.1    Kim, K.E.2    Kaelbling, L.P.3    Cassandra, A.R.4
  • 5
    • 84880772945 scopus 로고    scopus 로고
    • Point-based value iteration: An anytime algorithm for POMDPs
    • Pineau, J.; Gordon, G.; and Thrun, S. 2003. Point-based value iteration: An anytime algorithm for POMDPs. In IJCAI, 1025-1032.
    • (2003) IJCAI , pp. 1025-1032
    • Pineau, J.1    Gordon, G.2    Thrun, S.3
  • 6
    • 33745673007 scopus 로고    scopus 로고
    • Bounded finite state controllers
    • Poupart, P., and Boutilier, C. 2003. Bounded finite state controllers. In NIPS 16.
    • (2003) NIPS 16
    • Poupart, P.1    Boutilier, C.2
  • 8
    • 0037581251 scopus 로고
    • Modified policy iteration algorithms for discounted Markov decision problems
    • Puterman, M. L., and Shin, M. C. 1978. Modified policy iteration algorithms for discounted Markov decision problems. Management Science 24(1).
    • (1978) Management Science , vol.24 , Issue.1
    • Puterman, M.L.1    Shin, M.C.2
  • 9
    • 0015658957 scopus 로고
    • The optimal control of partially observable Markov processes over a finite horizon
    • Smallwood, R. D., and Sondik, E. J. 1973. The optimal control of partially observable Markov processes over a finite horizon. Operations Research 21 (5): 1071-1088.
    • (1973) Operations Research , vol.21 , Issue.5 , pp. 1071-1088
    • Smallwood, R.D.1    Sondik, E.J.2
  • 10
    • 33750297371 scopus 로고    scopus 로고
    • Heuristic search value iteration for POMDPs
    • Smith, T., and Simmons, R. 2004. Heuristic search value iteration for POMDPs. In UAI20.
    • (2004) UAI20
    • Smith, T.1    Simmons, R.2
  • 11
    • 80053262864 scopus 로고    scopus 로고
    • Point-based POMDP algorithms: Improved analysis and implementation
    • Smith, T., and Simmons, R. 2005. Point-based POMDP algorithms: Improved analysis and implementation. In UAI21.
    • (2005) UAI21
    • Smith, T.1    Simmons, R.2
  • 12
    • 0017943242 scopus 로고
    • The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs
    • Sondik, E. J. 1978. The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Operations Research 26(2):282-304.
    • (1978) Operations Research , vol.26 , Issue.2 , pp. 282-304
    • Sondik, E.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.