메뉴 건너뛰기




Volumn , Issue , 2005, Pages 1332-1338

Solving POMDPs with continuous or large discrete observation spaces

Author keywords

[No Author keywords available]

Indexed keywords

ASSISTED LIVING; DECISION PROBLEMS; DISCRETE OBSERVATIONS; DISCRETISATION; LOSSLESS; OBSERVATION SPACE; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; RELEVANT FEATURES;

EID: 84880741298     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (106)

References (27)
  • 1
    • 50549213583 scopus 로고
    • Optimal control of Markov decision processes with incomplete state estimation
    • K. J. Åström. Optimal control of Markov decision processes with incomplete state estimation. Journal of Mathematical Analysis and Applications, 10:174-205, 1965.
    • (1965) Journal of Mathematical Analysis and Applications , vol.10 , pp. 174-205
    • Åström, K.J.1
  • 2
    • 1942514241 scopus 로고    scopus 로고
    • Scaling internal-state policy-gradient methods for POMDPs
    • Sydney, Australia
    • D. Aberdeen and J. Baxter. Scaling internal-state policy-gradient methods for POMDPs. In ICML, pages 3-10, Sydney, Australia, 2002.
    • (2002) ICML , pp. 3-10
    • Aberdeen, D.1    Baxter, J.2
  • 9
    • 84947403595 scopus 로고
    • Probability inequalities for sums of bounded random variables
    • W. Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):13-30, 1963.
    • (1963) Journal of the American Statistical Association , vol.58 , Issue.301 , pp. 13-30
    • Hoeffding, W.1
  • 12
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Leslie Pack Kaelbling, Michael Littman, and Anthony R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99-134, 1998.
    • (1998) Artificial Intelligence , vol.101 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.2    Cassandra, A.R.3
  • 13
    • 33646430192 scopus 로고    scopus 로고
    • Learning finite-state controllers for partially observable environments
    • Stockholm
    • N. Meuleau, L. Peshkin, K.-E. Kim, and L. P. Kaelbling. Learning finite-state controllers for partially observable environments. In UAI, pages 427-436, Stockholm, 1999.
    • (1999) UAI , pp. 427-436
    • Meuleau, N.1    Peshkin, L.2    Kim, K.-E.3    Kaelbling, L.P.4
  • 15
    • 0036931186 scopus 로고    scopus 로고
    • Experiences with a mobile robotic guide for the elderly
    • Edmonton, AB
    • M. Montemerlo, J. Pineau, N. Roy, S. Thrun, and V. Verma. Experiences with a mobile robotic guide for the elderly. In AAAI, pages 587-592, Edmonton, AB, 2002.
    • (2002) AAAI , pp. 587-592
    • Montemerlo, M.1    Pineau, J.2    Roy, N.3    Thrun, S.4    Verma, V.5
  • 16
    • 0141819580 scopus 로고    scopus 로고
    • PEGASUS: A policy search method for large MDPs and POMDPs
    • Stanford, CA
    • A. Y. Ng and M. Jordan. PEGASUS: A policy search method for large MDPs and POMDPs. In UAI, pages 406-415, Stanford, CA, 2000.
    • (2000) UAI , pp. 406-415
    • Ng, A.Y.1    Jordan, M.2
  • 20
    • 85156196231 scopus 로고    scopus 로고
    • Exponential family PCA for belief compression in POMDPs
    • Vancouver, BC
    • N. Roy and G. Gordon. Exponential family PCA for belief compression in POMDPs. In NIPS, pages 1635-1642, Vancouver, BC, 2002.
    • (2002) NIPS , pp. 1635-1642
    • Roy, N.1    Gordon, G.2
  • 22
    • 0015658957 scopus 로고
    • The optimal control of partially observable Markov processes over a finite horizon
    • R. Smallwood and E. Sondik. The optimal control of partially observable Markov processes over a finite horizon. Operations Research, 21:1071-1088, 1973.
    • (1973) Operations Research , vol.21 , pp. 1071-1088
    • Smallwood, R.1    Sondik, E.2
  • 23
    • 84898978676 scopus 로고    scopus 로고
    • Monte Carlo POMDPs
    • Denver
    • S. Thrun. Monte Carlo POMDPs. In NIPS, pages 1064-1070, Denver, 1999.
    • (1999) NIPS , pp. 1064-1070
    • Thrun, S.1
  • 27
    • 0036374229 scopus 로고    scopus 로고
    • Speeding up the convergence of value iteration in partially observable Markov decision processes
    • N. Zhang and W. Zhang. Speeding up the convergence of value-iteration in partially observable Markov decision processes. Journal of Artificial Intelligence Research, 14:29-51, 2001. (Pubitemid 33738058)
    • (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 29-51
    • Zhang, N.L.1    Zhang, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.