SCOPUS 정보 검색 플랫폼

IJCAI International Joint Conference on Artificial Intelligence

Volumn , Issue , 2005, Pages 1332-1338

Solving POMDPs with continuous or large discrete observation spaces

(2) Hoey, Jesse a Poupart, Pascal b

a UNIVERSITY OF TORONTO (Canada)

b UNIVERSITY OF WATERLOO (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

ASSISTED LIVING; DECISION PROBLEMS; DISCRETE OBSERVATIONS; DISCRETISATION; LOSSLESS; OBSERVATION SPACE; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; RELEVANT FEATURES;

ARTIFICIAL INTELLIGENCE;

ALGORITHMS;

EID: 84880741298 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (106)

References (27)

1
- 50549213583
- Optimal control of Markov decision processes with incomplete state estimation
- K. J. Åström. Optimal control of Markov decision processes with incomplete state estimation. Journal of Mathematical Analysis and Applications, 10:174-205, 1965.
- (1965) Journal of Mathematical Analysis and Applications , vol.10 , pp. 174-205
- Åström, K.J.¹

2
- 1942514241
- Scaling internal-state policy-gradient methods for POMDPs
- Sydney, Australia
- D. Aberdeen and J. Baxter. Scaling internal-state policy-gradient methods for POMDPs. In ICML, pages 3-10, Sydney, Australia, 2002.
- (2002) ICML , pp. 3-10
- Aberdeen, D.¹ Baxter, J.²

3
- 0003487482
- Athena Scientific, Belmont, MA
- D. P. Bertsekas and J. N. Tsitsiklis. Neuro-dynamic programming. Athena Scientific, Belmont, MA, 1996.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

4
- 84880705310
- A decision-theoretic approach to task assistance for persons with dementia
- J. Boger, P. Poupart, J. Hoey, C. Boutilier, G. Fernie, and Alex Mihailidis. A decision-theoretic approach to task assistance for persons with dementia. In Proc. IJCAI, Edinburgh, 2005.
- Proc. IJCAI, Edinburgh, 2005
- Boger, J.¹ Poupart, P.² Hoey, J.³ Boutilier, C.⁴ Fernie, G.⁵ Mihailidis, A.⁶

5
- 84880747107
- Acting optimally in partially observable stochastic domains
- A. R. Cassandra, L. P. Kaelbling, and M. L. Littman. Acting optimally in partially observable stochastic domains. In AAAI, Seattle, WA, 1994.
- AAAI, Seattle, WA, 1994
- Cassandra, A.R.¹ Kaelbling, L.P.² Littman, M.L.³

6
- 0003818801
- PhD thesis, University of British Columbia, Vancouver
- H.-T. Cheng. Algorithms for Partially Observable Markov Decision Processes. PhD thesis, University of British Columbia, Vancouver, 1988.
- (1988) Algorithms for Partially Observable Markov Decision Processes
- Cheng, H.-T.¹

7
- 84898804274
- Active gesture recognition using partially observable Markov decision processes
- T. Darrell and A. P. Pentland. Active gesture recognition using partially observable Markov decision processes. In IEEE Intl. Conf. on Pattern Recognition, Vienna, Austria, 1996.
- IEEE Intl. Conf. on Pattern Recognition, Vienna, Austria, 1996
- Darrell, T.¹ Pentland, A.P.²

8
- 84880747253
- Approximate planning for factored POMDPs
- Z. Feng and E. Hansen. Approximate planning for factored POMDPs. In Proc. ECP, Toledo, Spain, 2001.
- Proc. ECP, Toledo, Spain, 2001
- Feng, Z.¹ Hansen, E.²

9
- 84947403595
- Probability inequalities for sums of bounded random variables
- W. Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):13-30, 1963.
- (1963) Journal of the American Statistical Association , vol.58 , Issue.301 , pp. 13-30
- Hoeffding, W.¹

10
- 84880739310
- Working paper
- Jesse Hoey, Pascal Poupart, Craig Boutilier, and Alex Mihailidis. Semi-supervised learning of patient-caregiver interactions using partially observable Markov decision processes. Working paper, 2005.
- (2005) Semi-supervised Learning of Patient-caregiver Interactions Using Partially Observable Markov Decision Processes
- Hoey, J.¹ Poupart, P.² Boutilier, C.³ Mihailidis, A.⁴

11
- 34247193910
- Technical Report IAS-UVA-04-04, Informatics Institute, University of Amsterdam
- M. Spaan J. Porta and N. Vlassis. Value iteration for continuous-state POMDPs. Technical Report IAS-UVA-04-04, Informatics Institute, University of Amsterdam, 2004.
- (2004) Value Iteration for Continuous-state POMDPs
- Spaan, M.¹ Porta, J.² Vlassis, N.³

12
- 0032073263
- Planning and acting in partially observable stochastic domains
- Leslie Pack Kaelbling, Michael Littman, and Anthony R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99-134, 1998.
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.² Cassandra, A.R.³

13
- 33646430192
- Learning finite-state controllers for partially observable environments
- Stockholm
- N. Meuleau, L. Peshkin, K.-E. Kim, and L. P. Kaelbling. Learning finite-state controllers for partially observable environments. In UAI, pages 427-436, Stockholm, 1999.
- (1999) UAI , pp. 427-436
- Meuleau, N.¹ Peshkin, L.² Kim, K.-E.³ Kaelbling, L.P.⁴

14
- 4644247530
- The use of computer vision in an intelligent environment to support aging-inplace, safety, and independence in the home
- A. Mihailidis, B. Carmichael, and J. Boger. The use of computer vision in an intelligent environment to support aging-inplace, safety, and independence in the home. IEEE Trans. on Information Technology in Biomedicine (Spec. Issue on Pervasive Healthcare), 8(3):1-11, 2004.
- (2004) IEEE Trans. on Information Technology in Biomedicine (Spec. Issue on Pervasive Healthcare) , vol.8 , Issue.3 , pp. 1-11
- Mihailidis, A.¹ Carmichael, B.² Boger, J.³

15
- 0036931186
- Experiences with a mobile robotic guide for the elderly
- Edmonton, AB
- M. Montemerlo, J. Pineau, N. Roy, S. Thrun, and V. Verma. Experiences with a mobile robotic guide for the elderly. In AAAI, pages 587-592, Edmonton, AB, 2002.
- (2002) AAAI , pp. 587-592
- Montemerlo, M.¹ Pineau, J.² Roy, N.³ Thrun, S.⁴ Verma, V.⁵

16
- 0141819580
- PEGASUS: A policy search method for large MDPs and POMDPs
- Stanford, CA
- A. Y. Ng and M. Jordan. PEGASUS: A policy search method for large MDPs and POMDPs. In UAI, pages 406-415, Stanford, CA, 2000.
- (2000) UAI , pp. 406-415
- Ng, A.Y.¹ Jordan, M.²

17
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: an anytime algorithm for POMDPs. In IJCAI, Acapulco, Mexico, 2003.
- IJCAI, Acapulco, Mexico, 2003
- Pineau, J.¹ Gordon, G.² Thrun, S.³

18
- 84898959164
- Bounded finite state controllers
- P. Poupart and C. Boutilier. Bounded finite state controllers. In NIPS, Vancouver, BC, 2003.
- NIPS, Vancouver, BC, 2003
- Poupart, P.¹ Boutilier, C.²

19
- 31144457984
- VDCBPI: An approximate scalable algorithm for large POMDPs
- P. Poupart and C. Boutilier. VDCBPI: an approximate scalable algorithm for large POMDPs. In NIPS, Vancouver, BC, 2004.
- NIPS, Vancouver, BC, 2004
- Poupart, P.¹ Boutilier, C.²

20
- 85156196231
- Exponential family PCA for belief compression in POMDPs
- Vancouver, BC
- N. Roy and G. Gordon. Exponential family PCA for belief compression in POMDPs. In NIPS, pages 1635-1642, Vancouver, BC, 2002.
- (2002) NIPS , pp. 1635-1642
- Roy, N.¹ Gordon, G.²

21
- 84880707672
- Spoken dialog management using probabilistic reasoning
- N. Roy, J. Pineau, and S. Thrun. Spoken dialog management using probabilistic reasoning. In ACL, Hong Kong, 2000.
- ACL, Hong Kong, 2000
- Roy, N.¹ Pineau, J.² Thrun, S.³

22
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- R. Smallwood and E. Sondik. The optimal control of partially observable Markov processes over a finite horizon. Operations Research, 21:1071-1088, 1973.
- (1973) Operations Research , vol.21 , pp. 1071-1088
- Smallwood, R.¹ Sondik, E.²

23
- 84898978676
- Monte Carlo POMDPs
- Denver
- S. Thrun. Monte Carlo POMDPs. In NIPS, pages 1064-1070, Denver, 1999.
- (1999) NIPS , pp. 1064-1070
- Thrun, S.¹

24
- 84880748332
- A fast point-based algorithm for POMDPs
- N. Vlassis and M. T. J. Spaan. A fast point-based algorithm for POMDPs. In Proc. Belgian-Dutch Conference on Machine Learning, Brussels, Belgium, 2004.
- Proc. Belgian-Dutch Conference on Machine Learning, Brussels, Belgium, 2004
- Vlassis, N.¹ Spaan, M.T.J.²

25
- 33750709342
- Technical Report CUED/F-INFEG/TR.520, Cambridge University, Engineering Department
- J. Williams, P. Poupart, and S. Young. Using factored Markov decision processes with continuous observations for dialogue management. Technical Report CUED/F-INFEG/TR.520, Cambridge University, Engineering Department, 2005.
- (2005) Using Factored Markov Decision Processes with Continuous Observations for Dialogue Management
- Williams, J.¹ Poupart, P.² Young, S.³

26
- 0010810245
- Technical Report HKUST-CS96-31, Hong Kong University of Science and Technology
- N. Zhang and W. Liu. Planning in stochastic domains: Problem characteristics and approximation. Technical Report HKUST-CS96-31, Hong Kong University of Science and Technology, 1996.
- (1996) Planning in Stochastic Domains: Problem Characteristics and Approximation
- Zhang, N.¹ Liu, W.²

27
- 0036374229
- Speeding up the convergence of value iteration in partially observable Markov decision processes
- N. Zhang and W. Zhang. Speeding up the convergence of value-iteration in partially observable Markov decision processes. Journal of Artificial Intelligence Research, 14:29-51, 2001. (Pubitemid 33738058)
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 29-51
- Zhang, N.L.¹ Zhang, W.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.