메뉴 건너뛰기




Volumn 171, Issue 8-9, 2007, Pages 453-490

Partially observable Markov decision processes with imprecise parameters

Author keywords

Parameter set; POMDP; Probability interval; Second order beliefs

Indexed keywords

ALGORITHMS; COMPUTATIONAL METHODS; COST EFFECTIVENESS; DECISION THEORY; OPTIMIZATION; PARAMETER ESTIMATION; PROBABILITY DISTRIBUTIONS;

EID: 34249672336     PISSN: 00043702     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.artint.2007.03.004     Document Type: Article
Times cited : (49)

References (59)
  • 1
    • 34249696459 scopus 로고    scopus 로고
    • D. Aberdeen, J. Baxter, Scaling internal-state policy-gradient methods for POMDPs, in: International Conference on Machine Learning (ICML-02), Sydney, Australia, July 2002, pp. 1-12
  • 2
    • 50549213583 scopus 로고
    • Optimal control of Markov decision processes with incomplete state estimation
    • Aström K.J. Optimal control of Markov decision processes with incomplete state estimation. Journal of Mathematical Analysis and Applications 10 (1965) 174-205
    • (1965) Journal of Mathematical Analysis and Applications , vol.10 , pp. 174-205
    • Aström, K.J.1
  • 3
    • 34249727744 scopus 로고    scopus 로고
    • T. Augustin, On the suboptimality of the generalized Bayes rule and robust Bayesian procedures from the decision theoretic point of view-a cautionary note on updating imprecise priors, in: Proceedings of 3rd International Symposium on Imprecise Probabilities and their Applications (ISIPTA-03), 2003
  • 4
    • 0003787146 scopus 로고
    • Princeton Univ. Press, Princeton, NJ
    • Bellman R. Dynamic Programming (1957), Princeton Univ. Press, Princeton, NJ
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 7
    • 31144460375 scopus 로고    scopus 로고
    • An epsilon-optimal grid-based algorithm for partially observable Markov decision processes
    • Morgan Kaufmann
    • Bonet B. An epsilon-optimal grid-based algorithm for partially observable Markov decision processes. Proc. 19th International Conf. on Machine Learning (ICML-02) (2002), Morgan Kaufmann 51-58
    • (2002) Proc. 19th International Conf. on Machine Learning (ICML-02) , pp. 51-58
    • Bonet, B.1
  • 8
  • 9
    • 0030349220 scopus 로고    scopus 로고
    • Computing optimal policies for partially observable decision processes using compact representations
    • Portland, OR, AAAI Press/The MIT Press
    • Boutilier C., and Poole D. Computing optimal policies for partially observable decision processes using compact representations. Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI-96). Portland, OR (1996), AAAI Press/The MIT Press 1168-1175
    • (1996) Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI-96) , pp. 1168-1175
    • Boutilier, C.1    Poole, D.2
  • 14
    • 0009236173 scopus 로고    scopus 로고
    • Quasi-Bayesian strategies for efficient plan generation: application to the 'planning to observe' problem
    • Horvitz E., and Jensen F.V. (Eds). San Francisco, CA, Morgan Kaufmann
    • Cozman F.G., and Krotkov E. Quasi-Bayesian strategies for efficient plan generation: application to the 'planning to observe' problem. In: Horvitz E., and Jensen F.V. (Eds). Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI-96). San Francisco, CA (1996), Morgan Kaufmann 186-193
    • (1996) Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI-96) , pp. 186-193
    • Cozman, F.G.1    Krotkov, E.2
  • 15
    • 34249743120 scopus 로고    scopus 로고
    • A. Drake, Observation of a Markov process through a noisy channel, PhD thesis, Massachusetts Institute of Technology, 1962
  • 16
    • 34249726620 scopus 로고    scopus 로고
    • Z. Feng, E.A. Hansen, Approximate planning for factored POMDPs, in: Proceedings of the 6th European Conference on Planning (ECP-01), Toledo, Spain, September 2001
  • 20
    • 0034272032 scopus 로고    scopus 로고
    • Bounded-parameter Markov decision processes
    • Givan R., Leach S.M., and Dean T. Bounded-parameter Markov decision processes. Artificial Intelligence 122 1-2 (2000) 71-109
    • (2000) Artificial Intelligence , vol.122 , Issue.1-2 , pp. 71-109
    • Givan, R.1    Leach, S.M.2    Dean, T.3
  • 24
    • 34249708923 scopus 로고    scopus 로고
    • E.A. Hansen, Solving POMDPs by searching in policy space, in: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98), 1998, pp. 211-219
  • 25
    • 34249748352 scopus 로고    scopus 로고
    • E.A. Hansen, Z. Feng, Dynamic programming for POMDPs using a factored state representation, in: Artificial Intelligence Planning Systems (AIPS-00), 2000, pp. 130-139
  • 26
    • 34249740981 scopus 로고    scopus 로고
    • E.A. Hansen, R. Zhou, Synthesis of hierarchical finite-state controllers for POMDPs, in: Thirteenth International Conference on Automated Planning and Scheduling (ICAPS-03), June 2003
  • 27
    • 0037097188 scopus 로고    scopus 로고
    • Generalizing Markov decision processes to imprecise probabilities
    • Harmanec D. Generalizing Markov decision processes to imprecise probabilities. Journal of Statistical Planning and Inference 105 (2002) 199-213
    • (2002) Journal of Statistical Planning and Inference , vol.105 , pp. 199-213
    • Harmanec, D.1
  • 28
    • 0001770240 scopus 로고    scopus 로고
    • Value-function approximations for partially observable Markov decision processes
    • Hauskrecht M. Value-function approximations for partially observable Markov decision processes. Journal of Artificial Intelligence Research 13 (2000) 33-94
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 33-94
    • Hauskrecht, M.1
  • 29
    • 0034160101 scopus 로고    scopus 로고
    • Planning treatment of ischemic heart disease with partially observable Markov decision processes
    • Hauskrecht M., and Fraser H. Planning treatment of ischemic heart disease with partially observable Markov decision processes. Artificial Intelligence in Medicine 18 (2000) 221-244
    • (2000) Artificial Intelligence in Medicine , vol.18 , pp. 221-244
    • Hauskrecht, M.1    Fraser, H.2
  • 31
    • 51249181779 scopus 로고
    • A new polynomial-time algorithm for linear programming
    • Karmarkar N. A new polynomial-time algorithm for linear programming. Combinatorica 4 (1984) 373-395
    • (1984) Combinatorica , vol.4 , pp. 373-395
    • Karmarkar, N.1
  • 33
    • 0040069490 scopus 로고
    • On indeterminate probabilities
    • Levi I. On indeterminate probabilities. Journal of Philosophy 71 (1974) 391-418
    • (1974) Journal of Philosophy , vol.71 , pp. 391-418
    • Levi, I.1
  • 35
    • 0002679852 scopus 로고
    • A survey of algorithmic methods for partially observed Markov decision processes
    • Lovejoy W.S. A survey of algorithmic methods for partially observed Markov decision processes. Annals of Operations Research 28 (1991) 47-66
    • (1991) Annals of Operations Research , vol.28 , pp. 47-66
    • Lovejoy, W.S.1
  • 37
    • 34249649586 scopus 로고    scopus 로고
    • D.A. McAllester, S. Singh, Approximate planning for factored POMDPs using belief state simplification, in: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI-99), 1999, pp. 409-416
  • 38
    • 0036931186 scopus 로고    scopus 로고
    • M. Montemerlo, J. Pineau, N. Roy, S. Thrun, V. Verma, Experiences with a mobile robotic guide for the elderly, in: Proceedings of the National Conference of Artificial Intelligence (AAAI-02), Edmonton, AB, July 2002, pp. 587-592
  • 40
    • 14344250395 scopus 로고    scopus 로고
    • Robust control of Markov decision processes with uncertain transition matrices
    • Nilim A., and El-Ghaoui L. Robust control of Markov decision processes with uncertain transition matrices. Operations Research 53 (2005) 780-798
    • (2005) Operations Research , vol.53 , pp. 780-798
    • Nilim, A.1    El-Ghaoui, L.2
  • 43
    • 34249697523 scopus 로고    scopus 로고
    • J. Pineau, Tractable planning under uncertainty: Exploiting structure, PhD thesis, Robotics Institute, Carnegie Mellon University, Pittsburgh, PA, 2004
  • 45
    • 34249677829 scopus 로고    scopus 로고
    • P. Poupart, Exploiting structure to efficiently solve large scale partially observable Markov decision processes, PhD thesis, Department of Computer Science, University of Toronto, Toronto, Ontario, Canada, 2005
  • 48
    • 0015630091 scopus 로고
    • Markovian decision processes with uncertain transition probabilities
    • Satia J.K., and Lave R.E. Markovian decision processes with uncertain transition probabilities. Operations Research 21 (1973) 728-740
    • (1973) Operations Research , vol.21 , pp. 728-740
    • Satia, J.K.1    Lave, R.E.2
  • 51
    • 34249714722 scopus 로고    scopus 로고
    • E.J. Sondik, The optimal control of partially observable Markov processes, PhD thesis, Stanford University, 1971
  • 53
    • 34249743118 scopus 로고    scopus 로고
    • N. Vlassis, M.T.J. Spaan, A fast point-based algorithm for POMDPs, in: Benelearn 2004: Proceedings of the Annual Machine Learning Conference of Belgium and the Netherlands, Brussels, Belgium, 2004, pp. 170-176
  • 55
    • 0022581409 scopus 로고
    • Parameter imprecision in finite state, finite action dynamic programs
    • White C.C., and Eldeib H.K. Parameter imprecision in finite state, finite action dynamic programs. Operations Research 34 (1986) 120-129
    • (1986) Operations Research , vol.34 , pp. 120-129
    • White, C.C.1    Eldeib, H.K.2
  • 56
    • 0028460403 scopus 로고
    • Markov decision processes with imprecise transition probabilities
    • White C.C., and Eldeib H.K. Markov decision processes with imprecise transition probabilities. Operations Research 43 (1994) 739-749
    • (1994) Operations Research , vol.43 , pp. 739-749
    • White, C.C.1    Eldeib, H.K.2
  • 57
    • 0036374229 scopus 로고    scopus 로고
    • Speeding up the convergence of value iteration in partially observable Markov decision processes
    • Zhang N.L., and Zhang W. Speeding up the convergence of value iteration in partially observable Markov decision processes. Journal of Artificial Intelligence Research 14 (2001) 29-51
    • (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 29-51
    • Zhang, N.L.1    Zhang, W.2
  • 59
    • 84880904402 scopus 로고    scopus 로고
    • R. Zhou, E.A. Hansen, An improved grid-based approximation algorithm for POMDPs, in: Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI-01), 2001, pp. 707-716


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.