SCOPUS 정보 검색 플랫폼

Artificial Intelligence

Volumn 171, Issue 8-9, 2007, Pages 453-490

Partially observable Markov decision processes with imprecise parameters

(2) Itoh, Hideaki a Nakamura, Kiyohiko a

a TOKYO INSTITUTE OF TECHNOLOGY (Japan)

Author keywords

Parameter set; POMDP; Probability interval; Second order beliefs

Indexed keywords

ALGORITHMS; COMPUTATIONAL METHODS; COST EFFECTIVENESS; DECISION THEORY; OPTIMIZATION; PARAMETER ESTIMATION; PROBABILITY DISTRIBUTIONS;

PARAMETER SETS; PROBABILITY INTERVALS; SECOND-ORDER BELIEFS;

MARKOV PROCESSES;

EID: 34249672336 PISSN: 00043702 EISSN: None Source Type: Journal
DOI: 10.1016/j.artint.2007.03.004 Document Type: Article

Times cited : (49)

References (59)

1
- 34249696459
- D. Aberdeen, J. Baxter, Scaling internal-state policy-gradient methods for POMDPs, in: International Conference on Machine Learning (ICML-02), Sydney, Australia, July 2002, pp. 1-12

2
- 50549213583
- Optimal control of Markov decision processes with incomplete state estimation
- Aström K.J. Optimal control of Markov decision processes with incomplete state estimation. Journal of Mathematical Analysis and Applications 10 (1965) 174-205
- (1965) Journal of Mathematical Analysis and Applications , vol.10 , pp. 174-205
- Aström, K.J.¹

3
- 34249727744
- T. Augustin, On the suboptimality of the generalized Bayes rule and robust Bayesian procedures from the decision theoretic point of view-a cautionary note on updating imprecise priors, in: Proceedings of 3rd International Symposium on Imprecise Probabilities and their Applications (ISIPTA-03), 2003

4
- 0003787146
- Princeton Univ. Press, Princeton, NJ
- Bellman R. Dynamic Programming (1957), Princeton Univ. Press, Princeton, NJ
- (1957) Dynamic Programming
- Bellman, R.¹

5
- 34249649587
- Bernard J.M., Seidenfeld T., and Zaffalon M. (Eds), Carleton Scientific
- In: Bernard J.M., Seidenfeld T., and Zaffalon M. (Eds). Proceedings of the Third International Symposium in Imprecise Probabilities and its Applications (2003), Carleton Scientific
- (2003) Proceedings of the Third International Symposium in Imprecise Probabilities and its Applications

6
- 0003565783
- Athena Scientific, Belmont, MA
- Bertsekas D.P. Dynamic Programming and Optimal Control, vol. 2. second ed. (2001), Athena Scientific, Belmont, MA
- (2001) Dynamic Programming and Optimal Control, vol. 2. second ed.
- Bertsekas, D.P.¹

7
- 31144460375
- An epsilon-optimal grid-based algorithm for partially observable Markov decision processes
- Morgan Kaufmann
- Bonet B. An epsilon-optimal grid-based algorithm for partially observable Markov decision processes. Proc. 19th International Conf. on Machine Learning (ICML-02) (2002), Morgan Kaufmann 51-58
- (2002) Proc. 19th International Conf. on Machine Learning (ICML-02) , pp. 51-58
- Bonet, B.¹

8
- 0346942368
- Decision-theoretic planning: Structural assumptions and computational leverage
- Boutilier C., Dean T., and Hanks S. Decision-theoretic planning: Structural assumptions and computational leverage. Journal of Artificial Intelligence Research 11 (1999) 1-94
- (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

9
- 0030349220
- Computing optimal policies for partially observable decision processes using compact representations
- Portland, OR, AAAI Press/The MIT Press
- Boutilier C., and Poole D. Computing optimal policies for partially observable decision processes using compact representations. Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI-96). Portland, OR (1996), AAAI Press/The MIT Press 1168-1175
- (1996) Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI-96) , pp. 1168-1175
- Boutilier, C.¹ Poole, D.²

10
- 0008621136
- Decision making with interval influence diagrams
- New York, Elsevier Science
- Breese J., and Fertig K. Decision making with interval influence diagrams. Proceedings of the 6th Annual Conference on Uncertainty in Artificial Intelligence (UAI-91). New York (1991), Elsevier Science 467-478
- (1991) Proceedings of the 6th Annual Conference on Uncertainty in Artificial Intelligence (UAI-91) , pp. 467-478
- Breese, J.¹ Fertig, K.²

11
- 0001909869
- Incremental Pruning: A simple, fast, exact method for partially observable Markov decision processes
- Geiger D., and Shenoy P.P. (Eds). San Francisco, CA, Morgan Kaufmann
- Cassandra A., Littman M.L., and Zhang N.L. Incremental Pruning: A simple, fast, exact method for partially observable Markov decision processes. In: Geiger D., and Shenoy P.P. (Eds). Proceedings of the Thirteenth Annual Conference on Uncertainty in Artificial Intelligence (UAI-97). San Francisco, CA (1997), Morgan Kaufmann 54-61
- (1997) Proceedings of the Thirteenth Annual Conference on Uncertainty in Artificial Intelligence (UAI-97) , pp. 54-61
- Cassandra, A.¹ Littman, M.L.² Zhang, N.L.³

12
- 0008637688
- Independence with lower and upper probabilities
- San Francisco, CA, Morgan Kaufmann
- Chrisman L. Independence with lower and upper probabilities. Proceedings of the 12th Annual Conference on Uncertainty in Artificial Intelligence (UAI-96). San Francisco, CA (1996), Morgan Kaufmann 169-177
- (1996) Proceedings of the 12th Annual Conference on Uncertainty in Artificial Intelligence (UAI-96) , pp. 169-177
- Chrisman, L.¹

13
- 0010358630
- Credal networks
- Cozman F.G. Credal networks. Artificial Intelligence 120 (2000) 199-233
- (2000) Artificial Intelligence , vol.120 , pp. 199-233
- Cozman, F.G.¹

14
- 0009236173
- Quasi-Bayesian strategies for efficient plan generation: application to the 'planning to observe' problem
- Horvitz E., and Jensen F.V. (Eds). San Francisco, CA, Morgan Kaufmann
- Cozman F.G., and Krotkov E. Quasi-Bayesian strategies for efficient plan generation: application to the 'planning to observe' problem. In: Horvitz E., and Jensen F.V. (Eds). Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI-96). San Francisco, CA (1996), Morgan Kaufmann 186-193
- (1996) Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI-96) , pp. 186-193
- Cozman, F.G.¹ Krotkov, E.²

15
- 34249743120
- A. Drake, Observation of a Markov process through a noisy channel, PhD thesis, Massachusetts Institute of Technology, 1962

16
- 34249726620
- Z. Feng, E.A. Hansen, Approximate planning for factored POMDPs, in: Proceedings of the 6th European Conference on Planning (ECP-01), Toledo, Spain, September 2001

17
- 85012579920
- Interval influence diagrams
- New York, Elsevier Science
- Fertig K., and Breese J. Interval influence diagrams. Proceedings of the 5th Annual Conference on Uncertainty in Artificial Intelligence (UAI-90) (1990), New York, Elsevier Science 149-161
- (1990) Proceedings of the 5th Annual Conference on Uncertainty in Artificial Intelligence (UAI-90) , pp. 149-161
- Fertig, K.¹ Breese, J.²

18
- 0027554261
- Probability intervals over influence diagrams
- Fertig K.W., and Breese J.S. Probability intervals over influence diagrams. IEEE Transactions on Pattern Analysis and Machine Intelligence 15 3 (1993) 280-286
- (1993) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.15 , Issue.3 , pp. 280-286
- Fertig, K.W.¹ Breese, J.S.²

19
- 0041153021
- A theory of higher order probabilities
- Morgan Kaufmann
- Gaifman H. A theory of higher order probabilities. Proceedings of the 1986 Conference on Theoretical Aspects of Reasoning about Knowledge (1986), Morgan Kaufmann 275-292
- (1986) Proceedings of the 1986 Conference on Theoretical Aspects of Reasoning about Knowledge , pp. 275-292
- Gaifman, H.¹

20
- 0034272032
- Bounded-parameter Markov decision processes
- Givan R., Leach S.M., and Dean T. Bounded-parameter Markov decision processes. Artificial Intelligence 122 1-2 (2000) 71-109
- (2000) Artificial Intelligence , vol.122 , Issue.1-2 , pp. 71-109
- Givan, R.¹ Leach, S.M.² Dean, T.³

21
- 0003697868
- University of Minnesota Press, Minneapolis
- Good I.J. Good Thinking: The Foundations of Probability and its Applications (1983), University of Minnesota Press, Minneapolis
- (1983) Good Thinking: The Foundations of Probability and its Applications
- Good, I.J.¹

22
- 18644375926
- Updating sets of probabilities
- San Francisco, CA, Morgan Kaufmann
- Grove A.J., and Halpern J.Y. Updating sets of probabilities. Proceedings of the 14th Annual Conference on Uncertainty in Artificial Intelligence (UAI-98). San Francisco, CA (1998), Morgan Kaufmann 173-182
- (1998) Proceedings of the 14th Annual Conference on Uncertainty in Artificial Intelligence (UAI-98) , pp. 173-182
- Grove, A.J.¹ Halpern, J.Y.²

23
- 0009089682
- Theoretical foundations for abstraction-based probabilistic planning
- San Francisco, CA, Morgan Kaufmann
- Ha V., and Haddawy P. Theoretical foundations for abstraction-based probabilistic planning. Proceedings of the 12th Annual Conference on Uncertainty in Artificial Intelligence (UAI-96). San Francisco, CA (1996), Morgan Kaufmann 291-298
- (1996) Proceedings of the 12th Annual Conference on Uncertainty in Artificial Intelligence (UAI-96) , pp. 291-298
- Ha, V.¹ Haddawy, P.²

24
- 34249708923
- E.A. Hansen, Solving POMDPs by searching in policy space, in: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98), 1998, pp. 211-219

25
- 34249748352
- E.A. Hansen, Z. Feng, Dynamic programming for POMDPs using a factored state representation, in: Artificial Intelligence Planning Systems (AIPS-00), 2000, pp. 130-139

26
- 34249740981
- E.A. Hansen, R. Zhou, Synthesis of hierarchical finite-state controllers for POMDPs, in: Thirteenth International Conference on Automated Planning and Scheduling (ICAPS-03), June 2003

27
- 0037097188
- Generalizing Markov decision processes to imprecise probabilities
- Harmanec D. Generalizing Markov decision processes to imprecise probabilities. Journal of Statistical Planning and Inference 105 (2002) 199-213
- (2002) Journal of Statistical Planning and Inference , vol.105 , pp. 199-213
- Harmanec, D.¹

28
- 0001770240
- Value-function approximations for partially observable Markov decision processes
- Hauskrecht M. Value-function approximations for partially observable Markov decision processes. Journal of Artificial Intelligence Research 13 (2000) 33-94
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 33-94
- Hauskrecht, M.¹

29
- 0034160101
- Planning treatment of ischemic heart disease with partially observable Markov decision processes
- Hauskrecht M., and Fraser H. Planning treatment of ischemic heart disease with partially observable Markov decision processes. Artificial Intelligence in Medicine 18 (2000) 221-244
- (2000) Artificial Intelligence in Medicine , vol.18 , pp. 221-244
- Hauskrecht, M.¹ Fraser, H.²

30
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling L.P., Littman M.L., and Cassandra A.R. Planning and acting in partially observable stochastic domains. Artificial Intelligence 101 (1999) 99-134
- (1999) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

31
- 51249181779
- A new polynomial-time algorithm for linear programming
- Karmarkar N. A new polynomial-time algorithm for linear programming. Combinatorica 4 (1984) 373-395
- (1984) Combinatorica , vol.4 , pp. 373-395
- Karmarkar, N.¹

32
- 0030150627
- An introduction to issues in higher order uncertainty
- Lehner P.E., Laskey K.B., and Dubois D. An introduction to issues in higher order uncertainty. IEEE Transactions on Systems, Man and Cybernetics, Part A 26 3 (1996) 289-293
- (1996) IEEE Transactions on Systems, Man and Cybernetics, Part A , vol.26 , Issue.3 , pp. 289-293
- Lehner, P.E.¹ Laskey, K.B.² Dubois, D.³

33
- 0040069490
- On indeterminate probabilities
- Levi I. On indeterminate probabilities. Journal of Philosophy 71 (1974) 391-418
- (1974) Journal of Philosophy , vol.71 , pp. 391-418
- Levi, I.¹

34
- 0004045895
- MIT Press, Cambridge, MA
- Levi I. The Enterprise of Knowledge (1980), MIT Press, Cambridge, MA
- (1980) The Enterprise of Knowledge
- Levi, I.¹

35
- 0002679852
- A survey of algorithmic methods for partially observed Markov decision processes
- Lovejoy W.S. A survey of algorithmic methods for partially observed Markov decision processes. Annals of Operations Research 28 (1991) 47-66
- (1991) Annals of Operations Research , vol.28 , pp. 47-66
- Lovejoy, W.S.¹

36
- 0036374190
- Nonapproximability results for partially observable Markov decision processes
- Lusena C., Goldsmith J., and Mundhenk M. Nonapproximability results for partially observable Markov decision processes. Journal of Artificial Intelligence Research 14 (2001) 83-103
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 83-103
- Lusena, C.¹ Goldsmith, J.² Mundhenk, M.³

37
- 34249649586
- D.A. McAllester, S. Singh, Approximate planning for factored POMDPs using belief state simplification, in: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI-99), 1999, pp. 409-416

38
- 0036931186
- M. Montemerlo, J. Pineau, N. Roy, S. Thrun, V. Verma, Experiences with a mobile robotic guide for the elderly, in: Proceedings of the National Conference of Artificial Intelligence (AAAI-02), Edmonton, AB, July 2002, pp. 587-592

39
- 13244260002
- Robustness in Markov decision problems with uncertain transition matrices
- MIT Press, Cambridge, MA
- Nilim A., and El-Ghaoui L. Robustness in Markov decision problems with uncertain transition matrices. Advances in Neutral Information Processing Systems 16 (NIPS-03) (2004), MIT Press, Cambridge, MA
- (2004) Advances in Neutral Information Processing Systems 16 (NIPS-03)
- Nilim, A.¹ El-Ghaoui, L.²

40
- 14344250395
- Robust control of Markov decision processes with uncertain transition matrices
- Nilim A., and El-Ghaoui L. Robust control of Markov decision processes with uncertain transition matrices. Operations Research 53 (2005) 780-798
- (2005) Operations Research , vol.53 , pp. 780-798
- Nilim, A.¹ El-Ghaoui, L.²

41
- 0342715115
- Second order probabilities for uncertain and conflicting evidence
- New York, Elsevier Science
- Paaß G. Second order probabilities for uncertain and conflicting evidence. Proceedings of the 6th Annual Conference on Uncertainty in Artificial Intelligence (UAI-91). New York (1991), Elsevier Science 447-456
- (1991) Proceedings of the 6th Annual Conference on Uncertainty in Artificial Intelligence (UAI-91) , pp. 447-456
- Paaß, G.¹

42
- 0000977910
- The complexity of Markov decision processes
- Papadimitriou C.H., and Tsitsiklis J.N. The complexity of Markov decision processes. Mathematics of Operations Research 12 3 (1987) 441-450
- (1987) Mathematics of Operations Research , vol.12 , Issue.3 , pp. 441-450
- Papadimitriou, C.H.¹ Tsitsiklis, J.N.²

43
- 34249697523
- J. Pineau, Tractable planning under uncertainty: Exploiting structure, PhD thesis, Robotics Institute, Carnegie Mellon University, Pittsburgh, PA, 2004

44
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- AAAI Press, Menlo Park, CA
- Pineau J., Gordon G., and Thrun S. Point-based value iteration: An anytime algorithm for POMDPs. Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI-03) (2003), AAAI Press, Menlo Park, CA
- (2003) Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI-03)
- Pineau, J.¹ Gordon, G.² Thrun, S.³

45
- 34249677829
- P. Poupart, Exploiting structure to efficiently solve large scale partially observable Markov decision processes, PhD thesis, Department of Computer Science, University of Toronto, Toronto, Ontario, Canada, 2005

46
- 84898959164
- Bounded finite state controllers
- MIT Press, Cambridge, MA
- Poupart P., and Boutilier C. Bounded finite state controllers. Advances in Neural Information Processing Systems 16 (NIPS-03) (2004), MIT Press, Cambridge, MA
- (2004) Advances in Neural Information Processing Systems 16 (NIPS-03)
- Poupart, P.¹ Boutilier, C.²

47
- 31144457984
- VDCBPI: An approximate scalable algorithm for large scale POMDPs
- MIT Press, Cambridge, MA
- Poupart P., and Boutilier C. VDCBPI: An approximate scalable algorithm for large scale POMDPs. Advances in Neural Information Processing Systems 17 (NIPS-04) (2005), MIT Press, Cambridge, MA
- (2005) Advances in Neural Information Processing Systems 17 (NIPS-04)
- Poupart, P.¹ Boutilier, C.²

48
- 0015630091
- Markovian decision processes with uncertain transition probabilities
- Satia J.K., and Lave R.E. Markovian decision processes with uncertain transition probabilities. Operations Research 21 (1973) 728-740
- (1973) Operations Research , vol.21 , pp. 728-740
- Satia, J.K.¹ Lave, R.E.²

49
- 0025400328
- Two perspectives on consensus for (Bayesian) inference and decisions
- Seidenfeld T., and Schervish M.J. Two perspectives on consensus for (Bayesian) inference and decisions. IEEE Transactions on Systems, Man and Cybernetics 20 2 (1990) 318-325
- (1990) IEEE Transactions on Systems, Man and Cybernetics , vol.20 , Issue.2 , pp. 318-325
- Seidenfeld, T.¹ Schervish, M.J.²

50
- 0004209735
- Princeton Univ. Press, Princeton, NJ
- Shafer G. A Mathematical Theory of Evidence (1976), Princeton Univ. Press, Princeton, NJ
- (1976) A Mathematical Theory of Evidence
- Shafer, G.¹

51
- 34249714722
- E.J. Sondik, The optimal control of partially observable Markov processes, PhD thesis, Stanford University, 1971

52
- 31144472319
- Perseus: Randomized point-based value iteration for POMDPs
- Spaan M.T.J., and Vlassis N. Perseus: Randomized point-based value iteration for POMDPs. Journal of Artificial Intelligence Research 24 (2005) 195-220
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 195-220
- Spaan, M.T.J.¹ Vlassis, N.²

53
- 34249743118
- N. Vlassis, M.T.J. Spaan, A fast point-based algorithm for POMDPs, in: Benelearn 2004: Proceedings of the Annual Machine Learning Conference of Belgium and the Netherlands, Brussels, Belgium, 2004, pp. 170-176

54
- 0003649050
- Chapman and Hall, London
- Walley P. Statistical Reasoning with Imprecise Probabilities (1991), Chapman and Hall, London
- (1991) Statistical Reasoning with Imprecise Probabilities
- Walley, P.¹

55
- 0022581409
- Parameter imprecision in finite state, finite action dynamic programs
- White C.C., and Eldeib H.K. Parameter imprecision in finite state, finite action dynamic programs. Operations Research 34 (1986) 120-129
- (1986) Operations Research , vol.34 , pp. 120-129
- White, C.C.¹ Eldeib, H.K.²

56
- 0028460403
- Markov decision processes with imprecise transition probabilities
- White C.C., and Eldeib H.K. Markov decision processes with imprecise transition probabilities. Operations Research 43 (1994) 739-749
- (1994) Operations Research , vol.43 , pp. 739-749
- White, C.C.¹ Eldeib, H.K.²

57
- 0036374229
- Speeding up the convergence of value iteration in partially observable Markov decision processes
- Zhang N.L., and Zhang W. Speeding up the convergence of value iteration in partially observable Markov decision processes. Journal of Artificial Intelligence Research 14 (2001) 29-51
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 29-51
- Zhang, N.L.¹ Zhang, W.²

58
- 27344454651
- Restricted value iteration: Theory and algorithms
- Zhang W., and Zhang N.L. Restricted value iteration: Theory and algorithms. Journal of Artificial Intelligence Research 23 (2005) 123-165
- (2005) Journal of Artificial Intelligence Research , vol.23 , pp. 123-165
- Zhang, W.¹ Zhang, N.L.²

59
- 84880904402
- R. Zhou, E.A. Hansen, An improved grid-based approximation algorithm for POMDPs, in: Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI-01), 2001, pp. 707-716

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.