-
1
-
-
34249696459
-
-
D. Aberdeen, J. Baxter, Scaling internal-state policy-gradient methods for POMDPs, in: International Conference on Machine Learning (ICML-02), Sydney, Australia, July 2002, pp. 1-12
-
-
-
-
2
-
-
50549213583
-
Optimal control of Markov decision processes with incomplete state estimation
-
Aström K.J. Optimal control of Markov decision processes with incomplete state estimation. Journal of Mathematical Analysis and Applications 10 (1965) 174-205
-
(1965)
Journal of Mathematical Analysis and Applications
, vol.10
, pp. 174-205
-
-
Aström, K.J.1
-
3
-
-
34249727744
-
-
T. Augustin, On the suboptimality of the generalized Bayes rule and robust Bayesian procedures from the decision theoretic point of view-a cautionary note on updating imprecise priors, in: Proceedings of 3rd International Symposium on Imprecise Probabilities and their Applications (ISIPTA-03), 2003
-
-
-
-
4
-
-
0003787146
-
-
Princeton Univ. Press, Princeton, NJ
-
Bellman R. Dynamic Programming (1957), Princeton Univ. Press, Princeton, NJ
-
(1957)
Dynamic Programming
-
-
Bellman, R.1
-
7
-
-
31144460375
-
An epsilon-optimal grid-based algorithm for partially observable Markov decision processes
-
Morgan Kaufmann
-
Bonet B. An epsilon-optimal grid-based algorithm for partially observable Markov decision processes. Proc. 19th International Conf. on Machine Learning (ICML-02) (2002), Morgan Kaufmann 51-58
-
(2002)
Proc. 19th International Conf. on Machine Learning (ICML-02)
, pp. 51-58
-
-
Bonet, B.1
-
9
-
-
0030349220
-
Computing optimal policies for partially observable decision processes using compact representations
-
Portland, OR, AAAI Press/The MIT Press
-
Boutilier C., and Poole D. Computing optimal policies for partially observable decision processes using compact representations. Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI-96). Portland, OR (1996), AAAI Press/The MIT Press 1168-1175
-
(1996)
Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI-96)
, pp. 1168-1175
-
-
Boutilier, C.1
Poole, D.2
-
11
-
-
0001909869
-
Incremental Pruning: A simple, fast, exact method for partially observable Markov decision processes
-
Geiger D., and Shenoy P.P. (Eds). San Francisco, CA, Morgan Kaufmann
-
Cassandra A., Littman M.L., and Zhang N.L. Incremental Pruning: A simple, fast, exact method for partially observable Markov decision processes. In: Geiger D., and Shenoy P.P. (Eds). Proceedings of the Thirteenth Annual Conference on Uncertainty in Artificial Intelligence (UAI-97). San Francisco, CA (1997), Morgan Kaufmann 54-61
-
(1997)
Proceedings of the Thirteenth Annual Conference on Uncertainty in Artificial Intelligence (UAI-97)
, pp. 54-61
-
-
Cassandra, A.1
Littman, M.L.2
Zhang, N.L.3
-
14
-
-
0009236173
-
Quasi-Bayesian strategies for efficient plan generation: application to the 'planning to observe' problem
-
Horvitz E., and Jensen F.V. (Eds). San Francisco, CA, Morgan Kaufmann
-
Cozman F.G., and Krotkov E. Quasi-Bayesian strategies for efficient plan generation: application to the 'planning to observe' problem. In: Horvitz E., and Jensen F.V. (Eds). Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI-96). San Francisco, CA (1996), Morgan Kaufmann 186-193
-
(1996)
Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI-96)
, pp. 186-193
-
-
Cozman, F.G.1
Krotkov, E.2
-
15
-
-
34249743120
-
-
A. Drake, Observation of a Markov process through a noisy channel, PhD thesis, Massachusetts Institute of Technology, 1962
-
-
-
-
16
-
-
34249726620
-
-
Z. Feng, E.A. Hansen, Approximate planning for factored POMDPs, in: Proceedings of the 6th European Conference on Planning (ECP-01), Toledo, Spain, September 2001
-
-
-
-
20
-
-
0034272032
-
Bounded-parameter Markov decision processes
-
Givan R., Leach S.M., and Dean T. Bounded-parameter Markov decision processes. Artificial Intelligence 122 1-2 (2000) 71-109
-
(2000)
Artificial Intelligence
, vol.122
, Issue.1-2
, pp. 71-109
-
-
Givan, R.1
Leach, S.M.2
Dean, T.3
-
24
-
-
34249708923
-
-
E.A. Hansen, Solving POMDPs by searching in policy space, in: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98), 1998, pp. 211-219
-
-
-
-
25
-
-
34249748352
-
-
E.A. Hansen, Z. Feng, Dynamic programming for POMDPs using a factored state representation, in: Artificial Intelligence Planning Systems (AIPS-00), 2000, pp. 130-139
-
-
-
-
26
-
-
34249740981
-
-
E.A. Hansen, R. Zhou, Synthesis of hierarchical finite-state controllers for POMDPs, in: Thirteenth International Conference on Automated Planning and Scheduling (ICAPS-03), June 2003
-
-
-
-
27
-
-
0037097188
-
Generalizing Markov decision processes to imprecise probabilities
-
Harmanec D. Generalizing Markov decision processes to imprecise probabilities. Journal of Statistical Planning and Inference 105 (2002) 199-213
-
(2002)
Journal of Statistical Planning and Inference
, vol.105
, pp. 199-213
-
-
Harmanec, D.1
-
28
-
-
0001770240
-
Value-function approximations for partially observable Markov decision processes
-
Hauskrecht M. Value-function approximations for partially observable Markov decision processes. Journal of Artificial Intelligence Research 13 (2000) 33-94
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 33-94
-
-
Hauskrecht, M.1
-
29
-
-
0034160101
-
Planning treatment of ischemic heart disease with partially observable Markov decision processes
-
Hauskrecht M., and Fraser H. Planning treatment of ischemic heart disease with partially observable Markov decision processes. Artificial Intelligence in Medicine 18 (2000) 221-244
-
(2000)
Artificial Intelligence in Medicine
, vol.18
, pp. 221-244
-
-
Hauskrecht, M.1
Fraser, H.2
-
31
-
-
51249181779
-
A new polynomial-time algorithm for linear programming
-
Karmarkar N. A new polynomial-time algorithm for linear programming. Combinatorica 4 (1984) 373-395
-
(1984)
Combinatorica
, vol.4
, pp. 373-395
-
-
Karmarkar, N.1
-
32
-
-
0030150627
-
An introduction to issues in higher order uncertainty
-
Lehner P.E., Laskey K.B., and Dubois D. An introduction to issues in higher order uncertainty. IEEE Transactions on Systems, Man and Cybernetics, Part A 26 3 (1996) 289-293
-
(1996)
IEEE Transactions on Systems, Man and Cybernetics, Part A
, vol.26
, Issue.3
, pp. 289-293
-
-
Lehner, P.E.1
Laskey, K.B.2
Dubois, D.3
-
33
-
-
0040069490
-
On indeterminate probabilities
-
Levi I. On indeterminate probabilities. Journal of Philosophy 71 (1974) 391-418
-
(1974)
Journal of Philosophy
, vol.71
, pp. 391-418
-
-
Levi, I.1
-
35
-
-
0002679852
-
A survey of algorithmic methods for partially observed Markov decision processes
-
Lovejoy W.S. A survey of algorithmic methods for partially observed Markov decision processes. Annals of Operations Research 28 (1991) 47-66
-
(1991)
Annals of Operations Research
, vol.28
, pp. 47-66
-
-
Lovejoy, W.S.1
-
37
-
-
34249649586
-
-
D.A. McAllester, S. Singh, Approximate planning for factored POMDPs using belief state simplification, in: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI-99), 1999, pp. 409-416
-
-
-
-
38
-
-
0036931186
-
-
M. Montemerlo, J. Pineau, N. Roy, S. Thrun, V. Verma, Experiences with a mobile robotic guide for the elderly, in: Proceedings of the National Conference of Artificial Intelligence (AAAI-02), Edmonton, AB, July 2002, pp. 587-592
-
-
-
-
40
-
-
14344250395
-
Robust control of Markov decision processes with uncertain transition matrices
-
Nilim A., and El-Ghaoui L. Robust control of Markov decision processes with uncertain transition matrices. Operations Research 53 (2005) 780-798
-
(2005)
Operations Research
, vol.53
, pp. 780-798
-
-
Nilim, A.1
El-Ghaoui, L.2
-
43
-
-
34249697523
-
-
J. Pineau, Tractable planning under uncertainty: Exploiting structure, PhD thesis, Robotics Institute, Carnegie Mellon University, Pittsburgh, PA, 2004
-
-
-
-
44
-
-
84880772945
-
Point-based value iteration: An anytime algorithm for POMDPs
-
AAAI Press, Menlo Park, CA
-
Pineau J., Gordon G., and Thrun S. Point-based value iteration: An anytime algorithm for POMDPs. Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI-03) (2003), AAAI Press, Menlo Park, CA
-
(2003)
Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI-03)
-
-
Pineau, J.1
Gordon, G.2
Thrun, S.3
-
45
-
-
34249677829
-
-
P. Poupart, Exploiting structure to efficiently solve large scale partially observable Markov decision processes, PhD thesis, Department of Computer Science, University of Toronto, Toronto, Ontario, Canada, 2005
-
-
-
-
48
-
-
0015630091
-
Markovian decision processes with uncertain transition probabilities
-
Satia J.K., and Lave R.E. Markovian decision processes with uncertain transition probabilities. Operations Research 21 (1973) 728-740
-
(1973)
Operations Research
, vol.21
, pp. 728-740
-
-
Satia, J.K.1
Lave, R.E.2
-
51
-
-
34249714722
-
-
E.J. Sondik, The optimal control of partially observable Markov processes, PhD thesis, Stanford University, 1971
-
-
-
-
53
-
-
34249743118
-
-
N. Vlassis, M.T.J. Spaan, A fast point-based algorithm for POMDPs, in: Benelearn 2004: Proceedings of the Annual Machine Learning Conference of Belgium and the Netherlands, Brussels, Belgium, 2004, pp. 170-176
-
-
-
-
55
-
-
0022581409
-
Parameter imprecision in finite state, finite action dynamic programs
-
White C.C., and Eldeib H.K. Parameter imprecision in finite state, finite action dynamic programs. Operations Research 34 (1986) 120-129
-
(1986)
Operations Research
, vol.34
, pp. 120-129
-
-
White, C.C.1
Eldeib, H.K.2
-
56
-
-
0028460403
-
Markov decision processes with imprecise transition probabilities
-
White C.C., and Eldeib H.K. Markov decision processes with imprecise transition probabilities. Operations Research 43 (1994) 739-749
-
(1994)
Operations Research
, vol.43
, pp. 739-749
-
-
White, C.C.1
Eldeib, H.K.2
-
57
-
-
0036374229
-
Speeding up the convergence of value iteration in partially observable Markov decision processes
-
Zhang N.L., and Zhang W. Speeding up the convergence of value iteration in partially observable Markov decision processes. Journal of Artificial Intelligence Research 14 (2001) 29-51
-
(2001)
Journal of Artificial Intelligence Research
, vol.14
, pp. 29-51
-
-
Zhang, N.L.1
Zhang, W.2
-
59
-
-
84880904402
-
-
R. Zhou, E.A. Hansen, An improved grid-based approximation algorithm for POMDPs, in: Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI-01), 2001, pp. 707-716
-
-
-
|