-
1
-
-
50549213583
-
Optimal control of Markov Decision Processes with incomplete state estimation
-
K. Astrom. Optimal control of Markov Decision Processes with incomplete state estimation. J. Math. Anal. Appl., 10:174-205, 1965.
-
(1965)
J. Math. Anal. Appl.
, vol.10
, pp. 174-205
-
-
Astrom, K.1
-
2
-
-
0029210635
-
Learning to act using real-time dynamic programming
-
A. Barto, S. Bradtke, and S. Singh. Learning to act using real-time dynamic programming. Art. Int., 72:81-138, 1995.
-
(1995)
Art. Int.
, vol.72
, pp. 81-138
-
-
Barto, A.1
Bradtke, S.2
Singh, S.3
-
5
-
-
0002098456
-
Learning sorting and decision trees with POMDPs
-
B. Bonet and H. Geffner. Learning sorting and decision trees with POMDPs. In Proc. ICML, pages 73-81, 1998.
-
(1998)
Proc. ICML
, pp. 73-81
-
-
Bonet, B.1
Geffner, H.2
-
6
-
-
0012252088
-
Solving large POMDPs using real time dynamic programming
-
AAAI Press
-
B. Bonet and H. Geffner. Solving large POMDPs using real time dynamic programming. In Proc. AAAI Fall Symp. on POMDPs. AAAI Press, 1998.
-
(1998)
Proc. AAAI Fall Symp. on POMDPs
-
-
Bonet, B.1
Geffner, H.2
-
7
-
-
85166261608
-
Planning with incomplete information as heuristic search in belief space
-
B. Bonet and H. Geffner. Planning with incomplete information as heuristic search in belief space. In Proc. ICAPS, pages 52-61, 2000.
-
(2000)
Proc. ICAPS
, pp. 52-61
-
-
Bonet, B.1
Geffner, H.2
-
8
-
-
31144460375
-
An ε-optimal grid-based algorithm for Partially Observable Markov Decision Processes
-
B. Bonet. An ε-optimal grid-based algorithm for Partially Observable Markov Decision Processes. In Proc. ICML, pages 51-58, 2002.
-
(2002)
Proc. ICML
, pp. 51-58
-
-
Bonet, B.1
-
9
-
-
0001770240
-
Value-Function Approximations for Partially Observable Markov Decision Processes
-
M. Hauskrecht. Value-function approximations for partially observable Markov decision processes. JAIR, 13:33-94, 2000. (Pubitemid 33682083)
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 33-94
-
-
Hauskrecht, M.1
-
10
-
-
0032073263
-
Planning and acting in partially observable stochastic domains
-
L. P. Kaelbling, M. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Art. Int., 101:99-134, 1999.
-
(1999)
Art. Int.
, vol.101
, pp. 99-134
-
-
Kaelbling, L.P.1
Littman, M.2
Cassandra, A.R.3
-
11
-
-
0025400088
-
Real-time heuristic search
-
R. Korf. Real-time heuristic search. Art. Int., 42(2-3):189-211, 1990.
-
(1990)
Art. Int.
, vol.42
, Issue.2-3
, pp. 189-211
-
-
Korf, R.1
-
12
-
-
0019909899
-
A survey of partially observable Markov decision processes: Theory, models and algorithms
-
G. Monahan. A survey of partially observable Markov decision processes: Theory, models and algorithms. Management Science, 28(1):1-16, 1983.
-
(1983)
Management Science
, vol.28
, Issue.1
, pp. 1-16
-
-
Monahan, G.1
-
13
-
-
52249090123
-
Anytime point-based approximations for large POMDPs
-
J. Pineau, G. J. Gordon, and S. Thrun. Anytime point-based approximations for large POMDPs. JAIR, 27:335-380, 2006.
-
(2006)
JAIR
, vol.27
, pp. 335-380
-
-
Pineau, J.1
Gordon, G.J.2
Thrun, S.3
-
14
-
-
84880906197
-
Forward search value iteration for POMDPs
-
G. Shani, R. I. Brafman, and S. E. Shimony. Forward search value iteration for POMDPs. In Proc. IJCAI, pages 2619-2624, 2007.
-
(2007)
Proc. IJCAI
, pp. 2619-2624
-
-
Shani, G.1
Brafman, R.I.2
Shimony, S.E.3
-
15
-
-
80053262864
-
Point-based POMDP algorithms: Improved analysis and implementation
-
T. Smith and R. Simmons. Point-based POMDP algorithms: Improved analysis and implementation. In Proc. UAI, pages 542-547, 2005.
-
(2005)
Proc. UAI
, pp. 542-547
-
-
Smith, T.1
Simmons, R.2
-
17
-
-
0017943242
-
The optimal control of partially observable Markov decision processes over the infinite horizon: Discounted costs
-
E. Sondik. The optimal control of partially observable Markov decision processes over the infinite horizon: discounted costs. Oper. Res., 26(2), 1978.
-
(1978)
Oper. Res.
, vol.26
, Issue.2
-
-
Sondik, E.1
-
18
-
-
31144472319
-
Perseus: Randomized point-based value iteration for POMDPs
-
M. T. J. Spaan and N. A. Vlassis. Perseus: Randomized point-based value iteration for POMDPs. JAIR, 24:195-220, 2005.
-
(2005)
JAIR
, vol.24
, pp. 195-220
-
-
Spaan, M.T.J.1
Vlassis, N.A.2
|