-
1
-
-
0029210635
-
Learning to act using real-time dynamic programming
-
Barto, A. G., Bradtke, S. J., & Singh, S. P. (1995). Learning to act using real-time dynamic programming. Artif. Intell., 72, 81-138.
-
(1995)
Artif. Intell.
, vol.72
, pp. 81-138
-
-
Barto, A.G.1
Bradtke, S.J.2
Singh, S.P.3
-
3
-
-
84880851659
-
Faster heuristic search algorithms for planning with uncertainty and full feedback
-
Acapulco, Mexico: Morgan Kaufmann
-
Bonet, B., & Geffner, H. (2003a). Faster heuristic search algorithms for planning with uncertainty and full feedback. Proc. 18th International Joint Conf. on Artificial Intelligence (pp. 1233-1238). Acapulco, Mexico: Morgan Kaufmann.
-
(2003)
Proc. 18th International Joint Conf. on Artificial Intelligence
, pp. 1233-1238
-
-
Bonet, B.1
Geffner, H.2
-
4
-
-
31844451230
-
-
Labeled RTDP: Im
-
Bonet, B., & Geffner, H. (2003b). Labeled RTDP: Im
-
(2003)
-
-
Bonet, B.1
Geffner, H.2
-
5
-
-
9444233135
-
Proving the convergence of real-time dynamic programming
-
proving the convergence of real-time dynamic programming. Proc. of ICAPS-03 (pp. 12-21).
-
Proc. of ICAPS-03
, pp. 12-21
-
-
-
6
-
-
0029332887
-
Planning under time constraints in stochastic domains
-
Dean, T., Kaelbling, L. P., Kirman, J., & Nicholson, A. (1995). Planning under time constraints in stochastic domains. Artif. Intell., 76, 35-74.
-
(1995)
Artif. Intell.
, vol.76
, pp. 35-74
-
-
Dean, T.1
Kaelbling, L.P.2
Kirman, J.3
Nicholson, A.4
-
7
-
-
16244399286
-
-
(Technical Report CMU-RI-TR-04-13). Robotics Institute, Carnegie Mellon University, Pittsburgh, PA.
-
Ferguson, D., & Stentz, A. T. (2004). Focussed dynamic programming: Extensive comparative results (Technical Report CMU-RI-TR-04-13). Robotics Institute, Carnegie Mellon University, Pittsburgh, PA.
-
(2004)
Focussed Dynamic Programming: Extensive Comparative Results
-
-
Ferguson, D.1
Stentz, A.T.2
-
8
-
-
0035369425
-
LAO*: A heuristic search algorithm that finds solutions with loops
-
Hansen, E. A., & Zilberstein, S. (2001). LAO*: a heuristic search algorithm that finds solutions with loops. Artif. Intell., 129, 35-62.
-
(2001)
Artif. Intell.
, vol.129
, pp. 35-62
-
-
Hansen, E.A.1
Zilberstein, S.2
-
9
-
-
84890267871
-
Fast exact planning in markov decision processes
-
McMahan, H. B., & Gordon, G. J. (2005). Fast exact planning in markov decision processes. To appear in ICAPS.
-
(2005)
To Appear in ICAPS
-
-
McMahan, H.B.1
Gordon, G.J.2
-
10
-
-
31844443291
-
Inverted autonomous helicopter flight via reinforcement learning
-
Springer.
-
Ng, A. Y., Coates, A., Diel, M., Ganapathi, V., Schulte, J., Tse, B., Berger, E., & Liang, E. (2004). Inverted autonomous helicopter flight via reinforcement learning. ISER. Springer.
-
(2004)
ISER.
-
-
Ng, A.Y.1
Coates, A.2
Diel, M.3
Ganapathi, V.4
Schulte, J.5
Tse, B.6
Berger, E.7
Liang, E.8
-
12
-
-
31144465830
-
Heuristic search value iteration for pomdps
-
Banff, Alberta.
-
Smith, T., & Simmons, R. (2004). Heuristic search value iteration for pomdps. Proc. of UAI 2004. Banff, Alberta.
-
(2004)
Proc. of UAI 2004
-
-
Smith, T.1
Simmons, R.2
|