-
2
-
-
78650145214
-
Monte Carlo value iteration for continuous-state POMDPs
-
Springer
-
H. Bai, D. Hsu, W. S. Lee, and V. Ngo. Monte Carlo Value Iteration for Continuous-State POMDPs. In Algorithmic Foundations of Robotics IX-Proc. Int.Workshop o n the Algorithmic Foundations of Robotics (WAFR), pages 175-191. Springer, 2011.
-
(2011)
Algorithmic Foundations of Robotics IX-Proc. Int.Workshop O N the Algorithmic Foundations of Robotics (WAFR)
, pp. 175-191
-
-
Bai, H.1
Hsu, D.2
Lee, W.S.3
Ngo, V.4
-
3
-
-
0037288370
-
Recent advances in hierarchical reinforcement learning
-
2003
-
Andrew G. Barto and Sridhar Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems, 13:2003, 2003.
-
(2003)
Discrete Event Dynamic Systems
, vol.13
-
-
Barto, A.G.1
Mahadevan, S.2
-
4
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
T. G. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. J. Artificial Intelligence Research, 13:227-303, 2000.
-
(2000)
J. Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
6
-
-
0006419533
-
Hierarchical solution of Markov decision processes using macro-actions
-
Citeseer
-
M. Hauskrecht, N. Meuleau, L.P. Kaelbling, T. Dean, and C. Boutilier. Hierarchical solution of Markov decision processes using macro-actions. In Proc. Conf. on Uncertainty in Artificial Intelligence, pages 220-229. Citeseer, 1998.
-
(1998)
Proc. Conf. on Uncertainty in Artificial Intelligence
, pp. 220-229
-
-
Hauskrecht, M.1
Meuleau, N.2
Kaelbling, L.P.3
Dean, T.4
Boutilier, C.5
-
8
-
-
79952126758
-
Motion planning under uncertainty for robotic tasks with long time horizons
-
H. Kurniawati, Y. Du, D. Hsu, and W. S. Lee. Motion planning under uncertainty for robotic tasks with long time horizons. Int. J. Robotics Research, 30(3):308-323, 2010.
-
(2010)
Int. J. Robotics Research
, vol.30
, Issue.3
, pp. 308-323
-
-
Kurniawati, H.1
Du, Y.2
Hsu, D.3
Lee, W.S.4
-
9
-
-
70349645087
-
SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces
-
H. Kurniawati, D. Hsu, and W.S. Lee. SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces. In Proc. Robotics: Science & Systems, 2008.
-
(2008)
Proc. Robotics: Science & Systems
-
-
Kurniawati, H.1
Hsu, D.2
Lee, W.S.3
-
10
-
-
84880772945
-
Point-based value iteration: An anytime algorithm for POMDPs
-
J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: An anytime algorithm for POMDPs. In Int. Jnt. Conf. on Artificial Intelligence, volume 18, pages 1025-1032, 2003.
-
(2003)
Int. Jnt. Conf. on Artificial Intelligence
, vol.18
, pp. 1025-1032
-
-
Pineau, J.1
Gordon, G.2
Thrun, S.3
-
12
-
-
52249086942
-
Online planning algorithms for POMDPs
-
S. Ross, J. Pineau, S. Paquet, and B. Chaib-Draa.Online planning algorithms for POMDPs. Journal of Artificial Intelligence Research, 32(1):663-704, 2008.
-
(2008)
Journal of Artificial Intelligence Research
, vol.32
, Issue.1
, pp. 663-704
-
-
Ross, S.1
Pineau, J.2
Paquet, S.3
Chaib-Draa, B.4
-
13
-
-
84899444044
-
UAV swarm coordination using cooperative control for establishing a wireless communications backbone
-
A. Sivakumar and C.K.Y. Tan. UAV swarm coordination using cooperative control for establishing a wireless communications backbone. In Proc. Int. Conf. on Autonomous Agents & Multiagent Systems, pages 1157-1164, 2010.
-
(2010)
Proc. Int. Conf. on Autonomous Agents & Multiagent Systems
, pp. 1157-1164
-
-
Sivakumar, A.1
Tan, C.K.Y.2
-
15
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
R.S. Sutton, D. Precup, and S. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1):181-211, 1999.
-
(1999)
Artificial Intelligence
, vol.112
, Issue.1
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.3
-
18
-
-
0016928374
-
Procedures for the solution of a finite-horizon, partially observed, semi-Markov optimization problem
-
C.C. White. Procedures for the solution of a finite-horizon, partially observed, semi-Markov optimization problem.Operations Research, 24(2):348-358, 1976.
-
(1976)
Operations Research
, vol.24
, Issue.2
, pp. 348-358
-
-
White, C.C.1
|