SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011

Volumn , Issue , 2011, Pages

Monte Carlo Value Iteration with macro-actions

(3) Lim, Zhan Wei a Hsu, David a Lee, Wee Sun a

a NATIONAL UNIVERSITY OF SINGAPORE (Singapore)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE;

COMPUTATIONAL CHALLENGES; CONDITION; CONTINUOUS STATE SPACE; DISCRETE STATE SPACE; PERFORMANCE; PLANNING HORIZONS; SPACE STATE; STATE-SPACE; TEMPORAL ABSTRACTION; VALUE ITERATION;

MONTE CARLO METHODS;

EID: 85162333012 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (36)

References (18)

1
- 84929851464
- Unmanned aircraft collision avoidance using continuous-state POMDPs
- H. Bai, D. Hsu, M.J. Kochenderfer, and W. S. Lee. Unmanned aircraft collision avoidance using continuous-state POMDPs. In Proc. Robotics: Science & Systems, 2011.
- (2011) Proc. Robotics: Science & Systems
- Bai, H.¹ Hsu, D.² Kochenderfer, M.J.³ Lee, W.S.⁴

2
- 78650145214
- Monte Carlo value iteration for continuous-state POMDPs
- Springer
- H. Bai, D. Hsu, W. S. Lee, and V. Ngo. Monte Carlo Value Iteration for Continuous-State POMDPs. In Algorithmic Foundations of Robotics IX-Proc. Int.Workshop o n the Algorithmic Foundations of Robotics (WAFR), pages 175-191. Springer, 2011.
- (2011) Algorithmic Foundations of Robotics IX-Proc. Int.Workshop O N the Algorithmic Foundations of Robotics (WAFR) , pp. 175-191
- Bai, H.¹ Hsu, D.² Lee, W.S.³ Ngo, V.⁴

3
- 0037288370
- Recent advances in hierarchical reinforcement learning
- 2003
- Andrew G. Barto and Sridhar Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems, 13:2003, 2003.
- (2003) Discrete Event Dynamic Systems , vol.13
- Barto, A.G.¹ Mahadevan, S.²

4
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- T. G. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. J. Artificial Intelligence Research, 13:227-303, 2000.
- (2000) J. Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

5
- 27344455577
- Synthesis of hierarchical finite-state controllers for POMDPs
- E. Hansen and R. Zhou. Synthesis of hierarchical finite-state controllers for POMDPs. In Proc. Int. Conf. on Automated Planning and Scheduling, 2003.
- (2003) Proc. Int. Conf. on Automated Planning and Scheduling
- Hansen, E.¹ Zhou, R.²

6
- 0006419533
- Hierarchical solution of Markov decision processes using macro-actions
- Citeseer
- M. Hauskrecht, N. Meuleau, L.P. Kaelbling, T. Dean, and C. Boutilier. Hierarchical solution of Markov decision processes using macro-actions. In Proc. Conf. on Uncertainty in Artificial Intelligence, pages 220-229. Citeseer, 1998.
- (1998) Proc. Conf. on Uncertainty in Artificial Intelligence , pp. 220-229
- Hauskrecht, M.¹ Meuleau, N.² Kaelbling, L.P.³ Dean, T.⁴ Boutilier, C.⁵

7
- 77958563254
- PUMA: Planning under uncertainty with macro-actions
- R. He, E. Brunskill, and N. Roy. PUMA: Planning under uncertainty with macro-actions. In Proc. AAAI Conf. on Artificial Intelligence, 2010.
- (2010) Proc. AAAI Conf. on Artificial Intelligence
- He, R.¹ Brunskill, E.² Roy, N.³

8
- 79952126758
- Motion planning under uncertainty for robotic tasks with long time horizons
- H. Kurniawati, Y. Du, D. Hsu, and W. S. Lee. Motion planning under uncertainty for robotic tasks with long time horizons. Int. J. Robotics Research, 30(3):308-323, 2010.
- (2010) Int. J. Robotics Research , vol.30 , Issue.3 , pp. 308-323
- Kurniawati, H.¹ Du, Y.² Hsu, D.³ Lee, W.S.⁴

9
- 70349645087
- SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces
- H. Kurniawati, D. Hsu, and W.S. Lee. SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces. In Proc. Robotics: Science & Systems, 2008.
- (2008) Proc. Robotics: Science & Systems
- Kurniawati, H.¹ Hsu, D.² Lee, W.S.³

10
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: An anytime algorithm for POMDPs. In Int. Jnt. Conf. on Artificial Intelligence, volume 18, pages 1025-1032, 2003.
- (2003) Int. Jnt. Conf. on Artificial Intelligence , vol.18 , pp. 1025-1032
- Pineau, J.¹ Gordon, G.² Thrun, S.³

11
- 27944443806
- A hierarchical approach to POMDP planning and execution
- J. Pineau, N. Roy, and S. Thrun. A hierarchical approach to POMDP planning and execution. In Workshop on Hierarchy & Memory in Reinforcement Learning (ICML), volume 156, 2001.
- (2001) Workshop on Hierarchy & Memory in Reinforcement Learning (ICML) , vol.156
- Pineau, J.¹ Roy, N.² Thrun, S.³

12
- 52249086942
- Online planning algorithms for POMDPs
- S. Ross, J. Pineau, S. Paquet, and B. Chaib-Draa.Online planning algorithms for POMDPs. Journal of Artificial Intelligence Research, 32(1):663-704, 2008.
- (2008) Journal of Artificial Intelligence Research , vol.32 , Issue.1 , pp. 663-704
- Ross, S.¹ Pineau, J.² Paquet, S.³ Chaib-Draa, B.⁴

13
- 84899444044
- UAV swarm coordination using cooperative control for establishing a wireless communications backbone
- A. Sivakumar and C.K.Y. Tan. UAV swarm coordination using cooperative control for establishing a wireless communications backbone. In Proc. Int. Conf. on Autonomous Agents & Multiagent Systems, pages 1157-1164, 2010.
- (2010) Proc. Int. Conf. on Autonomous Agents & Multiagent Systems , pp. 1157-1164
- Sivakumar, A.¹ Tan, C.K.Y.²

14
- 31144465830
- Heuristic search value iteration for POMDPs
- AUAI Press
- T. Smith and R. Simmons. Heuristic search value iteration for POMDPs. In Proc. Conf. on Uncertainty in Artificial Intelligence, pages 520-527. AUAI Press, 2004.
- (2004) Proc. Conf. on Uncertainty in Artificial Intelligence , pp. 520-527
- Smith, T.¹ Simmons, R.²

15
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- R.S. Sutton, D. Precup, and S. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1):181-211, 1999.
- (1999) Artificial Intelligence , vol.112 , Issue.1 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

16
- 80052287410
- Approximate planning in POMDPs with macro-actions
- G. Theocharous and L. P. Kaelbling. Approximate planning in POMDPs with macro-actions. Advances in Neural Processing Information Systems, 17, 2003.
- (2003) Advances in Neural Processing Information Systems , vol.17
- Theocharous, G.¹ Kaelbling, L.P.²

17
- 67349102783
- Hierarchical POMDP controller optimization by likelihood maximization
- M. Toussaint, L. Charlin, and P. Poupart. Hierarchical POMDP controller optimization by likelihood maximization. Proc. Conf. on Uncertainty in Artificial Intelligence, 2008.
- (2008) Proc. Conf. on Uncertainty in Artificial Intelligence
- Toussaint, M.¹ Charlin, L.² Poupart, P.³

18
- 0016928374
- Procedures for the solution of a finite-horizon, partially observed, semi-Markov optimization problem
- C.C. White. Procedures for the solution of a finite-horizon, partially observed, semi-Markov optimization problem.Operations Research, 24(2):348-358, 1976.
- (1976) Operations Research , vol.24 , Issue.2 , pp. 348-358
- White, C.C.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.