SCOPUS 정보 검색 플랫폼

Proceedings of the 24th AAAI Conference on Artificial Intelligence, AAAI 2010

Volumn , Issue , 2010, Pages 1089-1095

PUMA: Planning under Uncertainty with Macro-Actions

(3) He, Ruijie a Brunskill, Emma b Roy, Nicholas a

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

b UNIVERSITY OF CALIFORNIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACTION PLANNING; ANYTIME ALGORITHM; EXTENDED SEQUENCES; FUTURE OBSERVATIONS; OPEN-LOOP; OPTIMAL POLICIES; PLANNING UNDER UNCERTAINTY; SEQUENCE OF ACTIONS; STATE OF THE ART;

ARTIFICIAL INTELLIGENCE;

EID: 85167427989 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (27)

References (18)

1
- 0030388815
- Acting under uncertainty: Discrete Bayesian models for mobile robot navigation
- Cassandra, A.; Kaelbling, L.; and Kurien, J. 1996. Acting under uncertainty: Discrete Bayesian models for mobile robot navigation. In IROS.
- (1996) IROS
- Cassandra, A.¹ Kaelbling, L.² Kurien, J.³

2
- 77955785172
- Technical report, MIT
- He, R., and Roy, N. 2009. Efficient POMDP Forward Search by Predicting the Posterior Belief Distribution. Technical report, MIT.
- (2009) Efficient POMDP Forward Search by Predicting the Posterior Belief Distribution
- He, R.¹ Roy, N.²

3
- 77958566137
- Robust belief-based execution of manipulation programs
- Hsiao, K.; Lozano-Pérez, T.; and Kaelbling, L. 2008. Robust belief-based execution of manipulation programs. In WAFR.
- (2008) WAFR
- Hsiao, K.¹ Lozano-Pérez, T.² Kaelbling, L.³

4
- 51649106250
- A point-based POMDP planner for target tracking
- Hsu, D.; Lee, W.; and Rong, N. 2008. A point-based POMDP planner for target tracking. In ICRA.
- (2008) ICRA
- Hsu, D.¹ Lee, W.² Rong, N.³

5
- 0000148778
- A heuristic approach to the discovery of macro-operators
- Iba, G. 1989. A heuristic approach to the discovery of macro-operators. Machine Learning 3(4):285-317.
- (1989) Machine Learning , vol.3 , Issue.4 , pp. 285-317
- Iba, G.¹

6
- 78650161912
- Motion Planning under Uncertainty for Robotic Tasks with Long Time Horizons
- Kurniawati, H.; Du, Y.; Hsu, D.; and Lee, W. 2009. Motion Planning under Uncertainty for Robotic Tasks with Long Time Horizons. In ISRR.
- (2009) ISRR
- Kurniawati, H.¹ Du, Y.² Hsu, D.³ Lee, W.⁴

7
- 70349645087
- SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces
- Kurniawati, H.; Hsu, D.; and Lee, W. 2008. SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces. In RSS.
- (2008) RSS
- Kurniawati, H.¹ Hsu, D.² Lee, W.³

8
- 0013465187
- Automatic discovery of subgoals in reinforcement learning using diverse density
- McGovern, A., and Barto, A. 2001. Automatic discovery of subgoals in reinforcement learning using diverse density. In ICML.
- (2001) ICML
- McGovern, A.¹ Barto, A.²

9
- 84897718799
- acQuire-macros: An algorithm for automatically learning macro-actions
- McGovern, A. 1998. acQuire-macros: An algorithm for automatically learning macro-actions. In NIPS AHRL.
- (1998) NIPS AHRL
- McGovern, A.¹

10
- 20444478005
- Policy-contingent abstraction for robust robot control
- Pineau, J.; Gordon, G.; and Thrun, S. 2003. Policy-contingent abstraction for robust robot control. In UAI.
- (2003) UAI
- Pineau, J.¹ Gordon, G.² Thrun, S.³

11
- 0003998452
- John Wiley & Sons, Inc. New York, NY, USA
- Puterman, M. 1994. Markov decision processes: discrete stochastic dynamic programming. John Wiley & Sons, Inc. New York, NY, USA.
- (1994) Markov decision processes: discrete stochastic dynamic programming
- Puterman, M.¹

12
- 52249086942
- Online planning algorithms for POMDPs
- Ross, S.; Pineau, J.; Paquet, S.; and Chaib-draa, B. 2008. Online planning algorithms for POMDPs. JAIR 32:663-704.
- (2008) JAIR , vol.32 , pp. 663-704
- Ross, S.¹ Pineau, J.² Paquet, S.³ Chaib-draa, B.⁴

13
- 31144465830
- Heuristic search value iteration for POMDPs
- Smith, T., and Simmons, R. 2004. Heuristic search value iteration for POMDPs. In UAI.
- (2004) UAI
- Smith, T.¹ Simmons, R.²

14
- 84912073624
- Learning options in reinforcement learning
- Stolle, M., and Precup, D. 2002. Learning options in reinforcement learning. Lecture Notes in Computer Science.
- (2002) Lecture Notes in Computer Science
- Stolle, M.¹ Precup, D.²

15
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Sutton, R.; Precup, D.; and Singh, S. 1999. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial intelligence 112(1):181-211.
- (1999) Artificial intelligence , vol.112 , Issue.1 , pp. 181-211
- Sutton, R.¹ Precup, D.² Singh, S.³

16
- 80052287410
- Approximate planning in POMDPs with macro-actions
- Theocharous, G., and Kaelbling, L. 2003. Approximate planning in POMDPs with macro-actions. NIPS.
- (2003) NIPS
- Theocharous, G.¹ Kaelbling, L.²

17
- 66149171506
- Hierarchical POMDP controller optimization by likelihood maximization
- Toussaint, M.; Charlin, L.; and Poupart, P. 2008. Hierarchical POMDP controller optimization by likelihood maximization. In UAI.
- (2008) UAI
- Toussaint, M.¹ Charlin, L.² Poupart, P.³

18
- 51649100048
- Technical report, Stanford CS Dept
- Yu, C.; Chuang, J.; Computing, S.; Math, C.; Gerkey, B.; Gordon, G.; and Ng, A. 2005. Open-loop plans in multi-robot POMDPs. Technical report, Stanford CS Dept.
- (2005) Open-loop plans in multi-robot POMDPs
- Yu, C.¹ Chuang, J.² Computing, S.³ Math, C.⁴ Gerkey, B.⁵ Gordon, G.⁶ Ng, A.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.