SCOPUS 정보 검색 플랫폼

Journal of Artificial Intelligence Research

Volumn 40, Issue , 2011, Pages 523-570

Efficient planning under uncertainty with macro-actions

(3) He, Ruijie a Brunskill, Emma b Roy, Nicholas a

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

b UNIVERSITY OF CALIFORNIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACTIVE AREA; CONTROL PERFORMANCE; EFFICIENT PLANNING; FORMAL ANALYSIS; LOOK-AHEAD; MULTI-STEP; NOVEL METHODS; PARTIALLY OBSERVABLE ENVIRONMENTS; POSTERIOR DISTRIBUTIONS; PRIMITIVE ACTIONS; SCIENTIFIC EXPLORATION; SEQUENCE OF ACTIONS; SIMULATION EXPERIMENTS;

ALGORITHMS; BUDGET CONTROL; EXPERIMENTS; ROBOTICS;

ROBOT PROGRAMMING;

EID: 79956361567 PISSN: None EISSN: 10769757 Source Type: Journal
DOI: 10.1613/jair.3171 Document Type: Article

Times cited : (74)

References (52)

1
- 77955832297
- Autonomous flight in unstructured and unknown indoor environments
- Bachrach, A., He, R., & Roy, N. (2009). Autonomous flight in unstructured and unknown indoor environments. In Proceedings of the European Micro Aerial Vehicle (EMAV) Conference.
- (2009) Proceedings of the European Micro Aerial Vehicle (EMAV) Conference
- Bachrach, A.¹ He, R.² Roy, N.³

2
- 85008411102
- Information and exponential families in statistical theory
- Barndorff-Nielsen, O. (1979). Information and exponential families in statistical theory. Bulletin of the American Mathematics Society, 273, 667-668.
- (1979) Bulletin of the American Mathematics Society , vol.273 , pp. 667-668
- Barndorff-Nielsen, O.¹

3
- 0036056595
- Receding horizon control of autonomous aerial vehicles
- Bellingham, J., Richards, A., & How, J. (2002). Receding horizon control of autonomous aerial vehicles. In Proceedings of the American Control Conference (ACC), Vol. 5, pp. 3741-3746.
- (2002) Proceedings of the American Control Conference (ACC) , vol.5 , pp. 3741-3746
- Bellingham, J.¹ Richards, A.² How, J.³

4
- 0003565783
- 2nd. Athena Scientific
- Bertsekas, D. (2007). Dynamic Programming and Optimal Control, vol. 1 & 2, 2nd. Athena Scientific.
- (2007) Dynamic Programming and Optimal Control , vol.1-2
- Bertsekas, D.¹

5
- 78751689703
- Solving POMDPs: RTDP-Bel vs. point-based algorithms
- Bonet, B., & Geffner, H. (2009). Solving POMDPs: RTDP-Bel vs. point-based algorithms. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), pp. 1641- 1646.
- (2009) Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI , pp. 1641-1646
- Bonet, B.¹ Geffner, H.²

6
- 33750007368
- Parametric POMDPs for planning in continuous state spaces
- DOI 10.1016/j.robot.2006.05.007, PII S0921889006000960
- Brooks, A., Makarenko, A., Williams, S., & Durrant-Whyte, H. (2006). Parametric POMDPs for planning in continuous state spaces. Robotics and Autonomous Systems, 54(11), 887-897. (Pubitemid 44572669)
- (2006) Robotics and Autonomous Systems , vol.54 , Issue.11 , pp. 887-897
- Brooks, A.¹ Makarenko, A.² Williams, S.³ Durrant-Whyte, H.⁴

7
- 84864577176
- Continuous-state POMDPs with hybrid dynamics
- Brunskill, E., Kaelbling, L., Lozano-Perez, T., & Roy, N. (2008). Continuous-state POMDPs with hybrid dynamics. In Proceedings of the International Symposium on Artificial Intelligence and Mathematics (ISAIM).
- (2008) Proceedings of the International Symposium on Artificial Intelligence and Mathematics (ISAIM)
- Brunskill, E.¹ Kaelbling, L.² Lozano-Perez, T.³ Roy, N.⁴

8
- 77954761115
- Automated hierarchy discovery for planning in partially observable environments
- Charlin, L., Poupart, P., & Shioda, R. (2007). Automated hierarchy discovery for planning in partially observable environments. In Advances in Neural Information Processing Systems (NIPS).
- (2007) Advances in Neural Information Processing Systems (NIPS)
- Charlin, L.¹ Poupart, P.² Shioda, R.³

9
- 0034354798
- Time series analysis of non-Gaussian observations based on state space models from both classical and Bayesian perspectives
- Durbin, J., & Koopman, S. (2000). Time series analysis of non-Gaussian observations based on state space models from both classical and Bayesian perspectives. Journal of the Royal Statistical Society: Series B (Methodological), 62(1), 3-56.
- (2000) Journal of the Royal Statistical Society: Series B (Methodological) , vol.62 , Issue.1 , pp. 3-56
- Durbin, J.¹ Koopman, S.²

10
- 79956353833
- A scalable method for solving high-dimensional continuous pomdps using local approximation
- Erez, T., & Smart, W. (2010). A Scalable Method for Solving High-Dimensional Continuous POMDPs Using Local Approximation. In Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI).
- (2010) Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI)
- Erez, T.¹ Smart, W.²

11
- 34248656990
- Real-time hierarchical POMDPs for autonomous robot navigation
- DOI 10.1016/j.robot.2007.01.004, PII S0921889007000279
- Foka, A., & Trahanias, P. (2007). Real-time hierarchical POMDPs for autonomous robot navigation. Robotics and Autonomous Systems, 55(7), 561-571. (Pubitemid 46777274)
- (2007) Robotics and Autonomous Systems , vol.55 , Issue.7 , pp. 561-571
- Foka, A.¹ Trahanias, P.²

12
- 27344455577
- Synthesis of hierarchical finite-state controllers for POMDPs
- Hansen, E., & Zhou, R. (2003). Synthesis of hierarchical finite-state controllers for POMDPs. In Proceedings of the Thirteenth International Conference on Automated Planning and Scheduling (ICAPS).
- (2003) Proceedings of the Thirteenth International Conference on Automated Planning and Scheduling (ICAPS)
- Hansen, E.¹ Zhou, R.²

13
- 77951108715
- On the design and use of a micro air vehicle to track and avoid adversaries
- He, R., Bachrach, A., Achtelik, M., Geramifard, A., Gurdan, D., Prentice, S., Stumpf, J., & Roy, N. (2010a). On the design and use of a micro air vehicle to track and avoid adversaries. International Journal of Robotics Research, 29(5), 529-546.
- (2010) International Journal of Robotics Research , vol.29 , Issue.5 , pp. 529-546
- He, R.¹ Bachrach, A.² Achtelik, M.³ Geramifard, A.⁴ Gurdan, D.⁵ Prentice, S.⁶ Stumpf, J.⁷ Roy, N.⁸

14
- 77958563254
- PUMA: Planning under uncertainty with macro-actions
- He, R., Brunskill, E., & Roy, N. (2010b). PUMA: Planning under uncertainty with macro-actions. In Proceedings of the Association for the Advancement of Artificial Intelligence (AAAI).
- (2010) Proceedings of the Association for the Advancement of Artificial Intelligence (AAAI)
- He, R.¹ Brunskill, E.² Roy, N.³

15
- 51649127092
- Planning in information space for a quadrotor helicopter in GPS-denied environments
- He, R., Prentice, S., & Roy, N. (2008). Planning in information space for a quadrotor helicopter in GPS-denied environments. In Proceedings of the International Conference of Robotics and Automation (ICRA), pp. 1814-1820.
- (2008) Proceedings of the International Conference of Robotics and Automation (ICRA , pp. 1814-1820
- He, R.¹ Prentice, S.² Roy, N.³

16
- 84947403595
- Probability inequalities for sums of bounded random variables
- Hoeffding, W. (1963). Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301), 13-30.
- (1963) Journal of the American Statistical Association , vol.58 , Issue.301 , pp. 13-30
- Hoeffding, W.¹

17
- 84880741298
- Solving POMDPs with continuous or large discrete observation spaces
- Hoey, J., & Poupart, P. (2005). Solving POMDPs with continuous or large discrete observation spaces. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI).
- (2005) Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI)
- Hoey, J.¹ Poupart, P.²

18
- 84867429158
- Task-driven tactile exploration
- Hsiao, K., Kaelbling, L., & Lozano-Pérez, T. (2010). Task-driven tactile exploration. In Proceedings of Robotics: Science and Systems (RSS).
- (2010) Proceedings of Robotics: Science and Systems (RSS)
- Hsiao, K.¹ Kaelbling, L.² Lozano-Pérez, T.³

19
- 77958566137
- Robust belief-based execution of manipulation programs
- Hsiao, K., Lozano-Pérez, T., & Kaelbling, L. (2008). Robust belief-based execution of manipulation programs. In Proceedings of the Workshop on the Algorithmic Foundations of Robotics (WAFR).
- (2008) Proceedings of the Workshop on the Algorithmic Foundations of Robotics (WAFR)
- Hsiao, K.¹ Lozano-Pérez, T.² Kaelbling, L.³

20
- 85024429815
- A new approach to linear filtering and prediction problems
- (Series D
- Kalman, R. E. (1960). A new approach to linear filtering and prediction problems. Transactions of the ASME-Journal of Basic Engineering, 82(Series D), 35-45.
- (1960) Transactions of the ASME-Journal of Basic Engineering , vol.82 , pp. 35-45
- Kalman, R.E.¹

21
- 0036832951
- A sparse sampling algorithm for near-optimal planning in large Markov decision processes
- Kearns, M., Mansour, Y., & Ng, A. (2002). A sparse sampling algorithm for near-optimal planning in large Markov decision processes. Machine Learning, 49(2-3), 193-209.
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 193-209
- Kearns, M.¹ Mansour, Y.² Ng, A.³

22
- 36349024290
- Near-optimal observation selection using submodular functions
- AAAI-07/IAAI-07 Proceedings: 22nd AAAI Conference on Artificial Intelligence and the 19th Innovative Applications of Artificial Intelligence Conference
- Krause, A., & Guestrin, C. (2007). Near-optimal observation selection using submodular functions. In Proceedings of the National Conference on Artificial Intelligence (AAAI), Vol. 22, pp. 1650-1654. (Pubitemid 350149805)
- (2007) Proceedings of the National Conference on Artificial Intelligence , vol.2 , pp. 1650-1654
- Krause, A.¹ Guestrin, C.²

23
- 14344249315
- Efficient methods of non-myopic sensor management for multitarget tracking
- TuB08.3, 2004 43rd IEEE Conference on Decision and Control (CDC)
- Kreucher, C., Hero III, A., Kastella, K., & Chang, D. (2004). Efficient methods of non-myopic sensor management for multitarget tracking. In Proceedings of the IEEE Conference on Decision and Control (CDC), Vol. 1, pp. 722-727. (Pubitemid 40291418)
- (2004) Proceedings of the IEEE Conference on Decision and Control , vol.1 , pp. 722-727
- Kreucher, C.¹ Hero III, A.O.² Kastella, K.³ Chang, D.⁴

24
- 78650161912
- Motion planning under uncertainty for robotic tasks with long time horizons
- Kurniawati, H., Du, Y., Hsu, D., & Lee, W. (2009). Motion planning under uncertainty for robotic tasks with long time horizons. In Proceedings of the International Symposium of Robotics Research (ISRR).
- (2009) Proceedings of the International Symposium of Robotics Research (ISRR)
- Kurniawati, H.¹ Du, Y.² Hsu, D.³ Lee, W.⁴

25
- 70349645087
- SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces
- Kurniawati, H., Hsu, D., & Lee, W. (2008). SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces. In Proceedings of the Robotics: Science and Systems (RSS).
- (2008) Proceedings of the Robotics: Science and Systems (RSS)
- Kurniawati, H.¹ Hsu, D.² Lee, W.³

26
- 19844372830
- Three dimensional receding horizon control for UAVs
- Kuwata, Y., & How, J. (2004). Three dimensional receding horizon control for UAVs. In Proceedings of the AIAA Guidance, Navigation, and Control Conference and Exhibit (GNC), pp. 16-19.
- (2004) Proceedings of the AIAA Guidance, Navigation, and Control Conference and Exhibit (GNC , pp. 16-19
- Kuwata, Y.¹ How, J.²

27
- 85138579181
- Learning policies for partially observable environments: Scaling up
- Littman, M., Cassandra, A., & Kaelbling, L. (1995). Learning policies for partially observable environments: Scaling up. In Proceedings of the Twlfth International Conference on Machine Learning (ICML), pp. 362-370.
- (1995) Proceedings of the Twlfth International Conference on Machine Learning ( ICML , pp. 362-370
- Littman, M.¹ Cassandra, A.² Kaelbling, L.³

28
- 0001963197
- Self-improving factory simulation using continuous-time average-reward reinforcement learning
- Mahadevan, S., Marchalleck, N., Das, T., & Gosavi, A. (1997). Self-improving factory simulation using continuous-time average-reward reinforcement learning. In Proceedings of the International Conference on Machine Learning (ICML), pp. 202-210.
- (1997) Proceedings of the International Conference on Machine Learning ( ICML , pp. 202-210
- Mahadevan, S.¹ Marchalleck, N.² Das, T.³ Gosavi, A.⁴

29
- 0033876326
- Constrained model predictive control: Stability and optimality
- DOI 10.1016/S0005-1098(99)00214-9
- Mayne, D. Q., Rawlings, J. B., Rao, C. V., & Scokaert, P. O. M. (2000). Constrained model predictive control: Stability and optimality. Automatica, 36, 789-814. (Pubitemid 30587683)
- (2000) Automatica , vol.36 , Issue.6 , pp. 789-814
- Mayne, D.Q.¹ Rawlings, J.B.² Rao, C.V.³ Scokaert, P.O.M.⁴

30
- 0004835198
- Approximate planning for factored POMDPs using belief state simplification
- McAllester, D., & Singh, S. (1999). Approximate planning for factored POMDPs using belief state simplification. In Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI), pp. 409-416.
- (1999) Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI , pp. 409-416
- McAllester, D.¹ Singh, S.²

31
- 84897718799
- AcQuire-macros: An algorithm for automatically learning macro-actions
- McGovern, A. (1998). acQuire-macros: An algorithm for automatically learning macro-actions. In NIPS 98 Workshop on Abstraction and Hierarchy in Reinforcement Learning.
- (1998) NIPS 98 Workshop on Abstraction and Hierarchy in Reinforcement Learning
- McGovern, A.¹

32
- 52249103621
- Hybrid POMDP algorithms
- Paquet, S., Chaib-draa, B., & Ross, S. (2006). Hybrid POMDP algorithms. In Workshop on Multi- Agent Sequential Decision Making in Uncertain Domains (MSDM), pp. 133-147.
- (2006) Workshop on Multi- Agent Sequential Decision Making in Uncertain Domains (MSDM , pp. 133-147
- Paquet, S.¹ Chaib-draa, B.² Ross, S.³

33
- 44649095768
- An online POMDP algorithm for complex multiagent environments
- Paquet, S., Tobin, L., & Chaib-draa, B. (2005). An online POMDP algorithm for complex multiagent environments. In Proceedings of the Conference on Autonomous agents and Multiagent systems (AAMAS), pp. 970-977.
- (2005) Proceedings of the Conference on Autonomous agents and Multiagent systems (AAMAS , pp. 970-977
- Paquet, S.¹ Tobin, L.² Chaib-draa, B.³

34
- 77956151954
- Linear quadratic gaussianbased closed-loop control of type 1 diabetes
- Patek, S., Breton, M., Chen, Y., Solomon, C., & Kovatchev, B. (2007). Linear quadratic gaussianbased closed-loop control of type 1 diabetes. Journal of Diabetes Science and Technology, 1.
- (2007) Journal of Diabetes Science and Technology , vol.1
- Patek, S.¹ Breton, M.² Chen, Y.³ Solomon, C.⁴ Kovatchev, B.⁵

35
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- Pineau, J., Gordon, G., & Thrun, S. (2003a). Point-based value iteration: An anytime algorithm for POMDPs. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), Vol. 18, pp. 1025-1032.
- (2003) Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) , vol.18 , pp. 1025-1032
- Pineau, J.¹ Gordon, G.² Thrun, S.³

36
- 20444478005
- Policy-contingent abstraction for robust robot control
- Pineau, J., Gordon, G., & Thrun, S. (2003b). Policy-contingent abstraction for robust robot control. In Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI).
- (2003) Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI)
- Pineau, J.¹ Gordon, G.² Thrun, S.³

37
- 80052206200
- Belief space planning assuming maximum likelihood observations
- Platt, R., Tedrake, R., Lozano-Perez, T., & Kaelbling, L. (2010). Belief space planning assuming maximum likelihood observations. In Proceedings of Robotics: Science and Systems (RSS).
- (2010) Proceedings of Robotics: Science and Systems (RSS)
- Platt, R.¹ Tedrake, R.² Lozano-Perez, T.³ Kaelbling, L.⁴

38
- 33750724397
- Point-based value iteration for continuous POMDPs
- Porta, J., Vlassis, N., Spaan, M., & Poupart, P. (2006). Point-based value iteration for continuous POMDPs. Journal of Machine Learning Research, 7, 2329-2367. (Pubitemid 44708007)
- (2006) Journal of Machine Learning Research , vol.7 , pp. 2329-2367
- Porta, J.M.¹ Vlassis, N.² Spaan, M.T.J.³ Poupart, P.⁴

39
- 26944499915
- Ph.D. thesis, University of Toronto
- Poupart, P. (2005). Exploiting Structure to Efficiently Solve Large Scale Partially Observable Markov Decision Processes. Ph.D. thesis, University of Toronto.
- (2005) Exploiting Structure to Efficiently Solve Large Scale Partially Observable Markov Decision Processes
- Poupart, P.¹

40
- 85088722557
- Experimental demonstrations of real-time MILP control
- Richards, A., Kuwata, Y., & How, J. (2003). Experimental demonstrations of real-time MILP control. In Proceeding of the AIAA Guidance, Navigation, and Control Conference (GNC).
- (2003) Proceeding of the AIAA Guidance, Navigation, and Control Conference (GNC)
- Richards, A.¹ Kuwata, Y.² How, J.³

41
- 84880870183
- AEMS: An anytime online search algorithm for approximate policy refinement in large POMDPs
- Ross, S., & Chaib-draa, B. (2007). AEMS: An anytime online search algorithm for approximate policy refinement in large POMDPs. In Proceedings of the International Joint Conference in Artificial Intelligence (IJCAI), pp. 2592-2598.
- (2007) Proceedings of the International Joint Conference in Artificial Intelligence (IJCAI , pp. 2592-2598
- Ross, S.¹ Chaib-draa, B.²

42
- 52249086942
- Online planning algorithms for POMDPs
- Ross, S., Pineau, J., Paquet, S., & Chaib-draa, B. (2008a). Online planning algorithms for POMDPs. Journal of Artificial Intelligence Research, 32(1), 663-704.
- (2008) Journal of Artificial Intelligence Research , vol.32 , Issue.1 , pp. 663-704
- Ross, S.¹ Pineau, J.² Paquet, S.³ Chaib-draa, B.⁴

43
- 51649091499
- Bayesian reinforcement learning in continuous POMDPs with application to robot navigation
- IEEE
- Ross, S., Chaib-draa, B., & Pineau, J. (2008b). Bayesian reinforcement learning in continuous POMDPs with application to robot navigation. In Proceedings of the International Conference on Robotics and Automation (ICRA). IEEE.
- (2008) Proceedings of the International Conference on Robotics and Automation (ICRA)
- Ross, S.¹ Chaib-draa, B.² Pineau, J.³

44
- 63749125729
- A POMDP framework for coordinated guidance of autonomous UAVs for multitarget tracking
- Scott, A., Harris, Z., & Chong, E. (2009). A POMDP framework for coordinated guidance of autonomous UAVs for multitarget tracking. EURASIP Journal on Advances in Signal Processing, 2009, 1-17.
- (2009) EURASIP Journal on Advances in Signal Processing , vol.2009 , pp. 1-17
- Scott, A.¹ Harris, Z.² Chong, E.³

45
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- Smallwood, R., & Sondik, E. (1973). The optimal control of partially observable Markov processes over a finite horizon. Operations Research, 21(5), 1071-1088.
- (1973) Operations Research , vol.21 , Issue.5 , pp. 1071-1088
- Smallwood, R.¹ Sondik, E.²

46
- 80053262864
- Point-based POMDP algorithms: Improved analysis and implementation
- Smith, T., & Simmons, R. (2005). Point-based POMDP algorithms: Improved analysis and implementation. In Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI).
- (2005) Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI)
- Smith, T.¹ Simmons, R.²

47
- 84912073624
- Learning options in reinforcement learning
- Stolle, M., & Precup, D. (2002). Learning options in reinforcement learning. Lecture Notes in Computer Science, 212-223.
- (2002) Lecture Notes in Computer Science , pp. 212-223
- Stolle, M.¹ Precup, D.²

48
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- DOI 10.1016/S0004-3702(99)00052-1
- Sutton, R., Precup, D., & Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181-211. (Pubitemid 32079890)
- (1999) Artificial Intelligence , vol.112 , Issue.1 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

49
- 80052287410
- Approximate planning in POMDPs with macro-actions
- Theocharous, G., & Kaelbling, L. (2003). Approximate planning in POMDPs with macro-actions. In Advances in Neural Processing Information Systems (NIPS).
- (2003) Advances in Neural Processing Information Systems (NIPS)
- Theocharous, G.¹ Kaelbling, L.²

50
- 33645809087
- On the central moments of the multidimensional Gaussian distribution
- Triantafyllopoulos, K. (2003). On the central moments of the multidimensional Gaussian distribution. The Mathematical Scientist, 28, 125-128. (Pubitemid 38126665)
- (2003) MATHEMATICAL SCIENTIST , vol.28 , Issue.2 , pp. 125-128
- Triantafyllopoulos, K.¹

51
- 84919201360
- Dynamic generalized linear models and Bayesian forecasting
- West, M., Harrison, P., & Migon, H. (1985). Dynamic generalized linear models and Bayesian forecasting. Journal of the American Statistical Association, 80(389), 73-83.
- (1985) Journal of the American Statistical Association , vol.80 , Issue.389 , pp. 73-83
- West, M.¹ Harrison, P.² Migon, H.³

52
- 79956340208
- Open-loop plans in multi-robot POMDPs. Tech. rep., Stanford University
- Yu, C., Chuang, J., Gerkey, B., Gordon, G., & Ng, A. (2005). Open-loop plans in multi-robot POMDPs. Tech. rep., Stanford University.
- (2005)
- Yu, C.¹ Chuang, J.² Gerkey, B.³ Gordon, G.⁴ Ng, A.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.