메뉴 건너뛰기




Volumn 40, Issue , 2011, Pages 523-570

Efficient planning under uncertainty with macro-actions

Author keywords

[No Author keywords available]

Indexed keywords

ACTIVE AREA; CONTROL PERFORMANCE; EFFICIENT PLANNING; FORMAL ANALYSIS; LOOK-AHEAD; MULTI-STEP; NOVEL METHODS; PARTIALLY OBSERVABLE ENVIRONMENTS; POSTERIOR DISTRIBUTIONS; PRIMITIVE ACTIONS; SCIENTIFIC EXPLORATION; SEQUENCE OF ACTIONS; SIMULATION EXPERIMENTS;

EID: 79956361567     PISSN: None     EISSN: 10769757     Source Type: Journal    
DOI: 10.1613/jair.3171     Document Type: Article
Times cited : (74)

References (52)
  • 6
    • 33750007368 scopus 로고    scopus 로고
    • Parametric POMDPs for planning in continuous state spaces
    • DOI 10.1016/j.robot.2006.05.007, PII S0921889006000960
    • Brooks, A., Makarenko, A., Williams, S., & Durrant-Whyte, H. (2006). Parametric POMDPs for planning in continuous state spaces. Robotics and Autonomous Systems, 54(11), 887-897. (Pubitemid 44572669)
    • (2006) Robotics and Autonomous Systems , vol.54 , Issue.11 , pp. 887-897
    • Brooks, A.1    Makarenko, A.2    Williams, S.3    Durrant-Whyte, H.4
  • 9
    • 0034354798 scopus 로고    scopus 로고
    • Time series analysis of non-Gaussian observations based on state space models from both classical and Bayesian perspectives
    • Durbin, J., & Koopman, S. (2000). Time series analysis of non-Gaussian observations based on state space models from both classical and Bayesian perspectives. Journal of the Royal Statistical Society: Series B (Methodological), 62(1), 3-56.
    • (2000) Journal of the Royal Statistical Society: Series B (Methodological) , vol.62 , Issue.1 , pp. 3-56
    • Durbin, J.1    Koopman, S.2
  • 11
    • 34248656990 scopus 로고    scopus 로고
    • Real-time hierarchical POMDPs for autonomous robot navigation
    • DOI 10.1016/j.robot.2007.01.004, PII S0921889007000279
    • Foka, A., & Trahanias, P. (2007). Real-time hierarchical POMDPs for autonomous robot navigation. Robotics and Autonomous Systems, 55(7), 561-571. (Pubitemid 46777274)
    • (2007) Robotics and Autonomous Systems , vol.55 , Issue.7 , pp. 561-571
    • Foka, A.1    Trahanias, P.2
  • 16
    • 84947403595 scopus 로고
    • Probability inequalities for sums of bounded random variables
    • Hoeffding, W. (1963). Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301), 13-30.
    • (1963) Journal of the American Statistical Association , vol.58 , Issue.301 , pp. 13-30
    • Hoeffding, W.1
  • 20
    • 85024429815 scopus 로고
    • A new approach to linear filtering and prediction problems
    • (Series D
    • Kalman, R. E. (1960). A new approach to linear filtering and prediction problems. Transactions of the ASME-Journal of Basic Engineering, 82(Series D), 35-45.
    • (1960) Transactions of the ASME-Journal of Basic Engineering , vol.82 , pp. 35-45
    • Kalman, R.E.1
  • 21
    • 0036832951 scopus 로고    scopus 로고
    • A sparse sampling algorithm for near-optimal planning in large Markov decision processes
    • Kearns, M., Mansour, Y., & Ng, A. (2002). A sparse sampling algorithm for near-optimal planning in large Markov decision processes. Machine Learning, 49(2-3), 193-209.
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 193-209
    • Kearns, M.1    Mansour, Y.2    Ng, A.3
  • 22
    • 36349024290 scopus 로고    scopus 로고
    • Near-optimal observation selection using submodular functions
    • AAAI-07/IAAI-07 Proceedings: 22nd AAAI Conference on Artificial Intelligence and the 19th Innovative Applications of Artificial Intelligence Conference
    • Krause, A., & Guestrin, C. (2007). Near-optimal observation selection using submodular functions. In Proceedings of the National Conference on Artificial Intelligence (AAAI), Vol. 22, pp. 1650-1654. (Pubitemid 350149805)
    • (2007) Proceedings of the National Conference on Artificial Intelligence , vol.2 , pp. 1650-1654
    • Krause, A.1    Guestrin, C.2
  • 23
    • 14344249315 scopus 로고    scopus 로고
    • Efficient methods of non-myopic sensor management for multitarget tracking
    • TuB08.3, 2004 43rd IEEE Conference on Decision and Control (CDC)
    • Kreucher, C., Hero III, A., Kastella, K., & Chang, D. (2004). Efficient methods of non-myopic sensor management for multitarget tracking. In Proceedings of the IEEE Conference on Decision and Control (CDC), Vol. 1, pp. 722-727. (Pubitemid 40291418)
    • (2004) Proceedings of the IEEE Conference on Decision and Control , vol.1 , pp. 722-727
    • Kreucher, C.1    Hero III, A.O.2    Kastella, K.3    Chang, D.4
  • 29
    • 0033876326 scopus 로고    scopus 로고
    • Constrained model predictive control: Stability and optimality
    • DOI 10.1016/S0005-1098(99)00214-9
    • Mayne, D. Q., Rawlings, J. B., Rao, C. V., & Scokaert, P. O. M. (2000). Constrained model predictive control: Stability and optimality. Automatica, 36, 789-814. (Pubitemid 30587683)
    • (2000) Automatica , vol.36 , Issue.6 , pp. 789-814
    • Mayne, D.Q.1    Rawlings, J.B.2    Rao, C.V.3    Scokaert, P.O.M.4
  • 44
    • 63749125729 scopus 로고    scopus 로고
    • A POMDP framework for coordinated guidance of autonomous UAVs for multitarget tracking
    • Scott, A., Harris, Z., & Chong, E. (2009). A POMDP framework for coordinated guidance of autonomous UAVs for multitarget tracking. EURASIP Journal on Advances in Signal Processing, 2009, 1-17.
    • (2009) EURASIP Journal on Advances in Signal Processing , vol.2009 , pp. 1-17
    • Scott, A.1    Harris, Z.2    Chong, E.3
  • 45
    • 0015658957 scopus 로고
    • The optimal control of partially observable Markov processes over a finite horizon
    • Smallwood, R., & Sondik, E. (1973). The optimal control of partially observable Markov processes over a finite horizon. Operations Research, 21(5), 1071-1088.
    • (1973) Operations Research , vol.21 , Issue.5 , pp. 1071-1088
    • Smallwood, R.1    Sondik, E.2
  • 48
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • DOI 10.1016/S0004-3702(99)00052-1
    • Sutton, R., Precup, D., & Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181-211. (Pubitemid 32079890)
    • (1999) Artificial Intelligence , vol.112 , Issue.1 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 50
    • 33645809087 scopus 로고    scopus 로고
    • On the central moments of the multidimensional Gaussian distribution
    • Triantafyllopoulos, K. (2003). On the central moments of the multidimensional Gaussian distribution. The Mathematical Scientist, 28, 125-128. (Pubitemid 38126665)
    • (2003) MATHEMATICAL SCIENTIST , vol.28 , Issue.2 , pp. 125-128
    • Triantafyllopoulos, K.1
  • 52
    • 79956340208 scopus 로고    scopus 로고
    • Open-loop plans in multi-robot POMDPs. Tech. rep., Stanford University
    • Yu, C., Chuang, J., Gerkey, B., Gordon, G., & Ng, A. (2005). Open-loop plans in multi-robot POMDPs. Tech. rep., Stanford University.
    • (2005)
    • Yu, C.1    Chuang, J.2    Gerkey, B.3    Gordon, G.4    Ng, A.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.