메뉴 건너뛰기




Volumn 32, Issue 5, 2012, Pages 96-109

Robust adaptive Markov decision processes: Planning with model uncertainty

Author keywords

[No Author keywords available]

Indexed keywords

ANTENNAS; HUMAN RESOURCE MANAGEMENT; MARKOV PROCESSES; UNCERTAINTY ANALYSIS;

EID: 84877947571     PISSN: 1066033X     EISSN: None     Source Type: Journal    
DOI: 10.1109/MCS.2012.2205478     Document Type: Article
Times cited : (27)

References (31)
  • 2
    • 0034186843 scopus 로고    scopus 로고
    • Optimal electricity supply bidding by Markov decision process
    • H. Song, C. C. Liu, J. Lawarree, and R. Dahlgren, "Optimal electricity supply bidding by Markov decision process," IEEE Trans. Power Syst., vol. 15, no. 2, pp. 618-624, 2000.
    • (2000) IEEE Trans. Power Syst. , vol.15 , Issue.2 , pp. 618-624
    • Song, H.1    Liu, C.C.2    Lawarree, J.3    Dahlgren, R.4
  • 4
    • 52449093126 scopus 로고    scopus 로고
    • Group health management of UAV teams with applications to persistent surveillance
    • B. Bethke, J. How, and J. Vian, "Group health management of UAV teams with applications to persistent surveillance," in Proc. American Controls Conf., 2008, pp. 3145-3150.
    • Proc. American Controls Conf., 2008 , pp. 3145-3150
    • Bethke, B.1    How, J.2    Vian, J.3
  • 5
    • 42149156696 scopus 로고    scopus 로고
    • Collaborative distributed sensor management for multitarget tracking using hierarchical Markov decision processes
    • D. Akselrod, A. Sinha, and T. Kirubarajan, "Collaborative distributed sensor management for multitarget tracking using hierarchical Markov decision processes," Proc. SPIE, vol. 6699, pp. 1-14, 2007.
    • (2007) Proc. SPIE , vol.6699 , pp. 1-14
    • Akselrod, D.1    Sinha, A.2    Kirubarajan, T.3
  • 7
    • 33644691667 scopus 로고    scopus 로고
    • Performance prediction of an unmanned airborne vehicle multi-agent system
    • DOI 10.1016/j.ejor.2004.10.015, PII S0377221704008161
    • Z. Lian and A. Deshmukh, "Performance prediction of an unmanned airborne vehicle multiagent system," Eur. J. Oper. Res., vol. 172, no. 2, pp. 680-695, 2006. (Pubitemid 43332997)
    • (2006) European Journal of Operational Research , vol.172 , Issue.2 , pp. 680-695
    • Lian, Z.1    Deshmukh, A.2
  • 8
    • 33847336943 scopus 로고    scopus 로고
    • Bias and variance approximation in value function estimates
    • DOI 10.1287/mnsc.1060.0614
    • S. Mannor, D. Simester, P. Sun, and J. Tsitsiklis, "Bias and variance approximation in value function estimates," Manage. Sci., vol. 52, no. 2, pp. 308-322, 2007. (Pubitemid 46326182)
    • (2007) Management Science , vol.53 , Issue.2 , pp. 308-322
    • Mannor, S.1    Simester, D.2    Sun, P.3    Tsitsiklis, J.N.4
  • 9
    • 0028460403 scopus 로고
    • Markov decision processes with imprecise transition probabilities
    • C. C. White and H. K. Eldeib, "Markov decision processes with imprecise transition probabilities," Oper. Res., vol. 42, no. 4, pp. 739-749, 1994.
    • (1994) Oper. Res. , vol.42 , Issue.4 , pp. 739-749
    • White, C.C.1    Eldeib, H.K.2
  • 11
    • 77249117255 scopus 로고    scopus 로고
    • Percentile optimization for Markov decision processes with parameter uncertainty
    • E. Delage and S. Mannor, "Percentile optimization for Markov decision processes with parameter uncertainty," Oper. Res., vol. 58, no. 1, pp. 203-213, 2010.
    • (2010) Oper. Res. , vol.58 , Issue.1 , pp. 203-213
    • Delage, E.1    Mannor, S.2
  • 13
    • 14344250395 scopus 로고    scopus 로고
    • Robust control of Markov decision processes with uncertain transition matrices
    • DOI 10.1287/opre.1050.0216
    • A. Nilim and L. El Ghaoui, "Robust solutions to Markov decision problems with uncertain transition matrices," Oper. Res., vol. 53, no. 5, pp. 780-798, 2005. (Pubitemid 41525849)
    • (2005) Operations Research , vol.53 , Issue.5 , pp. 780-798
    • Nilim, A.1    Ghaoui, L.E.2
  • 14
    • 25444493818 scopus 로고    scopus 로고
    • Robust dynamic programming
    • G. Iyengar, "Robust dynamic programming," Math. Oper. Res., vol. 30, no. 2, pp. 257-280, 2005.
    • (2005) Math. Oper. Res. , vol.30 , Issue.2 , pp. 257-280
    • Iyengar, G.1
  • 16
    • 52649091902 scopus 로고    scopus 로고
    • Robust decision-making for uncertain Markov decision processes using sigma point sampling
    • L. F. Bertuccelli and J. P. How, "Robust decision-making for uncertain Markov decision processes using sigma point sampling," in Proc. American Control Conf., 2008, pp. 5003-5008.
    • Proc. American Control Conf., 2008 , pp. 5003-5008
    • Bertuccelli, L.F.1    How, J.P.2
  • 18
    • 0027684215 scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and less time
    • A. Moore and C. Atkeson, "Prioritized sweeping: Reinforcement learning with less data and less time," Mach. Learn., vol. 13, no. 1, pp. 103-130, 1991.
    • (1991) Mach. Learn. , vol.13 , Issue.1 , pp. 103-130
    • Moore, A.1    Atkeson, C.2
  • 19
    • 0035592363 scopus 로고    scopus 로고
    • Finding generators for Markov chains via empirical transition matrices with applications to credit ratings
    • R. B. Israel, J. S. Rosenthal, and J. Z. Wei, "Finding generators for Markov chains via empirical transition matrices with applications to credit ratings," Math. Finance, vol. 11, no. 2, pp. 245-265, 2001.
    • (2001) Math. Finance , vol.11 , Issue.2 , pp. 245-265
    • Israel, R.B.1    Rosenthal, J.S.2    Wei, J.Z.3
  • 22
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • A. Barto, S. Bradtke, and S. Singh, "Learning to act using real-time dynamic programming," Artif. Intell., vol. 72, pp. 81-138, 1993.
    • (1993) Artif. Intell. , vol.72 , pp. 81-138
    • Barto, A.1    Bradtke, S.2    Singh, S.3
  • 24
    • 0344445520 scopus 로고    scopus 로고
    • Adapting the sample size in particle filters through KLD-sampling
    • D. Fox, "Adapting the sample size in particle filters through KLD-sampling," Int. J. Robot. Res., vol. 22, no. 12, pp. 985, 2003.
    • (2003) Int. J. Robot. Res. , vol.22 , Issue.12 , pp. 985
    • Fox, D.1
  • 26
    • 21244437999 scopus 로고    scopus 로고
    • Unscented filtering and nonlinear estimation
    • S. Julier and J. Uhlmann, "Unscented filtering and nonlinear estimation," Proc. IEEE, vol. 92, no. 3, pp. 401-422, 2004.
    • (2004) Proc. IEEE , vol.92 , Issue.3 , pp. 401-422
    • Julier, S.1    Uhlmann, J.2
  • 28
    • 2942619107 scopus 로고    scopus 로고
    • Online Bayesian estimation of transition probabilities for Markovian jump systems
    • V. Jilkov and X. Li, "Online Bayesian estimation of transition probabilities for Markovian jump systems," IEEE Trans. Signal Processing, vol. 52, no. 6, pp. 307-315, 2004.
    • (2004) IEEE Trans. Signal Processing , vol.52 , Issue.6 , pp. 307-315
    • Jilkov, V.1    Li, X.2
  • 30
    • 0015025294 scopus 로고
    • Asymptotic behavior of the Kalman filter with exponential aging
    • R. W. Miller, "Asymptotic behavior of the Kalman filter with exponential aging," AIAA J., vol. 9, pp. 537-539, 1971.
    • (1971) AIAA J. , vol.9 , pp. 537-539
    • Miller, R.W.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.