SCOPUS 정보 검색 플랫폼

IEEE Control Systems

Volumn 32, Issue 5, 2012, Pages 96-109

Robust adaptive Markov decision processes: Planning with model uncertainty

(3) Bertuccelli, Luca F a,b Wu, Albert c How, Jonathan P d,e,f

a UNITED TECHNOLOGIES RESEARCH CENTER (United States)

b not available (United States)

c CARNEGIE MELLON UNIVERSITY (United States)

d MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

e AIAA (United States)

f IEEE (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ANTENNAS; HUMAN RESOURCE MANAGEMENT; MARKOV PROCESSES; UNCERTAINTY ANALYSIS;

AUTONOMOUS SYSTEMS; CO-OPERATIVE CONTROL; COMPLEX DECISION; MARKOV DECISION PROCESSES; MODEL UNCERTAINTIES; ROBUST ADAPTIVE; UNCERTAIN ENVIRONMENTS; VEHICLE SENSORS;

UNMANNED AERIAL VEHICLES (UAV);

EID: 84877947571 PISSN: 1066033X EISSN: None Source Type: Journal
DOI: 10.1109/MCS.2012.2205478 Document Type: Article

Times cited : (27)

References (31)

1
- 85102627959
- Hoboken, NJ: Wiley
- M. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming. Hoboken, NJ: Wiley, 2005.
- (2005) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.¹

2
- 0034186843
- Optimal electricity supply bidding by Markov decision process
- H. Song, C. C. Liu, J. Lawarree, and R. Dahlgren, "Optimal electricity supply bidding by Markov decision process," IEEE Trans. Power Syst., vol. 15, no. 2, pp. 618-624, 2000.
- (2000) IEEE Trans. Power Syst. , vol.15 , Issue.2 , pp. 618-624
- Song, H.¹ Liu, C.C.² Lawarree, J.³ Dahlgren, R.⁴

3
- 84883619811
- Norwell, MA: Kluwer
- M. Schal, Handbook of Markov Decision Processes: Methods and Applications, Chapter Markov Decision Processes in Finance and Dynamic Options. Norwell, MA: Kluwer, 2002.
- (2002) Handbook of Markov Decision Processes: Methods and Applications, Chapter Markov Decision Processes in Finance and Dynamic Options
- Schal, M.¹

4
- 52449093126
- Group health management of UAV teams with applications to persistent surveillance
- B. Bethke, J. How, and J. Vian, "Group health management of UAV teams with applications to persistent surveillance," in Proc. American Controls Conf., 2008, pp. 3145-3150.
- Proc. American Controls Conf., 2008 , pp. 3145-3150
- Bethke, B.¹ How, J.² Vian, J.³

5
- 42149156696
- Collaborative distributed sensor management for multitarget tracking using hierarchical Markov decision processes
- D. Akselrod, A. Sinha, and T. Kirubarajan, "Collaborative distributed sensor management for multitarget tracking using hierarchical Markov decision processes," Proc. SPIE, vol. 6699, pp. 1-14, 2007.
- (2007) Proc. SPIE , vol.6699 , pp. 1-14
- Akselrod, D.¹ Sinha, A.² Kirubarajan, T.³

6
- 84883619947
- Adapting an MDP planner to time-dependency: Case study on a UAV coordination problem
- E. Rachelson, P. Fabiani, and F. Garcia, "Adapting an MDP planner to time-dependency: Case study on a UAV coordination problem," in Proc. 4th Workshop Planning and Plan Execution for Real-World Systems: Principles and Practices for Planning in Execution, 2009, pp. 1-8.
- Proc. 4th Workshop Planning and Plan Execution for Real-World Systems: Principles and Practices for Planning in Execution, 2009 , pp. 1-8
- Rachelson, E.¹ Fabiani, P.² Garcia, F.³

7
- 33644691667
- Performance prediction of an unmanned airborne vehicle multi-agent system
- DOI 10.1016/j.ejor.2004.10.015, PII S0377221704008161
- Z. Lian and A. Deshmukh, "Performance prediction of an unmanned airborne vehicle multiagent system," Eur. J. Oper. Res., vol. 172, no. 2, pp. 680-695, 2006. (Pubitemid 43332997)
- (2006) European Journal of Operational Research , vol.172 , Issue.2 , pp. 680-695
- Lian, Z.¹ Deshmukh, A.²

8
- 33847336943
- Bias and variance approximation in value function estimates
- DOI 10.1287/mnsc.1060.0614
- S. Mannor, D. Simester, P. Sun, and J. Tsitsiklis, "Bias and variance approximation in value function estimates," Manage. Sci., vol. 52, no. 2, pp. 308-322, 2007. (Pubitemid 46326182)
- (2007) Management Science , vol.53 , Issue.2 , pp. 308-322
- Mannor, S.¹ Simester, D.² Sun, P.³ Tsitsiklis, J.N.⁴

9
- 0028460403
- Markov decision processes with imprecise transition probabilities
- C. C. White and H. K. Eldeib, "Markov decision processes with imprecise transition probabilities," Oper. Res., vol. 42, no. 4, pp. 739-749, 1994.
- (1994) Oper. Res. , vol.42 , Issue.4 , pp. 739-749
- White, C.C.¹ Eldeib, H.K.²

10
- 62949129319
- Ph.D. dissertation, MIT
- L. F. Bertuccelli, "Robust decision-making with model uncertainty in aerospace systems," Ph.D. dissertation, MIT, 2008.
- (2008) Robust Decision-Making with Model Uncertainty in Aerospace Systems
- Bertuccelli, L.F.¹

11
- 77249117255
- Percentile optimization for Markov decision processes with parameter uncertainty
- E. Delage and S. Mannor, "Percentile optimization for Markov decision processes with parameter uncertainty," Oper. Res., vol. 58, no. 1, pp. 203-213, 2010.
- (2010) Oper. Res. , vol.58 , Issue.1 , pp. 203-213
- Delage, E.¹ Mannor, S.²

12
- 0001916840
- Risk-sensitive Markov decision processes
- S. Marcus, E. Fernandez-Gaucherand, D. Hernandez-Hernandez, S. Coraluppi, and P. Fard, "Risk-sensitive Markov decision processes," in Systems and Control in the Twenty-First Century, 1997.
- (1997) Systems and Control in the Twenty-First Century
- Marcus, S.¹ Fernandez-Gaucherand, E.² Hernandez-Hernandez, D.³ Coraluppi, S.⁴ Fard, P.⁵

13
- 14344250395
- Robust control of Markov decision processes with uncertain transition matrices
- DOI 10.1287/opre.1050.0216
- A. Nilim and L. El Ghaoui, "Robust solutions to Markov decision problems with uncertain transition matrices," Oper. Res., vol. 53, no. 5, pp. 780-798, 2005. (Pubitemid 41525849)
- (2005) Operations Research , vol.53 , Issue.5 , pp. 780-798
- Nilim, A.¹ Ghaoui, L.E.²

14
- 25444493818
- Robust dynamic programming
- G. Iyengar, "Robust dynamic programming," Math. Oper. Res., vol. 30, no. 2, pp. 257-280, 2005.
- (2005) Math. Oper. Res. , vol.30 , Issue.2 , pp. 257-280
- Iyengar, G.¹

15
- 1942450194
- Robotics Inst., Carnegie Mellon Univ., Pittsburgh, PA, Tech. Rep. CMU-RI-TR-01-25
- J. Bagnell, A. Y. Ng, and J. Schneider, "Solving uncertain Markov decision problems," Robotics Inst., Carnegie Mellon Univ., Pittsburgh, PA, Tech. Rep. CMU-RI-TR-01-25, 2001.
- (2001) Solving Uncertain Markov Decision Problems
- Bagnell, J.¹ Ng, A.Y.² Schneider, J.³

16
- 52649091902
- Robust decision-making for uncertain Markov decision processes using sigma point sampling
- L. F. Bertuccelli and J. P. How, "Robust decision-making for uncertain Markov decision processes using sigma point sampling," in Proc. American Control Conf., 2008, pp. 5003-5008.
- Proc. American Control Conf., 2008 , pp. 5003-5008
- Bertuccelli, L.F.¹ How, J.P.²

17
- 0002357911
- Convergence of indirect adaptive asynchronous value iteration algorithms
- V. Gullapalli and A. G. Barto, "Convergence of indirect adaptive asynchronous value iteration algorithms," in Advances in Neural Information Processing Systems, 1994, pp. 695-695.
- (1994) Advances in Neural Information Processing Systems , pp. 695-695
- Gullapalli, V.¹ Barto, A.G.²

18
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less time
- A. Moore and C. Atkeson, "Prioritized sweeping: Reinforcement learning with less data and less time," Mach. Learn., vol. 13, no. 1, pp. 103-130, 1991.
- (1991) Mach. Learn. , vol.13 , Issue.1 , pp. 103-130
- Moore, A.¹ Atkeson, C.²

19
- 0035592363
- Finding generators for Markov chains via empirical transition matrices with applications to credit ratings
- R. B. Israel, J. S. Rosenthal, and J. Z. Wei, "Finding generators for Markov chains via empirical transition matrices with applications to credit ratings," Math. Finance, vol. 11, no. 2, pp. 245-265, 2001.
- (2001) Math. Finance , vol.11 , Issue.2 , pp. 245-265
- Israel, R.B.¹ Rosenthal, J.S.² Wei, J.Z.³

20
- 34548170726
- Pop-up threat models for persistent area denial
- DOI 10.1109/TAES.2007.4285350
- Y. Liu, J. B. Cruz, and C. J. Schumacher, "Pop-up threat models for persistent area denial," IEEE Trans. Aerosp. Electron. Syst., vol. 43, no. 2, pp. 509-521, 2007. (Pubitemid 47308045)
- (2007) IEEE Transactions on Aerospace and Electronic Systems , vol.43 , Issue.2 , pp. 509-521
- Liu, Y.¹ Cruz Jr., J.B.² Schumacher, C.J.³

21
- 77958493967
- Experimental demonstration of MDP-based planning with model uncertainty
- AIAA-2008-6322
- B. Bethke, L. Bertuccelli, and J. P. How, "Experimental demonstration of MDP-based planning with model uncertainty," in Proc. AIAA Guidance Navigation and Control Conf., Aug. 2008, AIAA-2008-6322.
- Proc. AIAA Guidance Navigation and Control Conf., Aug. 2008
- Bethke, B.¹ Bertuccelli, L.² How, J.P.³

22
- 0029210635
- Learning to act using real-time dynamic programming
- A. Barto, S. Bradtke, and S. Singh, "Learning to act using real-time dynamic programming," Artif. Intell., vol. 72, pp. 81-138, 1993.
- (1993) Artif. Intell. , vol.72 , pp. 81-138
- Barto, A.¹ Bradtke, S.² Singh, S.³

23
- 0004012196
- London, U.K.: Chapman and Hall
- A. Gelman, J. Carlin, H. Stern, and D. Rubin, Bayesian Data Analysis. London, U.K.: Chapman and Hall, 1995.
- (1995) Bayesian Data Analysis
- Gelman, A.¹ Carlin, J.² Stern, H.³ Rubin, D.⁴

24
- 0344445520
- Adapting the sample size in particle filters through KLD-sampling
- D. Fox, "Adapting the sample size in particle filters through KLD-sampling," Int. J. Robot. Res., vol. 22, no. 12, pp. 985, 2003.
- (2003) Int. J. Robot. Res. , vol.22 , Issue.12 , pp. 985
- Fox, D.¹

25
- 0003665481
- New York: Springer-Verlag
- A. Doucet, N. De Freitas, and N. Gordon, Sequential Monte Carlo Methods in Practice. New York: Springer-Verlag, 2001.
- (2001) Sequential Monte Carlo Methods in Practice
- Doucet, A.¹ De Freitas, N.² Gordon, N.³

26
- 21244437999
- Unscented filtering and nonlinear estimation
- S. Julier and J. Uhlmann, "Unscented filtering and nonlinear estimation," Proc. IEEE, vol. 92, no. 3, pp. 401-422, 2004.
- (2004) Proc. IEEE , vol.92 , Issue.3 , pp. 401-422
- Julier, S.¹ Uhlmann, J.²

27
- 39649090194
- Learning in non-stationary partially observable Markov decision processes
- R. Jaulmes, J. Pineau, and D. Precup, "Learning in non-stationary partially observable Markov decision processes," in Proc. ECML Workshop Reinforcement Learning in Non-Stationary Environments, 2005, vol. 2, pp. 26.
- Proc. ECML Workshop Reinforcement Learning in Non-Stationary Environments, 2005 , vol.2 , pp. 26
- Jaulmes, R.¹ Pineau, J.² Precup, D.³

28
- 2942619107
- Online Bayesian estimation of transition probabilities for Markovian jump systems
- V. Jilkov and X. Li, "Online Bayesian estimation of transition probabilities for Markovian jump systems," IEEE Trans. Signal Processing, vol. 52, no. 6, pp. 307-315, 2004.
- (2004) IEEE Trans. Signal Processing , vol.52 , Issue.6 , pp. 307-315
- Jilkov, V.¹ Li, X.²

29
- 35548994553
- New York: Wiley Interscience
- Y. Bar Shalom, X. Rong Li, and T. Kirubarajan, Estimation with Applications to Tracking and Navigation. New York: Wiley Interscience, 2001.
- (2001) Estimation with Applications to Tracking and Navigation
- Bar Shalom, Y.¹ Rong Li, X.² Kirubarajan, T.³

30
- 0015025294
- Asymptotic behavior of the Kalman filter with exponential aging
- R. W. Miller, "Asymptotic behavior of the Kalman filter with exponential aging," AIAA J., vol. 9, pp. 537-539, 1971.
- (1971) AIAA J. , vol.9 , pp. 537-539
- Miller, R.W.¹

31
- 0003565783
- Athena Scientific
- D. Bertsekas, Dynamic Programming and Optimal Control. Athena Scientific, 2005.
- (2005) Dynamic Programming and Optimal Control
- Bertsekas, D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.