SCOPUS 정보 검색 플랫폼

Proceedings of the American Control Conference

Volumn , Issue , 2009, Pages 1304-1309

Robust adaptive Markov decision processes in multi-vehicle applications

(3) Bertuccelli, Luca F a Bethke, Brett a How, Jonathan P a

a Massachusetts Institute of Technology (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACTUAL FLIGHT; ADAPTATION PROCESS; ADAPTIVE FRAMEWORK; ADAPTIVE POLICY; INDIVIDUAL STRENGTH; MARKOV DECISION PROCESSES; MISSION PERFORMANCE; MODEL UPDATING; MULTI-VEHICLES; NUMBER OF STATE; OPTIMAL VALUE FUNCTIONS; ROBUST ADAPTIVE; TRANSIENT BEHAVIOR; TRANSIENT PERFORMANCE; TRANSITION PROBABILITIES; WORST-CASE PERFORMANCE;

MARKOV PROCESSES;

SIMULATORS;

EID: 70449640592 PISSN: 07431619 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ACC.2009.5160511 Document Type: Conference Paper

Times cited : (7)

References (26)

1
- 14344250395
- Robust Solutions to Markov Decision Problems with Uncertain Transition Matrices
- A. Nilim and L. E. Ghaoui, "Robust Solutions to Markov Decision Problems with Uncertain Transition Matrices," Operations Research, vol. 53, no. 5, 2005.
- (2005) Operations Research , vol.53 , Issue.5
- Nilim, A.¹ Ghaoui, L.E.²

2
- 25444493818
- Robust Dynamic Programming
- G. Iyengar, "Robust Dynamic Programming," Math. Oper. Res., vol. 30, no. 2, pp. 257-280, 2005.
- (2005) Math. Oper. Res , vol.30 , Issue.2 , pp. 257-280
- Iyengar, G.¹

3
- 33847336943
- Bias and Variance Approximation in Value Function Estimates
- S. Mannor, D. Simester, P. Sun, and J. Tsitsiklis, "Bias and Variance Approximation in Value Function Estimates," Management Science, vol. 52, no. 2, pp. 308-322, 2007.
- (2007) Management Science , vol.52 , Issue.2 , pp. 308-322
- Mannor, S.¹ Simester, D.² Sun, P.³ Tsitsiklis, J.⁴

4
- 62949180684
- Robust Decision-Making for Uncertain Markov Decision Processes Using Sigma Point Sampling
- L. F. Bertuccelli and J. P. How, "Robust Decision-Making for Uncertain Markov Decision Processes Using Sigma Point Sampling," IEEE American Controls Conference, 2008.
- (2008) IEEE American Controls Conference
- Bertuccelli, L.F.¹ How, J.P.²

5
- 0025514707
- Methods for reasoning with imprecise probabilities in intelligent decision systems
- Man and Cybernetics, pp
- D. E. Brown and C. C. White., "Methods for reasoning with imprecise probabilities in intelligent decision systems," IEEE Conference on Systems, Man and Cybernetics, pp. 161-163, 1990.
- (1990) IEEE Conference on Systems , pp. 161-163
- Brown, D.E.¹ White, C.C.²

6
- 0015630091
- Markovian Decision Processes with Uncertain Transition Probabilities
- J. K. Satia and R. E. Lave., "Markovian Decision Processes with Uncertain Transition Probabilities," Operations Research, vol. 21, no. 3, 1973.
- (1973) Operations Research , vol.21 , Issue.3
- Satia, J.K.¹ Lave, R.E.²

7
- 0028460403
- Markov Decision Processes with Imprecise Transition Probabilities
- C. C. White and H. K. Eldeib., "Markov Decision Processes with Imprecise Transition Probabilities," Operations Research, vol. 42, no. 4, 1994.
- (1994) Operations Research , vol.42 , Issue.4
- White, C.C.¹ Eldeib, H.K.²

8
- 1942450194
- Solving Uncertain Markov Decision Processes
- A. Bagnell, A. Ng, and J. Schneider, "Solving Uncertain Markov Decision Processes," NIPS, 2001.
- (2001) NIPS
- Bagnell, A.¹ Ng, A.² Schneider, J.³

9
- 0004255876
- Boston, MA, USA: Addison-Wesley Longman Publishing Co, Inc
- K. J. Astrom and B. Wittenmark, Adaptive Control. Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc., 1994.
- (1994) Adaptive Control
- Astrom, K.J.¹ Wittenmark, B.²

10
- 0004102479
- The MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning). The MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning)
- Sutton, R.S.¹ Barto, A.G.²

11
- 39649090194
- Active Learning in Partially Observable Markov Decision Processes
- R. Jaulmes, J. Pineau, and D. Precup., "Active Learning in Partially Observable Markov Decision Processes," European Conference on Machine Learning (ECML), 2005.
- (2005) European Conference on Machine Learning (ECML)
- Jaulmes, R.¹ Pineau, J.² Precup, D.³

12
- 39649090194
- Learning in Non-Stationary Partially Observable Markov Decision Processes
- R. Jaulmes, J. Pineau, and D. Precup., "Learning in Non-Stationary Partially Observable Markov Decision Processes," ECML Workshop on Reinforcement Learning in Non-Stationary Environments, 2005.
- (2005) ECML Workshop on Reinforcement Learning in Non-Stationary Environments
- Jaulmes, R.¹ Pineau, J.² Precup, D.³

13
- 0009011171
- PhD thesis, MIT
- P. Marbach, Simulation-based methods for Markov Decision Processes. PhD thesis, MIT, 1998.
- (1998) Simulation-based methods for Markov Decision Processes
- Marbach, P.¹

14
- 0041510534
- Linear stochastic approximation driven by slowly varying Markov chains
- V. Konda and J. Tsitsiklis, "Linear stochastic approximation driven by slowly varying Markov chains," Systems and Control Letters, vol. 50, 2003.
- (2003) Systems and Control Letters , vol.50
- Konda, V.¹ Tsitsiklis, J.²

15
- 0020114278
- Learning Control of Finite Markov Chains with Unknown Transition Probabilities
- M. Sato, K. Abe, and H. Takeda., "Learning Control of Finite Markov Chains with Unknown Transition Probabilities," IEEE Trans. on Automatic Control, vol. AC-27, no. 2, 1982.
- (1982) IEEE Trans. on Automatic Control , vol.AC-27 , Issue.2
- Sato, M.¹ Abe, K.² Takeda, H.³

16
- 0020632587
- Simultaneous Identification and Adaptive Control of Unknown Systems over Finite Parameters Sets
- P. R. Kumar and W. Lin., "Simultaneous Identification and Adaptive Control of Unknown Systems over Finite Parameters Sets.," IEEE Trans. on Automatic Control, vol. AC-28, no. 1, 1983.
- (1983) IEEE Trans. on Automatic Control , vol.AC-28 , Issue.1
- Kumar, P.R.¹ Lin, W.²

17
- 0032075655
- Adaptive Estimation of HMM Transition Probabilities
- J. Ford and J. Moore, "Adaptive Estimation of HMM Transition Probabilities," IEEE Transactions on Signal Processing, vol. 46, no. 5, 1998.
- (1998) IEEE Transactions on Signal Processing , vol.46 , Issue.5
- Ford, J.¹ Moore, J.²

18
- 0004106918
- Prentice-Hall
- P. A. Ioannou and J. Sun, Robust Adaptive Control. Prentice-Hall, 1996.
- (1996) Robust Adaptive Control
- Ioannou, P.A.¹ Sun, J.²

19
- 55349141438
- A Robust Approach to the UAV Task Assignment Problem
- M. Alighanbari and J. P. How, "A Robust Approach to the UAV Task Assignment Problem," International Journal of Robust and Nonlinear Control, vol. 18, no. 2, 2008.
- (2008) International Journal of Robust and Nonlinear Control , vol.18 , Issue.2
- Alighanbari, M.¹ How, J.P.²

20
- 85102627959
- Wiley
- M. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, 2005.
- (2005) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.¹

21
- 62949129319
- PhD thesis, MIT
- L. F. Bertuccelli, Robust Decision-Making with Model Uncertainty in Aerospace Systems. PhD thesis, MIT, 2008.
- (2008) Robust Decision-Making with Model Uncertainty in Aerospace Systems
- Bertuccelli, L.F.¹

22
- 52449093126
- Group Health Management of UAV Teams With Applications to Persistent Surveillance
- B. Bethke, J. How, and J. Vian., " Group Health Management of UAV Teams With Applications to Persistent Surveillance," IEEE American Controls Conference, 2008.
- (2008) IEEE American Controls Conference
- Bethke, B.¹ How, J.² Vian, J.³

23
- 62949122724
- Estimation of Non-Stationary Markov Chain Transition Models
- L. F. Bertuccelli and J. P. How, "Estimation of Non-Stationary Markov Chain Transition Models," IEEE Conference on Decision and Control, 2008.
- (2008) IEEE Conference on Decision and Control
- Bertuccelli, L.F.¹ How, J.P.²

24
- 0029210635
- Learning to Act using Real-Time Dynamic Programming
- A. Barto, S. Bradtke, and S. Singh., " Learning to Act using Real-Time Dynamic Programming," Artificial Intelligence, vol. 72, pp. 81-138, 1993.
- (1993) Artificial Intelligence , vol.72 , pp. 81-138
- Barto, A.¹ Bradtke, S.² Singh, S.³

25
- 0002357911
- Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms
- V. Gullapalli and A. Barto., "Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms," Advances in NIPS, 1994.
- (1994) Advances in NIPS
- Gullapalli, V.¹ Barto, A.²

26
- 77958493967
- Experimental Demonstration of MDP- Based Planning with Model Uncertainty
- Aug, AIAA-2008-6322
- B. Bethke, L. Bertuccelli, and J. P. How, "Experimental Demonstration of MDP- Based Planning with Model Uncertainty," in AIAA Guidance Navigation and Control Conference, Aug 2008. AIAA-2008-6322.
- (2008) AIAA Guidance Navigation and Control Conference
- Bethke, B.¹ Bertuccelli, L.² How, J.P.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.