-
1
-
-
14344250395
-
Robust Solutions to Markov Decision Problems with Uncertain Transition Matrices
-
A. Nilim and L. E. Ghaoui, "Robust Solutions to Markov Decision Problems with Uncertain Transition Matrices," Operations Research, vol. 53, no. 5, 2005.
-
(2005)
Operations Research
, vol.53
, Issue.5
-
-
Nilim, A.1
Ghaoui, L.E.2
-
2
-
-
25444493818
-
Robust Dynamic Programming
-
G. Iyengar, "Robust Dynamic Programming," Math. Oper. Res., vol. 30, no. 2, pp. 257-280, 2005.
-
(2005)
Math. Oper. Res
, vol.30
, Issue.2
, pp. 257-280
-
-
Iyengar, G.1
-
3
-
-
33847336943
-
Bias and Variance Approximation in Value Function Estimates
-
S. Mannor, D. Simester, P. Sun, and J. Tsitsiklis, "Bias and Variance Approximation in Value Function Estimates," Management Science, vol. 52, no. 2, pp. 308-322, 2007.
-
(2007)
Management Science
, vol.52
, Issue.2
, pp. 308-322
-
-
Mannor, S.1
Simester, D.2
Sun, P.3
Tsitsiklis, J.4
-
4
-
-
62949180684
-
Robust Decision-Making for Uncertain Markov Decision Processes Using Sigma Point Sampling
-
L. F. Bertuccelli and J. P. How, "Robust Decision-Making for Uncertain Markov Decision Processes Using Sigma Point Sampling," IEEE American Controls Conference, 2008.
-
(2008)
IEEE American Controls Conference
-
-
Bertuccelli, L.F.1
How, J.P.2
-
5
-
-
0025514707
-
Methods for reasoning with imprecise probabilities in intelligent decision systems
-
Man and Cybernetics, pp
-
D. E. Brown and C. C. White., "Methods for reasoning with imprecise probabilities in intelligent decision systems," IEEE Conference on Systems, Man and Cybernetics, pp. 161-163, 1990.
-
(1990)
IEEE Conference on Systems
, pp. 161-163
-
-
Brown, D.E.1
White, C.C.2
-
6
-
-
0015630091
-
Markovian Decision Processes with Uncertain Transition Probabilities
-
J. K. Satia and R. E. Lave., "Markovian Decision Processes with Uncertain Transition Probabilities," Operations Research, vol. 21, no. 3, 1973.
-
(1973)
Operations Research
, vol.21
, Issue.3
-
-
Satia, J.K.1
Lave, R.E.2
-
7
-
-
0028460403
-
Markov Decision Processes with Imprecise Transition Probabilities
-
C. C. White and H. K. Eldeib., "Markov Decision Processes with Imprecise Transition Probabilities," Operations Research, vol. 42, no. 4, 1994.
-
(1994)
Operations Research
, vol.42
, Issue.4
-
-
White, C.C.1
Eldeib, H.K.2
-
8
-
-
1942450194
-
Solving Uncertain Markov Decision Processes
-
A. Bagnell, A. Ng, and J. Schneider, "Solving Uncertain Markov Decision Processes," NIPS, 2001.
-
(2001)
NIPS
-
-
Bagnell, A.1
Ng, A.2
Schneider, J.3
-
9
-
-
0004255876
-
-
Boston, MA, USA: Addison-Wesley Longman Publishing Co, Inc
-
K. J. Astrom and B. Wittenmark, Adaptive Control. Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc., 1994.
-
(1994)
Adaptive Control
-
-
Astrom, K.J.1
Wittenmark, B.2
-
14
-
-
0041510534
-
Linear stochastic approximation driven by slowly varying Markov chains
-
V. Konda and J. Tsitsiklis, "Linear stochastic approximation driven by slowly varying Markov chains," Systems and Control Letters, vol. 50, 2003.
-
(2003)
Systems and Control Letters
, vol.50
-
-
Konda, V.1
Tsitsiklis, J.2
-
15
-
-
0020114278
-
Learning Control of Finite Markov Chains with Unknown Transition Probabilities
-
M. Sato, K. Abe, and H. Takeda., "Learning Control of Finite Markov Chains with Unknown Transition Probabilities," IEEE Trans. on Automatic Control, vol. AC-27, no. 2, 1982.
-
(1982)
IEEE Trans. on Automatic Control
, vol.AC-27
, Issue.2
-
-
Sato, M.1
Abe, K.2
Takeda, H.3
-
16
-
-
0020632587
-
Simultaneous Identification and Adaptive Control of Unknown Systems over Finite Parameters Sets
-
P. R. Kumar and W. Lin., "Simultaneous Identification and Adaptive Control of Unknown Systems over Finite Parameters Sets.," IEEE Trans. on Automatic Control, vol. AC-28, no. 1, 1983.
-
(1983)
IEEE Trans. on Automatic Control
, vol.AC-28
, Issue.1
-
-
Kumar, P.R.1
Lin, W.2
-
17
-
-
0032075655
-
Adaptive Estimation of HMM Transition Probabilities
-
J. Ford and J. Moore, "Adaptive Estimation of HMM Transition Probabilities," IEEE Transactions on Signal Processing, vol. 46, no. 5, 1998.
-
(1998)
IEEE Transactions on Signal Processing
, vol.46
, Issue.5
-
-
Ford, J.1
Moore, J.2
-
22
-
-
52449093126
-
Group Health Management of UAV Teams With Applications to Persistent Surveillance
-
B. Bethke, J. How, and J. Vian., " Group Health Management of UAV Teams With Applications to Persistent Surveillance," IEEE American Controls Conference, 2008.
-
(2008)
IEEE American Controls Conference
-
-
Bethke, B.1
How, J.2
Vian, J.3
-
24
-
-
0029210635
-
Learning to Act using Real-Time Dynamic Programming
-
A. Barto, S. Bradtke, and S. Singh., " Learning to Act using Real-Time Dynamic Programming," Artificial Intelligence, vol. 72, pp. 81-138, 1993.
-
(1993)
Artificial Intelligence
, vol.72
, pp. 81-138
-
-
Barto, A.1
Bradtke, S.2
Singh, S.3
-
25
-
-
0002357911
-
Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms
-
V. Gullapalli and A. Barto., "Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms," Advances in NIPS, 1994.
-
(1994)
Advances in NIPS
-
-
Gullapalli, V.1
Barto, A.2
|