메뉴 건너뛰기




Volumn , Issue , 2009, Pages 1304-1309

Robust adaptive Markov decision processes in multi-vehicle applications

Author keywords

[No Author keywords available]

Indexed keywords

ACTUAL FLIGHT; ADAPTATION PROCESS; ADAPTIVE FRAMEWORK; ADAPTIVE POLICY; INDIVIDUAL STRENGTH; MARKOV DECISION PROCESSES; MISSION PERFORMANCE; MODEL UPDATING; MULTI-VEHICLES; NUMBER OF STATE; OPTIMAL VALUE FUNCTIONS; ROBUST ADAPTIVE; TRANSIENT BEHAVIOR; TRANSIENT PERFORMANCE; TRANSITION PROBABILITIES; WORST-CASE PERFORMANCE;

EID: 70449640592     PISSN: 07431619     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ACC.2009.5160511     Document Type: Conference Paper
Times cited : (7)

References (26)
  • 1
    • 14344250395 scopus 로고    scopus 로고
    • Robust Solutions to Markov Decision Problems with Uncertain Transition Matrices
    • A. Nilim and L. E. Ghaoui, "Robust Solutions to Markov Decision Problems with Uncertain Transition Matrices," Operations Research, vol. 53, no. 5, 2005.
    • (2005) Operations Research , vol.53 , Issue.5
    • Nilim, A.1    Ghaoui, L.E.2
  • 2
    • 25444493818 scopus 로고    scopus 로고
    • Robust Dynamic Programming
    • G. Iyengar, "Robust Dynamic Programming," Math. Oper. Res., vol. 30, no. 2, pp. 257-280, 2005.
    • (2005) Math. Oper. Res , vol.30 , Issue.2 , pp. 257-280
    • Iyengar, G.1
  • 3
    • 33847336943 scopus 로고    scopus 로고
    • Bias and Variance Approximation in Value Function Estimates
    • S. Mannor, D. Simester, P. Sun, and J. Tsitsiklis, "Bias and Variance Approximation in Value Function Estimates," Management Science, vol. 52, no. 2, pp. 308-322, 2007.
    • (2007) Management Science , vol.52 , Issue.2 , pp. 308-322
    • Mannor, S.1    Simester, D.2    Sun, P.3    Tsitsiklis, J.4
  • 4
    • 62949180684 scopus 로고    scopus 로고
    • Robust Decision-Making for Uncertain Markov Decision Processes Using Sigma Point Sampling
    • L. F. Bertuccelli and J. P. How, "Robust Decision-Making for Uncertain Markov Decision Processes Using Sigma Point Sampling," IEEE American Controls Conference, 2008.
    • (2008) IEEE American Controls Conference
    • Bertuccelli, L.F.1    How, J.P.2
  • 5
    • 0025514707 scopus 로고
    • Methods for reasoning with imprecise probabilities in intelligent decision systems
    • Man and Cybernetics, pp
    • D. E. Brown and C. C. White., "Methods for reasoning with imprecise probabilities in intelligent decision systems," IEEE Conference on Systems, Man and Cybernetics, pp. 161-163, 1990.
    • (1990) IEEE Conference on Systems , pp. 161-163
    • Brown, D.E.1    White, C.C.2
  • 6
    • 0015630091 scopus 로고
    • Markovian Decision Processes with Uncertain Transition Probabilities
    • J. K. Satia and R. E. Lave., "Markovian Decision Processes with Uncertain Transition Probabilities," Operations Research, vol. 21, no. 3, 1973.
    • (1973) Operations Research , vol.21 , Issue.3
    • Satia, J.K.1    Lave, R.E.2
  • 7
    • 0028460403 scopus 로고
    • Markov Decision Processes with Imprecise Transition Probabilities
    • C. C. White and H. K. Eldeib., "Markov Decision Processes with Imprecise Transition Probabilities," Operations Research, vol. 42, no. 4, 1994.
    • (1994) Operations Research , vol.42 , Issue.4
    • White, C.C.1    Eldeib, H.K.2
  • 8
    • 1942450194 scopus 로고    scopus 로고
    • Solving Uncertain Markov Decision Processes
    • A. Bagnell, A. Ng, and J. Schneider, "Solving Uncertain Markov Decision Processes," NIPS, 2001.
    • (2001) NIPS
    • Bagnell, A.1    Ng, A.2    Schneider, J.3
  • 9
    • 0004255876 scopus 로고
    • Boston, MA, USA: Addison-Wesley Longman Publishing Co, Inc
    • K. J. Astrom and B. Wittenmark, Adaptive Control. Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc., 1994.
    • (1994) Adaptive Control
    • Astrom, K.J.1    Wittenmark, B.2
  • 14
    • 0041510534 scopus 로고    scopus 로고
    • Linear stochastic approximation driven by slowly varying Markov chains
    • V. Konda and J. Tsitsiklis, "Linear stochastic approximation driven by slowly varying Markov chains," Systems and Control Letters, vol. 50, 2003.
    • (2003) Systems and Control Letters , vol.50
    • Konda, V.1    Tsitsiklis, J.2
  • 15
    • 0020114278 scopus 로고
    • Learning Control of Finite Markov Chains with Unknown Transition Probabilities
    • M. Sato, K. Abe, and H. Takeda., "Learning Control of Finite Markov Chains with Unknown Transition Probabilities," IEEE Trans. on Automatic Control, vol. AC-27, no. 2, 1982.
    • (1982) IEEE Trans. on Automatic Control , vol.AC-27 , Issue.2
    • Sato, M.1    Abe, K.2    Takeda, H.3
  • 16
    • 0020632587 scopus 로고
    • Simultaneous Identification and Adaptive Control of Unknown Systems over Finite Parameters Sets
    • P. R. Kumar and W. Lin., "Simultaneous Identification and Adaptive Control of Unknown Systems over Finite Parameters Sets.," IEEE Trans. on Automatic Control, vol. AC-28, no. 1, 1983.
    • (1983) IEEE Trans. on Automatic Control , vol.AC-28 , Issue.1
    • Kumar, P.R.1    Lin, W.2
  • 17
    • 0032075655 scopus 로고    scopus 로고
    • Adaptive Estimation of HMM Transition Probabilities
    • J. Ford and J. Moore, "Adaptive Estimation of HMM Transition Probabilities," IEEE Transactions on Signal Processing, vol. 46, no. 5, 1998.
    • (1998) IEEE Transactions on Signal Processing , vol.46 , Issue.5
    • Ford, J.1    Moore, J.2
  • 22
    • 52449093126 scopus 로고    scopus 로고
    • Group Health Management of UAV Teams With Applications to Persistent Surveillance
    • B. Bethke, J. How, and J. Vian., " Group Health Management of UAV Teams With Applications to Persistent Surveillance," IEEE American Controls Conference, 2008.
    • (2008) IEEE American Controls Conference
    • Bethke, B.1    How, J.2    Vian, J.3
  • 24
    • 0029210635 scopus 로고
    • Learning to Act using Real-Time Dynamic Programming
    • A. Barto, S. Bradtke, and S. Singh., " Learning to Act using Real-Time Dynamic Programming," Artificial Intelligence, vol. 72, pp. 81-138, 1993.
    • (1993) Artificial Intelligence , vol.72 , pp. 81-138
    • Barto, A.1    Bradtke, S.2    Singh, S.3
  • 25
    • 0002357911 scopus 로고
    • Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms
    • V. Gullapalli and A. Barto., "Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms," Advances in NIPS, 1994.
    • (1994) Advances in NIPS
    • Gullapalli, V.1    Barto, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.