메뉴 건너뛰기




Volumn , Issue , 2011, Pages 2165-2171

Robust online optimization of reward-uncertain MDPs

Author keywords

[No Author keywords available]

Indexed keywords

ANY-TIME ALGORITHMS; APPROXIMATION SCHEME; COMPUTATIONAL TRACTABILITY; ERROR BOUND; MARKOV DECISION PROCESSES; MINIMAX REGRET; ONLINE OPTIMIZATION; REWARD FUNCTION;

EID: 84881084517     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.5591/978-1-57735-516-8/IJCAI11-361     Document Type: Conference Paper
Times cited : (31)

References (19)
  • 1
    • 0010606787 scopus 로고    scopus 로고
    • lrs: A revised implementation of the reverse search vertex enumeration algorithm
    • Birkhauser-Verlag
    • David Avis. lrs: A revised implementation of the reverse search vertex enumeration algorithm. In Polytopes-Combinatorics and Computation, pages 177-198. Birkhauser-Verlag, 2000.
    • (2000) Polytopes-Combinatorics and Computation , pp. 177-198
    • Avis, D.1
  • 5
    • 33646096015 scopus 로고    scopus 로고
    • Constraint-based optimization and utility elicitation using the minimax decision criterion
    • Craig Boutilier, Relu Patrascu, Pascal Poupart, and Dale Schuurmans. Constraint-based optimization and utility elicitation using the minimax decision criterion. Artifical Intelligence, 170(8-9):686-713, 2006.
    • (2006) Artifical Intelligence , vol.170 , Issue.8-9 , pp. 686-713
    • Boutilier, C.1    Patrascu, R.2    Poupart, P.3    Schuurmans, D.4
  • 12
    • 14344250395 scopus 로고    scopus 로고
    • Robust control of Markov decision processes with uncertain transition matrices
    • Arnab Nilim and Laurent El Ghaoui. Robust control of Markov decision processes with uncertain transition matrices. Operations Research, 53(1):780-798, 2005.
    • (2005) Operations Research , vol.53 , Issue.1 , pp. 780-798
    • Nilim, A.1    El Ghaoui, L.2
  • 19
    • 77950823530 scopus 로고    scopus 로고
    • Parametric regret in uncertain Markov decision processes
    • Shanghai
    • Huan Xu and Shie Mannor. Parametric regret in uncertain Markov decision processes. In 48th IEEE Conference on Decision and Control, pages 3606-3613, Shanghai, 2009.
    • (2009) 48th IEEE Conference on Decision and Control , pp. 3606-3613
    • Xu, H.1    Mannor, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.