메뉴 건너뛰기




Volumn , Issue PART 3, 2013, Pages 1712-1720

Better rates for any adversarial deterministic MDP

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; LEARNING SYSTEMS; MARKOV PROCESSES;

EID: 84897554269     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (17)

References (17)
  • 1
    • 84862535425 scopus 로고    scopus 로고
    • Interior-point methods for full-information and bandit online learning
    • Abernethy, J., Hazan, E., and Rakhlin, A. Interior-point methods for full-information and bandit online learning. IEEE Transactions on Information Theory, 58(7):4164-4175, 2012.
    • (2012) IEEE Transactions on Information Theory , vol.58 , Issue.7 , pp. 4164-4175
    • Abernethy, J.1    Hazan, E.2    Rakhlin, A.3
  • 8
    • 0344560051 scopus 로고
    • Periods of connected networks and powers of nonnegative matrices
    • Denardo, E. V. Periods of connected networks and powers of nonnegative matrices. Mathematics of Operations Research, 2(1):20-24, 1977.
    • (1977) Mathematics of Operations Research , vol.2 , Issue.1 , pp. 20-24
    • Denardo, E.V.1
  • 11
    • 50249167647 scopus 로고    scopus 로고
    • On polynomial cases of the unichain classification problem for Markov decision processes
    • Feinberg, E. A. and Yang, F. On polynomial cases of the unichain classification problem for Markov decision processes. Operations Research Letters, 36(5): 527-530, 2008.
    • (2008) Operations Research Letters , vol.36 , Issue.5 , pp. 527-530
    • Feinberg, E.A.1    Yang, F.2
  • 13
    • 77953539718 scopus 로고    scopus 로고
    • Online regret bounds for Markov decision processes with deterministic transitions
    • Ortner, R. Online regret bounds for Markov decision processes with deterministic transitions. Theoretical Computer Science, 411 (29-30):2684-2695, 2010.
    • (2010) Theoretical Computer Science , vol.411 , Issue.29-30 , pp. 2684-2695
    • Ortner, R.1
  • 16
    • 70349280578 scopus 로고    scopus 로고
    • Markov decision processes with arbitrary reward processes
    • Yu, J. Y., Mannor, S., and Shimkin, N. Markov decision processes with arbitrary reward processes. Mathematics of Operations Research, 34(3):737-757, 2009.
    • (2009) Mathematics of Operations Research , vol.34 , Issue.3 , pp. 737-757
    • Yu, J.Y.1    Mannor, S.2    Shimkin, N.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.