메뉴 건너뛰기




Volumn , Issue , 2010, Pages

Online Markov decision processes under bandit feedback

Author keywords

[No Author keywords available]

Indexed keywords

E-LEARNING; MARKOV PROCESSES; STOCHASTIC SYSTEMS;

EID: 85162052729     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (154)

References (8)
  • 2
    • 84899000904 scopus 로고    scopus 로고
    • Experts in a Markov decision process
    • Saul, L. K., Weiss, Y., and Bottou, L., editors
    • Even-Dar, E., Kakade, S. M., and Mansour, Y. (2005). Experts in a Markov decision process. In Saul, L. K., Weiss, Y., and Bottou, L., editors, Advances in Neural Information Processing Systems 17, pages 401-408.
    • (2005) Advances in Neural Information Processing Systems , vol.17 , pp. 401-408
    • Even-Dar, E.1    Kakade, S.M.2    Mansour, Y.3
  • 4
    • 84898073198 scopus 로고    scopus 로고
    • The online loop-free stochastic shortest-path problem
    • Neu, G., György, A., and Szepesvári, C. (2010). The online loop-free stochastic shortest-path problem. In COLT-10.
    • (2010) COLT-10
    • Neu, G.1    György, A.2    Szepesvári, C.3
  • 8
    • 70349280578 scopus 로고    scopus 로고
    • Markov decision processes with arbitrary reward processes
    • Yu, J. Y., Mannor, S., and Shimkin, N. (2009). Markov decision processes with arbitrary reward processes. Mathematics of Operations Research, 34(3):737-757.
    • (2009) Mathematics of Operations Research , vol.34 , Issue.3 , pp. 737-757
    • Yu, J.Y.1    Mannor, S.2    Shimkin, N.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.