메뉴 건너뛰기




Volumn , Issue , 2014, Pages 459-467

Bandits with switching costs: T2/3 regret

Author keywords

Lower bounds; Multi armed Bandit; Online learning; Switching costs

Indexed keywords

COSTS; LEARNING ALGORITHMS; MARKOV PROCESSES; STATISTICS; TIME SWITCHES;

EID: 84904307224     PISSN: 07378017     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2591796.2591868     Document Type: Conference Paper
Times cited : (113)

References (17)
  • 7
    • 38249021538 scopus 로고
    • Graphs with small bandwidth and cutwidth
    • F. R. K. Chung and P. D. Seymour. Graphs with small bandwidth and cutwidth. Discrete Mathematics, 75(1- 3):113-119, 1989.
    • (1989) Discrete Mathematics , vol.75 , Issue.1-3 , pp. 113-119
    • Chung, F.R.K.1    Seymour, P.D.2
  • 10
    • 0031211090 scopus 로고    scopus 로고
    • A decision-theoretic generalization of on-line learning and an application to boosting
    • Y. Freund and R. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of computer and System Sciences, 55(1): 119-139, 1997.
    • (1997) Journal of Computer and System Sciences , vol.55 , Issue.1 , pp. 119-139
    • Freund, Y.1    Schapire, R.2
  • 17
    • 70349280578 scopus 로고    scopus 로고
    • Markov decision processes with arbitrary reward processes
    • J. Y. Yu, S. Mannor, and N. Shimkin. Markov decision processes with arbitrary reward processes. Mathematics of Operations Research, 34(3):737-757, 2009.
    • (2009) Mathematics of Operations Research , vol.34 , Issue.3 , pp. 737-757
    • Yu, J.Y.1    Mannor, S.2    Shimkin, N.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.