메뉴 건너뛰기




Volumn , Issue , 2011, Pages

Optimistic optimization of a deterministic function without the knowledge of its smoothness

(1)  Munos, Rémi a  

a INRIA   (France)

Author keywords

[No Author keywords available]

Indexed keywords

BUDGET CONTROL; SAMPLING;

EID: 85162504694     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (180)

References (27)
  • 2
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning Journal, 47(2-3):235-256, 2002.
    • (2002) Machine Learning Journal , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 7
    • 77952027689 scopus 로고    scopus 로고
    • Online optimization of X-armed bandits
    • D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, editors, MIT Press
    • S. Bubeck, R. Munos, G. Stoltz, and Cs. Szepesvári. Online optimization of X-armed bandits. In D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, editors, Advances in Neural Information Processing Systems, volume 22, pages 201-208. MIT Press, 2008.
    • (2008) Advances in Neural Information Processing Systems , vol.22 , pp. 201-208
    • Bubeck, S.1    Munos, R.2    Stoltz, G.3    Szepesvári, Cs.4
  • 16
    • 58449106591 scopus 로고    scopus 로고
    • Optimistic planning of deterministic systems
    • editor, Recent Advances in Reinforcement Learning
    • J-F. Hren and R.Munos. Optimistic planning of deterministic systems. In European Workshop on Reinforcement Learning Springer LNAI 5323, editor, Recent Advances in Reinforcement Learning, pages 151-164, 2008.
    • (2008) European Workshop on Reinforcement Learning Springer LNAI , vol.5323 , pp. 151-164
    • Hren, J.-F.1    Munos, R.2
  • 25
    • 77956501313 scopus 로고    scopus 로고
    • Gaussian process optimization in the bandit setting: No regret and experimental design
    • Niranjan Srinivas, Andreas Krause, Sham Kakade, and Matthias Seeger. Gaussian process optimization in the bandit setting: No regret and experimental design. In International Conference on Machine Learning, pages 1015-1022, 2010.
    • (2010) International Conference on Machine Learning , pp. 1015-1022
    • Srinivas, N.1    Krause, A.2    Kakade, S.3    Seeger, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.