SCOPUS 정보 검색 플랫폼

IFAC Proceedings Volumes (IFAC-PapersOnline)

Volumn 17, Issue 1 PART 1, 2008, Pages

Gap-free bounds for stochastic multi-armed bandit

(4) Juditsky, Anatoly a Nazin, Alexander V a Tsybakov, Alexander a Vayatis, Nicolas a

a NONE

Author keywords

Learning theory; Randomized methods; Stochastic control

Indexed keywords

GRADIENT TYPE; INSTANTANEOUS LOSS; LEARNING THEORY; MULTI ARMED BANDIT; MULTI-ARMED BANDIT PROBLEM; RANDOMIZED DECISIONS; RANDOMIZED METHODS; STOCHASTIC CONTROL;

RISK PERCEPTION; STOCHASTIC CONTROL SYSTEMS;

PROBABILITY DISTRIBUTIONS;

EID: 79961019787 PISSN: 14746670 EISSN: None Source Type: Conference Proceeding
DOI: 10.3182/20080706-5-KR-1001.2585 Document Type: Conference Paper

Times cited : (22)

References (15)

1
- 38149013086
- Tuning bandit algorithms in stochastic environments
- Sendai, 1-4 October
- J.Y. Audibert, Remi M., and Cs. Szepesvari. Tuning bandit algorithms in stochastic environments. In 18th International Conference on Algorithmic Learning Theory, pages 150-165, Sendai, 1-4 October 2007.
- (2007) 18th International Conference on Algorithmic Learning Theory , pp. 150-165
- Audibert, J.Y.¹ Remi, M.² Szepesvari, Cs.³

2
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47:235-256, 2002a.
- (2002) Machine Learning , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

3
- 0037709910
- The nonstochastic multiarmed bandit problem
- P. Auer, N. Cesa-Bianchi, Y. Freund, and R. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, 2002b.
- (2002) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.⁴

4
- 0004320602
- Minerva optimization center, Technion Institute of Technology
- A. Ben-Tal and A.S. Nemirovski. The conjugate barrier mirror descent method for non-smooth convex optimization. Minerva optimization center, Technion Institute of Technology, 1999.
- (1999) The Conjugate Barrier Mirror Descent Method for Non-smooth Convex Optimization
- Ben-Tal, A.¹ Nemirovski, A.S.²

5
- 84926078662
- Cambridge University Press
- N. Cesa-Bianchi and G. Lugosi. Prediction, Learning, and Games. Cambridge University Press, 2006.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

6
- 31344435933
- Recursive aggregation of estimators by the mirror descent algorithm with averaging
- A.B. Juditsky, A.V. Nazin, A.B. Tsybakov, and N. Vayatis. Recursive aggregation of estimators by the mirror descent algorithm with averaging. Problems of Information Transmission, 41(4):368-384, 2005.
- (2005) Problems of Information Transmission , vol.41 , Issue.4 , pp. 368-384
- Juditsky, A.B.¹ Nazin, A.V.² Tsybakov, A.B.³ Vayatis, N.⁴

7
- 0008815681
- Exponentiated gradient versus gradient descent for linear predictors
- J. Kivinen and M. Warmuth. Exponentiated gradient versus gradient descent for linear predictors. Information and Computation, 132(1):1-63, 1997.
- (1997) Information and Computation , vol.132 , Issue.1 , pp. 1-63
- Kivinen, J.¹ Warmuth, M.²

8
- 0002899547
- Asymptotic efficient adaptive allocation rules
- T.L. Lai and H. Robbins. Asymptotic efficient adaptive allocation rules. Advances in Applied Mathematics, 6: 4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

9
- 0003988124
- Pergamon Press, Inc., Elmsford, NY, USA, ISBN 0-08-042024-9
- K. Najim and A.S. Poznyak. Learning automata: theory and applications. Pergamon Press, Inc., Elmsford, NY, USA, 1994. ISBN 0-08-042024-9.
- (1994) Learning Automata: Theory and Applications
- Najim, K.¹ Poznyak, A.S.²

10
- 0004187109
- Nauka, Moscow
- A.V. Nazin and A.S. Poznyak. Adaptive Choice of Variants. Nauka, Moscow, 1986.
- (1986) Adaptive Choice of Variants
- Nazin, A.V.¹ Poznyak, A.S.²

11
- 0003692801
- Wiley-Interscience
- A.S. Nemirovski and D.B. Yudin. Problem Complexity and Method Efficiency in Optimization. Wiley-Interscience, 1983.
- (1983) Problem Complexity and Method Efficiency in Optimization
- Nemirovski, A.S.¹ Yudin, D.B.²

12
- 51849106154
- Louvain-la-Neuve, Belgium: Center for Operation Research and Econometrics
- Yu. Nesterov. Primal-dual subgradient methods for convex problems: Core discussion paper 2005/67. Louvain-la-Neuve, Belgium: Center for Operation Research and Econometrics, 2005.
- (2005) Primal-dual Subgradient Methods for Convex Problems: Core Discussion Paper 2005/67
- Nesterov, Yu.¹

13
- 84966203785
- Some aspects of the sequential design of experiments
- H. Robbins. Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society, 55:527-535, 1952.
- (1952) Bulletin of the American Mathematical Society , vol.55 , pp. 527-535
- Robbins, H.¹

14
- 34547275330
- PhD thesis, Universite Paris-Sud
- G. Stoltz. Incomplete information and internal regret in prediction of individual sequences. PhD thesis, Universite Paris-Sud, 2005.
- (2005) Incomplete Information and Internal Regret in Prediction of Individual Sequences
- Stoltz, G.¹

15
- 0038982800
- An asymptotic minimax theorem for the two armed bandit problem
- W. Vogel. An asymptotic minimax theorem for the two armed bandit problem. The Annals of Mathematical Statistics, 31:444-451, 1960.
- (1960) The Annals of Mathematical Statistics , vol.31 , pp. 444-451
- Vogel, W.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.