SCOPUS 정보 검색 플랫폼

Volumn 227, Issue , 2007, Pages 721-728

Multi-armed bandit problems with dependent arms

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION THEORY; CLUSTER ANALYSIS; ERRORS; KNOWLEDGE ACQUISITION; PROBLEM SOLVING; REAL TIME SYSTEMS;

ARMS; BANDIT PROBLEMS; SYNTHETIC DATA; THEORETICAL JUSTIFICATIONS;

LEARNING SYSTEMS;

EID: 34547966991 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1273496.1273587 Document Type: Conference Paper

Times cited : (117)

References (15)

1
- 0000616723
- Sample mean based index policies with O(log n) regret for the multi-armed bandit problem
- Agrawal, R. (1995). Sample mean based index policies with O(log n) regret for the multi-armed bandit problem. Advances in Applied Probability, 27, 1054-1078.
- (1995) Advances in Applied Probability , vol.27 , pp. 1054-1078
- Agrawal, R.¹

2
- 0009953451
- Optimal stopping and dynamic allocation
- Chang, F., & Lai, T. L. (1987). Optimal stopping and dynamic allocation. Advances in Applied Probability, 19, 829-853.
- (1987) Advances in Applied Probability , vol.19 , pp. 829-853
- Chang, F.¹ Lai, T.L.²

3
- 71049162986
- Coarse sample complexity bounds for active learning
- Dasgupta, S. (2005). Coarse sample complexity bounds for active learning. NIPS.
- (2005) NIPS
- Dasgupta, S.¹

4
- 10944259849
- Four proofs of Gittins' multiarmed bandit theorem
- Frostig, E., & Weiss, G. (1999). Four proofs of Gittins' multiarmed bandit theorem. Applied Probability Trust.
- (1999) Applied Probability Trust
- Frostig, E.¹ Weiss, G.²

5
- 0000169010
- Bandit processes and dynamic allocation indices
- J.C.Gittins (1979). Bandit processes and dynamic allocation indices. Journal of the Royal Statistical Society, Series B, 41, 148-177.
- (1979) Journal of the Royal Statistical Society, Series B , vol.41 , pp. 148-177
- Gittins, J.C.¹

6
- 34547975806
- Bandit based monte-carlo planning
- Kocsis, L., & Szepesvári, C (2006). Bandit based monte-carlo planning. ECML.
- (2006) ECML
- Kocsis, L.¹ Szepesvári, C.²

7
- 0000854435
- Adaptive treatment allocation and multi-armed bandit problem
- Lai, T. L. (1987). Adaptive treatment allocation and multi-armed bandit problem. Annals of Statistics, 15(3), 1091-1114.
- (1987) Annals of Statistics , vol.15 , Issue.3 , pp. 1091-1114
- Lai, T.L.¹

8
- 0002899547
- Asymptotically efficient adaptive allocation rules
- Lai, T. L., & Robbins, H. (1985). Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6, 4-22.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

9
- 0000695404
- Information-based objective functions for active data selection
- MacKay, D. (1992). Information-based objective functions for active data selection. Neural Computation, 4, 590-604.
- (1992) Neural Computation , vol.4 , pp. 590-604
- MacKay, D.¹

10
- 70049106076
- Bandits for taxonomies: A model-based approach
- Pandey, S., Agarwal, D., Chakrabarti, D., & Josifovski, V. (2007). Bandits for taxonomies: A model-based approach. SDM.
- (2007) SDM
- Pandey, S.¹ Agarwal, D.² Chakrabarti, D.³ Josifovski, V.⁴

11
- 0036568025
- Finitetime analysis of the multiarmed bandit problem
- P.Auer, N.Cesa-Bianchi, & P.Fischer (2002). Finitetime analysis of the multiarmed bandit problem. Machine Learning, 47, 235-256.
- (2002) Machine Learning , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

12
- 85102627959
- Wiley-Interscience. 2 edition
- Puterman, M. L. (2005). Markov decision processes: Discrete stochastic dynamic programming. Wiley-Interscience. 2 edition.
- (2005) Markov decision processes: Discrete stochastic dynamic programming
- Puterman, M.L.¹

13
- 34547977590
- Active learning in discrete input spaces
- Schneider, J., & Moore, A. (2002). Active learning in discrete input spaces. The 34th Interface Symposium.
- (2002) The 34th Interface Symposium
- Schneider, J.¹ Moore, A.²

14
- 15844389867
- Bandit problems with side observations
- Wang, C-C., Kulkami, S. R., & Poor, H. (2005). Bandit problems with side observations. IEEE Transactions on Automatic Control, 50(3), 338-355.
- (2005) IEEE Transactions on Automatic Control , vol.50 , Issue.3 , pp. 338-355
- Wang, C.-C.¹ Kulkami, S.R.² Poor, H.³

15
- 0000248624
- Multi-armed bandits and the Gittins index
- Whittle, P. (1980). Multi-armed bandits and the Gittins index. Journal of the Royal Statistcal Society B, 42, 143-149.
- (1980) Journal of the Royal Statistcal Society B , vol.42 , pp. 143-149
- Whittle, P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.