SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2008, Pages

The Epoch-Greedy algorithm for contextual multi-armed bandits

Author keywords

[No Author keywords available]

Indexed keywords

GREEDY ALGORITHMS; MULTIARMED BANDITS (MABS); PROPERTY; SAMPLE COMPLEXITY BOUNDS; SIDE INFORMATION; TIME HORIZONS;

EID: 85162018594 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (82)

References (10)

2
- 0029513526
- Gambling in a rigged casino: The adversarial multi-armed bandit problem
- Auer, P., Cesa-Bianchi, N., Freund, Y., & Schapire, R. E. (1995). Gambling in a rigged casino: The adversarial multi-armed bandit problem. FOCS.
- (1995) FOCS
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

3
- 33745295134
- Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
- Even-dar, E., Mannor, S., & Mansour, Y. (2006). Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. JMLR, 7, 1079-1105. (Pubitemid 43938989)
- (2006) Journal of Machine Learning Research , vol.7 , pp. 1079-1105
- Even-Bar, E.¹ Mannor, S.² Mansour, Y.³

4
- 0000125534
- Sample selection bias as a specification error
- Heckman, J. (1979). Sample selection bias as a specification error. Econometrica, 47, 153-161.
- (1979) Econometrica , vol.47 , pp. 153-161
- Heckman, J.¹

5
- 84898967749
- Approximate planning in large pomdps via reusable trajectories
- Kearns, M., Mansour, Y., & Ng, A. Y. (2000). Approximate planning in large pomdps via reusable trajectories. NIPS.
- (2000) NIPS
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

6
- 0002899547
- Asymptotically efficient adaptive allocation rules
- Lai, T., & Robbins, H. (1985). Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6, 4-22.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.¹ Robbins, H.²

7
- 0029344133
- Machine learning and nonparametric bandit theory
- Lai, T., & Yakowitz, S. (1995). Machine learning and nonparametric bandit theory. IEEE TAC, 40, 1199-1209.
- (1995) IEEE TAC , vol.40 , pp. 1199-1209
- Lai, T.¹ Yakowitz, S.²

8
- 70049106076
- Bandits for taxonomies: A modelbased approach
- Pandey, S., Agarwal, D., Chakrabarti, D., & Josifovski, V. (2007). Bandits for taxonomies: a modelbased approach. SIAM Data Mining Conference.
- (2007) SIAM Data Mining Conference
- Pandey, S.¹ Agarwal, D.² Chakrabarti, D.³ Josifovski, V.⁴

9
- 33749242078
- Experience-efficient learning in associative bandit problems
- Strehl, A. L., Mesterharm, C., Littman, M. L., & Hirsh, H. (2006). Experience-efficient learning in associative bandit problems. ICML.
- (2006) ICML
- Strehl, A.L.¹ Mesterharm, C.² Littman, M.L.³ Hirsh, H.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.