SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010

Volumn , Issue , 2010, Pages

Learning from logged implicit exploration data

(4) Strehl, Alexander L a Langford, John b Li, Lihong b Kakade, Sham M c

a FACEBOOK (United States)

b YAHOO RESEARCH (United States)

c UNIVERSITY OF PENNSYLVANIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CONTEXTUAL BANDITTI; EXPLORATION DATA; EXPLORATION POLICY; GIVEN FEATURES; HISTORICAL DATA; LEARNING PROCESS; OFFLINE DATA; RANDOMISATION; REAL-WORLD;

EID: 85162031443 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (208)

References (11)

1
- 0037709910
- The nonstochastic multiarmed bandit problem
- Peter Auer, Nicolò C. Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2002.
- (2002) SIAM Journal on Computing , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Bianchi, N.C.² Freund, Y.³ Schapire, R.E.⁴

2
- 84947396376
- A generalization of sampling without replacement from a finite universe
- D. Horvitz and D. Thompson. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association, 47, 1952.
- (1952) Journal of the American Statistical Association , vol.47
- Horvitz, D.¹ Thompson, D.²

3
- 84898967749
- Approximate planning in large pomdps via reusable trajectories
- Michael Kearns, YishayMansour, and Andrew Y. Ng. Approximate planning in large pomdps via reusable trajectories. In NIPS, 2000.
- (2000) NIPS
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

4
- 77953968105
- More bang for their bucks: Assessing new features for online advertisers
- Diane Lambert and Daryl Pregibon. More bang for their bucks: Assessing new features for online advertisers. In ADKDD 2007, 2007.
- (2007) ADKDD 2007
- Lambert, D.¹ Pregibon, D.²

5
- 56449124046
- Exploration scavenging
- John Langford, Alexander L. Strehl, and Jenn Wortman. Exploration scavenging. In ICML-08: Proceedings of the 25rd international conference on Machine learning, 2008.
- (2008) ICML-08: Proceedings of the 25rd International Conference on Machine Learning
- Langford, J.¹ Strehl, A.L.² Wortman, J.³

6
- 77956144722
- The epoch-greedy algorithm for multi-armed bandits with side information
- John Langford and Tong Zhang. The epoch-greedy algorithm for multi-armed bandits with side information. In Advances in Neural Information Processing Systems 20, pages 817-824, 2008.
- (2008) Advances in Neural Information Processing Systems , vol.20 , pp. 817-824
- Langford, J.¹ Zhang, T.²

7
- 0041609338
- Bounds on negative moments
- Robert A. Lew. Bounds on negative moments. SIAM Journal on Applied Mathematics, 30(4):728-731, 1976.
- (1976) SIAM Journal on Applied Mathematics , vol.30 , Issue.4 , pp. 728-731
- Lew, R.A.¹

8
- 77954641643
- A contextual-bandit approach to personalized news article recommendation
- Lihong Li, Wei Chu, John Langford, and Robert E. Schapire. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the Nineteenth International Conference on World Wide Web (WWW-10), pages 661-670, 2010.
- (2010) Proceedings of the Nineteenth International Conference on World Wide Web (WWW-10) , pp. 661-670
- Li, L.¹ Chu, W.² Langford, J.³ Schapire, R.E.⁴

9
- 79952384747
- Unbiased offline evaluation of contextualbandit-based news article recommendation algorithms
- Lihong Li, Wei Chu, John Langford, and Xuanhui Wang. Unbiased offline evaluation of contextualbandit-based news article recommendation algorithms. In Proceedings of the Fourth International Conference on Web Search and Web Data Mining (WSDM-11), 2011.
- (2011) Proceedings of the Fourth International Conference on Web Search and Web Data Mining (WSDM-11)
- Li, L.¹ Chu, W.² Langford, J.³ Wang, X.⁴

10
- 0442309556
- Safe and effective importance sampling
- Art Owen and Yi Zhou. Safe and effective importance sampling. Journal of the American Statistical Association, 95:135-143, 1998.
- (1998) Journal of the American Statistical Association , vol.95 , pp. 135-143
- Owen, A.¹ Zhou, Y.²

11
- 0242393653
- Eligibility traces for off-policy policy evaluation
- Doina Precup, Rich Sutton, and Satinder Singh. Eligibility traces for off-policy policy evaluation. In ICML, 2000.
- (2000) ICML
- Precup, D.¹ Sutton, R.² Singh, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.