SCOPUS 정보 검색 플랫폼

Information Retrieval

Volumn 16, Issue 1, 2013, Pages 63-90

Balancing exploration and exploitation in listwise and pairwise online learning to rank for information retrieval

(3) Hofmann, Katja a Whiteson, Shimon a de Rijke, Maarten a

a UNIVERSITY OF AMSTERDAM (Netherlands)

Author keywords

Implicit feedback; Information retrieval; Learning to rank

Indexed keywords

EID: 84873525321 PISSN: 13864564 EISSN: 15737659 Source Type: Journal
DOI: 10.1007/s10791-012-9197-9 Document Type: Article

Times cited : (128)

References (47)

1
- 84863344448
- In: NIPS'08
- Agarwal, D., Chen, B., Elango, P., Motgi, N., Park, S., Ramakrishnan, et al. (2008). Online models for content optimization. In: NIPS'08, pp 17-24.
- (2008) Online models for content optimization , pp. 17-24
- Agarwal, D.¹ Chen, B.² Elango, P.³ Motgi, N.⁴ Park, S.⁵ Ramakrishnan⁶

2
- 0041966002
- Using confidence bounds for exploitation-exploration trade-offs
- Auer, P. (2003). Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3, 397-422.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 397-422
- Auer, P.¹

3
- 0019519039
- Associative search network: A reinforcement learning associative memory
- Barto, A. G., Sutton, R. S., & Brouwer, P. S. (1981). Associative search network: A reinforcement learning associative memory. IEEE Transaction on System, Man, and Cybernetics, 40, 201-211.
- (1981) IEEE Transaction on System, Man, and Cybernetics , vol.40 , pp. 201-211
- Barto, A.G.¹ Sutton, R.S.² Brouwer, P.S.³

4
- 0142030258
- A taxonomy of web search
- Broder, A. (2002). A taxonomy of web search. SIGIR Forum, 36(2), 3-10.
- (2002) SIGIR Forum , vol.36 , Issue.2 , pp. 3-10
- Broder, A.¹

5
- 34250348767
- Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration
- Cohen, J. D., McClure, S. M., & Yu, A. J. (2007). Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philosophical Transactions of the Royal Society B: Biological Sciences, 362(1481), 933-942.
- (2007) Philosophical Transactions of the Royal Society B: Biological Sciences , vol.362 , Issue.1481 , pp. 933-942
- Cohen, J.D.¹ McClure, S.M.² Yu, A.J.³

6
- 42549140738
- In: WSDM '08
- Craswell, N., Zoeter, O., Taylor, M., & Ramsey, B. (2008). An experimental comparison of click position-bias models. In: WSDM '08, pp 87-94.
- (2008) An experimental comparison of click position-bias models , pp. 87-94
- Craswell, N.¹ Zoeter, O.² Taylor, M.³ Ramsey, B.⁴

7
- 67650696919
- In: ECIR'09
- Donmez, P., & Carbonell, J. (2009). Active sampling for rank learning via optimizing the area under the ROC curve. In: ECIR'09, pp 78-89.
- (2009) Active sampling for rank learning via optimizing the area under the ROC curve , pp. 78-89
- Donmez, P.¹ Carbonell, J.²

8
- 0000169010
- Bandit processes and dynamic allocation indices
- Gittins, J. C. (1979). Bandit processes and dynamic allocation indices. Journal of the Royal Statistical Society Series B (Methodological) 41(2), 148-177.
- (1979) Journal of the Royal Statistical Society Series B (Methodological) , vol.41 , Issue.2 , pp. 148-177
- Gittins, J.C.¹

9
- 70349117309
- In: WSDM '09
- Guo, F., Liu, C., & Wang, Y. M. (2009b). Efficient multiple-click models in web search. In: WSDM '09, pp 124-131.
- (2009) Efficient multiple-click models in web search , pp. 124-131
- Guo, F.¹ Liu, C.² Wang, Y.M.³

10
- 67650079933
- In: WSCD '09
- Guo, F., Li, L., & Faloutsos, C. (2009a). Tailoring click models to user goals. In: WSCD '09, pp 88-92.
- (2009) Tailoring click models to user goals , pp. 88-92
- Guo, F.¹ Li, L.² Faloutsos, C.³

11
- 74549144985
- In: CIKM '09
- He, J., Zhai, C., & Li, X. (2009). Evaluation of methods for relative comparison of retrieval systems based on clickthroughs. In: CIKM '09, pp 2029-2032.
- (2009) Evaluation of methods for relative comparison of retrieval systems based on clickthroughs , pp. 2029-2032
- He, J.¹ Zhai, C.² Li, X.³

12
- 0033322991
- In: ICANN '99
- Herbrich, R., Graepel, T., & Obermayer, K. (1999). Support vector learning for ordinal regression. In: ICANN '99, vol 1, pp 97-102.
- (1999) Support vector learning for ordinal regression , vol.1 , pp. 97-102
- Herbrich, R.¹ Graepel, T.² Obermayer, K.³

13
- 84996484566
- In: ECIR'11
- Hofmann, K., Whiteson, S., & de Rijke, M. (2011a). Balancing exploration and exploitation in learning to rank online. In: ECIR'11, pp 251-263.
- (2011) Balancing exploration and exploitation in learning to rank online , pp. 251-263
- Hofmann, K.¹ Whiteson, S.² de Rijke, M.³

14
- 83055168219
- In: CIKM '11
- Hofmann, K., Whiteson, S., & de Rijke, M. (2011b). A probabilistic method for inferring preferences from clicks. In: CIKM '11, pp 249-258.
- (2011) A probabilistic method for inferring preferences from clicks , pp. 249-258
- Hofmann, K.¹ Whiteson, S.² de Rijke, M.³

15
- 0242456822
- In: KDD '02
- Joachims, T. (2002). Optimizing search engines using clickthrough data. In: KDD '02, pp 133-142.
- (2002) Optimizing search engines using clickthrough data , pp. 133-142
- Joachims, T.¹

16
- 34247882698
- Evaluating the accuracy of implicit feedback from clicks and query reformulations in web search
- Joachims, T., Granka, L., Pan, B., Hembrooke, H., Radlinski, F., & Gay, G. (2007). Evaluating the accuracy of implicit feedback from clicks and query reformulations in web search. ACM Transaction of Information and System 25(2): 7.
- (2007) ACM Transaction of Information and System , vol.25 , Issue.2 , pp. 7
- Joachims, T.¹ Granka, L.² Pan, B.³ Hembrooke, H.⁴ Radlinski, F.⁵ Gay, G.⁶

17
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P., Littman, M. L., & Moore, A. W. (1996) Reinforcement learning: A survey. Journal Artifical Intelligence Research 4(1), 237-285.
- (1996) Journal Artifical Intelligence Research , vol.4 , Issue.1 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

18
- 77956526578
- In: ICML'10
- Kalyanakrishnan, S., & Stone, P. (2010). Efficient selection of multiple bandit arms: Theory and practice. In: ICML'10, pp 511-518.
- (2010) Efficient selection of multiple bandit arms: Theory and practice , pp. 511-518
- Kalyanakrishnan, S.¹ Stone, P.²

19
- 78651342495
- In: CIKM '10, ACM, New York, NY, USA
- Karimzadehgan, M., & Zhai, C. (2010). Exploration-exploitation tradeoff in interactive relevance feedback. In: CIKM '10, ACM, New York, NY, USA, pp 1397-1400.
- (2010) Exploration-exploitation tradeoff in interactive relevance feedback , pp. 1397-1400
- Karimzadehgan, M.¹ Zhai, C.²

20
- 77956144722
- In: NIPS'08
- Langford, J., & Zhang, T. (2008). The epoch-greedy algorithm for multi-armed bandits with side information. In: NIPS'08, pp 817-824.
- (2008) The epoch-greedy algorithm for multi-armed bandits with side information , pp. 817-824
- Langford, J.¹ Zhang, T.²

21
- 56449124046
- In: ICML '08
- Langford, J., Strehl, A., & Wortman, J. (2008). Exploration scavenging. In: ICML '08, pp 528-535.
- (2008) Exploration scavenging , pp. 528-535
- Langford, J.¹ Strehl, A.² Wortman, J.³

22
- 79952384747
- In: WSDM '11
- Li, L., Chu, W., Langford, J., & Wang, X. (2011). Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms. In: WSDM '11, pp 297-306.
- (2011) Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms , pp. 297-306
- Li, L.¹ Chu, W.² Langford, J.³ Wang, X.⁴

23
- 77954641643
- In: WWW '10
- Li, L., Chu, W., Langford, J., & Schapire, R. E. (2010). A contextual-bandit approach to personalized news article recommendation. In: WWW '10, pp 661-670.
- (2010) A contextual-bandit approach to personalized news article recommendation , pp. 661-670
- Li, L.¹ Chu, W.² Langford, J.³ Schapire, R.E.⁴

24
- 69249119464
- Learning to rank for information retrieval
- Liu, T. Y. (2009). Learning to rank for information retrieval. Foundations and Trends in Information Retrieval, 3(3), 225-331.
- (2009) Foundations and Trends in Information Retrieval , vol.3 , Issue.3 , pp. 225-331
- Liu, T.Y.¹

25
- 84873522690
- In: LR4IR '07
- Liu, T. Y., Xu, J., Qin, T., Xiong, W., & Li, H. (2007). Letor: Benchmark dataset for research on learning to rank for information retrieval. In: LR4IR '07.
- (2007) Letor: Benchmark dataset for research on learning to rank for information retrieval
- Liu, T.Y.¹ Xu, J.² Qin, T.³ Xiong, W.⁴ Li, H.⁵

26
- 61449109791
- Mahajan, A., & Teneketzis, D. (2008). Multi-armed bandit problems. Foundations and Applications of Sensor Management pp 121-151.
- (2008) Multi-armed bandit problems. Foundations and Applications of Sensor Management , pp. 121-151
- Mahajan, A.¹ Teneketzis, D.²

27
- 69249131366
- Selection bias in the LETOR datasets
- Minka, T., & Robertson, S. (2008). Selection bias in the LETOR datasets. In: SIGIR Workshop on Learning to Rank for Information Retrieval, pp 48-51.
- (2008) SIGIR Workshop on Learning to Rank for Information Retrieval , pp. 48-51
- Minka, T.¹ Robertson, S.²

28
- 77956014530
- In: SIGIR '10
- Radlinski, F., & Craswell, N. (2010). Comparing the sensitivity of information retrieval metrics. In: SIGIR '10, pp 667-674.
- (2010) Comparing the sensitivity of information retrieval metrics , pp. 667-674
- Radlinski, F.¹ Craswell, N.²

29
- 56449088596
- In: ICML '08, ACM
- Radlinski, F., Kleinberg, R., & Joachims, T. (2008a). Learning diverse rankings with multi-armed bandits. In: ICML '08, ACM, pp 784-791.
- (2008) Learning diverse rankings with multi-armed bandits , pp. 784-791
- Radlinski, F.¹ Kleinberg, R.² Joachims, T.³

30
- 67650085898
- In: CIKM '08
- Radlinski, F., Kurup, M., & Joachims, T. (2008b). How does clickthrough data reflect retrieval quality? In: CIKM '08, pp 43-52.
- (2008) How does clickthrough data reflect retrieval quality? , pp. 43-52
- Radlinski, F.¹ Kurup, M.² Joachims, T.³

31
- 84966203785
- Some aspects of the sequential design of experiments
- Robbins, H. (1952). Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society 58, 527-535.
- (1952) Bulletin of the American Mathematical Society , vol.58 , pp. 527-535
- Robbins, H.¹

32
- 77954220071
- Test collection based evaluation of information retrieval systems
- Sanderson, M. (2010). Test collection based evaluation of information retrieval systems. Foundations and Trends in Information Retrieval 4(4), 247-375.
- (2010) Foundations and Trends in Information Retrieval , vol.4 , Issue.4 , pp. 247-375
- Sanderson, M.¹

33
- 77956206413
- Large scale learning to rank
- Sculley, D. (2009). Large scale learning to rank. In: NIPS 2009 Workshop on Advances in Ranking.
- (2009) NIPS 2009 Workshop on Advances in Ranking
- Sculley, D.¹

34
- 3042687811
- Analysis of a very large web search engine query log
- Silverstein, C., Marais, H., Henzinger, M., & Moricz, M. (1999). Analysis of a very large web search engine query log. SIGIR Forum 33(1), 6-12.
- (1999) SIGIR Forum , vol.33 , Issue.1 , pp. 6-12
- Silverstein, C.¹ Marais, H.² Henzinger, M.³ Moricz, M.⁴

35
- 34250750797
- In: ICML '06
- Strehl, A., Mesterharm, C., Littman, M., & Hirsh, H. (2006). Experience-efficient learning in associative bandit problems. In: ICML '06, pp 889-896.
- (2006) Experience-efficient learning in associative bandit problems , pp. 889-896
- Strehl, A.¹ Mesterharm, C.² Littman, M.³ Hirsh, H.⁴

36
- 0004102479
- MIT Press, Cambridge, MA, USA
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. MIT Press, Cambridge, MA, USA.
- (1998) Reinforcement learning: An introduction
- Sutton, R.S.¹ Barto, A.G.²

37
- 42549161120
- In: WSDM '08, ACM
- Taylor, M., Guiver, J., Robertson, S., & Minka, T. (2008). Softrank: optimizing non-smooth rank metrics. In: WSDM '08, ACM, pp 77-86.
- (2008) Softrank: Optimizing non-smooth rank metrics , pp. 77-86
- Taylor, M.¹ Guiver, J.² Robertson, S.³ Minka, T.⁴

38
- 80052110979
- In: SIGIR '11
- Tian, A., & Lease, M. (2011). Active learning to maximize accuracy vs. effort in interactive information retrieval. In: SIGIR '11, pp 145-154.
- (2011) Active learning to maximize accuracy vs. Effort in interactive information retrieval , pp. 145-154
- Tian, A.¹ Lease, M.²

39
- 0004049893
- PhD thesis, Cambridge University
- Watkins, C. (1989). Learning from delayed rewards. PhD thesis, Cambridge University.
- (1989) Learning from delayed rewards
- Watkins, C.¹

40
- 33750226914
- On-line evolutionary computation for reinforcement learning in stochastic domains
- Whiteson, S., & Stone, P. (2006). On-line evolutionary computation for reinforcement learning in stochastic domains. In: GECCO 2006: Proceedings of the genetic and evolutionary computation conference, pp 1577-1584.
- (2006) GECCO 2006: Proceedings of the genetic and evolutionary computation conference , pp. 1577-1584
- Whiteson, S.¹ Stone, P.²

41
- 57549100649
- In: SIGIR '08
- Xu, Z., & Akella, R. (2008). A Bayesian logistic regression model for active relevance feedback. In: SIGIR '08, pp 227-234.
- (2008) A Bayesian logistic regression model for active relevance feedback , pp. 227-234
- Xu, Z.¹ Akella, R.²

42
- 37149038648
- In: ECIR'07
- Xu, Z., Akella, R., & Zhang, Y. (2007). Incorporating diversity and density in active learning for relevance feedback. In: ECIR'07, pp 246-257.
- (2007) Incorporating diversity and density in active learning for relevance feedback , pp. 246-257
- Xu, Z.¹ Akella, R.² Zhang, Y.³

43
- 77958058026
- In: ECML PKDD'10
- Xu, Z., Kersting, K., & Joachims, T. (2010). Fast active exploration for link-based preference learning using gaussian processes. In: ECML PKDD'10, pp 499-514.
- (2010) Fast active exploration for link-based preference learning using gaussian processes , pp. 499-514
- Xu, Z.¹ Kersting, K.² Joachims, T.³

44
- 71149114227
- In: ICML'09
- Yue, Y., & Joachims, T. (2009). Interactively optimizing information retrieval systems as a dueling bandits problem. In: ICML'09, pp 1201-1208.
- (2009) Interactively optimizing information retrieval systems as a dueling bandits problem , pp. 1201-1208
- Yue, Y.¹ Joachims, T.²

45
- 84898077397
- In: COLT'09
- Yue, Y., Broder, J., Kleinberg, R., & Joachims, T. (2009). The k-armed dueling bandits problem. In: COLT'09.
- (2009) The k-armed dueling bandits problem
- Yue, Y.¹ Broder, J.² Kleinberg, R.³ Joachims, T.⁴

46
- 33749243068
- In: ICML '04, ACM
- Zhang, T. (2004). Solving large scale linear prediction problems using stochastic gradient descent algorithms. In: ICML '04, ACM, pp 116.
- (2004) Solving large scale linear prediction problems using stochastic gradient descent algorithms , pp. 116
- Zhang, T.¹

47
- 1942484427
- In: ICML '03
- Zhang, Y., Xu, W., & Callan, J. (2003). Exploration and exploitation in adaptive filtering based on bayesian active learning. In: ICML '03, pp 896-904.
- (2003) Exploration and exploitation in adaptive filtering based on bayesian active learning , pp. 896-904
- Zhang, Y.¹ Xu, W.² Callan, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.