-
1
-
-
84863344448
-
-
In: NIPS'08
-
Agarwal, D., Chen, B., Elango, P., Motgi, N., Park, S., Ramakrishnan, et al. (2008). Online models for content optimization. In: NIPS'08, pp 17-24.
-
(2008)
Online models for content optimization
, pp. 17-24
-
-
Agarwal, D.1
Chen, B.2
Elango, P.3
Motgi, N.4
Park, S.5
Ramakrishnan6
-
2
-
-
0041966002
-
Using confidence bounds for exploitation-exploration trade-offs
-
Auer, P. (2003). Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3, 397-422.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 397-422
-
-
Auer, P.1
-
3
-
-
0019519039
-
Associative search network: A reinforcement learning associative memory
-
Barto, A. G., Sutton, R. S., & Brouwer, P. S. (1981). Associative search network: A reinforcement learning associative memory. IEEE Transaction on System, Man, and Cybernetics, 40, 201-211.
-
(1981)
IEEE Transaction on System, Man, and Cybernetics
, vol.40
, pp. 201-211
-
-
Barto, A.G.1
Sutton, R.S.2
Brouwer, P.S.3
-
4
-
-
0142030258
-
A taxonomy of web search
-
Broder, A. (2002). A taxonomy of web search. SIGIR Forum, 36(2), 3-10.
-
(2002)
SIGIR Forum
, vol.36
, Issue.2
, pp. 3-10
-
-
Broder, A.1
-
5
-
-
34250348767
-
Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration
-
Cohen, J. D., McClure, S. M., & Yu, A. J. (2007). Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philosophical Transactions of the Royal Society B: Biological Sciences, 362(1481), 933-942.
-
(2007)
Philosophical Transactions of the Royal Society B: Biological Sciences
, vol.362
, Issue.1481
, pp. 933-942
-
-
Cohen, J.D.1
McClure, S.M.2
Yu, A.J.3
-
6
-
-
42549140738
-
-
In: WSDM '08
-
Craswell, N., Zoeter, O., Taylor, M., & Ramsey, B. (2008). An experimental comparison of click position-bias models. In: WSDM '08, pp 87-94.
-
(2008)
An experimental comparison of click position-bias models
, pp. 87-94
-
-
Craswell, N.1
Zoeter, O.2
Taylor, M.3
Ramsey, B.4
-
9
-
-
70349117309
-
-
In: WSDM '09
-
Guo, F., Liu, C., & Wang, Y. M. (2009b). Efficient multiple-click models in web search. In: WSDM '09, pp 124-131.
-
(2009)
Efficient multiple-click models in web search
, pp. 124-131
-
-
Guo, F.1
Liu, C.2
Wang, Y.M.3
-
10
-
-
67650079933
-
-
In: WSCD '09
-
Guo, F., Li, L., & Faloutsos, C. (2009a). Tailoring click models to user goals. In: WSCD '09, pp 88-92.
-
(2009)
Tailoring click models to user goals
, pp. 88-92
-
-
Guo, F.1
Li, L.2
Faloutsos, C.3
-
11
-
-
74549144985
-
-
In: CIKM '09
-
He, J., Zhai, C., & Li, X. (2009). Evaluation of methods for relative comparison of retrieval systems based on clickthroughs. In: CIKM '09, pp 2029-2032.
-
(2009)
Evaluation of methods for relative comparison of retrieval systems based on clickthroughs
, pp. 2029-2032
-
-
He, J.1
Zhai, C.2
Li, X.3
-
12
-
-
0033322991
-
-
In: ICANN '99
-
Herbrich, R., Graepel, T., & Obermayer, K. (1999). Support vector learning for ordinal regression. In: ICANN '99, vol 1, pp 97-102.
-
(1999)
Support vector learning for ordinal regression
, vol.1
, pp. 97-102
-
-
Herbrich, R.1
Graepel, T.2
Obermayer, K.3
-
13
-
-
84996484566
-
-
In: ECIR'11
-
Hofmann, K., Whiteson, S., & de Rijke, M. (2011a). Balancing exploration and exploitation in learning to rank online. In: ECIR'11, pp 251-263.
-
(2011)
Balancing exploration and exploitation in learning to rank online
, pp. 251-263
-
-
Hofmann, K.1
Whiteson, S.2
de Rijke, M.3
-
14
-
-
83055168219
-
-
In: CIKM '11
-
Hofmann, K., Whiteson, S., & de Rijke, M. (2011b). A probabilistic method for inferring preferences from clicks. In: CIKM '11, pp 249-258.
-
(2011)
A probabilistic method for inferring preferences from clicks
, pp. 249-258
-
-
Hofmann, K.1
Whiteson, S.2
de Rijke, M.3
-
16
-
-
34247882698
-
Evaluating the accuracy of implicit feedback from clicks and query reformulations in web search
-
Joachims, T., Granka, L., Pan, B., Hembrooke, H., Radlinski, F., & Gay, G. (2007). Evaluating the accuracy of implicit feedback from clicks and query reformulations in web search. ACM Transaction of Information and System 25(2): 7.
-
(2007)
ACM Transaction of Information and System
, vol.25
, Issue.2
, pp. 7
-
-
Joachims, T.1
Granka, L.2
Pan, B.3
Hembrooke, H.4
Radlinski, F.5
Gay, G.6
-
17
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaelbling, L. P., Littman, M. L., & Moore, A. W. (1996) Reinforcement learning: A survey. Journal Artifical Intelligence Research 4(1), 237-285.
-
(1996)
Journal Artifical Intelligence Research
, vol.4
, Issue.1
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
19
-
-
78651342495
-
-
In: CIKM '10, ACM, New York, NY, USA
-
Karimzadehgan, M., & Zhai, C. (2010). Exploration-exploitation tradeoff in interactive relevance feedback. In: CIKM '10, ACM, New York, NY, USA, pp 1397-1400.
-
(2010)
Exploration-exploitation tradeoff in interactive relevance feedback
, pp. 1397-1400
-
-
Karimzadehgan, M.1
Zhai, C.2
-
21
-
-
56449124046
-
-
In: ICML '08
-
Langford, J., Strehl, A., & Wortman, J. (2008). Exploration scavenging. In: ICML '08, pp 528-535.
-
(2008)
Exploration scavenging
, pp. 528-535
-
-
Langford, J.1
Strehl, A.2
Wortman, J.3
-
22
-
-
79952384747
-
-
In: WSDM '11
-
Li, L., Chu, W., Langford, J., & Wang, X. (2011). Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms. In: WSDM '11, pp 297-306.
-
(2011)
Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms
, pp. 297-306
-
-
Li, L.1
Chu, W.2
Langford, J.3
Wang, X.4
-
23
-
-
77954641643
-
-
In: WWW '10
-
Li, L., Chu, W., Langford, J., & Schapire, R. E. (2010). A contextual-bandit approach to personalized news article recommendation. In: WWW '10, pp 661-670.
-
(2010)
A contextual-bandit approach to personalized news article recommendation
, pp. 661-670
-
-
Li, L.1
Chu, W.2
Langford, J.3
Schapire, R.E.4
-
25
-
-
84873522690
-
-
In: LR4IR '07
-
Liu, T. Y., Xu, J., Qin, T., Xiong, W., & Li, H. (2007). Letor: Benchmark dataset for research on learning to rank for information retrieval. In: LR4IR '07.
-
(2007)
Letor: Benchmark dataset for research on learning to rank for information retrieval
-
-
Liu, T.Y.1
Xu, J.2
Qin, T.3
Xiong, W.4
Li, H.5
-
29
-
-
56449088596
-
-
In: ICML '08, ACM
-
Radlinski, F., Kleinberg, R., & Joachims, T. (2008a). Learning diverse rankings with multi-armed bandits. In: ICML '08, ACM, pp 784-791.
-
(2008)
Learning diverse rankings with multi-armed bandits
, pp. 784-791
-
-
Radlinski, F.1
Kleinberg, R.2
Joachims, T.3
-
30
-
-
67650085898
-
-
In: CIKM '08
-
Radlinski, F., Kurup, M., & Joachims, T. (2008b). How does clickthrough data reflect retrieval quality? In: CIKM '08, pp 43-52.
-
(2008)
How does clickthrough data reflect retrieval quality?
, pp. 43-52
-
-
Radlinski, F.1
Kurup, M.2
Joachims, T.3
-
31
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
Robbins, H. (1952). Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society 58, 527-535.
-
(1952)
Bulletin of the American Mathematical Society
, vol.58
, pp. 527-535
-
-
Robbins, H.1
-
32
-
-
77954220071
-
Test collection based evaluation of information retrieval systems
-
Sanderson, M. (2010). Test collection based evaluation of information retrieval systems. Foundations and Trends in Information Retrieval 4(4), 247-375.
-
(2010)
Foundations and Trends in Information Retrieval
, vol.4
, Issue.4
, pp. 247-375
-
-
Sanderson, M.1
-
34
-
-
3042687811
-
Analysis of a very large web search engine query log
-
Silverstein, C., Marais, H., Henzinger, M., & Moricz, M. (1999). Analysis of a very large web search engine query log. SIGIR Forum 33(1), 6-12.
-
(1999)
SIGIR Forum
, vol.33
, Issue.1
, pp. 6-12
-
-
Silverstein, C.1
Marais, H.2
Henzinger, M.3
Moricz, M.4
-
35
-
-
34250750797
-
-
In: ICML '06
-
Strehl, A., Mesterharm, C., Littman, M., & Hirsh, H. (2006). Experience-efficient learning in associative bandit problems. In: ICML '06, pp 889-896.
-
(2006)
Experience-efficient learning in associative bandit problems
, pp. 889-896
-
-
Strehl, A.1
Mesterharm, C.2
Littman, M.3
Hirsh, H.4
-
36
-
-
0004102479
-
-
MIT Press, Cambridge, MA, USA
-
Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. MIT Press, Cambridge, MA, USA.
-
(1998)
Reinforcement learning: An introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
37
-
-
42549161120
-
-
In: WSDM '08, ACM
-
Taylor, M., Guiver, J., Robertson, S., & Minka, T. (2008). Softrank: optimizing non-smooth rank metrics. In: WSDM '08, ACM, pp 77-86.
-
(2008)
Softrank: Optimizing non-smooth rank metrics
, pp. 77-86
-
-
Taylor, M.1
Guiver, J.2
Robertson, S.3
Minka, T.4
-
42
-
-
37149038648
-
-
In: ECIR'07
-
Xu, Z., Akella, R., & Zhang, Y. (2007). Incorporating diversity and density in active learning for relevance feedback. In: ECIR'07, pp 246-257.
-
(2007)
Incorporating diversity and density in active learning for relevance feedback
, pp. 246-257
-
-
Xu, Z.1
Akella, R.2
Zhang, Y.3
-
43
-
-
77958058026
-
-
In: ECML PKDD'10
-
Xu, Z., Kersting, K., & Joachims, T. (2010). Fast active exploration for link-based preference learning using gaussian processes. In: ECML PKDD'10, pp 499-514.
-
(2010)
Fast active exploration for link-based preference learning using gaussian processes
, pp. 499-514
-
-
Xu, Z.1
Kersting, K.2
Joachims, T.3
-
45
-
-
84898077397
-
-
In: COLT'09
-
Yue, Y., Broder, J., Kleinberg, R., & Joachims, T. (2009). The k-armed dueling bandits problem. In: COLT'09.
-
(2009)
The k-armed dueling bandits problem
-
-
Yue, Y.1
Broder, J.2
Kleinberg, R.3
Joachims, T.4
-
47
-
-
1942484427
-
-
In: ICML '03
-
Zhang, Y., Xu, W., & Callan, J. (2003). Exploration and exploitation in adaptive filtering based on bayesian active learning. In: ICML '03, pp 896-904.
-
(2003)
Exploration and exploitation in adaptive filtering based on bayesian active learning
, pp. 896-904
-
-
Zhang, Y.1
Xu, W.2
Callan, J.3
|