-
1
-
-
70349138027
-
Diversifying search results
-
Agrawal, R., Sreenivas, G., Halverson, A., Leong, S.: Diversifying search results. In: Proceedings of ACM WSDM 2009, pp. 5-14 (2009)
-
(2009)
Proceedings of ACM WSDM 2009
, pp. 5-14
-
-
Agrawal, R.1
Sreenivas, G.2
Halverson, A.3
Leong, S.4
-
2
-
-
34547647257
-
Retrieval evaluation with incomplete relevance data: A comparative study of three measures
-
Ahlgren, P., Grönqvist, L.: Retrieval evaluation with incomplete relevance data: A comparative study of three measures. In: Proceedings of ACM CIKM 2006, pp. 872-873 (2006)
-
(2006)
Proceedings of ACM CIKM 2006
, pp. 872-873
-
-
Ahlgren, P.1
Grönqvist, L.2
-
3
-
-
84871533644
-
Frontiers, challenges and opportunities for information retrieval: Report from SWIRL 2012
-
Allan, J., Aslam, J., Azzopardi, L., Belkin, N., Borlund, P., Bruza, P., Callan, J., Carman, M., Clarke, C.L., Craswell, N., Croft, W.B., Culpepper, J.S., Diaz, F., Dumais, S., Ferro, N., Geva, S., Gonzalo, J., Hawking, D., Jarvelin, K., Jones, G., Jones, R., Kamps, J., Kando, N., Kanoulas, E., Karlgren, J., Kelly, D., Lease, M., Lin, J., Mizzaro, S., Moffat, A., Murdock, V., Oard, D.W., de Rijke, M., Sakai, T., Sanderson, M., Scholer, F., Si, L., Thom, J.A., Thomas, P., Trotman, A., Turpin, A., de Vries, A.P., Webber, W., Zhang, X., Zhang, Y.: Frontiers, challenges and opportunities for information retrieval: Report from SWIRL 2012. SIGIR Forum 46(1), 2-32 (2012)
-
(2012)
SIGIR Forum
, vol.46
, Issue.1
, pp. 2-32
-
-
Allan, J.1
Aslam, J.2
Azzopardi, L.3
Belkin, N.4
Borlund, P.5
Bruza, P.6
Callan, J.7
Carman, M.8
Clarke, C.L.9
Craswell, N.10
Croft, W.B.11
Culpepper, J.S.12
Diaz, F.13
Dumais, S.14
Ferro, N.15
Geva, S.16
Gonzalo, J.17
Hawking, D.18
Jarvelin, K.19
Jones, G.20
Jones, R.21
Kamps, J.22
Kando, N.23
Kanoulas, E.24
Karlgren, J.25
Kelly, D.26
Lease, M.27
Lin, J.28
Mizzaro, S.29
Moffat, A.30
Murdock, V.31
Oard, D.W.32
De Rijke, M.33
Sakai, T.34
Sanderson, M.35
Scholer, F.36
Si, L.37
Thom, J.A.38
Thomas, P.39
Trotman, A.40
Turpin, A.41
De Vries, A.P.42
Webber, W.43
Zhang, X.44
Zhang, Y.45
more..
-
4
-
-
84885591216
-
When will information retrieval be "good enough"?
-
Allan, J., Carterette, B., Lewis, J.: When will information retrieval be "good enough"? In: Proceedings of ACM SIGIR 2005, pp. 433-440 (2005)
-
(2005)
Proceedings of ACM SIGIR 2005
, pp. 433-440
-
-
Allan, J.1
Carterette, B.2
Lewis, J.3
-
5
-
-
85015342057
-
A methodology for evaluating aggregated search results
-
Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. Springer, Heidelberg
-
Arguello, J., Diaz, F., Callan, J., Carterette, B.: A methodology for evaluating aggregated search results. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 141- 152. Springer, Heidelberg (2011)
-
(2011)
LNCS
, vol.6611
, pp. 141-152
-
-
Arguello, J.1
Diaz, F.2
Callan, J.3
Carterette, B.4
-
6
-
-
77957131892
-
Expected reading effort in focused retrieval evaluation
-
Arvola, P., Kekäläinen, J., Junkkari, M.: Expected reading effort in focused retrieval evaluation. Information Retrieval 13(5), 460-484 (2010)
-
(2010)
Information Retrieval
, vol.13
, Issue.5
, pp. 460-484
-
-
Arvola, P.1
Kekäläinen, J.2
Junkkari, M.3
-
7
-
-
84871631227
-
On the informativeness of cascade and intent-aware effectiveness measures
-
Ashkan, A., Clarke, C.L.: On the informativeness of cascade and intent-aware effectiveness measures. In: Proceedings of ACM SIGIR 2013, pp. 407-416 (2011)
-
(2011)
Proceedings of ACM SIGIR 2013
, pp. 407-416
-
-
Ashkan, A.1
Clarke, C.L.2
-
8
-
-
33746264710
-
The maximum entropy method for analyzing retrieval measures
-
Aslam, J.A., Yilmaz, E., Pavlu, V.: The maximum entropy method for analyzing retrieval measures. In: Proceedings of ACM SIGIR 2005, pp. 27-34 (2005)
-
(2005)
Proceedings of ACM SIGIR 2005
, pp. 27-34
-
-
Aslam, J.A.1
Yilmaz, E.2
Pavlu, V.3
-
9
-
-
74549121417
-
Usage based effectiveness measures
-
Azzopardi, L.: Usage based effectiveness measures. In: Proceedings of ACM CIKM 2009, pp. 631-640 (2009)
-
(2009)
Proceedings of ACM CIKM 2009
, pp. 631-640
-
-
Azzopardi, L.1
-
10
-
-
84866615871
-
Time drives interaction: Simulating sessions in diverse searching environments
-
Baskaya, F., Keskustalo, H., Järvelin, K.: Time drives interaction: Simulating sessions in diverse searching environments. In: Proceedings of ACM SIGIR 2012, pp. 105-114 (2012)
-
(2012)
Proceedings of ACM SIGIR 2012
, pp. 105-114
-
-
Baskaya, F.1
Keskustalo, H.2
Järvelin, K.3
-
11
-
-
36448947171
-
Test theory for assessing IR test collections
-
Bodoff, D., Li, P.: Test theory for assessing IR test collections. In: Proceedings of ACM SIGIR 2007, pp. 367-374 (2007)
-
(2007)
Proceedings of ACM SIGIR 2007
, pp. 367-374
-
-
Bodoff, D.1
Li, P.2
-
13
-
-
79952370249
-
Dynamic ranked retrieval
-
Brandt, C., Joachims, T., Yue, Y., Bank, J.: Dynamic ranked retrieval. In: Proceedings of ACM WSDM 2011, pp. 247-256 (2011)
-
(2011)
Proceedings of ACM WSDM 2011
, pp. 247-256
-
-
Brandt, C.1
Joachims, T.2
Yue, Y.3
Bank, J.4
-
14
-
-
0142030258
-
A taxonomy of web search
-
Broder, A.: A taxonomy of web search. SIGIR Forum 36(2) (2002)
-
(2002)
SIGIR Forum
, vol.36
, Issue.2
-
-
Broder, A.1
-
17
-
-
31844446958
-
Learning to rank using gradient descent
-
Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hullender, G.: Learning to rank using gradient descent. In: Proceedings of ICML 2005, pp. 89-96 (2005)
-
(2005)
Proceedings of ICML 2005
, pp. 89-96
-
-
Burges, C.1
Shaked, T.2
Renshaw, E.3
Lazier, A.4
Deeds, M.5
Hamilton, N.6
Hullender, G.7
-
18
-
-
36448986732
-
Reliable information retrieval evaluation with incomplete and biased judgements
-
Büttcher, S., Clarke, C.L., Yeung, P.C., Soboroff, I.: Reliable information retrieval evaluation with incomplete and biased judgements. In: Proceedings of ACM SIGIR 2007, pp. 63-70 (2007)
-
(2007)
Proceedings of ACM SIGIR 2007
, pp. 63-70
-
-
Büttcher, S.1
Clarke, C.L.2
Yeung, P.C.3
Soboroff, I.4
-
19
-
-
72449194290
-
On rank correlation and the distance between rankings
-
Carterette, B.: On rank correlation and the distance between rankings. In: Proceedings of ACM SIGIR 2009, pp. 436-443 (2009)
-
(2009)
Proceedings of ACM SIGIR 2009
, pp. 436-443
-
-
Carterette, B.1
-
20
-
-
80052123590
-
System effectiveness, user models, and user utility: A conceptual framework for investigation
-
Carterette, B.: System effectiveness, user models, and user utility: A conceptual framework for investigation. In: Proceedings of ACM SIGIR 2011, pp. 903-912 (2011)
-
(2011)
Proceedings of ACM SIGIR 2011
, pp. 903-912
-
-
Carterette, B.1
-
21
-
-
84859011471
-
Multiple testing in statistical analysis of systems-based information retrieval experiments
-
Carterette, B.: Multiple testing in statistical analysis of systems-based information retrieval experiments. ACM TOIS 30(1) (2012)
-
(2012)
ACM TOIS
, vol.30
, Issue.1
-
-
Carterette, B.1
-
22
-
-
41849104667
-
Here or there preference judgments for relevance
-
DOI 10.1007/978-3-540-78646-7-5, Advances in Information Retrieval - 30th European Conference on IR Research, ECIR 2008, Proceedings
-
Carterette, B., Bennett, P.N., Chickering, D.M., Dumais, S.T.: Here or there: Preference judgments for relevance. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I.,White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 16-27. Springer, Heidelberg (2008) (Pubitemid 351499055)
-
(2008)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
, vol.4956 LNCS
, pp. 16-27
-
-
Carterette, B.1
Bennett, P.N.2
Chickering, D.M.3
Dumais, S.T.4
-
23
-
-
57349133736
-
Evaluation over thousands of queries
-
Carterette, B., Pavlu, V., Kanoulas, E., Aslam, J.A., Allan, J.: Evaluation over thousands of queries. In: Proceedings of ACM SIGIR 2008, pp. 651-658 (2008)
-
(2008)
Proceedings of ACM SIGIR 2008
, pp. 651-658
-
-
Carterette, B.1
Pavlu, V.2
Kanoulas, E.3
Aslam, J.A.4
Allan, J.5
-
24
-
-
84880779561
-
Analysis of various evaluation measures for diversity
-
Chandar, P., Carterette, B.: Analysis of various evaluation measures for diversity. In: Proceedings of DDR 2011, pp. 21-28 (2011)
-
(2011)
Proceedings of DDR 2011
, pp. 21-28
-
-
Chandar, P.1
Carterette, B.2
-
26
-
-
80255123851
-
Intent-based diversification of web search results: Metrics and algorithms
-
Chapelle, O., Ji, S., Liao, C., Velipasaoglu, E., Lai, L., Wu, S.L.: Intent-based diversification of web search results: Metrics and algorithms. Information Retrieval 14(6), 572-592 (2011)
-
(2011)
Information Retrieval
, vol.14
, Issue.6
, pp. 572-592
-
-
Chapelle, O.1
Ji, S.2
Liao, C.3
Velipasaoglu, E.4
Lai, L.5
Wu, S.L.6
-
27
-
-
74549208546
-
Expected reciprocal rank for graded relevance
-
Chapelle, O., Metzler, D., Zhang, Y., Grinspan, P.: Expected reciprocal rank for graded relevance. In: Proceedings of ACM CIKM 2009, pp. 621-630 (2009)
-
(2009)
Proceedings of ACM CIKM 2009
, pp. 621-630
-
-
Chapelle, O.1
Metzler, D.2
Zhang, Y.3
Grinspan, P.4
-
28
-
-
85107916112
-
MUC-4 evaluation metrics
-
Chinchor, N.: MUC-4 evaluation metrics. In: Proceedings of MUC-4, pp. 22-29 (1992)
-
(1992)
Proceedings of MUC-4
, pp. 22-29
-
-
Chinchor, N.1
-
30
-
-
79952370248
-
A comparative analysis of cascade measures for novelty and diversity
-
Clarke, C.L., Craswell, N., Soboroff, I., Ashkan, A.: A comparative analysis of cascade measures for novelty and diversity. In: Proceedings of ACM WSDM 2011, pp. 75-84 (2011)
-
(2011)
Proceedings of ACM WSDM 2011
, pp. 75-84
-
-
Clarke, C.L.1
Craswell, N.2
Soboroff, I.3
Ashkan, A.4
-
31
-
-
85175623957
-
Overview of the TREC 2011 web track
-
Clarke, C.L., Craswell, N., Soboroff, I., Voorhees, E.: Overview of the TREC 2011 web track. In: Proceedings of TREC 2011 (2012)
-
(2012)
Proceedings of TREC 2011
-
-
Clarke, C.L.1
Craswell, N.2
Soboroff, I.3
Voorhees, E.4
-
32
-
-
57349111122
-
Novelty and diversity in information retrieval evaluation
-
Clarke, C.L., Kolla, M., Cormack, G.V., Vechtomova, O., Ashkan, A., Büttcher, S., MacKinnon, I.: Novelty and diversity in information retrieval evaluation. In: Proceedings of ACM SIGIR 2008, pp. 659-666 (2009)
-
(2009)
Proceedings of ACM SIGIR 2008
, pp. 659-666
-
-
Clarke, C.L.1
Kolla, M.2
Cormack, G.V.3
Vechtomova, O.4
Ashkan, A.5
Büttcher, S.6
MacKinnon, I.7
-
33
-
-
70350576772
-
An effectiveness measure for ambiguous and underspecified queries
-
Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. Springer, Heidelberg
-
Clarke, C.L.A., Kolla, M., Vechtomova, O.: An effectiveness measure for ambiguous and underspecified queries. In: Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. LNCS, vol. 5766, pp. 188-199. Springer, Heidelberg (2009)
-
(2009)
LNCS
, vol.5766
, pp. 188-199
-
-
Clarke, C.L.A.1
Kolla, M.2
Vechtomova, O.3
-
35
-
-
0002064245
-
Expected search length: A single measure of retrieval effectiveness based on the weak ordering action of retrieval systems
-
Cooper, W.S.: Expected search length: A single measure of retrieval effectiveness based on the weak ordering action of retrieval systems. JASIS 19(1), 30-41 (1968)
-
(1968)
JASIS
, vol.19
, Issue.1
, pp. 30-41
-
-
Cooper, W.S.1
-
36
-
-
0015604498
-
On selecting a measure of retrieval effectiveness
-
Cooper, W.S.: On selecting a measure of retrieval effectiveness. JASIS 24(2), 87-100 (1973)
-
(1973)
JASIS
, vol.24
, Issue.2
, pp. 87-100
-
-
Cooper, W.S.1
-
37
-
-
0015681463
-
On selecting a measure of retrieval effectiveness: Part II. Implementation of the philosophy
-
Cooper, W.S.: On selecting a measure of retrieval effectiveness: Part II. Implementation of the philosophy. JASIS 24(6), 413-424 (1973)
-
(1973)
JASIS
, vol.24
, Issue.6
, pp. 413-424
-
-
Cooper, W.S.1
-
39
-
-
44649166130
-
Different structures for evaluating answers to complex questions: Pyramids won't topple, and neither will human assessors
-
Dang, H., Lin, J.: Different structures for evaluating answers to complex questions: Pyramids won't topple, and neither will human assessors. In: Proceedings of ACL 2007, pp. 768-775 (2007)
-
(2007)
Proceedings of ACL 2007
, pp. 768-775
-
-
Dang, H.1
Lin, J.2
-
40
-
-
33750363214
-
Rpref: A generalization of bpref towards graded relevance judgments
-
De Beer, J., Moens, M.F.: Rpref: A generalization of bpref towards graded relevance judgments. In: Proceedings of ACM SIGIR 2006, pp. 637-638 (2006)
-
(2006)
Proceedings of ACM SIGIR 2006
, pp. 637-638
-
-
De Beer, J.1
Moens, M.F.2
-
41
-
-
1842680293
-
Measuring retrieval effectiveness: A new proposal and a first experimental validation
-
Della Mea, V., Mizzaro, S.: Measuring retrieval effectiveness: A new proposal and a first experimental validation. JASIST 55(6), 503-543 (2004)
-
(2004)
JASIST
, vol.55
, Issue.6
, pp. 503-543
-
-
Della Mea, V.1
Mizzaro, S.2
-
42
-
-
0030650239
-
Time, relevance and interaction modelling for information retrieval
-
Dunlop,M.D.: Time, relevance and interaction modelling for information retrieval. In: Proceedings of ACM SIGIR 1997, pp. 206-213 (1997)
-
(1997)
Proceedings of ACM SIGIR 1997
, pp. 206-213
-
-
Dunlop, M.D.1
-
44
-
-
55449094674
-
Overview of the web retrieval task at the third NTCIR workshop
-
Eguchi, K., Oyama, K., Ishida, E., Kando, N., Kuriyama, K.: Overview of the web retrieval task at the third NTCIR workshop. NII Technical Reports NII-2003-002E (2003)
-
(2003)
NII Technical Reports NII-2003-002E
-
-
Eguchi, K.1
Oyama, K.2
Ishida, E.3
Kando, N.4
Kuriyama, K.5
-
45
-
-
84886070538
-
NTCIR9-GeoTime overview - Evaluating geographic and temporal search: Round 2
-
Gey, F., Larson, R., Machado, J., Yoshioka, M.: NTCIR9-GeoTime overview - evaluating geographic and temporal search: Round 2. In: Proceedings of NTCIR-9, pp. 9-17 (2011)
-
(2011)
Proceedings of NTCIR-9
, pp. 9-17
-
-
Gey, F.1
Larson, R.2
Machado, J.3
Yoshioka, M.4
-
47
-
-
0027725490
-
Using statistical testing in the evaluation of retrieval experiments
-
Hull, D.: Using statistical testing in the evaluation of retrieval experiments. In: Proceedings of ACM SIGIR 1993. pp. 329-338 (1993)
-
(1993)
Proceedings of ACM SIGIR 1993
, pp. 329-338
-
-
Hull, D.1
-
48
-
-
33846563409
-
Why most published research findings are false
-
Ioannidis, J.P.: Why most published research findings are false. PLoS Med. 2(8) (2005)
-
(2005)
PLoS Med
, vol.2
, Issue.8
-
-
Ioannidis, J.P.1
-
50
-
-
41849104668
-
Discounted cumulated gain based evaluation of multiple-query IR sessions
-
DOI 10.1007/978-3-540-78646-7-4, Advances in Information Retrieval - 30th European Conference on IR Research, ECIR 2008, Proceedings
-
Järvelin, K., Price, S.L., Delcambre, L.M.L., Nielsen, M.L.: Discounted cumulated gain based evaluation of multiple-query IR sessions. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 4-15. Springer, Heidelberg (2008) (Pubitemid 351499054)
-
(2008)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
, vol.4956 LNCS
, pp. 4-15
-
-
Jarvelin, K.1
Price, S.L.2
Delcambre, L.M.L.3
Nielsen, M.L.4
-
51
-
-
0032808670
-
The insignificance of statistical significance testing
-
Johnson, D.H.: The insignificance of statistical significance testing. The Journal of Wildlife Management 63(3), 763-772 (1999)
-
(1999)
The Journal of Wildlife Management
, vol.63
, Issue.3
, pp. 763-772
-
-
Johnson, D.H.1
-
52
-
-
51849123809
-
INEX 2007 evaluation measures
-
Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. Springer, Heidelberg
-
Kamps, J., Pehcevski, J., Kazai, G., Lalmas, M., Robertson, S.: INEX 2007 evaluation measures. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 24-33. Springer, Heidelberg (2008)
-
(2008)
LNCS
, vol.4862
, pp. 24-33
-
-
Kamps, J.1
Pehcevski, J.2
Kazai, G.3
Lalmas, M.4
Robertson, S.5
-
53
-
-
74549195035
-
Empirical justification of the gain and discount function for nDCG
-
Kanoulas, E., Aslam, J.A.: Empirical justification of the gain and discount function for nDCG. In: ACM CIKM 2009, pp. 611-620 (2009)
-
(2009)
ACM CIKM 2009
, pp. 611-620
-
-
Kanoulas, E.1
Aslam, J.A.2
-
54
-
-
80052112994
-
Evaluating multi-query sessions
-
Kanoulas, E., Carterette, B., Clough, P.D., Sanderson, M.: Evaluating multi-query sessions. In: Proceedings of ACM SIGIR 2011, pp. 1053-1062 (2011)
-
(2011)
Proceedings of ACM SIGIR 2011
, pp. 1053-1062
-
-
Kanoulas, E.1
Carterette, B.2
Clough, P.D.3
Sanderson, M.4
-
55
-
-
84883070185
-
Report from the NTCIR-10 1CLICK-2 Japanese subtask: Baselines, upperbounds and evaluation robustness
-
Kato, M.P., Sakai, T., Yamamoto, T., Iwata, M.: Report from the NTCIR-10 1CLICK-2 Japanese subtask: Baselines, upperbounds and evaluation robustness. In: Proceedings of ACM SIGIR 2013 (2013)
-
(2013)
Proceedings of ACM SIGIR 2013
-
-
Kato, M.P.1
Sakai, T.2
Yamamoto, T.3
Iwata, M.4
-
56
-
-
0036851939
-
Using graded relevance assessments in IR evaluation
-
Kekäläinen, J., Järvelin, K.: Using graded relevance assessments in IR evaluation. JASIST 53(13), 1120-1129 (2002)
-
(2002)
JASIST
, vol.53
, Issue.13
, pp. 1120-1129
-
-
Kekäläinen, J.1
Järvelin, K.2
-
57
-
-
33749530226
-
Property of average precision and its generalization: An examination of evaluation indicator for information retrieval
-
Kishida, K.: Property of average precision and its generalization: An examination of evaluation indicator for information retrieval. NII Technical Reports NII-2005-014E (2005)
-
(2005)
NII Technical Reports NII-2005-014E
-
-
Kishida, K.1
-
58
-
-
33751359439
-
Overview of CLIR task at the sixth NTCIR workshop
-
Kishida,K., Chen,K.H., Lee, S., Kuriyama,K., Kando, N., Chen, H.H.:Overview of CLIR task at the sixth NTCIR workshop. In: Proceedings of NTCIR-6, pp. 1-19 (2007)
-
(2007)
Proceedings of NTCIR-6
, pp. 1-19
-
-
Kishida, K.1
Chen, K.H.2
Lee, S.3
Kuriyama, K.4
Kando, N.5
Chen, H.H.6
-
59
-
-
84871050880
-
A comprehensive analysis of parameter settings for novelty-biased cumulative gain
-
Leenanupab, T., Zuccon, G., Jose, J.M.: A comprehensive analysis of parameter settings for novelty-biased cumulative gain. In: Proceedings of ACM CIKM 2012, pp. 1950-1954 (2012)
-
(2012)
Proceedings of ACM CIKM 2012
, pp. 1950-1954
-
-
Leenanupab, T.1
Zuccon, G.2
Jose, J.M.3
-
61
-
-
33748772836
-
Methods for automatically evaluating answers to complex questions
-
Lin, J., Demner-Fushman, D.: Methods for automatically evaluating answers to complex questions. Information Retrieval 9(5), 565-587 (2006)
-
(2006)
Information Retrieval
, vol.9
, Issue.5
, pp. 565-587
-
-
Lin, J.1
Demner-Fushman, D.2
-
62
-
-
77956042082
-
PRES: A score metric for evaluating recall-oriented information retrieval applications
-
Magdy, W., Jones, G.J.: PRES: A score metric for evaluating recall-oriented information retrieval applications. In: Proceedings of ACM SIGIR 2010, pp. 611-618 (2010)
-
(2010)
Proceedings of ACM SIGIR 2010
, pp. 611-618
-
-
Magdy, W.1
Jones, G.J.2
-
63
-
-
66949147248
-
Rank-biased precision for measurement of retrieval effectiveness
-
Moffat, A., Zobel, J.: Rank-biased precision for measurement of retrieval effectiveness. ACM TOIS 27(1) (2008)
-
(2008)
ACM TOIS
, vol.27
, Issue.1
-
-
Moffat, A.1
Zobel, J.2
-
65
-
-
34249275304
-
The pyramid method: Incorporating human content selection variation in summarization evaluation
-
Nenkova, A., Passonneau, R., McKeown, K.: The pyramid method: Incorporating human content selection variation in summarization evaluation. ACM Transactions on Speech and Language Processing 4(2), Article 4 (2007)
-
(2007)
ACM Transactions on Speech and Language Processing
, vol.4
, Issue.2
, pp. 4
-
-
Nenkova, A.1
Passonneau, R.2
McKeown, K.3
-
66
-
-
0141524308
-
Bleu: A method for automatic evaluation of machine translation
-
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. IBM Research Report RC22176 (2001)
-
(2001)
IBM Research Report RC22176
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.J.4
-
67
-
-
16244364485
-
Measures for the comparison of information retrieval systems
-
Pollock, S.M.: Measures for the comparison of information retrieval systems. American Documentation 19(4), 387-397 (1968)
-
(1968)
American Documentation
, vol.19
, Issue.4
, pp. 387-397
-
-
Pollock, S.M.1
-
69
-
-
0017630891
-
The probability ranking principle in IR
-
Robertson, S.E.: The probability ranking principle in IR. Journal of Documentation 33, 130-137 (1977)
-
(1977)
Journal of Documentation
, vol.33
, pp. 130-137
-
-
Robertson, S.E.1
-
71
-
-
57349087085
-
A new interpretation of average precision
-
Robertson, S.E.: A new interpretation of average precision. In: Proceedings of ACM SIGIR 2008, pp. 689-690 (2008)
-
(2008)
Proceedings of ACM SIGIR 2008
, pp. 689-690
-
-
Robertson, S.E.1
-
73
-
-
77956029237
-
Extending average precision to graded relevance judgments
-
Robertson, S.E., Kanoulas, E., Yilmaz, E.: Extending average precision to graded relevance judgments. In: Proceedings of ACM SIGIR 2010, pp. 603-610 (2010)
-
(2010)
Proceedings of ACM SIGIR 2010
, pp. 603-610
-
-
Robertson, S.E.1
Kanoulas, E.2
Yilmaz, E.3
-
74
-
-
84901381577
-
New performance metrics based on multigrade relevance: Their application to question answering
-
Sakai, T.: New performance metrics based on multigrade relevance: Their application to question answering. In: Proceedings of NTCIR-4 (Open Submission Session) (2004)
-
Proceedings of NTCIR-4 (Open Submission Session) (2004)
-
-
Sakai, T.1
-
75
-
-
24344471313
-
Ranking the NTCIR systems based on multigrade relevance
-
Myaeng, S.-H., Zhou, M., Wong, K.-F., Zhang, H.-J. (eds.) AIRS 2004. Springer, Heidelberg
-
Sakai, T.: Ranking the NTCIR systems based on multigrade relevance. In: Myaeng, S.-H., Zhou, M., Wong, K.-F., Zhang, H.-J. (eds.) AIRS 2004. LNCS, vol. 3411, pp. 251-262. Springer, Heidelberg (2005)
-
(2005)
LNCS
, vol.3411
, pp. 251-262
-
-
Sakai, T.1
-
76
-
-
33751354079
-
Bootstrap-based comparisons of IR metrics for finding one relevant document
-
Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. Springer, Heidelberg
-
Sakai, T.: Bootstrap-based comparisons of IR metrics for finding one relevant document. In: Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. LNCS, vol. 4182, pp. 374-389. Springer, Heidelberg (2006)
-
(2006)
LNCS
, vol.4182
, pp. 374-389
-
-
Sakai, T.1
-
77
-
-
33750340100
-
Evaluating evaluation metrics based on the bootstrap
-
Sakai, T.: Evaluating evaluation metrics based on the bootstrap. In: Proceedings of ACM SIGIR 2006, pp. 525-532 (2006)
-
(2006)
Proceedings of ACM SIGIR 2006
, pp. 525-532
-
-
Sakai, T.1
-
78
-
-
38149100610
-
For building better retrieval systems: Trends in information retrieval evaluation based on graded relevance
-
in Japanese
-
Sakai, T.: For building better retrieval systems: Trends in information retrieval evaluation based on graded relevance (in Japanese). IPSJ Magazine 47(2), 147-158 (2006)
-
(2006)
IPSJ Magazine
, vol.47
, Issue.2
, pp. 147-158
-
-
Sakai, T.1
-
80
-
-
57349094546
-
On penalising late arrival of relevant documents in information retrieval evaluation with graded relevance
-
Sakai, T.: On penalising late arrival of relevant documents in information retrieval evaluation with graded relevance. In: Proceedings of EVIA 2007, pp. 32-43 (2007)
-
(2007)
Proceedings of EVIA 2007
, pp. 32-43
-
-
Sakai, T.1
-
81
-
-
70349242289
-
Comparing metrics across TREC and NTCIR: The robustness to system bias
-
Sakai, T.: Comparing metrics across TREC and NTCIR: The robustness to system bias. In: Proceedings of ACM CIKM 2008, pp. 581-590 (2008)
-
(2008)
Proceedings of ACM CIKM 2008
, pp. 581-590
-
-
Sakai, T.1
-
82
-
-
84860858401
-
Evaluation with informational and navigational intents
-
Sakai, T.: Evaluation with informational and navigational intents. In: Proceedings of WWW 2012, pp. 499-508 (2012)
-
(2012)
Proceedings of WWW 2012
, pp. 499-508
-
-
Sakai, T.1
-
83
-
-
84901374927
-
How intuitive are diversified search metrics? Concordance test results for the diversity U-measures
-
Sakai, T.: How intuitive are diversified search metrics? Concordance test results for the diversity U-measures. IPSJ SIG Technical Report 2013-IFAT-111 (2013)
-
(2013)
IPSJ SIG Technical Report 2013-IFAT-111
-
-
Sakai, T.1
-
84
-
-
84883084005
-
The unreusability of diversified test collections
-
Sakai, T.: The unreusability of diversified test collections. In: Proceedings of EVIA 2013 (2013)
-
(2013)
Proceedings of EVIA 2013
-
-
Sakai, T.1
-
85
-
-
84883092343
-
Summaries, ranked retrieval and sessions: A unified framework for information access evaluation
-
Sakai, T., Dou, Z.: Summaries, ranked retrieval and sessions: A unified framework for information access evaluation. In: Proceedings of ACM SIGIR 2013, pp. 473-482 (2013)
-
(2013)
Proceedings of ACM SIGIR 2013
, pp. 473-482
-
-
Sakai, T.1
Dou, Z.2
-
87
-
-
84871603799
-
The reusability of a diversified search test collection
-
Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. Springer, Heidelberg
-
Sakai, T., Dou, Z., Song, R., Kando, N.: The reusability of a diversified search test collection. In: Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. LNCS, vol. 7675, pp. 26-38. Springer, Heidelberg (2012)
-
(2012)
LNCS
, vol.7675
, pp. 26-38
-
-
Sakai, T.1
Dou, Z.2
Song, R.3
Kando, N.4
-
88
-
-
84883063122
-
Summary of the NTCIR-10 INTENT-2 task: Subtopic mining and search result diversification
-
Sakai, T., Dou, Z., Yamamoto, T., Liu, Y., Zhang, M., Kato, M.P., Song, R., Iwata, M.: Summary of the NTCIR-10 INTENT-2 task: Subtopic mining and search result diversification. In: Proceedings of ACM SIGIR 2013 (2013)
-
(2013)
Proceedings of ACM SIGIR 2013
-
-
Sakai, T.1
Dou, Z.2
Yamamoto, T.3
Liu, Y.4
Zhang, M.5
Kato, M.P.6
Song, R.7
Iwata, M.8
-
89
-
-
50849122035
-
On information retrieval metrics designed for evaluation with incomplete relevance assessments
-
Sakai, T., Kando, N.: On information retrieval metrics designed for evaluation with incomplete relevance assessments. Information Retrieval 11, 447-470 (2008)
-
(2008)
Information Retrieval
, vol.11
, pp. 447-470
-
-
Sakai, T.1
Kando, N.2
-
90
-
-
84871582413
-
One click one revisited: Enhancing evaluation based on information units
-
Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. Springer, Heidelberg
-
Sakai, T., Kato, M.P.: One click one revisited: Enhancing evaluation based on information units. In: Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. LNCS, vol. 7675, pp. 39-51. Springer, Heidelberg (2012)
-
(2012)
LNCS
, vol.7675
, pp. 39-51
-
-
Sakai, T.1
Kato, M.P.2
-
91
-
-
83055168077
-
Click the search button and be happy: Evaluating direct and immediate information access
-
Sakai, T., Kato, M.P., Song, Y.I.: Click the search button and be happy: Evaluating direct and immediate information access. In: Proceedings of ACM CIKM 2011, pp. 621-630 (2011)
-
(2011)
Proceedings of ACM CIKM 2011
, pp. 621-630
-
-
Sakai, T.1
Kato, M.P.2
Song, Y.I.3
-
92
-
-
76349094521
-
Modelling a user population for designing information retrieval metrics
-
Sakai, T., Robertson, S.: Modelling a user population for designing information retrieval metrics. In: Proceedings of EVIA 2008, pp. 30-41 (2008)
-
(2008)
Proceedings of EVIA 2008
, pp. 30-41
-
-
Sakai, T.1
Robertson, S.2
-
93
-
-
78650890564
-
Overview of NTCIR-8 ACLIA IR4QA
-
Sakai, T., Shima, H., Kando, N., Song, R., Lin, C.J., Mitamura, T., Sugimoto, M., Lee, C.W.: Overview of NTCIR-8 ACLIA IR4QA. In: Proceedings of NTCIR-8, pp. 63-93 (2010)
-
(2010)
Proceedings of NTCIR-8
, pp. 63-93
-
-
Sakai, T.1
Shima, H.2
Kando, N.3
Song, R.4
Lin, C.J.5
Mitamura, T.6
Sugimoto, M.7
Lee, C.W.8
-
94
-
-
80052111133
-
Evaluating diversified search results using per-intent graded relevance
-
Sakai, T., Song, R.: Evaluating diversified search results using per-intent graded relevance. In: Proceedings of ACM SIGIR 2011 (2011)
-
(2011)
Proceedings of ACM SIGIR 2011
-
-
Sakai, T.1
Song, R.2
-
95
-
-
84880838418
-
Diversified search evaluation: Lessons from the NTCIR-9 INTENT task
-
Sakai, T., Song, R.: Diversified search evaluation: Lessons from the NTCIR-9 INTENT task. Information Retrieval (2013)
-
(2013)
Information Retrieval
-
-
Sakai, T.1
Song, R.2
-
97
-
-
77954220071
-
Test collection based evaluation of information retrieval systems
-
Sanderson, M.: Test collection based evaluation of information retrieval systems. Foundations and Trends in Information Retrieval 4, 247-375 (2010)
-
(2010)
Foundations and Trends in Information Retrieval
, vol.4
, pp. 247-375
-
-
Sanderson, M.1
-
98
-
-
77956037058
-
Do user preferences and evaluation measures line up?
-
Sanderson, M., Paramita, M.L., Clough, P., Kanoulas, E.: Do user preferences and evaluation measures line up? In: Proceedings of ACM SIGIR 2010, pp. 555-562 (2010)
-
(2010)
Proceedings of ACM SIGIR 2010
, pp. 555-562
-
-
Sanderson, M.1
Paramita, M.L.2
Clough, P.3
Kanoulas, E.4
-
99
-
-
84885608872
-
Information retrieval system evaluation: Effort, sensitivity, and reliability
-
Sanderson, M., Zobel, J.: Information retrieval system evaluation: Effort, sensitivity, and reliability. In: Proceedings of ACM SIGIR 2005, pp. 162-169 (2005)
-
(2005)
Proceedings of ACM SIGIR 2005
, pp. 162-169
-
-
Sanderson, M.1
Zobel, J.2
-
100
-
-
0031193029
-
Statistical inference in retrieval effectiveness evaluation
-
Savoy, J.: Statistical inference in retrieval effectiveness evaluation. Information Processing and Management 33(4), 495-512 (1997)
-
(1997)
Information Processing and Management
, vol.33
, Issue.4
, pp. 495-512
-
-
Savoy, J.1
-
101
-
-
63449088172
-
A comparison of statistical significance tests for information retrieval evaluation
-
Smucker, M.D., Allan, J., Carterette, B.: A comparison of statistical significance tests for information retrieval evaluation. In: Proceedings of ACM CIKM 2007, pp. 623-632 (2007)
-
(2007)
Proceedings of ACM CIKM 2007
, pp. 623-632
-
-
Smucker, M.D.1
Allan, J.2
Carterette, B.3
-
105
-
-
72449172028
-
Including summaries in system evaluation
-
Turpin, A., Scholer, F., Järvelin, K., Wu, M., Culpepper, J.S.: Including summaries in system evaluation. In: Proceedings of ACM SIGIR 2009, pp. 508-515 (2009)
-
(2009)
Proceedings of ACM SIGIR 2009
, pp. 508-515
-
-
Turpin, A.1
Scholer, F.2
Järvelin, K.3
Wu, M.4
Culpepper, J.S.5
-
106
-
-
8644262918
-
The philosophy of information retrieval evaluation
-
Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. Springer, Heidelberg
-
Voorhees, E.M.: The philosophy of information retrieval evaluation. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 355-370. Springer, Heidelberg (2002)
-
(2002)
LNCS
, vol.2406
, pp. 355-370
-
-
Voorhees, E.M.1
-
107
-
-
0036993119
-
The effect of topic set size on retrieval experiment error
-
Voorhees, E.M., Buckley, C.: The effect of topic set size on retrieval experiment error. In: Proceedings of ACM SIGIR 2002, pp. 316-323 (2002)
-
(2002)
Proceedings of ACM SIGIR 2002
, pp. 316-323
-
-
Voorhees, E.M.1
Buckley, C.2
-
109
-
-
57349160444
-
Score standardization for inter-collection comparison of retrieval systems
-
Webber, W., Moffat, A., Zobel, J.: Score standardization for inter-collection comparison of retrieval systems. In: Proceedings of ACM SIGIR 2008, pp. 51-58 (2008)
-
(2008)
Proceedings of ACM SIGIR 2008
, pp. 51-58
-
-
Webber, W.1
Moffat, A.2
Zobel, J.3
-
110
-
-
70349250276
-
Statistical power in retrieval experimentation
-
Webber, W., Moffat, A., Zobel, J.: Statistical power in retrieval experimentation. In: Proceedings of ACM CIKM 2008, pp. 571-580 (2008)
-
(2008)
Proceedings of ACM CIKM 2008
, pp. 571-580
-
-
Webber, W.1
Moffat, A.2
Zobel, J.3
-
111
-
-
80052117619
-
The effect of pooling and evaluation depth on metric stability
-
Webber, W., Moffat, A., Zobel, J.: The effect of pooling and evaluation depth on metric stability. In: Proceedings of EVIA 2010, pp. 7-15 (2010)
-
(2010)
Proceedings of EVIA 2010
, pp. 7-15
-
-
Webber, W.1
Moffat, A.2
Zobel, J.3
-
112
-
-
80051482719
-
A similarity measure for indefinite rankings
-
Webber, W., Moffat, A., Zobel, J.: A similarity measure for indefinite rankings. ACM TOIS 28(4) (2010)
-
(2010)
ACM TOIS
, vol.28
, Issue.4
-
-
Webber, W.1
Moffat, A.2
Zobel, J.3
-
113
-
-
72449157957
-
Score adjustment for correction of pooling bias
-
Webber, W., Park, L.A.: Score adjustment for correction of pooling bias. In: Proceedings of ACM SIGIR 2009, pp. 444-451 (2009)
-
(2009)
Proceedings of ACM SIGIR 2009
, pp. 444-451
-
-
Webber, W.1
Park, L.A.2
-
114
-
-
70449488015
-
Modeling expected utility of multi-session information distillation
-
Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. Springer, Heidelberg
-
Yang, Y., Lad, A.: Modeling expected utility of multi-session information distillation. In: Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. LNCS, vol. 5766, pp. 164-175. Springer, Heidelberg (2009)
-
(2009)
LNCS
, vol.5766
, pp. 164-175
-
-
Yang, Y.1
Lad, A.2
-
115
-
-
57349152359
-
A new rank correlation coefficient for information retrieval
-
Yilmaz, E., Aslam, J., Robertson, S.: A new rank correlation coefficient for information retrieval. In: Proceedings of ACM SIGIR 2008, pp. 587-594 (2008)
-
(2008)
Proceedings of ACM SIGIR 2008
, pp. 587-594
-
-
Yilmaz, E.1
Aslam, J.2
Robertson, S.3
-
116
-
-
34547632535
-
Estimating average precision with incomplete and imperfect judgments
-
Yilmaz, E., Aslam, J.A.: Estimating average precision with incomplete and imperfect judgments. In: ACM CIKM 2006 Proceedings, pp. 102-111 (2006)
-
(2006)
ACM CIKM 2006 Proceedings
, pp. 102-111
-
-
Yilmaz, E.1
Aslam, J.A.2
-
117
-
-
78651268113
-
Expected browsing utility for web search evaluation
-
Yilmaz, E., Shokouhi, M., Craswell, N., Robertson, S.: Expected browsing utility for web search evaluation. In: Proceedings of ACM CIKM 2010, pp. 1561-1564 (2010)
-
(2010)
Proceedings of ACM CIKM 2010
, pp. 1561-1564
-
-
Yilmaz, E.1
Shokouhi, M.2
Craswell, N.3
Robertson, S.4
-
118
-
-
1542347826
-
Beyond independent relevance: Methods and evaluation metrics for subtopic retrieval
-
Zhai, C., Cohen, W.W., Lafferty, J.: Beyond independent relevance: Methods and evaluation metrics for subtopic retrieval. In: Proceedings of ACM SIGIR 2003, pp. 10-17 (2003)
-
(2003)
Proceedings of ACM SIGIR 2003
, pp. 10-17
-
-
Zhai, C.1
Cohen, W.W.2
Lafferty, J.3
-
119
-
-
76349123386
-
Click-based evidence for decaying weight distributions in search effectiveness metrics
-
Zhang, Y., Park, L.A.F., Moffat, A.: Click-based evidence for decaying weight distributions in search effectiveness metrics. Information Retrieval 13(1), 46-69 (2010)
-
(2010)
Information Retrieval
, vol.13
, Issue.1
, pp. 46-69
-
-
Zhang, Y.1
Park, L.A.F.2
Moffat, A.3
-
120
-
-
84866618066
-
Evaluating aggregated search pages
-
Zhou, K., Cummins, R., Lalmas, M., Jose, J.M.: Evaluating aggregated search pages. In: Proceedings of ACM SIGIR 2012, pp. 115-124 (2012)
-
(2012)
Proceedings of ACM SIGIR 2012
, pp. 115-124
-
-
Zhou, K.1
Cummins, R.2
Lalmas, M.3
Jose, J.M.4
-
121
-
-
0032272626
-
How reliable are the results of large-scale information retrieval experiments?
-
Zobel, J.: How reliable are the results of large-scale information retrieval experiments? In: Proceedings of ACM SIGIR 1998, pp. 307-314 (1998)
-
(1998)
Proceedings of ACM SIGIR 1998
, pp. 307-314
-
-
Zobel, J.1
|