SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 8173 LNCS, Issue , 2014, Pages 116-163

Metrics, statistics, tests

(1) Sakai, Tetsuya a

a WASEDA UNIVERSITY (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

INFORMATION RETRIEVAL;

EFFECTIVENESS METRICS; EVALUATION METRICS; STATISTICAL SIGNIFICANCE; STATISTICAL SIGNIFICANCE TEST; TECHNOLOGICAL ADVANCES; TEST COLLECTION; XML RETRIEVAL;

STATISTICS;

EID: 84901306933 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-54798-0_6 Document Type: Conference Paper

Times cited : (50)

References (121)

1
- 70349138027
- Diversifying search results
- Agrawal, R., Sreenivas, G., Halverson, A., Leong, S.: Diversifying search results. In: Proceedings of ACM WSDM 2009, pp. 5-14 (2009)
- (2009) Proceedings of ACM WSDM 2009 , pp. 5-14
- Agrawal, R.¹ Sreenivas, G.² Halverson, A.³ Leong, S.⁴

2
- 34547647257
- Retrieval evaluation with incomplete relevance data: A comparative study of three measures
- Ahlgren, P., Grönqvist, L.: Retrieval evaluation with incomplete relevance data: A comparative study of three measures. In: Proceedings of ACM CIKM 2006, pp. 872-873 (2006)
- (2006) Proceedings of ACM CIKM 2006 , pp. 872-873
- Ahlgren, P.¹ Grönqvist, L.²

3
- 84871533644
- Frontiers, challenges and opportunities for information retrieval: Report from SWIRL 2012
- Allan, J., Aslam, J., Azzopardi, L., Belkin, N., Borlund, P., Bruza, P., Callan, J., Carman, M., Clarke, C.L., Craswell, N., Croft, W.B., Culpepper, J.S., Diaz, F., Dumais, S., Ferro, N., Geva, S., Gonzalo, J., Hawking, D., Jarvelin, K., Jones, G., Jones, R., Kamps, J., Kando, N., Kanoulas, E., Karlgren, J., Kelly, D., Lease, M., Lin, J., Mizzaro, S., Moffat, A., Murdock, V., Oard, D.W., de Rijke, M., Sakai, T., Sanderson, M., Scholer, F., Si, L., Thom, J.A., Thomas, P., Trotman, A., Turpin, A., de Vries, A.P., Webber, W., Zhang, X., Zhang, Y.: Frontiers, challenges and opportunities for information retrieval: Report from SWIRL 2012. SIGIR Forum 46(1), 2-32 (2012)
- (2012) SIGIR Forum , vol.46 , Issue.1 , pp. 2-32
- Allan, J.¹ Aslam, J.² Azzopardi, L.³ Belkin, N.⁴ Borlund, P.⁵ Bruza, P.⁶ Callan, J.⁷ Carman, M.⁸ Clarke, C.L.⁹ Craswell, N.¹⁰ Croft, W.B.¹¹ Culpepper, J.S.¹² Diaz, F.¹³ Dumais, S.¹⁴ Ferro, N.¹⁵ Geva, S.¹⁶ Gonzalo, J.¹⁷ Hawking, D.¹⁸ Jarvelin, K.¹⁹ Jones, G.²⁰ more..

4
- 84885591216
- When will information retrieval be "good enough"?
- Allan, J., Carterette, B., Lewis, J.: When will information retrieval be "good enough"? In: Proceedings of ACM SIGIR 2005, pp. 433-440 (2005)
- (2005) Proceedings of ACM SIGIR 2005 , pp. 433-440
- Allan, J.¹ Carterette, B.² Lewis, J.³

5
- 85015342057
- A methodology for evaluating aggregated search results
- Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. Springer, Heidelberg
- Arguello, J., Diaz, F., Callan, J., Carterette, B.: A methodology for evaluating aggregated search results. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 141- 152. Springer, Heidelberg (2011)
- (2011) LNCS , vol.6611 , pp. 141-152
- Arguello, J.¹ Diaz, F.² Callan, J.³ Carterette, B.⁴

6
- 77957131892
- Expected reading effort in focused retrieval evaluation
- Arvola, P., Kekäläinen, J., Junkkari, M.: Expected reading effort in focused retrieval evaluation. Information Retrieval 13(5), 460-484 (2010)
- (2010) Information Retrieval , vol.13 , Issue.5 , pp. 460-484
- Arvola, P.¹ Kekäläinen, J.² Junkkari, M.³

7
- 84871631227
- On the informativeness of cascade and intent-aware effectiveness measures
- Ashkan, A., Clarke, C.L.: On the informativeness of cascade and intent-aware effectiveness measures. In: Proceedings of ACM SIGIR 2013, pp. 407-416 (2011)
- (2011) Proceedings of ACM SIGIR 2013 , pp. 407-416
- Ashkan, A.¹ Clarke, C.L.²

8
- 33746264710
- The maximum entropy method for analyzing retrieval measures
- Aslam, J.A., Yilmaz, E., Pavlu, V.: The maximum entropy method for analyzing retrieval measures. In: Proceedings of ACM SIGIR 2005, pp. 27-34 (2005)
- (2005) Proceedings of ACM SIGIR 2005 , pp. 27-34
- Aslam, J.A.¹ Yilmaz, E.² Pavlu, V.³

9
- 74549121417
- Usage based effectiveness measures
- Azzopardi, L.: Usage based effectiveness measures. In: Proceedings of ACM CIKM 2009, pp. 631-640 (2009)
- (2009) Proceedings of ACM CIKM 2009 , pp. 631-640
- Azzopardi, L.¹

10
- 84866615871
- Time drives interaction: Simulating sessions in diverse searching environments
- Baskaya, F., Keskustalo, H., Järvelin, K.: Time drives interaction: Simulating sessions in diverse searching environments. In: Proceedings of ACM SIGIR 2012, pp. 105-114 (2012)
- (2012) Proceedings of ACM SIGIR 2012 , pp. 105-114
- Baskaya, F.¹ Keskustalo, H.² Järvelin, K.³

11
- 36448947171
- Test theory for assessing IR test collections
- Bodoff, D., Li, P.: Test theory for assessing IR test collections. In: Proceedings of ACM SIGIR 2007, pp. 367-374 (2007)
- (2007) Proceedings of ACM SIGIR 2007 , pp. 367-374
- Bodoff, D.¹ Li, P.²

12
- 0019670083
- Measurement-theoretical investigation of the MZ-metric
- Bollman, P., Cherniavsky, V.S.: Measurement-theoretical investigation of the MZ-metric. In: Proceedings of ACM SIGIR 1980, pp. 256-267 (1980)
- (1980) Proceedings of ACM SIGIR 1980 , pp. 256-267
- Bollman, P.¹ Cherniavsky, V.S.²

13
- 79952370249
- Dynamic ranked retrieval
- Brandt, C., Joachims, T., Yue, Y., Bank, J.: Dynamic ranked retrieval. In: Proceedings of ACM WSDM 2011, pp. 247-256 (2011)
- (2011) Proceedings of ACM WSDM 2011 , pp. 247-256
- Brandt, C.¹ Joachims, T.² Yue, Y.³ Bank, J.⁴

14
- 0142030258
- A taxonomy of web search
- Broder, A.: A taxonomy of web search. SIGIR Forum 36(2) (2002)
- (2002) SIGIR Forum , vol.36 , Issue.2
- Broder, A.¹

15
- 0033650323
- Evaluating evaluation measure stability
- Buckley, C., Voorhees, E.M.: Evaluating evaluation measure stability. In: Proceedings of ACM SIGIR 2000, pp. 33-40 (2000)
- (2000) Proceedings of ACM SIGIR 2000 , pp. 33-40
- Buckley, C.¹ Voorhees, E.M.²

16
- 8644251996
- Retrieval evaluation with incomplete information
- Buckley, C., Voorhees, E.M.: Retrieval evaluation with incomplete information. In: Proceedings of ACM SIGIR 2004, pp. 25-32 (2004)
- (2004) Proceedings of ACM SIGIR 2004 , pp. 25-32
- Buckley, C.¹ Voorhees, E.M.²

17
- 31844446958
- Learning to rank using gradient descent
- Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hullender, G.: Learning to rank using gradient descent. In: Proceedings of ICML 2005, pp. 89-96 (2005)
- (2005) Proceedings of ICML 2005 , pp. 89-96
- Burges, C.¹ Shaked, T.² Renshaw, E.³ Lazier, A.⁴ Deeds, M.⁵ Hamilton, N.⁶ Hullender, G.⁷

18
- 36448986732
- Reliable information retrieval evaluation with incomplete and biased judgements
- Büttcher, S., Clarke, C.L., Yeung, P.C., Soboroff, I.: Reliable information retrieval evaluation with incomplete and biased judgements. In: Proceedings of ACM SIGIR 2007, pp. 63-70 (2007)
- (2007) Proceedings of ACM SIGIR 2007 , pp. 63-70
- Büttcher, S.¹ Clarke, C.L.² Yeung, P.C.³ Soboroff, I.⁴

19
- 72449194290
- On rank correlation and the distance between rankings
- Carterette, B.: On rank correlation and the distance between rankings. In: Proceedings of ACM SIGIR 2009, pp. 436-443 (2009)
- (2009) Proceedings of ACM SIGIR 2009 , pp. 436-443
- Carterette, B.¹

20
- 80052123590
- System effectiveness, user models, and user utility: A conceptual framework for investigation
- Carterette, B.: System effectiveness, user models, and user utility: A conceptual framework for investigation. In: Proceedings of ACM SIGIR 2011, pp. 903-912 (2011)
- (2011) Proceedings of ACM SIGIR 2011 , pp. 903-912
- Carterette, B.¹

21
- 84859011471
- Multiple testing in statistical analysis of systems-based information retrieval experiments
- Carterette, B.: Multiple testing in statistical analysis of systems-based information retrieval experiments. ACM TOIS 30(1) (2012)
- (2012) ACM TOIS , vol.30 , Issue.1
- Carterette, B.¹

22
- 41849104667
- Here or there preference judgments for relevance
- DOI 10.1007/978-3-540-78646-7-5, Advances in Information Retrieval - 30th European Conference on IR Research, ECIR 2008, Proceedings
- Carterette, B., Bennett, P.N., Chickering, D.M., Dumais, S.T.: Here or there: Preference judgments for relevance. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I.,White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 16-27. Springer, Heidelberg (2008) (Pubitemid 351499055)
- (2008) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.4956 LNCS , pp. 16-27
- Carterette, B.¹ Bennett, P.N.² Chickering, D.M.³ Dumais, S.T.⁴

23
- 57349133736
- Evaluation over thousands of queries
- Carterette, B., Pavlu, V., Kanoulas, E., Aslam, J.A., Allan, J.: Evaluation over thousands of queries. In: Proceedings of ACM SIGIR 2008, pp. 651-658 (2008)
- (2008) Proceedings of ACM SIGIR 2008 , pp. 651-658
- Carterette, B.¹ Pavlu, V.² Kanoulas, E.³ Aslam, J.A.⁴ Allan, J.⁵

24
- 84880779561
- Analysis of various evaluation measures for diversity
- Chandar, P., Carterette, B.: Analysis of various evaluation measures for diversity. In: Proceedings of DDR 2011, pp. 21-28 (2011)
- (2011) Proceedings of DDR 2011 , pp. 21-28
- Chandar, P.¹ Carterette, B.²

25
- 84901306970
- What qualities do users prefer in diversity rankings?
- Chandar, P., Carterette, B.: What qualities do users prefer in diversity rankings? In: Proceedings of DDR 2012 (2012)
- (2012) Proceedings of DDR 2012
- Chandar, P.¹ Carterette, B.²

26
- 80255123851
- Intent-based diversification of web search results: Metrics and algorithms
- Chapelle, O., Ji, S., Liao, C., Velipasaoglu, E., Lai, L., Wu, S.L.: Intent-based diversification of web search results: Metrics and algorithms. Information Retrieval 14(6), 572-592 (2011)
- (2011) Information Retrieval , vol.14 , Issue.6 , pp. 572-592
- Chapelle, O.¹ Ji, S.² Liao, C.³ Velipasaoglu, E.⁴ Lai, L.⁵ Wu, S.L.⁶

27
- 74549208546
- Expected reciprocal rank for graded relevance
- Chapelle, O., Metzler, D., Zhang, Y., Grinspan, P.: Expected reciprocal rank for graded relevance. In: Proceedings of ACM CIKM 2009, pp. 621-630 (2009)
- (2009) Proceedings of ACM CIKM 2009 , pp. 621-630
- Chapelle, O.¹ Metzler, D.² Zhang, Y.³ Grinspan, P.⁴

28
- 85107916112
- MUC-4 evaluation metrics
- Chinchor, N.: MUC-4 evaluation metrics. In: Proceedings of MUC-4, pp. 22-29 (1992)
- (1992) Proceedings of MUC-4 , pp. 22-29
- Chinchor, N.¹

29
- 77956044089
- Overview of the TREC 2009 web track
- Clarke, C.L., Craswell, N., Soboroff, I.: Overview of the TREC 2009 web track. In: Proceedings of TREC 2009 (2009)
- (2009) Proceedings of TREC 2009
- Clarke, C.L.¹ Craswell, N.² Soboroff, I.³

30
- 79952370248
- A comparative analysis of cascade measures for novelty and diversity
- Clarke, C.L., Craswell, N., Soboroff, I., Ashkan, A.: A comparative analysis of cascade measures for novelty and diversity. In: Proceedings of ACM WSDM 2011, pp. 75-84 (2011)
- (2011) Proceedings of ACM WSDM 2011 , pp. 75-84
- Clarke, C.L.¹ Craswell, N.² Soboroff, I.³ Ashkan, A.⁴

31
- 85175623957
- Overview of the TREC 2011 web track
- Clarke, C.L., Craswell, N., Soboroff, I., Voorhees, E.: Overview of the TREC 2011 web track. In: Proceedings of TREC 2011 (2012)
- (2012) Proceedings of TREC 2011
- Clarke, C.L.¹ Craswell, N.² Soboroff, I.³ Voorhees, E.⁴

32
- 57349111122
- Novelty and diversity in information retrieval evaluation
- Clarke, C.L., Kolla, M., Cormack, G.V., Vechtomova, O., Ashkan, A., Büttcher, S., MacKinnon, I.: Novelty and diversity in information retrieval evaluation. In: Proceedings of ACM SIGIR 2008, pp. 659-666 (2009)
- (2009) Proceedings of ACM SIGIR 2008 , pp. 659-666
- Clarke, C.L.¹ Kolla, M.² Cormack, G.V.³ Vechtomova, O.⁴ Ashkan, A.⁵ Büttcher, S.⁶ MacKinnon, I.⁷

33
- 70350576772
- An effectiveness measure for ambiguous and underspecified queries
- Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. Springer, Heidelberg
- Clarke, C.L.A., Kolla, M., Vechtomova, O.: An effectiveness measure for ambiguous and underspecified queries. In: Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. LNCS, vol. 5766, pp. 188-199. Springer, Heidelberg (2009)
- (2009) LNCS , vol.5766 , pp. 188-199
- Clarke, C.L.A.¹ Kolla, M.² Vechtomova, O.³

34
- 85175623957
- Overview of the TREC 2012 web track
- Clarke, C.L., Craswell, N., Voorhees, E.: Overview of the TREC 2012 web track. In: Proceedings of TREC 2012 (2013)
- (2013) Proceedings of TREC 2012
- Clarke, C.L.¹ Craswell, N.² Voorhees, E.³

35
- 0002064245
- Expected search length: A single measure of retrieval effectiveness based on the weak ordering action of retrieval systems
- Cooper, W.S.: Expected search length: A single measure of retrieval effectiveness based on the weak ordering action of retrieval systems. JASIS 19(1), 30-41 (1968)
- (1968) JASIS , vol.19 , Issue.1 , pp. 30-41
- Cooper, W.S.¹

36
- 0015604498
- On selecting a measure of retrieval effectiveness
- Cooper, W.S.: On selecting a measure of retrieval effectiveness. JASIS 24(2), 87-100 (1973)
- (1973) JASIS , vol.24 , Issue.2 , pp. 87-100
- Cooper, W.S.¹

37
- 0015681463
- On selecting a measure of retrieval effectiveness: Part II. Implementation of the philosophy
- Cooper, W.S.: On selecting a measure of retrieval effectiveness: Part II. Implementation of the philosophy. JASIS 24(6), 413-424 (1973)
- (1973) JASIS , vol.24 , Issue.6 , pp. 413-424
- Cooper, W.S.¹

38
- 33750336173
- Statistical precision of information retrieval evaluation
- Cormack, G.V., Lynam, T.R.: Statistical precision of information retrieval evaluation. In: Proceedings of ACM SIGIR 2006 (2006)
- (2006) Proceedings of ACM SIGIR 2006
- Cormack, G.V.¹ Lynam, T.R.²

39
- 44649166130
- Different structures for evaluating answers to complex questions: Pyramids won't topple, and neither will human assessors
- Dang, H., Lin, J.: Different structures for evaluating answers to complex questions: Pyramids won't topple, and neither will human assessors. In: Proceedings of ACL 2007, pp. 768-775 (2007)
- (2007) Proceedings of ACL 2007 , pp. 768-775
- Dang, H.¹ Lin, J.²

40
- 33750363214
- Rpref: A generalization of bpref towards graded relevance judgments
- De Beer, J., Moens, M.F.: Rpref: A generalization of bpref towards graded relevance judgments. In: Proceedings of ACM SIGIR 2006, pp. 637-638 (2006)
- (2006) Proceedings of ACM SIGIR 2006 , pp. 637-638
- De Beer, J.¹ Moens, M.F.²

41
- 1842680293
- Measuring retrieval effectiveness: A new proposal and a first experimental validation
- Della Mea, V., Mizzaro, S.: Measuring retrieval effectiveness: A new proposal and a first experimental validation. JASIST 55(6), 503-543 (2004)
- (2004) JASIST , vol.55 , Issue.6 , pp. 503-543
- Della Mea, V.¹ Mizzaro, S.²

42
- 0030650239
- Time, relevance and interaction modelling for information retrieval
- Dunlop,M.D.: Time, relevance and interaction modelling for information retrieval. In: Proceedings of ACM SIGIR 1997, pp. 206-213 (1997)
- (1997) Proceedings of ACM SIGIR 1997 , pp. 206-213
- Dunlop, M.D.¹

43
- 0003991665
- Chapman & Hall/CRC
- Efron, B., Tibshirani, R.J.: An Introduction to the Bootstrap. Chapman & Hall/CRC (1993)
- (1993) An Introduction to the Bootstrap
- Efron, B.¹ Tibshirani, R.J.²

44
- 55449094674
- Overview of the web retrieval task at the third NTCIR workshop
- Eguchi, K., Oyama, K., Ishida, E., Kando, N., Kuriyama, K.: Overview of the web retrieval task at the third NTCIR workshop. NII Technical Reports NII-2003-002E (2003)
- (2003) NII Technical Reports NII-2003-002E
- Eguchi, K.¹ Oyama, K.² Ishida, E.³ Kando, N.⁴ Kuriyama, K.⁵

45
- 84886070538
- NTCIR9-GeoTime overview - Evaluating geographic and temporal search: Round 2
- Gey, F., Larson, R., Machado, J., Yoshioka, M.: NTCIR9-GeoTime overview - evaluating geographic and temporal search: Round 2. In: Proceedings of NTCIR-9, pp. 9-17 (2011)
- (2011) Proceedings of NTCIR-9 , pp. 9-17
- Gey, F.¹ Larson, R.² Machado, J.³ Yoshioka, M.⁴

46
- 84880823081
- Increasing evaluation sensitivity to diversity
- Golbus, P.B., Aslam, J.A., Clarke, C.L.: Increasing evaluation sensitivity to diversity. Information Retrieval (2013)
- (2013) Information Retrieval
- Golbus, P.B.¹ Aslam, J.A.² Clarke, C.L.³

47
- 0027725490
- Using statistical testing in the evaluation of retrieval experiments
- Hull, D.: Using statistical testing in the evaluation of retrieval experiments. In: Proceedings of ACM SIGIR 1993. pp. 329-338 (1993)
- (1993) Proceedings of ACM SIGIR 1993 , pp. 329-338
- Hull, D.¹

48
- 33846563409
- Why most published research findings are false
- Ioannidis, J.P.: Why most published research findings are false. PLoS Med. 2(8) (2005)
- (2005) PLoS Med , vol.2 , Issue.8
- Ioannidis, J.P.¹

49
- 1842637192
- Cumulated gain-based evaluation of IR techniques
- Järvelin, K., Kekäläinen, J.: Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems 20(4), 422-446 (2002)
- (2002) ACM Transactions on Information Systems , vol.20 , Issue.4 , pp. 422-446
- Järvelin, K.¹ Kekäläinen, J.²

50
- 41849104668
- Discounted cumulated gain based evaluation of multiple-query IR sessions
- DOI 10.1007/978-3-540-78646-7-4, Advances in Information Retrieval - 30th European Conference on IR Research, ECIR 2008, Proceedings
- Järvelin, K., Price, S.L., Delcambre, L.M.L., Nielsen, M.L.: Discounted cumulated gain based evaluation of multiple-query IR sessions. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 4-15. Springer, Heidelberg (2008) (Pubitemid 351499054)
- (2008) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.4956 LNCS , pp. 4-15
- Jarvelin, K.¹ Price, S.L.² Delcambre, L.M.L.³ Nielsen, M.L.⁴

51
- 0032808670
- The insignificance of statistical significance testing
- Johnson, D.H.: The insignificance of statistical significance testing. The Journal of Wildlife Management 63(3), 763-772 (1999)
- (1999) The Journal of Wildlife Management , vol.63 , Issue.3 , pp. 763-772
- Johnson, D.H.¹

52
- 51849123809
- INEX 2007 evaluation measures
- Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. Springer, Heidelberg
- Kamps, J., Pehcevski, J., Kazai, G., Lalmas, M., Robertson, S.: INEX 2007 evaluation measures. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 24-33. Springer, Heidelberg (2008)
- (2008) LNCS , vol.4862 , pp. 24-33
- Kamps, J.¹ Pehcevski, J.² Kazai, G.³ Lalmas, M.⁴ Robertson, S.⁵

53
- 74549195035
- Empirical justification of the gain and discount function for nDCG
- Kanoulas, E., Aslam, J.A.: Empirical justification of the gain and discount function for nDCG. In: ACM CIKM 2009, pp. 611-620 (2009)
- (2009) ACM CIKM 2009 , pp. 611-620
- Kanoulas, E.¹ Aslam, J.A.²

54
- 80052112994
- Evaluating multi-query sessions
- Kanoulas, E., Carterette, B., Clough, P.D., Sanderson, M.: Evaluating multi-query sessions. In: Proceedings of ACM SIGIR 2011, pp. 1053-1062 (2011)
- (2011) Proceedings of ACM SIGIR 2011 , pp. 1053-1062
- Kanoulas, E.¹ Carterette, B.² Clough, P.D.³ Sanderson, M.⁴

55
- 84883070185
- Report from the NTCIR-10 1CLICK-2 Japanese subtask: Baselines, upperbounds and evaluation robustness
- Kato, M.P., Sakai, T., Yamamoto, T., Iwata, M.: Report from the NTCIR-10 1CLICK-2 Japanese subtask: Baselines, upperbounds and evaluation robustness. In: Proceedings of ACM SIGIR 2013 (2013)
- (2013) Proceedings of ACM SIGIR 2013
- Kato, M.P.¹ Sakai, T.² Yamamoto, T.³ Iwata, M.⁴

56
- 0036851939
- Using graded relevance assessments in IR evaluation
- Kekäläinen, J., Järvelin, K.: Using graded relevance assessments in IR evaluation. JASIST 53(13), 1120-1129 (2002)
- (2002) JASIST , vol.53 , Issue.13 , pp. 1120-1129
- Kekäläinen, J.¹ Järvelin, K.²

57
- 33749530226
- Property of average precision and its generalization: An examination of evaluation indicator for information retrieval
- Kishida, K.: Property of average precision and its generalization: An examination of evaluation indicator for information retrieval. NII Technical Reports NII-2005-014E (2005)
- (2005) NII Technical Reports NII-2005-014E
- Kishida, K.¹

58
- 33751359439
- Overview of CLIR task at the sixth NTCIR workshop
- Kishida,K., Chen,K.H., Lee, S., Kuriyama,K., Kando, N., Chen, H.H.:Overview of CLIR task at the sixth NTCIR workshop. In: Proceedings of NTCIR-6, pp. 1-19 (2007)
- (2007) Proceedings of NTCIR-6 , pp. 1-19
- Kishida, K.¹ Chen, K.H.² Lee, S.³ Kuriyama, K.⁴ Kando, N.⁵ Chen, H.H.⁶

59
- 84871050880
- A comprehensive analysis of parameter settings for novelty-biased cumulative gain
- Leenanupab, T., Zuccon, G., Jose, J.M.: A comprehensive analysis of parameter settings for novelty-biased cumulative gain. In: Proceedings of ACM CIKM 2012, pp. 1950-1954 (2012)
- (2012) Proceedings of ACM CIKM 2012 , pp. 1950-1954
- Leenanupab, T.¹ Zuccon, G.² Jose, J.M.³

60
- 26944501715
- ROUGE: A package for automatic evaluation of summaries
- Lin, C.Y.: ROUGE: A package for automatic evaluation of summaries. In: Proceedings of the ACL 2004 Workshop on Text Summarization Branches Out (2004)
- (2004) Proceedings of the ACL 2004 Workshop on Text Summarization Branches Out
- Lin, C.Y.¹

61
- 33748772836
- Methods for automatically evaluating answers to complex questions
- Lin, J., Demner-Fushman, D.: Methods for automatically evaluating answers to complex questions. Information Retrieval 9(5), 565-587 (2006)
- (2006) Information Retrieval , vol.9 , Issue.5 , pp. 565-587
- Lin, J.¹ Demner-Fushman, D.²

62
- 77956042082
- PRES: A score metric for evaluating recall-oriented information retrieval applications
- Magdy, W., Jones, G.J.: PRES: A score metric for evaluating recall-oriented information retrieval applications. In: Proceedings of ACM SIGIR 2010, pp. 611-618 (2010)
- (2010) Proceedings of ACM SIGIR 2010 , pp. 611-618
- Magdy, W.¹ Jones, G.J.²

63
- 66949147248
- Rank-biased precision for measurement of retrieval effectiveness
- Moffat, A., Zobel, J.: Rank-biased precision for measurement of retrieval effectiveness. ACM TOIS 27(1) (2008)
- (2008) ACM TOIS , vol.27 , Issue.1
- Moffat, A.¹ Zobel, J.²

64
- 84901382132
- Automatic evaluation in text summarization
- in Japanese
- Nanba, H., Hirao, T.: Automatic evaluation in text summarization (in Japanese). Transactions of the Japanese Society for Artificial Intelligence 22(1), 10-16 (2008)
- (2008) Transactions of the Japanese Society for Artificial Intelligence , vol.22 , Issue.1 , pp. 10-16
- Nanba, H.¹ Hirao, T.²

65
- 34249275304
- The pyramid method: Incorporating human content selection variation in summarization evaluation
- Nenkova, A., Passonneau, R., McKeown, K.: The pyramid method: Incorporating human content selection variation in summarization evaluation. ACM Transactions on Speech and Language Processing 4(2), Article 4 (2007)
- (2007) ACM Transactions on Speech and Language Processing , vol.4 , Issue.2 , pp. 4
- Nenkova, A.¹ Passonneau, R.² McKeown, K.³

66
- 0141524308
- Bleu: A method for automatic evaluation of machine translation
- Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. IBM Research Report RC22176 (2001)
- (2001) IBM Research Report RC22176
- Papineni, K.¹ Roukos, S.² Ward, T.³ Zhu, W.J.⁴

67
- 16244364485
- Measures for the comparison of information retrieval systems
- Pollock, S.M.: Measures for the comparison of information retrieval systems. American Documentation 19(4), 387-397 (1968)
- (1968) American Documentation , vol.19 , Issue.4 , pp. 387-397
- Pollock, S.M.¹

68
- 0004217877
- 2nd edn. Butterworths
- Rijsbergen, C.J.V.: Information Retrieval, 2nd edn. Butterworths (1979)
- (1979) Information Retrieval
- Rijsbergen, C.J.V.¹

69
- 0017630891
- The probability ranking principle in IR
- Robertson, S.E.: The probability ranking principle in IR. Journal of Documentation 33, 130-137 (1977)
- (1977) Journal of Documentation , vol.33 , pp. 130-137
- Robertson, S.E.¹

70
- 34547631051
- On GMAP: And other transformations
- Robertson, S.E.: On GMAP: and other transformations. In: Proceedings of ACM CIKM 2006, pp. 78-83 (2006)
- (2006) Proceedings of ACM CIKM 2006 , pp. 78-83
- Robertson, S.E.¹

71
- 57349087085
- A new interpretation of average precision
- Robertson, S.E.: A new interpretation of average precision. In: Proceedings of ACM SIGIR 2008, pp. 689-690 (2008)
- (2008) Proceedings of ACM SIGIR 2008 , pp. 689-690
- Robertson, S.E.¹

72
- 84866617782
- On per-topic variance in IR evaluation
- Robertson, S.E., Kanoulas, E.: On per-topic variance in IR evaluation. In: Proceedings of ACM SIGIR 2012, pp. 891-900 (2012)
- (2012) Proceedings of ACM SIGIR 2012 , pp. 891-900
- Robertson, S.E.¹ Kanoulas, E.²

73
- 77956029237
- Extending average precision to graded relevance judgments
- Robertson, S.E., Kanoulas, E., Yilmaz, E.: Extending average precision to graded relevance judgments. In: Proceedings of ACM SIGIR 2010, pp. 603-610 (2010)
- (2010) Proceedings of ACM SIGIR 2010 , pp. 603-610
- Robertson, S.E.¹ Kanoulas, E.² Yilmaz, E.³

74
- 84901381577
- New performance metrics based on multigrade relevance: Their application to question answering
- Sakai, T.: New performance metrics based on multigrade relevance: Their application to question answering. In: Proceedings of NTCIR-4 (Open Submission Session) (2004)
- Proceedings of NTCIR-4 (Open Submission Session) (2004)
- Sakai, T.¹

75
- 24344471313
- Ranking the NTCIR systems based on multigrade relevance
- Myaeng, S.-H., Zhou, M., Wong, K.-F., Zhang, H.-J. (eds.) AIRS 2004. Springer, Heidelberg
- Sakai, T.: Ranking the NTCIR systems based on multigrade relevance. In: Myaeng, S.-H., Zhou, M., Wong, K.-F., Zhang, H.-J. (eds.) AIRS 2004. LNCS, vol. 3411, pp. 251-262. Springer, Heidelberg (2005)
- (2005) LNCS , vol.3411 , pp. 251-262
- Sakai, T.¹

76
- 33751354079
- Bootstrap-based comparisons of IR metrics for finding one relevant document
- Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. Springer, Heidelberg
- Sakai, T.: Bootstrap-based comparisons of IR metrics for finding one relevant document. In: Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. LNCS, vol. 4182, pp. 374-389. Springer, Heidelberg (2006)
- (2006) LNCS , vol.4182 , pp. 374-389
- Sakai, T.¹

77
- 33750340100
- Evaluating evaluation metrics based on the bootstrap
- Sakai, T.: Evaluating evaluation metrics based on the bootstrap. In: Proceedings of ACM SIGIR 2006, pp. 525-532 (2006)
- (2006) Proceedings of ACM SIGIR 2006 , pp. 525-532
- Sakai, T.¹

78
- 38149100610
- For building better retrieval systems: Trends in information retrieval evaluation based on graded relevance
- in Japanese
- Sakai, T.: For building better retrieval systems: Trends in information retrieval evaluation based on graded relevance (in Japanese). IPSJ Magazine 47(2), 147-158 (2006)
- (2006) IPSJ Magazine , vol.47 , Issue.2 , pp. 147-158
- Sakai, T.¹

79
- 36448993626
- Alternatives to bpref
- Sakai, T.: Alternatives to bpref. In: Proceedings of ACM SIGIR 2007, pp. 71-78 (2007)
- (2007) Proceedings of ACM SIGIR 2007 , pp. 71-78
- Sakai, T.¹

80
- 57349094546
- On penalising late arrival of relevant documents in information retrieval evaluation with graded relevance
- Sakai, T.: On penalising late arrival of relevant documents in information retrieval evaluation with graded relevance. In: Proceedings of EVIA 2007, pp. 32-43 (2007)
- (2007) Proceedings of EVIA 2007 , pp. 32-43
- Sakai, T.¹

81
- 70349242289
- Comparing metrics across TREC and NTCIR: The robustness to system bias
- Sakai, T.: Comparing metrics across TREC and NTCIR: The robustness to system bias. In: Proceedings of ACM CIKM 2008, pp. 581-590 (2008)
- (2008) Proceedings of ACM CIKM 2008 , pp. 581-590
- Sakai, T.¹

82
- 84860858401
- Evaluation with informational and navigational intents
- Sakai, T.: Evaluation with informational and navigational intents. In: Proceedings of WWW 2012, pp. 499-508 (2012)
- (2012) Proceedings of WWW 2012 , pp. 499-508
- Sakai, T.¹

83
- 84901374927
- How intuitive are diversified search metrics? Concordance test results for the diversity U-measures
- Sakai, T.: How intuitive are diversified search metrics? Concordance test results for the diversity U-measures. IPSJ SIG Technical Report 2013-IFAT-111 (2013)
- (2013) IPSJ SIG Technical Report 2013-IFAT-111
- Sakai, T.¹

84
- 84883084005
- The unreusability of diversified test collections
- Sakai, T.: The unreusability of diversified test collections. In: Proceedings of EVIA 2013 (2013)
- (2013) Proceedings of EVIA 2013
- Sakai, T.¹

85
- 84883092343
- Summaries, ranked retrieval and sessions: A unified framework for information access evaluation
- Sakai, T., Dou, Z.: Summaries, ranked retrieval and sessions: A unified framework for information access evaluation. In: Proceedings of ACM SIGIR 2013, pp. 473-482 (2013)
- (2013) Proceedings of ACM SIGIR 2013 , pp. 473-482
- Sakai, T.¹ Dou, Z.²

86
- 84883100429
- The impact of intent selection on diversified search evaluation
- Sakai, T., Dou, Z., Clarke, C.L.: The impact of intent selection on diversified search evaluation. In: Proceedings of ACM SIGIR 2013 (2013)
- (2013) Proceedings of ACM SIGIR 2013
- Sakai, T.¹ Dou, Z.² Clarke, C.L.³

87
- 84871603799
- The reusability of a diversified search test collection
- Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. Springer, Heidelberg
- Sakai, T., Dou, Z., Song, R., Kando, N.: The reusability of a diversified search test collection. In: Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. LNCS, vol. 7675, pp. 26-38. Springer, Heidelberg (2012)
- (2012) LNCS , vol.7675 , pp. 26-38
- Sakai, T.¹ Dou, Z.² Song, R.³ Kando, N.⁴

88
- 84883063122
- Summary of the NTCIR-10 INTENT-2 task: Subtopic mining and search result diversification
- Sakai, T., Dou, Z., Yamamoto, T., Liu, Y., Zhang, M., Kato, M.P., Song, R., Iwata, M.: Summary of the NTCIR-10 INTENT-2 task: Subtopic mining and search result diversification. In: Proceedings of ACM SIGIR 2013 (2013)
- (2013) Proceedings of ACM SIGIR 2013
- Sakai, T.¹ Dou, Z.² Yamamoto, T.³ Liu, Y.⁴ Zhang, M.⁵ Kato, M.P.⁶ Song, R.⁷ Iwata, M.⁸

89
- 50849122035
- On information retrieval metrics designed for evaluation with incomplete relevance assessments
- Sakai, T., Kando, N.: On information retrieval metrics designed for evaluation with incomplete relevance assessments. Information Retrieval 11, 447-470 (2008)
- (2008) Information Retrieval , vol.11 , pp. 447-470
- Sakai, T.¹ Kando, N.²

90
- 84871582413
- One click one revisited: Enhancing evaluation based on information units
- Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. Springer, Heidelberg
- Sakai, T., Kato, M.P.: One click one revisited: Enhancing evaluation based on information units. In: Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. LNCS, vol. 7675, pp. 39-51. Springer, Heidelberg (2012)
- (2012) LNCS , vol.7675 , pp. 39-51
- Sakai, T.¹ Kato, M.P.²

91
- 83055168077
- Click the search button and be happy: Evaluating direct and immediate information access
- Sakai, T., Kato, M.P., Song, Y.I.: Click the search button and be happy: Evaluating direct and immediate information access. In: Proceedings of ACM CIKM 2011, pp. 621-630 (2011)
- (2011) Proceedings of ACM CIKM 2011 , pp. 621-630
- Sakai, T.¹ Kato, M.P.² Song, Y.I.³

92
- 76349094521
- Modelling a user population for designing information retrieval metrics
- Sakai, T., Robertson, S.: Modelling a user population for designing information retrieval metrics. In: Proceedings of EVIA 2008, pp. 30-41 (2008)
- (2008) Proceedings of EVIA 2008 , pp. 30-41
- Sakai, T.¹ Robertson, S.²

93
- 78650890564
- Overview of NTCIR-8 ACLIA IR4QA
- Sakai, T., Shima, H., Kando, N., Song, R., Lin, C.J., Mitamura, T., Sugimoto, M., Lee, C.W.: Overview of NTCIR-8 ACLIA IR4QA. In: Proceedings of NTCIR-8, pp. 63-93 (2010)
- (2010) Proceedings of NTCIR-8 , pp. 63-93
- Sakai, T.¹ Shima, H.² Kando, N.³ Song, R.⁴ Lin, C.J.⁵ Mitamura, T.⁶ Sugimoto, M.⁷ Lee, C.W.⁸

94
- 80052111133
- Evaluating diversified search results using per-intent graded relevance
- Sakai, T., Song, R.: Evaluating diversified search results using per-intent graded relevance. In: Proceedings of ACM SIGIR 2011 (2011)
- (2011) Proceedings of ACM SIGIR 2011
- Sakai, T.¹ Song, R.²

95
- 84880838418
- Diversified search evaluation: Lessons from the NTCIR-9 INTENT task
- Sakai, T., Song, R.: Diversified search evaluation: Lessons from the NTCIR-9 INTENT task. Information Retrieval (2013)
- (2013) Information Retrieval
- Sakai, T.¹ Song, R.²

96
- 84901366523
- On evaluation environments for web search result diversification
- Sakai, T., Song, Y.I.: On evaluation environments for web search result diversification. In: Forum on Information Technology 2013 (2013)
- (2013) Forum on Information Technology 2013
- Sakai, T.¹ Song, Y.I.²

97
- 77954220071
- Test collection based evaluation of information retrieval systems
- Sanderson, M.: Test collection based evaluation of information retrieval systems. Foundations and Trends in Information Retrieval 4, 247-375 (2010)
- (2010) Foundations and Trends in Information Retrieval , vol.4 , pp. 247-375
- Sanderson, M.¹

98
- 77956037058
- Do user preferences and evaluation measures line up?
- Sanderson, M., Paramita, M.L., Clough, P., Kanoulas, E.: Do user preferences and evaluation measures line up? In: Proceedings of ACM SIGIR 2010, pp. 555-562 (2010)
- (2010) Proceedings of ACM SIGIR 2010 , pp. 555-562
- Sanderson, M.¹ Paramita, M.L.² Clough, P.³ Kanoulas, E.⁴

99
- 84885608872
- Information retrieval system evaluation: Effort, sensitivity, and reliability
- Sanderson, M., Zobel, J.: Information retrieval system evaluation: Effort, sensitivity, and reliability. In: Proceedings of ACM SIGIR 2005, pp. 162-169 (2005)
- (2005) Proceedings of ACM SIGIR 2005 , pp. 162-169
- Sanderson, M.¹ Zobel, J.²

100
- 0031193029
- Statistical inference in retrieval effectiveness evaluation
- Savoy, J.: Statistical inference in retrieval effectiveness evaluation. Information Processing and Management 33(4), 495-512 (1997)
- (1997) Information Processing and Management , vol.33 , Issue.4 , pp. 495-512
- Savoy, J.¹

101
- 63449088172
- A comparison of statistical significance tests for information retrieval evaluation
- Smucker, M.D., Allan, J., Carterette, B.: A comparison of statistical significance tests for information retrieval evaluation. In: Proceedings of ACM CIKM 2007, pp. 623-632 (2007)
- (2007) Proceedings of ACM CIKM 2007 , pp. 623-632
- Smucker, M.D.¹ Allan, J.² Carterette, B.³

102
- 84870917792
- Modeling user variance in time-biased gain
- Smucker, M.D., Clarke, C.L.A.: Modeling user variance in time-biased gain. In: Proceedings of ACM HCIR 2012 (2012)
- (2012) Proceedings of ACM HCIR 2012
- Smucker, M.D.¹ Clarke, C.L.A.²

103
- 84871057318
- Stochastic simulation of time-biased gain
- Smucker, M.D., Clarke, C.L.A.: Stochastic simulation of time-biased gain. In: Proceedings of ACM CIKM 2012, pp. 2040-2044 (2012)
- (2012) Proceedings of ACM CIKM 2012 , pp. 2040-2044
- Smucker, M.D.¹ Clarke, C.L.A.²

104
- 84866603223
- Time-based calibration of effectiveness measures
- Smucker, M.D., Clarke, C.L.A.: Time-based calibration of effectiveness measures. In: Proceedings of ACM SIGIR 2012, pp. 95-104 (2012)
- (2012) Proceedings of ACM SIGIR 2012 , pp. 95-104
- Smucker, M.D.¹ Clarke, C.L.A.²

105
- 72449172028
- Including summaries in system evaluation
- Turpin, A., Scholer, F., Järvelin, K., Wu, M., Culpepper, J.S.: Including summaries in system evaluation. In: Proceedings of ACM SIGIR 2009, pp. 508-515 (2009)
- (2009) Proceedings of ACM SIGIR 2009 , pp. 508-515
- Turpin, A.¹ Scholer, F.² Järvelin, K.³ Wu, M.⁴ Culpepper, J.S.⁵

106
- 8644262918
- The philosophy of information retrieval evaluation
- Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. Springer, Heidelberg
- Voorhees, E.M.: The philosophy of information retrieval evaluation. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 355-370. Springer, Heidelberg (2002)
- (2002) LNCS , vol.2406 , pp. 355-370
- Voorhees, E.M.¹

107
- 0036993119
- The effect of topic set size on retrieval experiment error
- Voorhees, E.M., Buckley, C.: The effect of topic set size on retrieval experiment error. In: Proceedings of ACM SIGIR 2002, pp. 316-323 (2002)
- (2002) Proceedings of ACM SIGIR 2002 , pp. 316-323
- Voorhees, E.M.¹ Buckley, C.²

108
- 8844267001
- The MIT Press
- Voorhees, E.M., Harman, D.K. (eds.): TREC: Experiment and Evaluation in Information Retrieval. The MIT Press (2005)
- (2005) TREC: Experiment and Evaluation in Information Retrieval
- Voorhees, E.M.¹ Harman, D.K.²

109
- 57349160444
- Score standardization for inter-collection comparison of retrieval systems
- Webber, W., Moffat, A., Zobel, J.: Score standardization for inter-collection comparison of retrieval systems. In: Proceedings of ACM SIGIR 2008, pp. 51-58 (2008)
- (2008) Proceedings of ACM SIGIR 2008 , pp. 51-58
- Webber, W.¹ Moffat, A.² Zobel, J.³

110
- 70349250276
- Statistical power in retrieval experimentation
- Webber, W., Moffat, A., Zobel, J.: Statistical power in retrieval experimentation. In: Proceedings of ACM CIKM 2008, pp. 571-580 (2008)
- (2008) Proceedings of ACM CIKM 2008 , pp. 571-580
- Webber, W.¹ Moffat, A.² Zobel, J.³

111
- 80052117619
- The effect of pooling and evaluation depth on metric stability
- Webber, W., Moffat, A., Zobel, J.: The effect of pooling and evaluation depth on metric stability. In: Proceedings of EVIA 2010, pp. 7-15 (2010)
- (2010) Proceedings of EVIA 2010 , pp. 7-15
- Webber, W.¹ Moffat, A.² Zobel, J.³

112
- 80051482719
- A similarity measure for indefinite rankings
- Webber, W., Moffat, A., Zobel, J.: A similarity measure for indefinite rankings. ACM TOIS 28(4) (2010)
- (2010) ACM TOIS , vol.28 , Issue.4
- Webber, W.¹ Moffat, A.² Zobel, J.³

113
- 72449157957
- Score adjustment for correction of pooling bias
- Webber, W., Park, L.A.: Score adjustment for correction of pooling bias. In: Proceedings of ACM SIGIR 2009, pp. 444-451 (2009)
- (2009) Proceedings of ACM SIGIR 2009 , pp. 444-451
- Webber, W.¹ Park, L.A.²

114
- 70449488015
- Modeling expected utility of multi-session information distillation
- Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. Springer, Heidelberg
- Yang, Y., Lad, A.: Modeling expected utility of multi-session information distillation. In: Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. LNCS, vol. 5766, pp. 164-175. Springer, Heidelberg (2009)
- (2009) LNCS , vol.5766 , pp. 164-175
- Yang, Y.¹ Lad, A.²

115
- 57349152359
- A new rank correlation coefficient for information retrieval
- Yilmaz, E., Aslam, J., Robertson, S.: A new rank correlation coefficient for information retrieval. In: Proceedings of ACM SIGIR 2008, pp. 587-594 (2008)
- (2008) Proceedings of ACM SIGIR 2008 , pp. 587-594
- Yilmaz, E.¹ Aslam, J.² Robertson, S.³

116
- 34547632535
- Estimating average precision with incomplete and imperfect judgments
- Yilmaz, E., Aslam, J.A.: Estimating average precision with incomplete and imperfect judgments. In: ACM CIKM 2006 Proceedings, pp. 102-111 (2006)
- (2006) ACM CIKM 2006 Proceedings , pp. 102-111
- Yilmaz, E.¹ Aslam, J.A.²

117
- 78651268113
- Expected browsing utility for web search evaluation
- Yilmaz, E., Shokouhi, M., Craswell, N., Robertson, S.: Expected browsing utility for web search evaluation. In: Proceedings of ACM CIKM 2010, pp. 1561-1564 (2010)
- (2010) Proceedings of ACM CIKM 2010 , pp. 1561-1564
- Yilmaz, E.¹ Shokouhi, M.² Craswell, N.³ Robertson, S.⁴

118
- 1542347826
- Beyond independent relevance: Methods and evaluation metrics for subtopic retrieval
- Zhai, C., Cohen, W.W., Lafferty, J.: Beyond independent relevance: Methods and evaluation metrics for subtopic retrieval. In: Proceedings of ACM SIGIR 2003, pp. 10-17 (2003)
- (2003) Proceedings of ACM SIGIR 2003 , pp. 10-17
- Zhai, C.¹ Cohen, W.W.² Lafferty, J.³

119
- 76349123386
- Click-based evidence for decaying weight distributions in search effectiveness metrics
- Zhang, Y., Park, L.A.F., Moffat, A.: Click-based evidence for decaying weight distributions in search effectiveness metrics. Information Retrieval 13(1), 46-69 (2010)
- (2010) Information Retrieval , vol.13 , Issue.1 , pp. 46-69
- Zhang, Y.¹ Park, L.A.F.² Moffat, A.³

120
- 84866618066
- Evaluating aggregated search pages
- Zhou, K., Cummins, R., Lalmas, M., Jose, J.M.: Evaluating aggregated search pages. In: Proceedings of ACM SIGIR 2012, pp. 115-124 (2012)
- (2012) Proceedings of ACM SIGIR 2012 , pp. 115-124
- Zhou, K.¹ Cummins, R.² Lalmas, M.³ Jose, J.M.⁴

121
- 0032272626
- How reliable are the results of large-scale information retrieval experiments?
- Zobel, J.: How reliable are the results of large-scale information retrieval experiments? In: Proceedings of ACM SIGIR 1998, pp. 307-314 (1998)
- (1998) Proceedings of ACM SIGIR 1998 , pp. 307-314
- Zobel, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.