SCOPUS 정보 검색 플랫폼

International Conference on Information and Knowledge Management, Proceedings

Volumn , Issue , 2008, Pages 581-590

Comparing metrics across TREC and NTCIR: The robustness to system bias

(1) Sakai, Tetsuya a

a NewsWatch Inc (Japan)

Author keywords

Evaluation metrics; Graded relevance; Test collection

Indexed keywords

EVALUATION METRICS; GRADED RELEVANCE; NEW SYSTEM; PAIRWISE STATISTICAL SIGNIFICANCE; Q-MEASURES; RANDOM SAMPLE; SYSTEM BIAS; TEST COLLECTION; UNBIASED CONDITIONS;

KNOWLEDGE MANAGEMENT;

METRIC SYSTEM;

EID: 70349242289 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1458082.1458159 Document Type: Conference Paper

Times cited : (29)

References (31)

1
- 35548951792
- Evaluation of Retrieval Effectiveness with Incomplete Relevance Data: Theoretical and Experimental Comparison of Three Measures
- Ahlgren, P. and Gröonqvist, L.: Evaluation of Retrieval Effectiveness with Incomplete Relevance Data: Theoretical and Experimental Comparison of Three Measures, Information Processing and Management, Volume 44, pp. 212-225, 2008.
- (2008) Information Processing and Management , vol.44 , pp. 212-225
- Ahlgren, P.¹ Gröonqvist, L.²

2
- 63449125656
- Inferring Document Relevance from Incomplete Information
- Aslam, J. A. and Yilmaz, E.: Inferring Document Relevance from Incomplete Information, ACM CIKM 2007 Proceedings, pp. 633-642, 2007.
- (2007) ACM CIKM 2007 Proceedings , pp. 633-642
- Aslam, J.A.¹ Yilmaz, E.²

3
- 39649091430
- Evaluating Epistemic Uncertainty under Incomplete Assessments
- Baillie, M., Azzopardi, L. and Ruthven, I.: Evaluating Epistemic Uncertainty under Incomplete Assessments, Information Processing and Management, 44(2), pp. 811-837, 2008.
- (2008) Information Processing and Management , vol.44 , Issue.2 , pp. 811-837
- Baillie, M.¹ Azzopardi, L.² Ruthven, I.³

4
- 36448951542
- On the Robustness of Relevance Measures with Incomplete Judgments
- Bompada, T. et al.: On the Robustness of Relevance Measures with Incomplete Judgments, ACM SIGIR 2007 Proceedings, pp. 359-366, 2007.
- (2007) ACM SIGIR 2007 Proceedings , pp. 359-366
- Bompada, T.¹

5
- 0033650323
- Evaluating Evaluation Measure Stability
- Buckley, C. and Voorhees, E. M.: Evaluating Evaluation Measure Stability, ACM SIGIR 2000 Proceedings, pp. 33-40, 2000.
- (2000) ACM SIGIR 2000 Proceedings , pp. 33-40
- Buckley, C.¹ Voorhees, E.M.²

6
- 8644251996
- Retrieval Evaluation with Incomplete Information
- Buckley, C. and Voorhees, E. M.: Retrieval Evaluation with Incomplete Information, ACM SIGIR 2004 Proceedings, pp. 25-32, 2004.
- (2004) ACM SIGIR 2004 Proceedings , pp. 25-32
- Buckley, C.¹ Voorhees, E.M.²

7
- 35548987507
- Bias and the Limits of Pooling for Large Collections
- Buckley, C. et al.: Bias and the Limits of Pooling for Large Collections, Information Retrieval, Vol. 10, Number 6, pp. 491-508, 2007.
- (2007) Information Retrieval , vol.10 , Issue.6 , pp. 491-508
- Buckley, C.¹

8
- 31844446958
- Learning to Rank using Gradient Descent
- Burges, C. et al.: Learning to Rank using Gradient Descent, ACM ICML 2005 Proceedings, pp. 89-96, 2005.
- (2005) ACM ICML 2005 Proceedings , pp. 89-96
- Burges, C.¹

9
- 36448986732
- Reliable Information Retrieval Evaluation with Incomplete and Biased Judgements
- Büttcher et al.: Reliable Information Retrieval Evaluation with Incomplete and Biased Judgements, ACM SIGIR 2007 Proceedings., pp. 63-70, 2007.
- (2007) ACM SIGIR 2007 Proceedings , pp. 63-70
- Büttcher¹

10
- 36448969717
- Robust Test Collections for Retrieval Evaluation
- Carterette, B.: Robust Test Collections for Retrieval Evaluation, ACM SIGIR 2007 Proceedings, pp. 55-62, 2007.
- (2007) ACM SIGIR 2007 Proceedings , pp. 55-62
- Carterette, B.¹

11
- 57349133736
- Evaluation Over Thousands of Queries
- Carterette, B. et al.: Evaluation Over Thousands of Queries, ACM SIGIR 2008 Proceedings, pp. 651-658, 2008.
- (2008) ACM SIGIR 2008 Proceedings , pp. 651-658
- Carterette, B.¹

12
- 1842637192
- Cumulated Gain-Based Evaluation of IR Techniques
- Järvelin, K. and Kekäläinen, J.: Cumulated Gain-Based Evaluation of IR Techniques, ACM TOIS, Vol. 20, No. 4, pp. 422-446, 2002.
- (2002) ACM TOIS , vol.20 , Issue.4 , pp. 422-446
- Järvelin, K.¹ Kekäläinen, J.²

13
- 70349245905
- Overview of the Sixth NTCIR Workshop
- Kando, N.: Overview of the Sixth NTCIR Workshop, NTCIR-6 Proceedings, pp. i-ix, 2007.
- (2007) NTCIR-6 Proceedings
- Kando, N.¹

14
- 70349255143
- Rank-Biased Precision for Measurement of Retrieval Effectiveness
- to appear
- Moffat, A. and Zobel, J.: Rank-Biased Precision for Measurement of Retrieval Effectiveness, ACM TOIS, to appear, 2008.
- (2008) ACM TOIS
- Moffat, A.¹ Zobel, J.²

15
- 57349087085
- A New Interpretation of Average Precision
- Robertson, S.: A New Interpretation of Average Precision, ACM SIGIR 2008 Proceedings, pp. 689-690, 2008.
- (2008) ACM SIGIR 2008 Proceedings , pp. 689-690
- Robertson, S.¹

16
- 0034795978
- Generic Summaries for Indexing in Information Retrieval
- Sakai, T. and Sparck Jones, K.: Generic Summaries for Indexing in Information Retrieval, ACM SIGIR 2001 Proceedings, pp.190-198, 2001.
- (2001) ACM SIGIR 2001 Proceedings , pp. 190-198
- Sakai, T.¹ Sparck Jones, K.²

17
- 33750340100
- Evaluating Evaluation Metrics based on the Bootstrap
- Sakai, T.: Evaluating Evaluation Metrics based on the Bootstrap, ACM SIGIR 2006 Proceedings, pp. 525-532, 2006.
- (2006) ACM SIGIR 2006 Proceedings , pp. 525-532
- Sakai, T.¹

18
- 33750437740
- On the Reliability of Information Retrieval Metrics based on Graded Relevance
- Sakai, T.: On the Reliability of Information Retrieval Metrics based on Graded Relevance, Information Processing and Management, 43(2), pp. 531-548, 2007.
- (2007) Information Processing and Management , vol.43 , Issue.2 , pp. 531-548
- Sakai, T.¹

19
- 70349231816
- Sakai, T.: On Penalising Late Arrival of Relevant Documents in Information Retrieval Evaluation with Graded Relevance, Proceedings of the First International Workshop on Evaluating Information Acess (EVIA 2007), pp. 32-43, 2007. http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings6/ EVIA/1.pdf
- Sakai, T.: On Penalising Late Arrival of Relevant Documents in Information Retrieval Evaluation with Graded Relevance, Proceedings of the First International Workshop on Evaluating Information Acess (EVIA 2007), pp. 32-43, 2007. http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings6/ EVIA/1.pdf

20
- 36448993626
- Alternatives to Bpref
- Sakai, T.: Alternatives to Bpref, ACM SIGIR 2007 Proceedings, pp. 71-78, 2007.
- (2007) ACM SIGIR 2007 Proceedings , pp. 71-78
- Sakai, T.¹

21
- 70349231817
- IPSJ Transactions on Databases, Vol.48, No.SIG 9 (TOD35), pp.11-28, 2007. Also available in IPSJ Digital Courier
- Sakai, T.: Evaluating Information Retrieval Metrics based on Bootstrap Hypothesis Tests, IPSJ Transactions on Databases, Vol.48, No.SIG 9 (TOD35), pp.11-28, 2007. Also available in IPSJ Digital Courier, Vol.3, pp.625-642, 2007. http://www.jstage.jst.go.jp/article/ipsjdc/3/0/625/-pdf
- (2007) , vol.3 , pp. 625-642
- Sakai, T.¹

22
- 50849122035
- On Information Retrieval Metrics Designed for Evaluation with Incomplete Relevance Assessments
- open access
- Sakai, T. and Kando, N.: On Information Retrieval Metrics Designed for Evaluation with Incomplete Relevance Assessments, Information Retrieval, 2008. http://www.springerlink.com/content/k41j1152140326l4/fulltext.pdf (open access)
- (2008) Information Retrieval
- Sakai, T.¹ Kando, N.²

23
- 57349141449
- Comparing Metrics across TREC and NTCIR: The Robustness to Pool Depth Bias
- Sakai, T.: Comparing Metrics across TREC and NTCIR: The Robustness to Pool Depth Bias, ACM SIGIR 2008, pp. 691-692, 2008.
- (2008) ACM SIGIR 2008 , pp. 691-692
- Sakai, T.¹

24
- 8644220612
- Forming Test Collections with No System Pooling
- Sanderson, M. and Joho, H.: Forming Test Collections with No System Pooling, ACM SIGIR 2004 Proceedings, pp. 33-40, 2004.
- (2004) ACM SIGIR 2004 Proceedings , pp. 33-40
- Sanderson, M.¹ Joho, H.²

25
- 0036989640
- Liberal Relevance Criteria of TREC - Counting on Negligible Documents?
- Sormunen, E.: Liberal Relevance Criteria of TREC - Counting on Negligible Documents? ACM SIGIR 2002 Proceedings, pp. 324-330, 2002.
- (2002) ACM SIGIR 2002 Proceedings , pp. 324-330
- Sormunen, E.¹

26
- 8644262918
- The Philosophy of Information Retrieval Evaluation
- CLEF 2001 Proceedings
- Voorhees, E. M.: The Philosophy of Information Retrieval Evaluation, CLEF 2001 Proceedings, LNCS 2406, pp. 355-370, 2002.
- (2002) LNCS , vol.2406 , pp. 355-370
- Voorhees, E.M.¹

27
- 24644514267
- Overview of the TREC 2003 Robust Retrieval Track
- Voorhees, E. M.: Overview of the TREC 2003 Robust Retrieval Track, TREC 2003 Proceedings, 2004.
- (2004) TREC 2003 Proceedings
- Voorhees, E.M.¹

28
- 8644250683
- Overview of the TREC 2004 Robust Retrieval Track
- Voorhees, E. M.: Overview of the TREC 2004 Robust Retrieval Track, TREC 2004 Proceedings, 2005.
- (2005) TREC 2004 Proceedings
- Voorhees, E.M.¹

29
- 57349093460
- Precision-At-Ten Considered Redundant
- Webber, W., Moffat, A., Zobel, J. and Sakai, T.: Precision-At-Ten Considered Redundant, ACM SIGIR 2008 Proceedings, pp. 695-696, 2008.
- (2008) ACM SIGIR 2008 Proceedings , pp. 695-696
- Webber, W.¹ Moffat, A.² Zobel, J.³ Sakai, T.⁴

30
- 34547632535
- Estimating Average Precision with Incomplete and Imperfect Judgments
- Yilmaz, E. and Aslam, J. A.: Estimating Average Precision with Incomplete and Imperfect Judgments, CIKM 2006 Proceedings, 2006.
- (2006) CIKM 2006 Proceedings
- Yilmaz, E.¹ Aslam, J.A.²

31
- 0032272626
- How Reliable are the Results of Large-Scale Information Retrieval Experiments?
- Zobel, J.: How Reliable are the Results of Large-Scale Information Retrieval Experiments? ACM SIGIR '98 Proceedings, pp. 307-314, 1998.
- (1998) ACM SIGIR '98 Proceedings , pp. 307-314
- Zobel, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.