메뉴 건너뛰기




Volumn , Issue , 2008, Pages 581-590

Comparing metrics across TREC and NTCIR: The robustness to system bias

Author keywords

Evaluation metrics; Graded relevance; Test collection

Indexed keywords

EVALUATION METRICS; GRADED RELEVANCE; NEW SYSTEM; PAIRWISE STATISTICAL SIGNIFICANCE; Q-MEASURES; RANDOM SAMPLE; SYSTEM BIAS; TEST COLLECTION; UNBIASED CONDITIONS;

EID: 70349242289     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1458082.1458159     Document Type: Conference Paper
Times cited : (29)

References (31)
  • 1
    • 35548951792 scopus 로고    scopus 로고
    • Evaluation of Retrieval Effectiveness with Incomplete Relevance Data: Theoretical and Experimental Comparison of Three Measures
    • Ahlgren, P. and Gröonqvist, L.: Evaluation of Retrieval Effectiveness with Incomplete Relevance Data: Theoretical and Experimental Comparison of Three Measures, Information Processing and Management, Volume 44, pp. 212-225, 2008.
    • (2008) Information Processing and Management , vol.44 , pp. 212-225
    • Ahlgren, P.1    Gröonqvist, L.2
  • 2
    • 63449125656 scopus 로고    scopus 로고
    • Inferring Document Relevance from Incomplete Information
    • Aslam, J. A. and Yilmaz, E.: Inferring Document Relevance from Incomplete Information, ACM CIKM 2007 Proceedings, pp. 633-642, 2007.
    • (2007) ACM CIKM 2007 Proceedings , pp. 633-642
    • Aslam, J.A.1    Yilmaz, E.2
  • 4
    • 36448951542 scopus 로고    scopus 로고
    • On the Robustness of Relevance Measures with Incomplete Judgments
    • Bompada, T. et al.: On the Robustness of Relevance Measures with Incomplete Judgments, ACM SIGIR 2007 Proceedings, pp. 359-366, 2007.
    • (2007) ACM SIGIR 2007 Proceedings , pp. 359-366
    • Bompada, T.1
  • 6
    • 8644251996 scopus 로고    scopus 로고
    • Retrieval Evaluation with Incomplete Information
    • Buckley, C. and Voorhees, E. M.: Retrieval Evaluation with Incomplete Information, ACM SIGIR 2004 Proceedings, pp. 25-32, 2004.
    • (2004) ACM SIGIR 2004 Proceedings , pp. 25-32
    • Buckley, C.1    Voorhees, E.M.2
  • 7
    • 35548987507 scopus 로고    scopus 로고
    • Bias and the Limits of Pooling for Large Collections
    • Buckley, C. et al.: Bias and the Limits of Pooling for Large Collections, Information Retrieval, Vol. 10, Number 6, pp. 491-508, 2007.
    • (2007) Information Retrieval , vol.10 , Issue.6 , pp. 491-508
    • Buckley, C.1
  • 8
    • 31844446958 scopus 로고    scopus 로고
    • Learning to Rank using Gradient Descent
    • Burges, C. et al.: Learning to Rank using Gradient Descent, ACM ICML 2005 Proceedings, pp. 89-96, 2005.
    • (2005) ACM ICML 2005 Proceedings , pp. 89-96
    • Burges, C.1
  • 9
    • 36448986732 scopus 로고    scopus 로고
    • Reliable Information Retrieval Evaluation with Incomplete and Biased Judgements
    • Büttcher et al.: Reliable Information Retrieval Evaluation with Incomplete and Biased Judgements, ACM SIGIR 2007 Proceedings., pp. 63-70, 2007.
    • (2007) ACM SIGIR 2007 Proceedings , pp. 63-70
    • Büttcher1
  • 10
    • 36448969717 scopus 로고    scopus 로고
    • Robust Test Collections for Retrieval Evaluation
    • Carterette, B.: Robust Test Collections for Retrieval Evaluation, ACM SIGIR 2007 Proceedings, pp. 55-62, 2007.
    • (2007) ACM SIGIR 2007 Proceedings , pp. 55-62
    • Carterette, B.1
  • 11
    • 57349133736 scopus 로고    scopus 로고
    • Evaluation Over Thousands of Queries
    • Carterette, B. et al.: Evaluation Over Thousands of Queries, ACM SIGIR 2008 Proceedings, pp. 651-658, 2008.
    • (2008) ACM SIGIR 2008 Proceedings , pp. 651-658
    • Carterette, B.1
  • 12
    • 1842637192 scopus 로고    scopus 로고
    • Cumulated Gain-Based Evaluation of IR Techniques
    • Järvelin, K. and Kekäläinen, J.: Cumulated Gain-Based Evaluation of IR Techniques, ACM TOIS, Vol. 20, No. 4, pp. 422-446, 2002.
    • (2002) ACM TOIS , vol.20 , Issue.4 , pp. 422-446
    • Järvelin, K.1    Kekäläinen, J.2
  • 13
    • 70349245905 scopus 로고    scopus 로고
    • Overview of the Sixth NTCIR Workshop
    • Kando, N.: Overview of the Sixth NTCIR Workshop, NTCIR-6 Proceedings, pp. i-ix, 2007.
    • (2007) NTCIR-6 Proceedings
    • Kando, N.1
  • 14
    • 70349255143 scopus 로고    scopus 로고
    • Rank-Biased Precision for Measurement of Retrieval Effectiveness
    • to appear
    • Moffat, A. and Zobel, J.: Rank-Biased Precision for Measurement of Retrieval Effectiveness, ACM TOIS, to appear, 2008.
    • (2008) ACM TOIS
    • Moffat, A.1    Zobel, J.2
  • 15
    • 57349087085 scopus 로고    scopus 로고
    • A New Interpretation of Average Precision
    • Robertson, S.: A New Interpretation of Average Precision, ACM SIGIR 2008 Proceedings, pp. 689-690, 2008.
    • (2008) ACM SIGIR 2008 Proceedings , pp. 689-690
    • Robertson, S.1
  • 16
    • 0034795978 scopus 로고    scopus 로고
    • Generic Summaries for Indexing in Information Retrieval
    • Sakai, T. and Sparck Jones, K.: Generic Summaries for Indexing in Information Retrieval, ACM SIGIR 2001 Proceedings, pp.190-198, 2001.
    • (2001) ACM SIGIR 2001 Proceedings , pp. 190-198
    • Sakai, T.1    Sparck Jones, K.2
  • 17
    • 33750340100 scopus 로고    scopus 로고
    • Evaluating Evaluation Metrics based on the Bootstrap
    • Sakai, T.: Evaluating Evaluation Metrics based on the Bootstrap, ACM SIGIR 2006 Proceedings, pp. 525-532, 2006.
    • (2006) ACM SIGIR 2006 Proceedings , pp. 525-532
    • Sakai, T.1
  • 18
    • 33750437740 scopus 로고    scopus 로고
    • On the Reliability of Information Retrieval Metrics based on Graded Relevance
    • Sakai, T.: On the Reliability of Information Retrieval Metrics based on Graded Relevance, Information Processing and Management, 43(2), pp. 531-548, 2007.
    • (2007) Information Processing and Management , vol.43 , Issue.2 , pp. 531-548
    • Sakai, T.1
  • 19
    • 70349231816 scopus 로고    scopus 로고
    • Sakai, T.: On Penalising Late Arrival of Relevant Documents in Information Retrieval Evaluation with Graded Relevance, Proceedings of the First International Workshop on Evaluating Information Acess (EVIA 2007), pp. 32-43, 2007. http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings6/ EVIA/1.pdf
    • Sakai, T.: On Penalising Late Arrival of Relevant Documents in Information Retrieval Evaluation with Graded Relevance, Proceedings of the First International Workshop on Evaluating Information Acess (EVIA 2007), pp. 32-43, 2007. http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings6/ EVIA/1.pdf
  • 21
    • 70349231817 scopus 로고    scopus 로고
    • IPSJ Transactions on Databases, Vol.48, No.SIG 9 (TOD35), pp.11-28, 2007. Also available in IPSJ Digital Courier
    • Sakai, T.: Evaluating Information Retrieval Metrics based on Bootstrap Hypothesis Tests, IPSJ Transactions on Databases, Vol.48, No.SIG 9 (TOD35), pp.11-28, 2007. Also available in IPSJ Digital Courier, Vol.3, pp.625-642, 2007. http://www.jstage.jst.go.jp/article/ipsjdc/3/0/625/-pdf
    • (2007) , vol.3 , pp. 625-642
    • Sakai, T.1
  • 22
    • 50849122035 scopus 로고    scopus 로고
    • On Information Retrieval Metrics Designed for Evaluation with Incomplete Relevance Assessments
    • open access
    • Sakai, T. and Kando, N.: On Information Retrieval Metrics Designed for Evaluation with Incomplete Relevance Assessments, Information Retrieval, 2008. http://www.springerlink.com/content/k41j1152140326l4/fulltext.pdf (open access)
    • (2008) Information Retrieval
    • Sakai, T.1    Kando, N.2
  • 23
    • 57349141449 scopus 로고    scopus 로고
    • Comparing Metrics across TREC and NTCIR: The Robustness to Pool Depth Bias
    • Sakai, T.: Comparing Metrics across TREC and NTCIR: The Robustness to Pool Depth Bias, ACM SIGIR 2008, pp. 691-692, 2008.
    • (2008) ACM SIGIR 2008 , pp. 691-692
    • Sakai, T.1
  • 24
    • 8644220612 scopus 로고    scopus 로고
    • Forming Test Collections with No System Pooling
    • Sanderson, M. and Joho, H.: Forming Test Collections with No System Pooling, ACM SIGIR 2004 Proceedings, pp. 33-40, 2004.
    • (2004) ACM SIGIR 2004 Proceedings , pp. 33-40
    • Sanderson, M.1    Joho, H.2
  • 25
    • 0036989640 scopus 로고    scopus 로고
    • Liberal Relevance Criteria of TREC - Counting on Negligible Documents?
    • Sormunen, E.: Liberal Relevance Criteria of TREC - Counting on Negligible Documents? ACM SIGIR 2002 Proceedings, pp. 324-330, 2002.
    • (2002) ACM SIGIR 2002 Proceedings , pp. 324-330
    • Sormunen, E.1
  • 26
    • 8644262918 scopus 로고    scopus 로고
    • The Philosophy of Information Retrieval Evaluation
    • CLEF 2001 Proceedings
    • Voorhees, E. M.: The Philosophy of Information Retrieval Evaluation, CLEF 2001 Proceedings, LNCS 2406, pp. 355-370, 2002.
    • (2002) LNCS , vol.2406 , pp. 355-370
    • Voorhees, E.M.1
  • 27
    • 24644514267 scopus 로고    scopus 로고
    • Overview of the TREC 2003 Robust Retrieval Track
    • Voorhees, E. M.: Overview of the TREC 2003 Robust Retrieval Track, TREC 2003 Proceedings, 2004.
    • (2004) TREC 2003 Proceedings
    • Voorhees, E.M.1
  • 28
    • 8644250683 scopus 로고    scopus 로고
    • Overview of the TREC 2004 Robust Retrieval Track
    • Voorhees, E. M.: Overview of the TREC 2004 Robust Retrieval Track, TREC 2004 Proceedings, 2005.
    • (2005) TREC 2004 Proceedings
    • Voorhees, E.M.1
  • 30
    • 34547632535 scopus 로고    scopus 로고
    • Estimating Average Precision with Incomplete and Imperfect Judgments
    • Yilmaz, E. and Aslam, J. A.: Estimating Average Precision with Incomplete and Imperfect Judgments, CIKM 2006 Proceedings, 2006.
    • (2006) CIKM 2006 Proceedings
    • Yilmaz, E.1    Aslam, J.A.2
  • 31
    • 0032272626 scopus 로고    scopus 로고
    • How Reliable are the Results of Large-Scale Information Retrieval Experiments?
    • Zobel, J.: How Reliable are the Results of Large-Scale Information Retrieval Experiments? ACM SIGIR '98 Proceedings, pp. 307-314, 1998.
    • (1998) ACM SIGIR '98 Proceedings , pp. 307-314
    • Zobel, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.