메뉴 건너뛰기




Volumn , Issue , 2013, Pages 925-928

A comparison of the optimality of statistical significance tests for information retrieval evaluation

Author keywords

Bootstrap; Evaluation; Permutation; Randomization; Sign test; Statistical significance; Student's t test; Wilcoxon test

Indexed keywords

BOOTSTRAP; EVALUATION; PERMUTATION; RANDOMIZATION; SIGN TEST; STATISTICAL SIGNIFICANCE; STUDENT'S T TESTS; WILCOXON TEST;

EID: 84883083423     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2484028.2484163     Document Type: Conference Paper
Times cited : (32)

References (7)
  • 1
    • 36448963050 scopus 로고    scopus 로고
    • Validity and power of t-test for comparing MAP and GMAP
    • G. V. Cormack and T. R. Lynam. Validity and Power of t-test for Comparing MAP and GMAP. In ACM SIGIR, pages 753-754, 2007.
    • (2007) ACM SIGIR , pp. 753-754
    • Cormack, G.V.1    Lynam, T.R.2
  • 2
    • 33750340100 scopus 로고    scopus 로고
    • Evaluating evaluation metrics based on the bootstrap
    • T. Sakai. Evaluating Evaluation Metrics Based on the Bootstrap. In ACM SIGIR, pages 525-532, 2006.
    • (2006) ACM SIGIR , pp. 525-532
    • Sakai, T.1
  • 3
    • 84885608872 scopus 로고    scopus 로고
    • Information retrieval system evaluation: Effort, sensitivity, and reliability
    • M. Sanderson and J. Zobel. Information Retrieval System Evaluation: Effort, Sensitivity, and Reliability. In ACM SIGIR, pages 162-169, 2005.
    • (2005) ACM SIGIR , pp. 162-169
    • Sanderson, M.1    Zobel, J.2
  • 4
    • 63449088172 scopus 로고    scopus 로고
    • A comparison of statistical significance tests for information retrieval evaluation
    • M. D. Smucker, J. Allan, and B. Carterette. A Comparison of Statistical Significance Tests for Information Retrieval Evaluation. In ACM CIKM, pages 623-632, 2007.
    • (2007) ACM CIKM , pp. 623-632
    • Smucker, M.D.1    Allan, J.2    Carterette, B.3
  • 5
    • 72449159192 scopus 로고    scopus 로고
    • Agreement among statistical significance tests for information retrieval evaluation at varying sample sizes
    • M. D. Smucker, J. Allan, and B. Carterette. Agreement Among Statistical Significance Tests for Information Retrieval Evaluation at Varying Sample Sizes. In ACM SIGIR, pages 630-631, 2009.
    • (2009) ACM SIGIR , pp. 630-631
    • Smucker, M.D.1    Allan, J.2    Carterette, B.3
  • 6
    • 72449211066 scopus 로고    scopus 로고
    • Topic set size redux
    • E. M. Voorhees. Topic Set Size Redux. In ACM SIGIR, pages 806-807, 2009.
    • (2009) ACM SIGIR , pp. 806-807
    • Voorhees, E.M.1
  • 7
    • 0032272626 scopus 로고    scopus 로고
    • How reliable are the results of large-scale information retrieval experiments?
    • J. Zobel. How Reliable are the Results of Large-Scale Information Retrieval Experiments? In ACM SIGIR, pages 307-314, 1998.
    • (1998) ACM SIGIR , pp. 307-314
    • Zobel, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.