메뉴 건너뛰기




Volumn , Issue , 2007, Pages 623-632

A comparison of statistical significance tests for information retrieval evaluation

Author keywords

Bootstrap; Hypothesis test; Permutation; Randomization; Sign; Statistical significance; Student's t test; Wilcoxon

Indexed keywords

BOOTSTRAP; HYPOTHESIS TEST; PERMUTATION; RANDOMIZATION; SIGN; STATISTICAL SIGNIFICANCE; WILCOXON;

EID: 63449088172     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1321440.1321528     Document Type: Conference Paper
Times cited : (642)

References (22)
  • 3
    • 63449098297 scopus 로고    scopus 로고
    • C. Buckley. trec-eval. http://trec.nist.gov/trec-eval/trec-eval.8.0.tar. gz.
    • C. Buckley. trec-eval. http://trec.nist.gov/trec-eval/trec-eval.8.0.tar. gz.
  • 5
    • 36448963050 scopus 로고    scopus 로고
    • Validity and power of t-test for comparing map and gmap
    • ACM Press
    • G. Cormack and T. Lynam. Validity and power of t-test for comparing map and gmap. In SIGIR '07. ACM Press, 2007.
    • (2007) SIGIR '07
    • Cormack, G.1    Lynam, T.2
  • 6
    • 33750336173 scopus 로고    scopus 로고
    • Statistical precision of information retrieval evaluation
    • ACM Press
    • G. V. Cormack and T. R. Lynam. Statistical precision of information retrieval evaluation. In SIGIR '06, pages 533-540. ACM Press, 2006.
    • (2006) SIGIR '06 , pp. 533-540
    • Cormack, G.V.1    Lynam, T.R.2
  • 10
    • 0027725490 scopus 로고
    • Using statistical testing in the evaluation of retrieval experiments
    • New York, NY, USA, ACM Press
    • D. Hull. Using statistical testing in the evaluation of retrieval experiments. In SIGIR '93, pages 329 - 338, New York, NY, USA, 1993. ACM Press.
    • (1993) SIGIR '93 , pp. 329-338
    • Hull, D.1
  • 11
    • 0001251775 scopus 로고
    • The behavior of some significance tests under experimental randomization
    • August
    • O. Kempthorne and T. E. Doerfler. The behavior of some significance tests under experimental randomization. Biometrika, 56(2):231 - 248, August 1969.
    • (1969) Biometrika , vol.56 , Issue.2 , pp. 231-248
    • Kempthorne, O.1    Doerfler, T.E.2
  • 12
    • 0031599142 scopus 로고    scopus 로고
    • Mersenne twister: A 623-dimensionally equidistributed uniform pseudo-random number generator
    • M. Matsumoto and T. Nishimura. Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator. ACM Trans. Model. Comput. Simul., 8(1):3 - 30, 1998.
    • (1998) ACM Trans. Model. Comput. Simul , vol.8 , Issue.1 , pp. 3-30
    • Matsumoto, M.1    Nishimura, T.2
  • 15
    • 1842607847 scopus 로고    scopus 로고
    • R Development Core Team, R Foundation for Statistical Computing, Vienna, Austria
    • R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, 2004. 3-900051-07-0.
    • (2004) R: A language and environment for statistical computing
  • 16
    • 33750340100 scopus 로고    scopus 로고
    • Evaluating evaluation metrics based on the bootstrap
    • ACM Press
    • T. Sakai. Evaluating evaluation metrics based on the bootstrap. In SIGIR '06, pages 525 - 532. ACM Press, 2006.
    • (2006) SIGIR '06 , pp. 525-532
    • Sakai, T.1
  • 17
    • 84885608872 scopus 로고    scopus 로고
    • Information retrieval system evaluation: Effort, sensitivity, and reliability
    • ACM Press
    • M. Sanderson and J. Zobel. Information retrieval system evaluation: effort, sensitivity, and reliability. In SIGIR '05, pages 162 - 169. ACM Press, 2005.
    • (2005) SIGIR '05 , pp. 162-169
    • Sanderson, M.1    Zobel, J.2
  • 18
    • 0031193029 scopus 로고    scopus 로고
    • Statistical inference in retrieval effectiveness evaluation
    • J. Savoy. Statistical inference in retrieval effectiveness evaluation. IPM, 33(4):495 - 512, 1997.
    • (1997) IPM , vol.33 , Issue.4 , pp. 495-512
    • Savoy, J.1
  • 19
    • 0004217877 scopus 로고
    • Butterworths, second edition
    • C. J. van Rijsbergen. Information Retrieval. Butterworths, second edition, 1979. http://www.dcs.gla.ac.uk/Keith/Preface.html.
    • (1979) Information Retrieval
    • van Rijsbergen, C.J.1
  • 20
    • 63449114476 scopus 로고    scopus 로고
    • E. M. Voorhees and D. K. Harman, editors, MIT Press
    • E. M. Voorhees and D. K. Harman, editors. TREC. MIT Press, 2005.
    • (2005) TREC
  • 21
    • 0027927918 scopus 로고
    • Non-parametric significance tests of retrieval performance comparisons
    • W. J. Wilbur. Non-parametric significance tests of retrieval performance comparisons. J. Inf. Sci., 20(4):270 - 284, 1994.
    • (1994) J. Inf. Sci , vol.20 , Issue.4 , pp. 270-284
    • Wilbur, W.J.1
  • 22
    • 0001884644 scopus 로고
    • Individual comparisons by ranking methods
    • December
    • F. Wilcoxon. Individual comparisons by ranking methods. Biometrics Bulletin, 1(6):80 - 83, December 1945.
    • (1945) Biometrics Bulletin , vol.1 , Issue.6 , pp. 80-83
    • Wilcoxon, F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.