메뉴 건너뛰기




Volumn , Issue , 2014, Pages 61-70

Designing test collections for comparing many systems

Author keywords

Effect sizes; Evaluation; Evaluation measures; Power; Sample sizes; Statistical significance; Test collections; Variances

Indexed keywords

BUDGET CONTROL; COST BENEFIT ANALYSIS; DESIGN; ERROR ANALYSIS; KNOWLEDGE MANAGEMENT; SAMPLING; SEARCH ENGINES;

EID: 84925449110     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2661829.2661893     Document Type: Conference Paper
Times cited : (16)

References (27)
  • 1
    • 36448947171 scopus 로고    scopus 로고
    • Test theory for assessing IR test collections
    • D. Bodoff and P. Li. Test theory for assessing IR test collections. In Proceedings of ACM SIGIR 2007, pages 367-374, 2007.
    • (2007) Proceedings of ACM SIGIR 2007 , pp. 367-374
    • Bodoff, D.1    Li, P.2
  • 2
    • 80053037341 scopus 로고    scopus 로고
    • Model-based inference about IR systems
    • B. Carterette. Model-based inference about IR systems. In ICTIR 2011 (LNCS 6931), pages 101-112, 2011.
    • (2011) ICTIR 2011 (LNCS 6931) , pp. 101-112
    • Carterette, B.1
  • 4
    • 57349104848 scopus 로고    scopus 로고
    • Hypothesis testing with incomplete relevance judgments
    • B. Carterette and M. D. Smucker. Hypothesis testing with incomplete relevance judgments. In Proceedings of ACM CIKM 2007, pages 643-652, 2007.
    • (2007) Proceedings of ACM CIKM 2007 , pp. 643-652
    • Carterette, B.1    Smucker, M.D.2
  • 5
    • 80255123851 scopus 로고    scopus 로고
    • Intent-based diversification of web search results: Metrics and algorithms
    • O. Chapelle, S. Ji, C. Liao, E. Velipasaoglu, L. Lai, and S.-L. Wu. Intent-based diversification of web search results: Metrics and algorithms. Information Retrieval, 14(6): 572-592, 2011.
    • (2011) Information Retrieval , vol.14 , Issue.6 , pp. 572-592
    • Chapelle, O.1    Ji, S.2    Liao, C.3    Velipasaoglu, E.4    Lai, L.5    Wu, S.-L.6
  • 14
    • 19844373641 scopus 로고    scopus 로고
    • An alternative to null hypothesis significance tests
    • P. R. Killeen. An alternative to null hypothesis significance tests. Psychological Science, 16: 345-353, 2005.
    • (2005) Psychological Science , vol.16 , pp. 345-353
    • Killeen, P.R.1
  • 16
    • 84937551641 scopus 로고    scopus 로고
    • Statistical power and effect size in information retrieval experiments
    • M. J. Nelson. Statistical power and effect size in information retrieval experiments. In Proceedings of CAIS/ASCI'98, pages 393-400, 1998.
    • (1998) Proceedings of CAIS/ASCI'98 , pp. 393-400
    • Nelson, M.J.1
  • 20
    • 84925396934 scopus 로고    scopus 로고
    • Statistical reform in information retrieval?
    • T. Sakai. Statistical reform in information retrieval? SIGIR Forum, 48(1): 3-12, 2014.
    • (2014) SIGIR Forum , vol.48 , Issue.1 , pp. 3-12
    • Sakai, T.1
  • 21
    • 80052111133 scopus 로고    scopus 로고
    • Evaluating diversified search results using per-intent graded relevance
    • T. Sakai and R. Song. Evaluating diversified search results using per-intent graded relevance. In Proceedings of ACM SIGIR 2011, pages 1043-1042, 2011.
    • (2011) Proceedings of ACM SIGIR 2011 , pp. 1043-1042
    • Sakai, T.1    Song, R.2
  • 22
    • 84880838418 scopus 로고    scopus 로고
    • Diversified search evaluation: Lessons from the NTCIR-9 INTENT task
    • T. Sakai and R. Song. Diversified search evaluation: Lessons from the NTCIR-9 INTENT task. Information Retrieval, 16(4): 504-529, 2013.
    • (2013) Information Retrieval , vol.16 , Issue.4 , pp. 504-529
    • Sakai, T.1    Song, R.2
  • 24
    • 8644250683 scopus 로고    scopus 로고
    • Overview of the TREC 2003 robust retrieval track
    • E. M. Voorhees. Overview of the TREC 2003 robust retrieval track. In Proceeings of TREC 2003, 2004.
    • (2004) Proceeings of TREC 2003
    • Voorhees, E.M.1
  • 25
    • 8644250683 scopus 로고    scopus 로고
    • Overview of the TREC 2004 robust retrieval track
    • E. M. Voorhees. Overview of the TREC 2004 robust retrieval track. In Proceeings of TREC 2004, 2005.
    • (2005) Proceeings of TREC 2004
    • Voorhees, E.M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.