메뉴 건너뛰기




Volumn , Issue , 2010, Pages 539-546

The effect of assessor errors on IR system evaluation

Author keywords

Assessor error; Retrieval test collections

Indexed keywords

AMAZON'S MECHANICAL TURKS; ASSESSOR ERROR; CRANFIELD; RELEVANCE JUDGMENT; SCALING-UP; SYSTEM EVALUATION; SYSTEM RANKINGS; TEST COLLECTION;

EID: 77956024152     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1835449.1835540     Document Type: Conference Paper
Times cited : (80)

References (17)
  • 2
    • 72449173733 scopus 로고    scopus 로고
    • A practical sampling strategy for efficient retrieval evaluat ion
    • Javed A. Aslam and Virgil Pavlu. A practical sampling strategy for efficient retrieval evaluat ion, technical report.
    • Technical Report
    • Aslam, J.A.1    Pavlu, V.2
  • 3
    • 33750288965 scopus 로고    scopus 로고
    • A statistical method for system evaluation using incomplete judgments
    • Javed A. Aslam, Virgil Pavlu, and Emine Yilmaz. A statistical method for system evaluation using incomplete judgments. In Proceedings of SIGIR, pages 541-548, 2006.
    • (2006) Proceedings of SIGIR , pp. 541-548
    • Aslam, J.A.1    Pavlu, V.2    Yilmaz, E.3
  • 5
    • 77956049388 scopus 로고    scopus 로고
    • Robust evaluation of information retrieval systems
    • Ben Carterette. Robust evaluation of information retrieval systems. In Proceedings of SIGIR, 2007.
    • (2007) Proceedings of SIGIR
    • Carterette, B.1
  • 6
    • 33750359727 scopus 로고    scopus 로고
    • Sitaraman. Minimal test collections for retrieval evaluation
    • Ben Carterette, James Allan, and Ramesh K. Sitaraman. Minimal test collections for retrieval evaluation. In Proceedings of SIGIR, pages 268-275, 2006.
    • (2006) Proceedings of SIGIR , pp. 268-275
    • Carterette, B.1    Allan, J.2    Ramesh, K.3
  • 8
    • 0032259402 scopus 로고    scopus 로고
    • Efficient construction of large test collections
    • Gordon V. Cormack, Christopher R. Palmer, and Charles L.A. Clarke. Efficient construction of large test collections. In Proceedings of SIGIR, pages 282-289, 1998.
    • (1998) Proceedings of SIGIR , pp. 282-289
    • Cormack, G.V.1    Palmer, C.R.2    Clarke, C.L.A.3
  • 9
    • 0002565067 scopus 로고
    • Overview of the fourth text REtrieval conference
    • NIST Special Publication 500-236
    • Donna Harman. Overview of the fourth Text REtrieval Conference. In Proceedings of the Fourth Text REtrieval Conference (TREC-4), pages 1-24, 1995. NIST Special Publication 500-236.
    • (1995) Proceedings of the Fourth Text REtrieval Conference (TREC-4) , pp. 1-24
    • Harman, D.1
  • 10
    • 67650075369 scopus 로고    scopus 로고
    • How evaluator domain expertise affects search result relevance judgments
    • Kenneth A. Kinney, Scott Huffman, and Juting Zhai. How evaluator domain expertise affects search result relevance judgments. In Proceedings of CIKM, pages 591-598, 2008.
    • (2008) Proceedings of CIKM , pp. 591-598
    • Kinney, K.A.1    Huffman, S.2    Zhai, J.3
  • 11
    • 84885608872 scopus 로고    scopus 로고
    • Information retrieval system evaluation: Effort, sensitivity, and reliability
    • Mark Sanderson and Justin Zobel. Information retrieval system evaluation: Effort, sensitivity, and reliability. In Proceedings of SIGIR, pages 186-193, 2005.
    • (2005) Proceedings of SIGIR , pp. 186-193
    • Sanderson, M.1    Zobel, J.2
  • 12
    • 0034790621 scopus 로고    scopus 로고
    • Ranking retrieval systems without relevance judgments
    • Ian Soboroff, Charles Nicholas, and Patrick Cahan. Ranking Retrieval Systems without Relevance Judgments. In Proceedings of SIGIR, pages 66-73, 2001.
    • (2001) Proceedings of SIGIR , pp. 66-73
    • Soboroff, I.1    Nicholas, C.2    Cahan, P.3
  • 13
    • 0032264624 scopus 로고    scopus 로고
    • Variations in relevance judgments and the measurement of retrieval effectiveness
    • Ellen Voorhees. Variations in relevance judgments and the measurement of retrieval effectiveness. In Proceedings of SIGIR, pages 315-323, 1998.
    • (1998) Proceedings of SIGIR , pp. 315-323
    • Voorhees, E.1
  • 16
    • 34547632535 scopus 로고    scopus 로고
    • Estimating average precision with incomplete and imperfect relevance judgments
    • Emine Yilmaz and Javed Aslam. Estimating average precision with incomplete and imperfect relevance judgments. In Proceedings of CIKM, pages 102-111, 2006.
    • (2006) Proceedings of CIKM , pp. 102-111
    • Yilmaz, E.1    Aslam, J.2
  • 17
    • 0032272626 scopus 로고    scopus 로고
    • How reliable are the results of large-scale information retrieval experiments?
    • Justin Zobel. How Reliable are the Results of Large-Scale Information Retrieval Experiments? In Proceedings of SIGIR, pages 307-314, 1998.
    • (1998) Proceedings of SIGIR , pp. 307-314
    • Zobel, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.