메뉴 건너뛰기




Volumn 48, Issue 6, 2012, Pages 1053-1066

Using crowdsourcing for TREC relevance assessment

Author keywords

Amazon Mechanical Turk; Crowdsourcing; Experimental design; IR evaluation; Relevance assessment; Test collections; TREC

Indexed keywords

CROWDSOURCING; MECHANICAL TURKS; RELEVANCE ASSESSMENTS; TEST COLLECTION; TREC;

EID: 84865695467     PISSN: 03064573     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.ipm.2012.01.004     Document Type: Article
Times cited : (119)

References (40)
  • 2
    • 80052128947 scopus 로고    scopus 로고
    • Crowdsourcing for information retrieval: Principles, methods and applications, SIGIR tutorial
    • Alonso, O.; & Lease, M. (2011). Crowdsourcing for information retrieval: Principles, methods and applications, SIGIR tutorial. In: Proceedings of the 34th ACM SIGIR conference (pp. 1299-1300).
    • (2011) Proceedings of the 34th ACM SIGIR Conference , pp. 1299-1300
    • Alonso, O.1    Lease, M.2
  • 3
    • 72449180422 scopus 로고    scopus 로고
    • Relevance criteria for E-commerce. A crowdsourcing-based experimental analysis
    • Alonso, O.; & Mizzaro, S. (2009a). Relevance criteria for E-commerce. A crowdsourcing-based experimental analysis. In Proceedings of the 32nd ACM SIGIR conference (pp. 760-761).
    • (2009) Proceedings of the 32nd ACM SIGIR Conference , pp. 760-761
    • Alonso, O.1    Mizzaro, S.2
  • 5
    • 65249129950 scopus 로고    scopus 로고
    • Crowdsourcing for relevance evaluation
    • O. Alonso, D. Rose, and B. Stewart Crowdsourcing for relevance evaluation SIGIR Forum 42 2 2008 9 15
    • (2008) SIGIR Forum , vol.42 , Issue.2 , pp. 9-15
    • Alonso, O.1    Rose, D.2    Stewart, B.3
  • 16
  • 18
    • 3343019470 scopus 로고
    • Measuring nominal scale agreement among many raters
    • J.L. Fleiss Measuring nominal scale agreement among many raters Psychological Bulletin 76 5 1971 378 382
    • (1971) Psychological Bulletin , vol.76 , Issue.5 , pp. 378-382
    • Fleiss, J.L.1
  • 20
    • 75149143907 scopus 로고    scopus 로고
    • A few good topics: Experiments in topic set reduction for retrieval evaluation
    • J. Guiver, S. Mizzaro, and S. Robertson A few good topics: Experiments in topic set reduction for retrieval evaluation ACM Transactions on Information Systems 27 4 2009 1 26
    • (2009) ACM Transactions on Information Systems , vol.27 , Issue.4 , pp. 1-26
    • Guiver, J.1    Mizzaro, S.2    Robertson, S.3
  • 22
  • 23
    • 80052132873 scopus 로고    scopus 로고
    • Crowdsourcing for book search evaluation: Impact of HIT design on comparative system ranking
    • Beijing, China, ACM
    • Kazai, G.; Kamps, J.; Koolen, M.; & Milic-Frayling, N. (2011). Crowdsourcing for book search evaluation: Impact of HIT design on comparative system ranking. In Proceedings of the 34th ACM SIGIR conference (pp. 205-214). Beijing, China, ACM.
    • (2011) Proceedings of the 34th ACM SIGIR Conference , pp. 205-214
    • Kazai, G.1    Kamps, J.2    Koolen, M.3    Milic-Frayling, N.4
  • 25
    • 0008500240 scopus 로고
    • Estimating the reliability, systematic error, and random error of interval data
    • K. Krippendorff Estimating the reliability, systematic error, and random error of interval data Educational and Psychological Measurement 30 1 1970 61 70
    • (1970) Educational and Psychological Measurement , vol.30 , Issue.1 , pp. 61-70
    • Krippendorff, K.1
  • 32
    • 84885608872 scopus 로고    scopus 로고
    • Information retrieval system evaluation: Effort, sensitivity, and reliability
    • Sanderson, M. & Zobel, J. (2005). Information retrieval system evaluation: Effort, sensitivity, and reliability. In Proceedings of the 28th ACM SIGIR conference (pp. 162-169).
    • (2005) Proceedings of the 28th ACM SIGIR Conference , pp. 162-169
    • Sanderson, M.1    Zobel, J.2
  • 33
    • 77954220071 scopus 로고    scopus 로고
    • Test collection based evaluation of information retrieval systems
    • M. Sanderson Test collection based evaluation of information retrieval systems Foundations and Trends in Information Retrieval 4 4 2010 247 375
    • (2010) Foundations and Trends in Information Retrieval , vol.4 , Issue.4 , pp. 247-375
    • Sanderson, M.1
  • 34
    • 80052119348 scopus 로고    scopus 로고
    • Measuring assessor accuracy: A comparison of NIST assessors and user study participants
    • Smucker, M.; & Prakash Jethani, C. (2011). Measuring assessor accuracy: A comparison of NIST assessors and user study participants. In: Proceedings of the 34th ACM SIGIR conference (pp. 1231-1232).
    • (2011) Proceedings of the 34th ACM SIGIR Conference , pp. 1231-1232
    • Smucker, M.1    Prakash Jethani, C.2
  • 37
    • 0036989640 scopus 로고    scopus 로고
    • Liberal relevance criteria of TREC: Counting on negligible documents?
    • Sormunen, E. (2002). Liberal relevance criteria of TREC: Counting on negligible documents? In Proceedings of the 25th ACM SIGIR conference (pp. 324-330).
    • (2002) Proceedings of the 25th ACM SIGIR Conference , pp. 324-330
    • Sormunen, E.1
  • 38
    • 84874593076 scopus 로고    scopus 로고
    • A comparison of consensus, consistency, and measurement approaches to estimating interrater reliability
    • Retrieved 01.10.10
    • Stemler, S.E. (2004). A comparison of consensus, consistency, and measurement approaches to estimating interrater reliability. Practical Assessment, Research & Evaluation, 9(4). < http://PAREonline.net/getvn. asp?v=9&n=4 > Retrieved 01.10.10.
    • (2004) Practical Assessment, Research & Evaluation , vol.9 , Issue.4
    • Stemler, S.E.1
  • 39
    • 0033733783 scopus 로고    scopus 로고
    • Variations in relevance judgments and the measurement of retrieval effectiveness
    • E. Voorhees Variations in relevance judgments and the measurement of retrieval effectiveness Information Processing and Management 36 5 2000 697 716
    • (2000) Information Processing and Management , vol.36 , Issue.5 , pp. 697-716
    • Voorhees, E.1
  • 40
    • 8644262918 scopus 로고    scopus 로고
    • The philosophy of information retrieval evaluation
    • Voorhees, E. (2001). The philosophy of information retrieval evaluation. In CLEF '01 proceedings (pp. 355-370).
    • (2001) CLEF '01 Proceedings , pp. 355-370
    • Voorhees, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.