메뉴 건너뛰기




Volumn , Issue , 2008, Pages 667-674

Relevance assessment: Are judges exchangeable and does it matter?

Author keywords

Experimentation; Measurement; Performance

Indexed keywords

BRONZE; COPPER ALLOYS; INFORMATION RETRIEVAL; INFORMATION RETRIEVAL SYSTEMS; INFORMATION SERVICES; RESEARCH AND DEVELOPMENT MANAGEMENT; SILVER;

EID: 57349188929     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1390334.1390447     Document Type: Conference Paper
Times cited : (166)

References (25)
  • 1
    • 84892523862 scopus 로고    scopus 로고
    • Inter-coder agreement for computational linguistics
    • to appear
    • R. Artstein and M. Poesio. Inter-coder agreement for computational linguistics. Computational Linguistics, to appear.
    • Computational Linguistics
    • Artstein, R.1    Poesio, M.2
  • 2
    • 33750288965 scopus 로고    scopus 로고
    • A statistical method for system evaluation using incomplete judgments
    • J. A. Aslam, V. Pavlu, and E. Yilmaz. A statistical method for system evaluation using incomplete judgments. In Proc. SIGIR, 2006.
    • (2006) Proc. SIGIR
    • Aslam, J.A.1    Pavlu, V.2    Yilmaz, E.3
  • 4
    • 8644251996 scopus 로고    scopus 로고
    • Retrieval evaluation with incomplete information
    • C. Buckley and E. M. Voorhees. Retrieval evaluation with incomplete information. In Proc. SIGIR, 2004.
    • (2004) Proc. SIGIR
    • Buckley, C.1    Voorhees, E.M.2
  • 5
    • 0013287054 scopus 로고
    • Variations in relevance judgments and the evaluation of retrieval performance
    • Sep-Oct
    • R. Burgin. Variations in relevance judgments and the evaluation of retrieval performance. Information Processing & Management, 28(5):619-627, Sep-Oct 1992.
    • (1992) Information Processing & Management , vol.28 , Issue.5 , pp. 619-627
    • Burgin, R.1
  • 6
    • 84937275232 scopus 로고    scopus 로고
    • Assessing agreement on classification tasks: The kappa statistic
    • J. Carletta. Assessing agreement on classification tasks: The kappa statistic. Computational Linguistics, 22(2):249-254, 1996.
    • (1996) Computational Linguistics , vol.22 , Issue.2 , pp. 249-254
    • Carletta, J.1
  • 7
    • 0005655729 scopus 로고
    • The effect of variations in relevance assessments in comparative experimental tests of index languages
    • Cranfield Institute of Technology
    • C. W. Cleverdon. The effect of variations in relevance assessments in comparative experimental tests of index languages. Technical Report ASLIB part 2, Cranfield Institute of Technology, 1970.
    • (1970) Technical Report ASLIB part , vol.2
    • Cleverdon, C.W.1
  • 10
    • 2142668188 scopus 로고    scopus 로고
    • The kappa statistic: A second look
    • B. D. Eugenio and M. Glass. The kappa statistic: a second look. Computational Linguistics, 30(1):95-101, 2004.
    • (2004) Computational Linguistics , vol.30 , Issue.1 , pp. 95-101
    • Eugenio, B.D.1    Glass, M.2
  • 11
    • 0001769424 scopus 로고    scopus 로고
    • Variations in relevance assessments and the measurement of retrieval effectiveness
    • S. P. Harter. Variations in relevance assessments and the measurement of retrieval effectiveness. JASIS, 47(1):37-49, 1996.
    • (1996) JASIS , vol.47 , Issue.1 , pp. 37-49
    • Harter, S.P.1
  • 12
    • 0033645041 scopus 로고    scopus 로고
    • IR evaluation methods for retrieving highly relevant documents
    • K. Järvelin and J. Kekäläinen. IR evaluation methods for retrieving highly relevant documents. In Proc. SIGIR, 2000.
    • (2000) Proc. SIGIR
    • Järvelin, K.1    Kekäläinen, J.2
  • 14
    • 0009233105 scopus 로고
    • Relevance assessments and retrieval system evaluation
    • M. E. Lesk and G. Salton. Relevance assessments and retrieval system evaluation. Information Storage and Retrieval, 4:343-359, 1969.
    • (1969) Information Storage and Retrieval , vol.4 , pp. 343-359
    • Lesk, M.E.1    Salton, G.2
  • 16
    • 85050172503 scopus 로고
    • Statistical Techniques for the Study of Language and Language Behaviour
    • R. Rietveld and R. van Hout. Statistical Techniques for the Study of Language and Language Behaviour. Mouton de Gray ter, 1993.
    • (1993) Mouton de Gray ter
    • Rietveld, R.1    van Hout, R.2
  • 18
    • 36448954593 scopus 로고    scopus 로고
    • A comparison of pooled and sampled relevance judgments
    • I. Soboroff. A comparison of pooled and sampled relevance judgments. In Proc. SIGIR, 2007.
    • (2007) Proc. SIGIR
    • Soboroff, I.1
  • 19
    • 0036989640 scopus 로고    scopus 로고
    • Liberal relevance criteria of TREC: Counting on negligible documents?
    • E. Sormunen. Liberal relevance criteria of TREC: counting on negligible documents? In Proc. SIGIR, 2002.
    • (2002) Proc. SIGIR
    • Sormunen, E.1
  • 20
    • 84876705138 scopus 로고    scopus 로고
    • IR Evaluation Using Multiple Assessors per Topic
    • A. Trotman and D. Jenkinson. IR Evaluation Using Multiple Assessors per Topic. In Proc. ADCS, 2007.
    • (2007) Proc. ADCS
    • Trotman, A.1    Jenkinson, D.2
  • 22
    • 0032264624 scopus 로고    scopus 로고
    • Variations in relevance judgments and the measurement of retrieval effectiveness
    • E. M. Voorhees. Variations in relevance judgments and the measurement of retrieval effectiveness. In Proc. SIGIR, 1998.
    • (1998) Proc. SIGIR
    • Voorhees, E.M.1
  • 24
    • 34547632535 scopus 로고    scopus 로고
    • Estimating average precision with incomplete and imperfect judgments
    • E. Yilmaz and J. A. Aslam. Estimating average precision with incomplete and imperfect judgments. In Proc. CIKM, 2006.
    • (2006) Proc. CIKM
    • Yilmaz, E.1    Aslam, J.A.2
  • 25
    • 57349107098 scopus 로고    scopus 로고
    • A simple and efficient sampling method for estimating AP and NDCG
    • E. Yilmaz, E. Kanoulas, and J. Aslam.. A simple and efficient sampling method for estimating AP and NDCG. In Proc. SIGIR, 2008.
    • (2008) Proc. SIGIR
    • Yilmaz, E.1    Kanoulas, E.2    Aslam, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.