메뉴 건너뛰기




Volumn 6, Issue 1, 2007, Pages

On the reliability of factoid question answering evaluation

Author keywords

Evaluation metrics; Question answering

Indexed keywords

EVALUATION METRICS; QUESTION ANSWERING (QA);

EID: 34247192084     PISSN: 15300226     EISSN: 15583430     Source Type: Journal    
DOI: 10.1145/1227850.1227853     Document Type: Article
Times cited : (3)

References (23)
  • 3
    • 34247182321 scopus 로고    scopus 로고
    • FUKUMOTO, J., KATO, T., AND MASUI, F. 2004. Question answering challenge for five ranked answers and list answers - overview of NTCIR4 QAC2 subtask 1 and 2-. In Working Notes of NTCIR-4. 283-290.
    • FUKUMOTO, J., KATO, T., AND MASUI, F. 2004. Question answering challenge for five ranked answers and list answers - overview of NTCIR4 QAC2 subtask 1 and 2-. In Working Notes of NTCIR-4. 283-290.
  • 7
    • 8644222913 scopus 로고    scopus 로고
    • New performance metrics based on multigrade relevance: Their application to question answering
    • SAKAI, T. 2004a. New performance metrics based on multigrade relevance: Their application to question answering. In Proceedings of NTCIR-4.
    • (2004) Proceedings of NTCIR-4
    • SAKAI, T.1
  • 9
    • 33750369717 scopus 로고    scopus 로고
    • The effect of topics sampling in sensitivity comparisons of information retrieval metrics
    • SAKAI, T. 2005. The effect of topics sampling in sensitivity comparisons of information retrieval metrics. In NTCIR-5 Proceedings. 505-512.
    • (2005) NTCIR-5 Proceedings , pp. 505-512
    • SAKAI, T.1
  • 10
    • 33751354079 scopus 로고    scopus 로고
    • Bootstrap-based comparisons of IR metrics for finding one relevant document
    • Proceedings of Asia Information Retrieval Symposium (AIRS) 2006
    • SAKAI, T. 2006a. Bootstrap-based comparisons of IR metrics for finding one relevant document. In Proceedings of Asia Information Retrieval Symposium (AIRS) 2006, Lecture Notes in Computer Science 4182. 374-389.
    • (2006) Lecture Notes in Computer Science , vol.4182 , pp. 374-389
    • SAKAI, T.1
  • 11
    • 33750340100 scopus 로고    scopus 로고
    • Evaluating evaluation metrics based on the bootstrap
    • SAKAI, T. 2006b. Evaluating evaluation metrics based on the bootstrap. In Proceedings of ACM SIOIR 2006. 525-532.
    • (2006) Proceedings of ACM SIOIR 2006 , pp. 525-532
    • SAKAI, T.1
  • 12
    • 33750307579 scopus 로고    scopus 로고
    • Give me just one highly relevant document: P-measure
    • SAKAI, T. 2006c. Give me just one highly relevant document: P-measure. In Proceedings of ACM SIGIR 2006. 695-696.
    • (2006) Proceedings of ACM SIGIR 2006 , pp. 695-696
    • SAKAI, T.1
  • 13
    • 33750437740 scopus 로고    scopus 로고
    • On the reliability of information retrieval metrics based on graded relevance
    • SAKAI, T. 2006d. On the reliability of information retrieval metrics based on graded relevance. Information Processing and Management. 531-548.
    • (2006) Information Processing and Management , pp. 531-548
    • SAKAI, T.1
  • 14
    • 33750298184 scopus 로고    scopus 로고
    • On the task of finding one highly relevant document with high precision
    • SAKAI, T. 2006e. On the task of finding one highly relevant document with high precision. Information Processing Society of Japan Digital Courier 2, 174-188.
    • (2006) Information Processing Society of Japan Digital Courier , vol.2 , pp. 174-188
    • SAKAI, T.1
  • 18
    • 8644236798 scopus 로고    scopus 로고
    • On evaluating web search with very few relevant documents
    • SOBOROFF, I. 2004. On evaluating web search with very few relevant documents. In Proceedings of ACM SIGIR 2004. 530-531.
    • (2004) Proceedings of ACM SIGIR 2004 , pp. 530-531
    • SOBOROFF, I.1
  • 19
    • 0012435995 scopus 로고    scopus 로고
    • A probabilistic model of information retrieval: Development and comparative experiments
    • Part I) and 809-840 Part II
    • SPARCK JONES, K., WALKER, S., AND ROBERTSON, S. E. 2000. A probabilistic model of information retrieval: development and comparative experiments. Information Processing and Management 36, 779-808 (Part I) and 809-840 (Part II).
    • (2000) Information Processing and Management , vol.36 , pp. 779-808
    • SPARCK JONES, K.1    WALKER, S.2    ROBERTSON, S.E.3
  • 20
    • 1542370065 scopus 로고    scopus 로고
    • Overview of the TREC 2001 question answering track
    • VOORHEES, E. M. 2002. Overview of the TREC 2001 question answering track. In Proceedings of TREC 2001.
    • (2002) Proceedings of TREC 2001
    • VOORHEES, E.M.1
  • 21
    • 24644514267 scopus 로고    scopus 로고
    • Overview of the TREC 2003 question answering track
    • VOORHEES, E. M. 2004. Overview of the TREC 2003 question answering track. In Proceedings of TREC 2003.
    • (2004) Proceedings of TREC 2003
    • VOORHEES, E.M.1
  • 22
    • 34247190428 scopus 로고    scopus 로고
    • Overview of the TREC 2004 question answering track
    • VOORHEES, E. M. 2005. Overview of the TREC 2004 question answering track. In Proceedings of TREC 2004.
    • (2005) Proceedings of TREC 2004
    • VOORHEES, E.M.1
  • 23
    • 0036993119 scopus 로고    scopus 로고
    • The effect of topic set size on retrieval experiment error
    • VOORHEES, E. M. AND BUCKLEY, C. 2002. The effect of topic set size on retrieval experiment error. In Proceedings of ACM SIGIR 2002. 316-323.
    • (2002) Proceedings of ACM SIGIR 2002 , pp. 316-323
    • VOORHEES, E.M.1    BUCKLEY, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.