SCOPUS 정보 검색 플랫폼

ACM Transactions on Asian Language Information Processing

Volumn 6, Issue 1, 2007, Pages

On the reliability of factoid question answering evaluation

(1) Sakai, Tetsuya a

a TOSHIBA CORPORATION (Japan)

Author keywords

Evaluation metrics; Question answering

Indexed keywords

EVALUATION METRICS; QUESTION ANSWERING (QA);

DATA PROCESSING; QUERY LANGUAGES; RELIABILITY THEORY;

COMPUTER SCIENCE;

EID: 34247192084 PISSN: 15300226 EISSN: 15583430 Source Type: Journal
DOI: 10.1145/1227850.1227853 Document Type: Article

Times cited : (3)

References (23)

1
- 0033650323
- Evaluating evaluation measure stability
- BUCKLEY, C. AND VOORHEES, E. M. 2000. Evaluating evaluation measure stability. In Proceedings of ACM SIGIR 2000. 33-40.
- (2000) Proceedings of ACM SIGIR 2000 , pp. 33-40
- BUCKLEY, C.¹ VOORHEES, E.M.²

2
- 0003991665
- Chapman & Hall/CRC, Boca Raton, FL
- EFRON, B. AND TIBSHIRANI, R. 1993. An Introduction to the Bootstrap. Chapman & Hall/CRC, Boca Raton, FL.
- (1993) An Introduction to the Bootstrap
- EFRON, B.¹ TIBSHIRANI, R.²

3
- 34247182321
- FUKUMOTO, J., KATO, T., AND MASUI, F. 2004. Question answering challenge for five ranked answers and list answers - overview of NTCIR4 QAC2 subtask 1 and 2-. In Working Notes of NTCIR-4. 283-290.
- FUKUMOTO, J., KATO, T., AND MASUI, F. 2004. Question answering challenge for five ranked answers and list answers - overview of NTCIR4 QAC2 subtask 1 and 2-. In Working Notes of NTCIR-4. 283-290.

4
- 1842637192
- Cumulated gain-based evaluation of IR techniques
- JÄRVELIN, K. AND KEKÄLÄINEN, J. 2002. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems 20, 4, 422-446.
- (2002) ACM Transactions on Information Systems , vol.20 , Issue.4 , pp. 422-446
- JÄRVELIN, K.¹ KEKÄLÄINEN, J.²

5
- 34247193849
- Characterization of list-type question answering and its evaluation measures (in Japanese)
- FI-76-16, NL-163-16. 115-122
- KATO, T., MASUI, F., FUKUMOTO, J., AND KANDO, N. 2004. Characterization of list-type question answering and its evaluation measures (in Japanese). In Information Processing Society of Japan SIG Technical Reports FI-76-16 / NL-163-16. 115-122.
- (2004) Information Processing Society of Japan SIG Technical Reports
- KATO, T.¹ MASUI, F.² FUKUMOTO, J.³ KANDO, N.⁴

6
- 34247240253
- multilingual question answering track
- MAGNINI, B., VALLIN, A., AYACHE, C., ERBACH, G., PEÑAS, A., DE RIJKE, M., ROCHA, P., SIMOV, K., AND SUTCLIFFE, R. 2004. Overview of the CLEF 2004 multilingual question answering track.
- (2004) Overview of the CLEF 2004
- MAGNINI, B.¹ VALLIN, A.² AYACHE, C.³ ERBACH, G.⁴ PEÑAS, A.⁵ DE RIJKE, M.⁶ ROCHA, P.⁷ SIMOV, K.⁸ SUTCLIFFE, R.⁹

7
- 8644222913
- New performance metrics based on multigrade relevance: Their application to question answering
- SAKAI, T. 2004a. New performance metrics based on multigrade relevance: Their application to question answering. In Proceedings of NTCIR-4.
- (2004) Proceedings of NTCIR-4
- SAKAI, T.¹

8
- 33646164697
- A note on the reliability of Japanese question answering evaluation
- FI-77-7. 57-64
- SAKAI, T. 2004b. A note on the reliability of Japanese question answering evaluation. In Information Processing Society of Japan SIG Technical Reports FI-77-7. 57-64.
- (2004) Information Processing Society of Japan SIG Technical Reports
- SAKAI, T.¹

9
- 33750369717
- The effect of topics sampling in sensitivity comparisons of information retrieval metrics
- SAKAI, T. 2005. The effect of topics sampling in sensitivity comparisons of information retrieval metrics. In NTCIR-5 Proceedings. 505-512.
- (2005) NTCIR-5 Proceedings , pp. 505-512
- SAKAI, T.¹

10
- 33751354079
- Bootstrap-based comparisons of IR metrics for finding one relevant document
- Proceedings of Asia Information Retrieval Symposium (AIRS) 2006
- SAKAI, T. 2006a. Bootstrap-based comparisons of IR metrics for finding one relevant document. In Proceedings of Asia Information Retrieval Symposium (AIRS) 2006, Lecture Notes in Computer Science 4182. 374-389.
- (2006) Lecture Notes in Computer Science , vol.4182 , pp. 374-389
- SAKAI, T.¹

11
- 33750340100
- Evaluating evaluation metrics based on the bootstrap
- SAKAI, T. 2006b. Evaluating evaluation metrics based on the bootstrap. In Proceedings of ACM SIOIR 2006. 525-532.
- (2006) Proceedings of ACM SIOIR 2006 , pp. 525-532
- SAKAI, T.¹

12
- 33750307579
- Give me just one highly relevant document: P-measure
- SAKAI, T. 2006c. Give me just one highly relevant document: P-measure. In Proceedings of ACM SIGIR 2006. 695-696.
- (2006) Proceedings of ACM SIGIR 2006 , pp. 695-696
- SAKAI, T.¹

13
- 33750437740
- On the reliability of information retrieval metrics based on graded relevance
- SAKAI, T. 2006d. On the reliability of information retrieval metrics based on graded relevance. Information Processing and Management. 531-548.
- (2006) Information Processing and Management , pp. 531-548
- SAKAI, T.¹

14
- 33750298184
- On the task of finding one highly relevant document with high precision
- SAKAI, T. 2006e. On the task of finding one highly relevant document with high precision. Information Processing Society of Japan Digital Courier 2, 174-188.
- (2006) Information Processing Society of Japan Digital Courier , vol.2 , pp. 174-188
- SAKAI, T.¹

15
- 33750298184
- On the task of finding one highly relevant document with high precision
- SAKAI, T. 2006f. On the task of finding one highly relevant document with high precision. Information Processing Society of Japan Transactions on Databases TOD29, 13-27.
- (2006) Information Processing Society of Japan Transactions on Databases TOD29 , pp. 13-27
- SAKAI, T.¹

16
- 34247250005
- Toshiba ASKMi at NTCIR-4 QAC2
- SAKAI, T., SAITO, Y., ICHIMURA, Y., KOYAMA, M., AND KOKUBU, T. 2004a. Toshiba ASKMi at NTCIR-4 QAC2. In Proceedings of NTCIR-4.
- (2004) Proceedings of NTCIR-4
- SAKAI, T.¹ SAITO, Y.² ICHIMURA, Y.³ KOYAMA, M.⁴ KOKUBU, T.⁵

17
- 33751361636
- ASKMi: A Japanese question answering system based on semantic role analysis
- SAKAI, T., SAITO, Y., ICHIMURA, Y., KOYAMA, M., KOKUBU, T., AND MANABE, T. 2004b. ASKMi: A Japanese question answering system based on semantic role analysis. In Proceedings of RIAO 2004. 215-231.
- (2004) Proceedings of RIAO 2004 , pp. 215-231
- SAKAI, T.¹ SAITO, Y.² ICHIMURA, Y.³ KOYAMA, M.⁴ KOKUBU, T.⁵ MANABE, T.⁶

18
- 8644236798
- On evaluating web search with very few relevant documents
- SOBOROFF, I. 2004. On evaluating web search with very few relevant documents. In Proceedings of ACM SIGIR 2004. 530-531.
- (2004) Proceedings of ACM SIGIR 2004 , pp. 530-531
- SOBOROFF, I.¹

19
- 0012435995
- A probabilistic model of information retrieval: Development and comparative experiments
- Part I) and 809-840 Part II
- SPARCK JONES, K., WALKER, S., AND ROBERTSON, S. E. 2000. A probabilistic model of information retrieval: development and comparative experiments. Information Processing and Management 36, 779-808 (Part I) and 809-840 (Part II).
- (2000) Information Processing and Management , vol.36 , pp. 779-808
- SPARCK JONES, K.¹ WALKER, S.² ROBERTSON, S.E.³

20
- 1542370065
- Overview of the TREC 2001 question answering track
- VOORHEES, E. M. 2002. Overview of the TREC 2001 question answering track. In Proceedings of TREC 2001.
- (2002) Proceedings of TREC 2001
- VOORHEES, E.M.¹

21
- 24644514267
- Overview of the TREC 2003 question answering track
- VOORHEES, E. M. 2004. Overview of the TREC 2003 question answering track. In Proceedings of TREC 2003.
- (2004) Proceedings of TREC 2003
- VOORHEES, E.M.¹

22
- 34247190428
- Overview of the TREC 2004 question answering track
- VOORHEES, E. M. 2005. Overview of the TREC 2004 question answering track. In Proceedings of TREC 2004.
- (2005) Proceedings of TREC 2004
- VOORHEES, E.M.¹

23
- 0036993119
- The effect of topic set size on retrieval experiment error
- VOORHEES, E. M. AND BUCKLEY, C. 2002. The effect of topic set size on retrieval experiment error. In Proceedings of ACM SIGIR 2002. 316-323.
- (2002) Proceedings of ACM SIGIR 2002 , pp. 316-323
- VOORHEES, E.M.¹ BUCKLEY, C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.