메뉴 건너뛰기




Volumn 39, Issue 2, 2013, Pages 267-300

Automatically assessing machine summary content without a gold standard

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC EVALUATION; GOLD STANDARDS; HUMAN ASSESSMENT; HUMAN JUDGMENTS; HUMAN MODEL; PSEUDO-MODELS; SINGLE MODELS; SOURCE TEXT;

EID: 84877033915     PISSN: 08912017     EISSN: 15309312     Source Type: Journal    
DOI: 10.1162/COLI_a_00123     Document Type: Article
Times cited : (170)

References (48)
  • 1
    • 49449097329 scopus 로고    scopus 로고
    • Regression for sentence-level MT evaluation with pseudo references
    • Prague
    • Albrecht, Joshua and Rebecca Hwa. 2007. Regression for sentence-level MT evaluation with pseudo references. In Proceedings of ACL, pages 296-303, Prague.
    • (2007) Proceedings of ACL , pp. 296-303
    • Albrecht, J.1    Hwa, R.2
  • 6
    • 0032270694 scopus 로고    scopus 로고
    • The use of MMR, diversity-based reranking for reordering documents and producing summaries
    • Melbourne
    • Carbonell, Jaime and Jade Goldstein. 1998. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of SIGIR, pages 335-336, Melbourne.
    • (1998) Proceedings of SIGIR , pp. 335-336
    • Carbonell, J.1    Goldstein, J.2
  • 8
    • 0034785142 scopus 로고    scopus 로고
    • Text summarization via hidden Markov models
    • New Orleans, LA
    • Conroy, John M. and Dianne P. O'Leary. 2001. Text summarization via hidden Markov models. In Proceedings of SIGIR, pages 406-407, New Orleans, LA.
    • (2001) Proceedings of SIGIR , pp. 406-407
    • Conroy, J.M.1    O'leary, D.P.2
  • 9
    • 84876786218 scopus 로고    scopus 로고
    • Classy 2011 at TAC: Guided and multi-lingual summaries and evaluation metrics
    • MD. Available at
    • Conroy, John M., Judith D. Schlesinger, Jeff Kubina, Peter A. Rankel, and Dianne P. O'Leary. 2011. Classy 2011 at TAC: Guided and multi-lingual summaries and evaluation metrics. In Proceedings of TAC, Gaithersburg, MD. Available at: http://www.nist.gov/tac/publications/2011/participant.papers/CLASSY.proceedings.pdf.
    • (2011) Proceedings of TAC, Gaithersburg
    • Conroy, J.M.1    Schlesinger, J.D.2    Kubina, J.3    Rankel, P.A.4    O'Leary, D.P.5
  • 10
    • 85119957662 scopus 로고    scopus 로고
    • Topic-focused multi-document summarization using an approximate oracle score
    • Sydney
    • Conroy, John M., Judith D. Schlesinger, and Dianne P. O'Leary. 2006. Topic-focused multi-document summarization using an approximate oracle score. In Proceedings of the COLING-ACL, pages 152-159, Sydney.
    • (2006) Proceedings of the COLING-ACL , pp. 152-159
    • Conroy, J.M.1    Schlesinger, J.D.2    O'Leary, D.P.3
  • 13
    • 79952259163 scopus 로고    scopus 로고
    • Lexpagerank: Prestige in multi-document text summarization
    • Barcelona
    • Erkan, Günȩ and Dragomir R. Radev. 2004. Lexpagerank: Prestige in multi-document text summarization. In Proceedings of EMNLP, pages 365-371, Barcelona.
    • (2004) Proceedings of EMNLP , pp. 365-371
    • Erkan, G.1    Radev, D.R.2
  • 16
    • 0034795521 scopus 로고    scopus 로고
    • Generic text summarization using relevance measure and latent semantic analysis
    • New Orleans, LA
    • Gong, Yihong and Xin Liu. 2001. Generic text summarization using relevance measure and latent semantic analysis. In Proceedings of SIGIR, pages 19-25, New Orleans, LA.
    • (2001) Proceedings of SIGIR , pp. 19-25
    • Gong, Y.1    Liu, X.2
  • 17
    • 77956957790 scopus 로고    scopus 로고
    • Exploring content models for multi-document summarization
    • Boulder, CO
    • Haghighi, Aria and Lucy Vanderwende. 2009. Exploring content models for multi-document summarization. In Proceedings of HLT-NAACL, pages 362-370, Boulder, CO.
    • (2009) Proceedings of HLT-NAACL , pp. 362-370
    • Haghighi, A.1    Vanderwende, L.2
  • 20
    • 85117703506 scopus 로고    scopus 로고
    • Minimum Bayes-risk decoding for statistical machine translation
    • Boston, MA
    • Kumar, Shankar and William Byrne. 2004. Minimum Bayes-risk decoding for statistical machine translation. In Proceedings of HLT-NAACL, pages 169-176, Boston, MA.
    • (2004) Proceedings of HLT-NAACL , pp. 169-176
    • Kumar, S.1    Byrne, W.2
  • 21
    • 34547487207 scopus 로고    scopus 로고
    • Looking for a few good metrics: Automatic summarization evaluation-how many samples are enough
    • Tokyo
    • Lin, Chin-Yew. 2004a. Looking for a few good metrics: Automatic summarization evaluation-how many samples are enough. In Proceedings of the NTCIR Workshop, volume 4, pages 1-10, Tokyo.
    • (2004) Proceedings of the NTCIR Workshop , vol.4 , pp. 1-10
    • Lin, C.-Y.1
  • 22
    • 26944501715 scopus 로고    scopus 로고
    • ROUGE: A package for automatic evaluation of summaries
    • Barcelona
    • Lin, Chin-Yew. 2004b. ROUGE: A package for automatic evaluation of summaries. In Proceedings of the ACL Text Summarization Workshop, pages 74-81, Barcelona.
    • (2004) Proceedings of the ACL Text Summarization Workshop , pp. 74-81
    • Lin, C.-Y.1
  • 23
    • 84863337556 scopus 로고    scopus 로고
    • An information-theoretic approach to automatic evaluation of summaries
    • New York, NY
    • Lin, Chin-Yew, Guihong Cao, Jianfeng Gao, and Jian-Yun Nie. 2006. An information-theoretic approach to automatic evaluation of summaries. In Proceedings of HLT-NAACL, pages 463-470, New York, NY.
    • (2006) Proceedings of HLT-NAACL , pp. 463-470
    • Lin, C.-Y.1    Cao, G.2    Gao, J.3    Nie, J.-Y.4
  • 24
    • 0038375892 scopus 로고    scopus 로고
    • The automated acquisition of topic signatures for text summarization
    • Saarbrücken
    • Lin, Chin-Yew and Eduard Hovy. 2000. The automated acquisition of topic signatures for text summarization. In Proceedings of COLING, pages 495-501, Saarbrücken.
    • (2000) Proceedings of COLING , pp. 495-501
    • Lin, C.-Y.1    Hovy, E.2
  • 25
    • 85016508365 scopus 로고    scopus 로고
    • Automatic evaluation of summaries using n-gram co-occurrence statistics
    • Edmonton
    • Lin, Chin-Yew and Eduard Hovy. 2003. Automatic evaluation of summaries using n-gram co-occurrence statistics. In Proceedings of HLT-NAACL, pages 71-78, Edmonton.
    • (2003) Proceedings of HLT-NAACL , pp. 71-78
    • Lin, C.-Y.1    Hovy, E.2
  • 26
    • 84926402338 scopus 로고    scopus 로고
    • Automatic summary evaluation without human models
    • Gaithersburg, MD. Available at:
    • Louis, Annie and Ani Nenkova. 2008. Automatic summary evaluation without human models. In Proceedings of TAC, Gaithersburg, MD. Available at: http://www.nist.gov/tac/publications/2008/additional.papers/Penn.proceedings.pdf.
    • (2008) Proceedings of TAC
    • Louis, A.1    Nenkova, A.2
  • 27
    • 78650467116 scopus 로고    scopus 로고
    • Automatically evaluating content selection in summarization without human models
    • Singapore
    • Louis, Annie and Ani Nenkova. 2009a. Automatically evaluating content selection in summarization without human models. In Proceedings of EMNLP, pages 306-314, Singapore.
    • (2009) Proceedings of EMNLP , pp. 306-314
    • Louis, A.1    Nenkova, A.2
  • 28
    • 77955311450 scopus 로고    scopus 로고
    • Performance confidence estimation for automatic summarization
    • Athens
    • Louis, Annie and Ani Nenkova. 2009b. Performance confidence estimation for automatic summarization. In Proceedings of EACL, pages 541-548, Athens.
    • (2009) Proceedings of EACL , pp. 541-548
    • Louis, A.1    Nenkova, A.2
  • 29
    • 84877057710 scopus 로고    scopus 로고
    • Predicting summary quality using limited human input
    • Gaithersburg, MD. Available at
    • Louis, Annie and Ani Nenkova. 2009c. Predicting summary quality using limited human input. In Proceedings of TAC, Gaithersburg, MD. Available at: http://www.nist.gov/tac/publications/2009/participant.papers/UPenn.proceedings.pdf.
    • (2009) Proceedings of TAC
    • Louis, A.1    Nenkova, A.2
  • 31
    • 37149002864 scopus 로고    scopus 로고
    • A study of global inference algorithms in multi-document summarization
    • Rome
    • McDonald, Ryan. 2007. A study of global inference algorithms in multi-document summarization. In Proceedings of ECIR, pages 557-564, Rome.
    • (2007) Proceedings of ECIR , pp. 557-564
    • McDonald, R.1
  • 32
    • 14844364264 scopus 로고    scopus 로고
    • Columbia multi-document summarization: Approach and evaluation
    • New Orleans, LA. Available at
    • McKeown, Kathy, Regina Barzilay, David Evans, Vasileios Hatzivassiloglou, Barry Schiffman, and Simone Teufel. 2001. Columbia multi-document summarization: Approach and evaluation. In Proceedings of DUC, New Orleans, LA. Available at: http://www-nlpir.nist.gov/projects/duc/pubs/2001papers/columbiaredo.pdf.
    • (2001) Proceedings of DUC
    • McKeown, K.1    Barzilay, R.2    Evans, D.3    Hatzivassiloglou, V.4    Schiffman, B.5    Teufel, S.6
  • 34
    • 84859899789 scopus 로고    scopus 로고
    • Can you summarize this? Identifying correlates of input difficulty for multi-document summarization
    • Columbus, OH
    • Nenkova, Ani and Annie Louis. 2008. Can you summarize this? Identifying correlates of input difficulty for multi-document summarization. In Proceedings of ACL-HLT, pages 825-833, Columbus, OH.
    • (2008) Proceedings of ACL-HLT , pp. 825-833
    • Nenkova, A.1    Louis, A.2
  • 35
    • 85013202438 scopus 로고    scopus 로고
    • Evaluating content selection in summarization: The pyramid method
    • Boston, MA
    • Nenkova, Ani and Rebecca Passonneau. 2004. Evaluating content selection in summarization: The pyramid method. In Proceedings of HLT-NAACL, pages 145-152, Boston, MA.
    • (2004) Proceedings of HLT-NAACL , pp. 145-152
    • Nenkova, A.1    Passonneau, R.2
  • 36
    • 34249275304 scopus 로고    scopus 로고
    • The pyramid method: Incorporating human content selection variation in summarization evaluation
    • Nenkova, Ani, Rebecca Passonneau, and Kathleen McKeown. 2007. The pyramid method: Incorporating human content selection variation in summarization evaluation. ACM Transactions on Speech and Language Processing, 4(2):4.
    • (2007) ACM Transactions On Speech and Language Processing , vol.4 , Issue.2 , pp. 4
    • Nenkova, A.1    Passonneau, R.2    McKeown, K.3
  • 37
    • 33750346745 scopus 로고    scopus 로고
    • A compositional context sensitive multi-document summarizer: Exploring the factors that influence summarization
    • Seattle, WA
    • Nenkova, Ani, Lucy Vanderwende, and Kathleen McKeown. 2006. A compositional context sensitive multi-document summarizer: Exploring the factors that influence summarization. In Proceedings of SIGIR, pages 573-580, Seattle, WA.
    • (2006) Proceedings of SIGIR , pp. 573-580
    • Nenkova, A.1    Vanderwende, L.2    McKeown, K.3
  • 39
    • 84907095419 scopus 로고    scopus 로고
    • R: A Language and Environment For Statistical Computing
    • R Development Core Team, Vienna
    • R Development Core Team. 2011. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna.
    • (2011) R Foundation for Statistical Computing
  • 42
    • 18744387235 scopus 로고    scopus 로고
    • Single-document and multi-document summary evaluation via relative utility
    • New Orleans, LA
    • Radev, Dragomir and Daniel Tam. 2003. Single-document and multi-document summary evaluation via relative utility. In Proceedings of CIKM, pages 508-511, New Orleans, LA.
    • (2003) Proceedings of CIKM , pp. 508-511
    • Radev, D.1    Tam, D.2
  • 43
    • 0003172123 scopus 로고
    • The formation of abstracts by the selection of sentences: Part 1: Sentence selection by man and machines
    • Rath, G. J., A. Resnick, and R. Savage. 1961. The formation of abstracts by the selection of sentences: Part 1: Sentence selection by man and machines. American Documentation, 2(12):139-208.
    • (1961) American Documentation , vol.2 , Issue.12 , pp. 139-208
    • Rath, G.J.1    Resnick, A.2    Savage, R.3
  • 45
    • 0034790621 scopus 로고    scopus 로고
    • Ranking retrieval systems without relevance judgments
    • New Orleans, LA
    • Soboroff, Ian, Charles Nicholas, and Patrick Cahan. 2001. Ranking retrieval systems without relevance judgments. In Proceedings of SIGIR, pages 66-73, New Orleans, LA.
    • (2001) Proceedings of SIGIR , pp. 66-73
    • Soboroff, I.1    Nicholas, C.2    Cahan, P.3
  • 46
    • 80053375451 scopus 로고    scopus 로고
    • Lattice minimum Bayes-risk decoding for statistical machine translation
    • Honolulu, HI
    • Tromble, Roy W., Shankar Kumar, Franz Och, and Wolfgang Macherey. 2008. Lattice minimum Bayes-risk decoding for statistical machine translation. In Proceedings of EMNLP, pages 620-629, Honolulu, HI.
    • (2008) Proceedings of EMNLP , pp. 620-629
    • Tromble, R.W.1    Kumar, S.2    Och, F.3    Macherey, W.4
  • 47
    • 23044445534 scopus 로고    scopus 로고
    • Examining the consensus between human summaries: Initial experiments with factoid analysis
    • Edmonton
    • van Halteren Hans and Simone Teufel. 2003. Examining the consensus between human summaries: Initial experiments with factoid analysis. In Proceedings of the HLT-NAACL DUC on Text Summarization Workshop, pages 57-64, Edmonton.
    • (2003) Proceedings of the HLT-NAACL DUC On Text Summarization Workshop , pp. 57-64
    • van Hans, H.1    Teufel, S.2
  • 48
    • 78650039224 scopus 로고    scopus 로고
    • Significance tests of automatic machine translation evaluation metrics
    • Zhang, Ying and Stephan Vogel. 2010. Significance tests of automatic machine translation evaluation metrics. Machine Translation, 24(1):51-65.
    • (2010) Machine Translation , vol.24 , Issue.1 , pp. 51-65
    • Zhang, Y.1    Vogel, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.