SCOPUS 정보 검색 플랫폼

Computational Linguistics

Volumn 39, Issue 2, 2013, Pages 267-300

Automatically assessing machine summary content without a gold standard

(2) Louis, Annie a Nenkova, Ani a

a UNIVERSITY OF PENNSYLVANIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC EVALUATION; GOLD STANDARDS; HUMAN ASSESSMENT; HUMAN JUDGMENTS; HUMAN MODEL; PSEUDO-MODELS; SINGLE MODELS; SOURCE TEXT;

QUALITY CONTROL;

EID: 84877033915 PISSN: 08912017 EISSN: 15309312 Source Type: Journal
DOI: 10.1162/COLI_a_00123 Document Type: Article

Times cited : (170)

References (48)

1
- 49449097329
- Regression for sentence-level MT evaluation with pseudo references
- Prague
- Albrecht, Joshua and Rebecca Hwa. 2007. Regression for sentence-level MT evaluation with pseudo references. In Proceedings of ACL, pages 296-303, Prague.
- (2007) Proceedings of ACL , pp. 296-303
- Albrecht, J.¹ Hwa, R.²

2
- 85120067450
- The role of pseudo references in MT evaluation
- Columbus, OH
- Albrecht, Joshua and Rebecca Hwa. 2008. The role of pseudo references in MT evaluation. In Proceedings of the Third Workshop on Statistical Machine Translation, ACL, pages 187-190, Columbus, OH.
- (2008) Proceedings of the Third Workshop On Statistical Machine Translation, ACL , pp. 187-190
- Albrecht, J.¹ Hwa, R.²

3
- 0042005465
- Algorithm as 89: The upper tail probabilities of Spearman's rho
- Best, D. J. and D. E. Roberts. 1975. Algorithm as 89: The upper tail probabilities of Spearman's rho. Journal of the Royal Statistical Society. Series C (Applied Statistics), 24(3):377-379.
- (1975) Journal of the Royal Statistical Society. Series C (Applied Statistics) , vol.24 , Issue.3 , pp. 377-379
- Best, D.J.¹ Roberts, D.E.²

4
- 84926313951
- Findings of the 2010 joint workshop on statistical machine translation and metrics for machine translation
- Uppsala
- Callison-Burch, Chris, Philipp Koehn, Christof Monz, Kay Peterson, Mark Przybocki, and Omar Zaidan. 2010. Findings of the 2010 joint workshop on statistical machine translation and metrics for machine translation. In Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pages 17-53, Uppsala.
- (2010) Proceedings of the Joint Fifth Workshop On Statistical Machine Translation and MetricsMATR , pp. 17-53
- Callison-Burch, C.¹ Koehn, P.² Monz, C.³ Peterson, K.⁴ Przybocki, M.⁵ Zaidan, O.⁶

5
- 85122015378
- Findings of the 2011 workshop on statistical machine translation
- Edinburgh
- Callison-Burch, Chris, Philipp Koehn, Christof Monz, and Omar Zaidan. 2011. Findings of the 2011 workshop on statistical machine translation. In Proceedings of the Sixth Workshop on Statistical Machine Translation, pages 22-64, Edinburgh.
- (2011) Proceedings of the Sixth Workshop On Statistical Machine Translation , pp. 22-64
- Callison-Burch, C.¹ Koehn, P.² Monz, C.³ Zaidan, O.⁴

6
- 0032270694
- The use of MMR, diversity-based reranking for reordering documents and producing summaries
- Melbourne
- Carbonell, Jaime and Jade Goldstein. 1998. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of SIGIR, pages 335-336, Melbourne.
- (1998) Proceedings of SIGIR , pp. 335-336
- Carbonell, J.¹ Goldstein, J.²

7
- 33750366655
- Left-brain/right-brain multi-document summarization
- Boston, MA. Available at
- Conroy, John M., Jade Goldstein, Judith D. Schlesinger, and Dianne P. O'Leary. 2004. Left-brain/right-brain multi-document summarization. In Proceedings of the 4th Document Understanding Conference (DUC'04), Boston, MA. Available at: http://duc.nist.gov/pubs/2004papers/ida.conroy.ps.
- (2004) Proceedings of the 4th Document Understanding Conference (DUC'04)
- Conroy, J.M.¹ Goldstein, J.² Schlesinger, J.D.³ O'Leary, D.P.⁴

8
- 0034785142
- Text summarization via hidden Markov models
- New Orleans, LA
- Conroy, John M. and Dianne P. O'Leary. 2001. Text summarization via hidden Markov models. In Proceedings of SIGIR, pages 406-407, New Orleans, LA.
- (2001) Proceedings of SIGIR , pp. 406-407
- Conroy, J.M.¹ O'leary, D.P.²

9
- 84876786218
- Classy 2011 at TAC: Guided and multi-lingual summaries and evaluation metrics
- MD. Available at
- Conroy, John M., Judith D. Schlesinger, Jeff Kubina, Peter A. Rankel, and Dianne P. O'Leary. 2011. Classy 2011 at TAC: Guided and multi-lingual summaries and evaluation metrics. In Proceedings of TAC, Gaithersburg, MD. Available at: http://www.nist.gov/tac/publications/2011/participant.papers/CLASSY.proceedings.pdf.
- (2011) Proceedings of TAC, Gaithersburg
- Conroy, J.M.¹ Schlesinger, J.D.² Kubina, J.³ Rankel, P.A.⁴ O'Leary, D.P.⁵

10
- 85119957662
- Topic-focused multi-document summarization using an approximate oracle score
- Sydney
- Conroy, John M., Judith D. Schlesinger, and Dianne P. O'Leary. 2006. Topic-focused multi-document summarization using an approximate oracle score. In Proceedings of the COLING-ACL, pages 152-159, Sydney.
- (2006) Proceedings of the COLING-ACL , pp. 152-159
- Conroy, J.M.¹ Schlesinger, J.D.² O'Leary, D.P.³

11
- 84989525001
- Indexing by latent semantic analysis
- Deerwester, Scott, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, and Richard Harshman. 1990. Indexing by latent semantic analysis. Journal of the Americal Society for Information Science, 41(6):391-407.
- (1990) Journal of the Americal Society For Information Science , vol.41 , Issue.6 , pp. 391-407
- Deerwester, S.¹ Dumais, S.T.² Furnas, G.W.³ Landauer, T.K.⁴ Harshman, R.⁵

12
- 18744384440
- A comparison of rankings produced by summarization evaluation measures
- Seattle, WA
- Donaway, Robert L., Kevin W. Drummey, and Laura A. Mather. 2000. A comparison of rankings produced by summarization evaluation measures. In Proceedings of the NAACL-ANLP Workshop on Automatic Summarization, pages 69-78, Seattle, WA.
- (2000) Proceedings of the NAACL-ANLP Workshop On Automatic Summarization , pp. 69-78
- Donaway, R.L.¹ Drummey, K.W.² Mather, L.A.³

13
- 79952259163
- Lexpagerank: Prestige in multi-document text summarization
- Barcelona
- Erkan, Günȩ and Dragomir R. Radev. 2004. Lexpagerank: Prestige in multi-document text summarization. In Proceedings of EMNLP, pages 365-371, Barcelona.
- (2004) Proceedings of EMNLP , pp. 365-371
- Erkan, G.¹ Radev, D.R.²

14
- 85119267705
- A scalable global model for summarization
- Boulder, CO
- Gillick, Dan and Benoit Favre. 2009. A scalable global model for summarization. In Proceedings of the Workshop on Integer Linear Programming for Natural Language Processing, pages 10-18, Boulder, CO.
- (2009) Proceedings of the Workshop On Integer Linear Programming For Natural Language Processing , pp. 10-18
- Gillick, D.¹ Favre, B.²

15
- 85083134080
- Non-expert evaluation of summarization systems is risky
- Los Angeles, CA
- Gillick, Dan and Yang Liu. 2010. Non-expert evaluation of summarization systems is risky. In Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, pages 148-151, Los Angeles, CA.
- (2010) Proceedings of the NAACL HLT 2010 Workshop On Creating Speech and Language Data With Amazon's Mechanical Turk , pp. 148-151
- Gillick, D.¹ Liu, Y.²

16
- 0034795521
- Generic text summarization using relevance measure and latent semantic analysis
- New Orleans, LA
- Gong, Yihong and Xin Liu. 2001. Generic text summarization using relevance measure and latent semantic analysis. In Proceedings of SIGIR, pages 19-25, New Orleans, LA.
- (2001) Proceedings of SIGIR , pp. 19-25
- Gong, Y.¹ Liu, X.²

17
- 77956957790
- Exploring content models for multi-document summarization
- Boulder, CO
- Haghighi, Aria and Lucy Vanderwende. 2009. Exploring content models for multi-document summarization. In Proceedings of HLT-NAACL, pages 362-370, Boulder, CO.
- (2009) Proceedings of HLT-NAACL , pp. 362-370
- Haghighi, A.¹ Vanderwende, L.²

18
- 33745812461
- The effects of human variation in DUC summarization evaluation
- Barcelona
- Harman, Donna and Paul Over. 2004. The effects of human variation in DUC summarization evaluation. In Proceedings of the ACL-04 Workshop: Text Summarization Branches Out, pages 10-17, Barcelona.
- (2004) Proceedings of the ACL-04 Workshop: Text Summarization Branches Out , pp. 10-17
- Harman, D.¹ Over, P.²

19
- 0002854749
- Summarization evaluation methods: Experiments and analysis
- Palo Alto, CA
- Jing, Hongyan, Regina Barzilay, Kathleen Mckeown, and Michael Elhadad. 1998. Summarization evaluation methods: Experiments and analysis. In AAAI Symposium on Intelligent Summarization, pages 60-68, Palo Alto, CA.
- (1998) AAAI Symposium On Intelligent Summarization , pp. 60-68
- Jing, H.¹ Barzilay, R.² McKeown, K.³ Elhadad, M.⁴

20
- 85117703506
- Minimum Bayes-risk decoding for statistical machine translation
- Boston, MA
- Kumar, Shankar and William Byrne. 2004. Minimum Bayes-risk decoding for statistical machine translation. In Proceedings of HLT-NAACL, pages 169-176, Boston, MA.
- (2004) Proceedings of HLT-NAACL , pp. 169-176
- Kumar, S.¹ Byrne, W.²

21
- 34547487207
- Looking for a few good metrics: Automatic summarization evaluation-how many samples are enough
- Tokyo
- Lin, Chin-Yew. 2004a. Looking for a few good metrics: Automatic summarization evaluation-how many samples are enough. In Proceedings of the NTCIR Workshop, volume 4, pages 1-10, Tokyo.
- (2004) Proceedings of the NTCIR Workshop , vol.4 , pp. 1-10
- Lin, C.-Y.¹

22
- 26944501715
- ROUGE: A package for automatic evaluation of summaries
- Barcelona
- Lin, Chin-Yew. 2004b. ROUGE: A package for automatic evaluation of summaries. In Proceedings of the ACL Text Summarization Workshop, pages 74-81, Barcelona.
- (2004) Proceedings of the ACL Text Summarization Workshop , pp. 74-81
- Lin, C.-Y.¹

23
- 84863337556
- An information-theoretic approach to automatic evaluation of summaries
- New York, NY
- Lin, Chin-Yew, Guihong Cao, Jianfeng Gao, and Jian-Yun Nie. 2006. An information-theoretic approach to automatic evaluation of summaries. In Proceedings of HLT-NAACL, pages 463-470, New York, NY.
- (2006) Proceedings of HLT-NAACL , pp. 463-470
- Lin, C.-Y.¹ Cao, G.² Gao, J.³ Nie, J.-Y.⁴

24
- 0038375892
- The automated acquisition of topic signatures for text summarization
- Saarbrücken
- Lin, Chin-Yew and Eduard Hovy. 2000. The automated acquisition of topic signatures for text summarization. In Proceedings of COLING, pages 495-501, Saarbrücken.
- (2000) Proceedings of COLING , pp. 495-501
- Lin, C.-Y.¹ Hovy, E.²

25
- 85016508365
- Automatic evaluation of summaries using n-gram co-occurrence statistics
- Edmonton
- Lin, Chin-Yew and Eduard Hovy. 2003. Automatic evaluation of summaries using n-gram co-occurrence statistics. In Proceedings of HLT-NAACL, pages 71-78, Edmonton.
- (2003) Proceedings of HLT-NAACL , pp. 71-78
- Lin, C.-Y.¹ Hovy, E.²

26
- 84926402338
- Automatic summary evaluation without human models
- Gaithersburg, MD. Available at:
- Louis, Annie and Ani Nenkova. 2008. Automatic summary evaluation without human models. In Proceedings of TAC, Gaithersburg, MD. Available at: http://www.nist.gov/tac/publications/2008/additional.papers/Penn.proceedings.pdf.
- (2008) Proceedings of TAC
- Louis, A.¹ Nenkova, A.²

27
- 78650467116
- Automatically evaluating content selection in summarization without human models
- Singapore
- Louis, Annie and Ani Nenkova. 2009a. Automatically evaluating content selection in summarization without human models. In Proceedings of EMNLP, pages 306-314, Singapore.
- (2009) Proceedings of EMNLP , pp. 306-314
- Louis, A.¹ Nenkova, A.²

28
- 77955311450
- Performance confidence estimation for automatic summarization
- Athens
- Louis, Annie and Ani Nenkova. 2009b. Performance confidence estimation for automatic summarization. In Proceedings of EACL, pages 541-548, Athens.
- (2009) Proceedings of EACL , pp. 541-548
- Louis, A.¹ Nenkova, A.²

29
- 84877057710
- Predicting summary quality using limited human input
- Gaithersburg, MD. Available at
- Louis, Annie and Ani Nenkova. 2009c. Predicting summary quality using limited human input. In Proceedings of TAC, Gaithersburg, MD. Available at: http://www.nist.gov/tac/publications/2009/participant.papers/UPenn.proceedings.pdf.
- (2009) Proceedings of TAC
- Louis, A.¹ Nenkova, A.²

30
- 84880192089
- Using paraphrases for parameter tuning in statistical machine translation
- Prague
- Madnani, Nitin, Necip Fazil Ayan, Philip Resnik, and Bonnie J. Dorr. 2007. Using paraphrases for parameter tuning in statistical machine translation. In Proceedings of the Second Workshop on Statistical Machine Translation, pages 120-127, Prague.
- (2007) Proceedings of the Second Workshop On Statistical Machine Translation , pp. 120-127
- Madnani, N.¹ Ayan, N.F.² Resnik, P.³ Dorr, B.J.⁴

31
- 37149002864
- A study of global inference algorithms in multi-document summarization
- Rome
- McDonald, Ryan. 2007. A study of global inference algorithms in multi-document summarization. In Proceedings of ECIR, pages 557-564, Rome.
- (2007) Proceedings of ECIR , pp. 557-564
- McDonald, R.¹

32
- 14844364264
- Columbia multi-document summarization: Approach and evaluation
- New Orleans, LA. Available at
- McKeown, Kathy, Regina Barzilay, David Evans, Vasileios Hatzivassiloglou, Barry Schiffman, and Simone Teufel. 2001. Columbia multi-document summarization: Approach and evaluation. In Proceedings of DUC, New Orleans, LA. Available at: http://www-nlpir.nist.gov/projects/duc/pubs/2001papers/columbiaredo.pdf.
- (2001) Proceedings of DUC
- McKeown, K.¹ Barzilay, R.² Evans, D.³ Hatzivassiloglou, V.⁴ Schiffman, B.⁵ Teufel, S.⁶

33
- 54249136128
- Multi-document summarization with iterative graph-based algorithms
- McLean, VA
- Mihalcea, Rada and Paul Tarau. 2005. Multi-document summarization with iterative graph-based algorithms. In Proceedings of the First International Conference on Intelligent Analysis Methods and Tools (IA 2005), McLean, VA.
- (2005) Proceedings of the First International Conference On Intelligent Analysis Methods and Tools (IA 2005)
- Mihalcea, R.¹ Tarau, P.²

34
- 84859899789
- Can you summarize this? Identifying correlates of input difficulty for multi-document summarization
- Columbus, OH
- Nenkova, Ani and Annie Louis. 2008. Can you summarize this? Identifying correlates of input difficulty for multi-document summarization. In Proceedings of ACL-HLT, pages 825-833, Columbus, OH.
- (2008) Proceedings of ACL-HLT , pp. 825-833
- Nenkova, A.¹ Louis, A.²

35
- 85013202438
- Evaluating content selection in summarization: The pyramid method
- Boston, MA
- Nenkova, Ani and Rebecca Passonneau. 2004. Evaluating content selection in summarization: The pyramid method. In Proceedings of HLT-NAACL, pages 145-152, Boston, MA.
- (2004) Proceedings of HLT-NAACL , pp. 145-152
- Nenkova, A.¹ Passonneau, R.²

36
- 34249275304
- The pyramid method: Incorporating human content selection variation in summarization evaluation
- Nenkova, Ani, Rebecca Passonneau, and Kathleen McKeown. 2007. The pyramid method: Incorporating human content selection variation in summarization evaluation. ACM Transactions on Speech and Language Processing, 4(2):4.
- (2007) ACM Transactions On Speech and Language Processing , vol.4 , Issue.2 , pp. 4
- Nenkova, A.¹ Passonneau, R.² McKeown, K.³

37
- 33750346745
- A compositional context sensitive multi-document summarizer: Exploring the factors that influence summarization
- Seattle, WA
- Nenkova, Ani, Lucy Vanderwende, and Kathleen McKeown. 2006. A compositional context sensitive multi-document summarizer: Exploring the factors that influence summarization. In Proceedings of SIGIR, pages 573-580, Seattle, WA.
- (2006) Proceedings of SIGIR , pp. 573-580
- Nenkova, A.¹ Vanderwende, L.² McKeown, K.³

38
- 80053407364
- Evaluation of automatic summaries: Metrics under varying data conditions
- Singapore
- Owkzarzak, Karolina and Hoa Trang Dang. 2009. Evaluation of automatic summaries: Metrics under varying data conditions. In Proceedings of the Workshop on Language Generation and Summarisation, pages 23-30, Singapore.
- (2009) Proceedings of the Workshop On Language Generation and Summarisation , pp. 23-30
- Owkzarzak, K.¹ Dang, H.T.²

39
- 84907095419
- R: A Language and Environment For Statistical Computing
- R Development Core Team, Vienna
- R Development Core Team. 2011. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna.
- (2011) R Foundation for Statistical Computing

40
- 33751356877
- MEAD-A platform for multidocument multilingual text summarization
- Lisbon
- Radev, Dragomir, Timothy Allison, Sasha Blair-Goldensohn, John Blitzer, Arda Çelebi, Stanko Dimitrov, Elliott Drabek, Ali Hakim, Wai Lam, Danyu Liu, Jahna Otterbacher, Hong Qi, Horacio Saggion, Simone Teufel, Michael Topper, Adam Winkel, and Zhu Zhang. 2004a. MEAD-A platform for multidocument multilingual text summarization. In Proceedings of LREC 2004, pages 1-4, Lisbon.
- (2004) Proceedings of LREC 2004 , pp. 1-4
- Radev, D.¹ Allison, T.² Blair-Goldensohn, S.³ Blitzer, J.⁴ Çelebi, A.⁵ Dimitrov, S.⁶ Drabek, E.⁷ Hakim, A.⁸ Lam, W.⁹ Liu, D.¹⁰ Otterbacher, J.¹¹ Qi, H.¹² Saggion, H.¹³ Teufel, S.¹⁴ Topper, M.¹⁵ Winkel, A.¹⁶ Zhang, Z.¹⁷

41
- 4243064897
- Centroid-based summarization of multiple documents
- Radev, Dragomir, Hongyan Jing, Malgorzata Sty, and Daniel Tam. 2004b. Centroid-based summarization of multiple documents. Information Processing and Management, 40:919-938.
- (2004) Information Processing and Management , vol.40 , pp. 919-938
- Radev, D.¹ Jing, H.² Sty, M.³ Tam, D.⁴

42
- 18744387235
- Single-document and multi-document summary evaluation via relative utility
- New Orleans, LA
- Radev, Dragomir and Daniel Tam. 2003. Single-document and multi-document summary evaluation via relative utility. In Proceedings of CIKM, pages 508-511, New Orleans, LA.
- (2003) Proceedings of CIKM , pp. 508-511
- Radev, D.¹ Tam, D.²

43
- 0003172123
- The formation of abstracts by the selection of sentences: Part 1: Sentence selection by man and machines
- Rath, G. J., A. Resnick, and R. Savage. 1961. The formation of abstracts by the selection of sentences: Part 1: Sentence selection by man and machines. American Documentation, 2(12):139-208.
- (1961) American Documentation , vol.2 , Issue.12 , pp. 139-208
- Rath, G.J.¹ Resnick, A.² Savage, R.³

44
- 80053423337
- Multilingual summarization evaluation without human models
- Beijing
- Saggion, Horacio, Juan-Manuel, Torres Moreno, Iria da Cunha, Eric SanJuan, and Patricia Velazquez-Morales. 2010. Multilingual summarization evaluation without human models. In Proceedings of COLING, pages 1059-1067, Beijing.
- (2010) Proceedings of COLING , pp. 1059-1067
- Saggion, H.¹ Juan-Manuel, T.M.² da Cunha, I.³ Sanjuan, E.⁴ Velazquez-Morales, P.⁵

45
- 0034790621
- Ranking retrieval systems without relevance judgments
- New Orleans, LA
- Soboroff, Ian, Charles Nicholas, and Patrick Cahan. 2001. Ranking retrieval systems without relevance judgments. In Proceedings of SIGIR, pages 66-73, New Orleans, LA.
- (2001) Proceedings of SIGIR , pp. 66-73
- Soboroff, I.¹ Nicholas, C.² Cahan, P.³

46
- 80053375451
- Lattice minimum Bayes-risk decoding for statistical machine translation
- Honolulu, HI
- Tromble, Roy W., Shankar Kumar, Franz Och, and Wolfgang Macherey. 2008. Lattice minimum Bayes-risk decoding for statistical machine translation. In Proceedings of EMNLP, pages 620-629, Honolulu, HI.
- (2008) Proceedings of EMNLP , pp. 620-629
- Tromble, R.W.¹ Kumar, S.² Och, F.³ Macherey, W.⁴

47
- 23044445534
- Examining the consensus between human summaries: Initial experiments with factoid analysis
- Edmonton
- van Halteren Hans and Simone Teufel. 2003. Examining the consensus between human summaries: Initial experiments with factoid analysis. In Proceedings of the HLT-NAACL DUC on Text Summarization Workshop, pages 57-64, Edmonton.
- (2003) Proceedings of the HLT-NAACL DUC On Text Summarization Workshop , pp. 57-64
- van Hans, H.¹ Teufel, S.²

48
- 78650039224
- Significance tests of automatic machine translation evaluation metrics
- Zhang, Ying and Stephan Vogel. 2010. Significance tests of automatic machine translation evaluation metrics. Machine Translation, 24(1):51-65.
- (2010) Machine Translation , vol.24 , Issue.1 , pp. 51-65
- Zhang, Y.¹ Vogel, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.