-
1
-
-
84893365704
-
Comparing automatic and human evaluation of NLG systems
-
Trento, Italy
-
Anja Belz and Ehud Reiter. 2006. Comparing automatic and human evaluation of NLG systems. In Proceedings of EACL 2006, pages 313-320, Trento, Italy.
-
(2006)
Proceedings of EACL 2006
, pp. 313-320
-
-
Belz, A.1
Reiter, E.2
-
2
-
-
84859893222
-
Human evaluation of a german surface realisation ranker
-
Athens, Greece, March
-
Aoife Cahill and Martin Forst. 2009. Human Evaluation of a German Surface Realisation Ranker. In Proceedings of EACL 2009, pages 112-120, Athens, Greece, March.
-
(2009)
Proceedings of EACL 2009
, pp. 112-120
-
-
Cahill, A.1
Forst, M.2
-
3
-
-
77956315385
-
Stochastic realisation ranking for a free word order language
-
Saarbrücken, Germany, June
-
Aoife Cahill, Martin Forst, and Christian Rohrer. 2007. Stochastic Realisation Ranking for a Free Word Order Language. In Proceedings of ENLG-07, pages 17-24, Saarbrücken, Germany, June.
-
(2007)
Proceedings of ENLG-07
, pp. 17-24
-
-
Cahill, A.1
Forst, M.2
Rohrer, C.3
-
4
-
-
28944439707
-
A comparison of evaluation metrics for a broad coverage parser
-
Las Palmas, Spain
-
Richard Crouch, Ron Kaplan, Tracy Holloway King, and Stefan Riezler. 2002. A comparison of evaluation metrics for a broad coverage parser. In Proceedings of the LREC Workshop: Beyond PARSEVAL, pages 67-74, Las Palmas, Spain.
-
(2002)
Proceedings of the LREC Workshop: Beyond PARSEVAL
, pp. 67-74
-
-
Crouch, R.1
Kaplan, R.2
King, T.H.3
Riezler, S.4
-
5
-
-
77956312829
-
A dependencydriven parser for German dependency and constituency representations
-
Columbus, Ohio, June
-
Johan Hall and Joakim Nivre. 2008. A dependencydriven parser for German dependency and constituency representations. In Proceedings of the Workshop on Parsing German, pages 47-54, Columbus, Ohio, June.
-
(2008)
Proceedings of the Workshop on Parsing German
, pp. 47-54
-
-
Hall, J.1
Nivre, J.2
-
7
-
-
26944501715
-
Rouge: A package for automatic evaluation of summaries
-
Stan Szpakowicz Marie-Francine Moens, editor, Barcelona, Spain, July
-
Chin-Yew Lin. 2004. Rouge: A package for automatic evaluation of summaries. In Stan Szpakowicz Marie-Francine Moens, editor, Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, pages 74-81, Barcelona, Spain, July.
-
(2004)
Text Summarization Branches Out: Proceedings of the ACL-04 Workshop
, pp. 74-81
-
-
Lin, C.-Y.1
-
8
-
-
85118650171
-
Precision and recall of machine translation
-
NJ, USA
-
I. Dan Melamed, Ryan Green, and Joseph P. Turian. 2003. Precision and recall of machine translation. In Proceedings of NAACL-03, pages 61-63, NJ, USA.
-
(2003)
Proceedings of NAACL-03
, pp. 61-63
-
-
Melamed, I.D.1
Green, R.2
Turian, J.P.3
-
9
-
-
49549118420
-
Evaluating machine translation with LFG dependencies
-
Karolina Owczarzak, Josef van Genabith, and Andy Way. 2008. Evaluating machine translation with LFG dependencies. Machine Translation, 21:95-119.
-
(2008)
Machine Translation
, vol.21
, pp. 95-119
-
-
Owczarzak, K.1
Van Genabith, J.2
Way, A.3
-
10
-
-
77956310517
-
DEPEVAL(summ): Dependency-based evaluation for automatic summaries
-
Singapore
-
Karolina Owczarzak. 2009. DEPEVAL(summ): Dependency-based Evaluation for Automatic Summaries. In Proceedings of ACL-IJCNLP 2009, Singapore.
-
(2009)
Proceedings of ACL-IJCNLP 2009
-
-
Owczarzak, K.1
-
11
-
-
85133336275
-
Bleu: A method for automatic evaluation of machine translation
-
NJ, USA
-
Kishore Papineni, Salim Roukos, ToddWard, andWei- Jing Zhu. 2001. Bleu: a method for automatic evaluation of machine translation. In Proceedings of ACL-02, pages 311-318, NJ, USA.
-
(2001)
Proceedings of ACL-02
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.-J.4
-
12
-
-
71749094730
-
An investigation into the validity of some metrics for automatically evaluating natural language generation systems
-
Ehud Reiter and Anja Belz. 2009. An Investigation into the Validity of Some Metrics for Automatically Evaluating Natural Language Generation Systems. Computational Linguistics, 35.
-
(2009)
Computational Linguistics
, vol.35
-
-
Reiter, E.1
Belz, A.2
-
13
-
-
84893320674
-
Improving coverage and parsing quality of a large-scale LFG for german
-
Genoa, Italy
-
Christian Rohrer and Martin Forst. 2006. Improving Coverage and Parsing Quality of a Large-Scale LFG for German. In Proceedings of LREC 2006, Genoa, Italy.
-
(2006)
Proceedings of LREC 2006
-
-
Rohrer, C.1
Forst, M.2
-
14
-
-
84857522507
-
A study of translation error rate with targeted human annotation
-
Matthew Snover, Bonnie Dorr, Richard Schwartz, Linnea Micciulla, and Ralph Weischedel. 2006. A study of translation error rate with targeted human annotation. In Proceedings of AMTA 2006, pages 223-231.
-
(2006)
Proceedings of AMTA 2006
, pp. 223-231
-
-
Snover, M.1
Dorr, B.2
Schwartz, R.3
Micciulla, L.4
Weischedel, R.5
-
15
-
-
24344465910
-
Evaluating evaluation methods for generation in the presense of variation
-
Amanda Stent, Matthew Marge, and Mohit Singhai. 2005. Evaluating evaluation methods for generation in the presense of variation. In Proceedings of CICLING, pages 341-351.
-
(2005)
Proceedings of CICLING
, pp. 341-351
-
-
Stent, A.1
Marge, M.2
Singhai, M.3
|