-
2
-
-
84893361786
-
Re-evaluating the role of bleu in machine translation research
-
Chris Callison-Burch, Miles Osborne, and Philipp Koehn. 2006. Re-evaluating the role of bleu in machine translation research. In EACL, pages 249-256
-
(2006)
EACL
, pp. 249-256
-
-
Callison-Burch, C.1
Osborne, M.2
Koehn, P.3
-
3
-
-
33748682610
-
Correlating automated and human assessments of machine translation quality
-
Deborah Coughlin. 2003. Correlating automated and human assessments of machine translation quality. In Proc. ofMT Summit IX, pages 63-70
-
(2003)
Proc. OfMT Summit IX
, pp. 63-70
-
-
Coughlin, D.1
-
4
-
-
15744403523
-
Automatic evaluation of machine translation quality using n-gram co-occurrence statistics
-
George Doddington. 2002. Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In Proc. of HLT, pages 138-145
-
(2002)
Proc. of HLT
, pp. 138-145
-
-
Doddington, G.1
-
5
-
-
84884925008
-
HyTER: Meaning-equivalent semantics for translation evalu-ation
-
Markus Dreyer and Daniel Marcu. 2012. HyTER: Meaning-equivalent semantics for translation evalu-ation. In Proc. of HLT-NAACL, pages 162-171
-
(2012)
Proc. of HLT-NAACL
, pp. 162-171
-
-
Dreyer, M.1
Marcu, D.2
-
6
-
-
84926052089
-
Testing for significance of increased correlation with human judgment
-
Yvette Graham and Timothy Baldwin. 2014. Testing for significance of increased correlation with human judgment. In Proc. of EMNLP, pages 172-176
-
(2014)
Proc. of EMNLP
, pp. 172-176
-
-
Graham, Y.1
Baldwin, T.2
-
7
-
-
84944091060
-
Accurate evaluation of segment-level ma-chine translation metrics
-
Yvette Graham, Timothy Baldwin, and Nitika Mathur. 2015. Accurate evaluation of segment-level ma-chine translation metrics. In Proc. of NAACL-HLT, pages 1183-1191
-
(2015)
Proc. of NAACL-HLT
, pp. 1183-1191
-
-
Graham, Y.1
Baldwin, T.2
Mathur, N.3
-
8
-
-
84883394520
-
Framing image description as a ranking task: Data, models and evaluation metrics
-
Micah Hodosh, Peter Young, and Mia Hockenmaier. 2013. Framing image description as a ranking task: Data, models and evaluation metrics. J. Artif. Int. Res., 47(1):853-899
-
(2013)
J. Artif. Int. Res.
, vol.47
, Issue.1
, pp. 853-899
-
-
Hodosh, M.1
Young, P.2
Hockenmaier, M.3
-
9
-
-
84889566627
-
Learning deep structured semantic models for web search using clickthrough data
-
Po-Sen Huang, Xiaodong He, Lianfeng Gao, Li Deng, Alex Acero, and Larry Heck. 2013. Learning deep structured semantic models for web search using clickthrough data. In Proc. of the 22nd ACM Inter-national Conference on Information & Knowledge Management, pages 2333-2338
-
(2013)
Proc. of the 22nd ACM Inter-national Conference on Information & Knowledge Management
, pp. 2333-2338
-
-
Huang, P.1
He, X.2
Gao, L.3
Deng, L.4
Acero, A.5
Heck, L.6
-
10
-
-
85120046073
-
Meteor: An automatic metric for mt evaluation with high lev-els of correlation with human judgments
-
Alon Lavie and Abhaya Agarwal. 2007. meteor: An automatic metric for mt evaluation with high lev-els of correlation with human judgments. In Proc. of the Workshop on Statistical Machine Translation (StatMT), pages 228-231
-
(2007)
Proc. of the Workshop on Statistical Machine Translation (StatMT)
, pp. 228-231
-
-
Lavie, A.1
Agarwal, A.2
-
11
-
-
84876788727
-
Optimizing for sentence-level BLEU+1 yields short translations
-
Preslav Nakov, Francisco Guzman, and Stephan Vo-gel. 2012. Optimizing for Sentence-Level BLEU+1 Yields Short Translations. In Proc. of COLING
-
(2012)
Proc. of COLING
-
-
Nakov, P.1
Guzman, F.2
Vo-Gel, S.3
-
12
-
-
22944447077
-
Minimum error rate training in statistical machine translation
-
Franz Losef Och. 2003. Minimum error rate training in statistical machine translation. In Proc. of ACL, pages 160-167
-
(2003)
Proc. of ACL
, pp. 160-167
-
-
Losef Och, F.1
-
13
-
-
85133336275
-
Bleu: A method for automatic evaluation of machine translation
-
Kishore Papineni, Salim Roukos, Todd Ward, and Weiling Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. In Proc. of ACL, pages 311-318
-
(2002)
Proc. of ACL
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.4
-
15
-
-
80053292690
-
Data-driven response generation in social media
-
Alan Ritter, Colin Cherry, and William B. Dolan. 2011. Data-driven response generation in social media. In Proc. of EMNLP, pages 583-593
-
(2011)
Proc. of EMNLP
, pp. 583-593
-
-
Ritter, A.1
Cherry, C.2
Dolan, W.B.3
-
17
-
-
84960121226
-
A neural network approach to context-sensitive generation of conversational responses
-
Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Li, Meg Mitchell, Lian-Yun Nie, Lianfeng Gao, and Bill Dolan. 2015. A neural network approach to context-sensitive generation of conversational responses. In Proc. of NAACL-HLT
-
(2015)
Proc. of NAACL-HLT
-
-
Sordoni, A.1
Galley, M.2
Auli, M.3
Brockett, C.4
Li, Y.5
Mitchell, M.6
Nie, L.7
Gao, L.8
Dolan, B.9
-
18
-
-
84878177385
-
Loint learning of a dual SMT system for paraphrase generation
-
Hong Sun and Ming Zhou. 2012. loint learning of a dual SMT system for paraphrase generation. In ACL, pages 38-12
-
(2012)
ACL
, pp. 38-12
-
-
Sun, H.1
Zhou, M.2
-
19
-
-
84956980995
-
CIDEr: Consensus-based image description evaluation
-
Ramakrishna Vedantam, C. Lawrence Zitnick, and Devi Parikh. 2015. CIDEr: Consensus-based image description evaluation. In CVPR
-
(2015)
CVPR
-
-
Vedantam, R.1
Lawrence Zitnick, C.2
Parikh, D.3
|