메뉴 건너뛰기




Volumn 2, Issue , 2015, Pages 445-450

ΔbLEU: A discriminative metric for generation tasks with intrinsically diverse targets

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; LINGUISTICS; SCALES (WEIGHING INSTRUMENTS);

EID: 84944036795     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (116)

References (19)
  • 2
    • 84893361786 scopus 로고    scopus 로고
    • Re-evaluating the role of bleu in machine translation research
    • Chris Callison-Burch, Miles Osborne, and Philipp Koehn. 2006. Re-evaluating the role of bleu in machine translation research. In EACL, pages 249-256
    • (2006) EACL , pp. 249-256
    • Callison-Burch, C.1    Osborne, M.2    Koehn, P.3
  • 3
    • 33748682610 scopus 로고    scopus 로고
    • Correlating automated and human assessments of machine translation quality
    • Deborah Coughlin. 2003. Correlating automated and human assessments of machine translation quality. In Proc. ofMT Summit IX, pages 63-70
    • (2003) Proc. OfMT Summit IX , pp. 63-70
    • Coughlin, D.1
  • 4
    • 15744403523 scopus 로고    scopus 로고
    • Automatic evaluation of machine translation quality using n-gram co-occurrence statistics
    • George Doddington. 2002. Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In Proc. of HLT, pages 138-145
    • (2002) Proc. of HLT , pp. 138-145
    • Doddington, G.1
  • 5
    • 84884925008 scopus 로고    scopus 로고
    • HyTER: Meaning-equivalent semantics for translation evalu-ation
    • Markus Dreyer and Daniel Marcu. 2012. HyTER: Meaning-equivalent semantics for translation evalu-ation. In Proc. of HLT-NAACL, pages 162-171
    • (2012) Proc. of HLT-NAACL , pp. 162-171
    • Dreyer, M.1    Marcu, D.2
  • 6
    • 84926052089 scopus 로고    scopus 로고
    • Testing for significance of increased correlation with human judgment
    • Yvette Graham and Timothy Baldwin. 2014. Testing for significance of increased correlation with human judgment. In Proc. of EMNLP, pages 172-176
    • (2014) Proc. of EMNLP , pp. 172-176
    • Graham, Y.1    Baldwin, T.2
  • 7
    • 84944091060 scopus 로고    scopus 로고
    • Accurate evaluation of segment-level ma-chine translation metrics
    • Yvette Graham, Timothy Baldwin, and Nitika Mathur. 2015. Accurate evaluation of segment-level ma-chine translation metrics. In Proc. of NAACL-HLT, pages 1183-1191
    • (2015) Proc. of NAACL-HLT , pp. 1183-1191
    • Graham, Y.1    Baldwin, T.2    Mathur, N.3
  • 8
    • 84883394520 scopus 로고    scopus 로고
    • Framing image description as a ranking task: Data, models and evaluation metrics
    • Micah Hodosh, Peter Young, and Mia Hockenmaier. 2013. Framing image description as a ranking task: Data, models and evaluation metrics. J. Artif. Int. Res., 47(1):853-899
    • (2013) J. Artif. Int. Res. , vol.47 , Issue.1 , pp. 853-899
    • Hodosh, M.1    Young, P.2    Hockenmaier, M.3
  • 10
    • 85120046073 scopus 로고    scopus 로고
    • Meteor: An automatic metric for mt evaluation with high lev-els of correlation with human judgments
    • Alon Lavie and Abhaya Agarwal. 2007. meteor: An automatic metric for mt evaluation with high lev-els of correlation with human judgments. In Proc. of the Workshop on Statistical Machine Translation (StatMT), pages 228-231
    • (2007) Proc. of the Workshop on Statistical Machine Translation (StatMT) , pp. 228-231
    • Lavie, A.1    Agarwal, A.2
  • 11
    • 84876788727 scopus 로고    scopus 로고
    • Optimizing for sentence-level BLEU+1 yields short translations
    • Preslav Nakov, Francisco Guzman, and Stephan Vo-gel. 2012. Optimizing for Sentence-Level BLEU+1 Yields Short Translations. In Proc. of COLING
    • (2012) Proc. of COLING
    • Nakov, P.1    Guzman, F.2    Vo-Gel, S.3
  • 12
    • 22944447077 scopus 로고    scopus 로고
    • Minimum error rate training in statistical machine translation
    • Franz Losef Och. 2003. Minimum error rate training in statistical machine translation. In Proc. of ACL, pages 160-167
    • (2003) Proc. of ACL , pp. 160-167
    • Losef Och, F.1
  • 13
    • 85133336275 scopus 로고    scopus 로고
    • Bleu: A method for automatic evaluation of machine translation
    • Kishore Papineni, Salim Roukos, Todd Ward, and Weiling Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. In Proc. of ACL, pages 311-318
    • (2002) Proc. of ACL , pp. 311-318
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.4
  • 14
    • 77954762161 scopus 로고    scopus 로고
    • Official results of the NIST 2008
    • challenge (MetricsMATR08)
    • M. Przybocki, K. Peterson, and S. Bronsart. 2008. Official results of the NIST 2008 "Metrics for MA-chine TRanslation" challenge (MetricsMATR08). http://nist.gov/speech/tests/metricsmatr/2008/results/
    • (2008) Metrics for MA-chine TRanslation
    • Przybocki, M.1    Peterson, K.2    Bronsart, S.3
  • 15
    • 80053292690 scopus 로고    scopus 로고
    • Data-driven response generation in social media
    • Alan Ritter, Colin Cherry, and William B. Dolan. 2011. Data-driven response generation in social media. In Proc. of EMNLP, pages 583-593
    • (2011) Proc. of EMNLP , pp. 583-593
    • Ritter, A.1    Cherry, C.2    Dolan, W.B.3
  • 18
    • 84878177385 scopus 로고    scopus 로고
    • Loint learning of a dual SMT system for paraphrase generation
    • Hong Sun and Ming Zhou. 2012. loint learning of a dual SMT system for paraphrase generation. In ACL, pages 38-12
    • (2012) ACL , pp. 38-12
    • Sun, H.1    Zhou, M.2
  • 19
    • 84956980995 scopus 로고    scopus 로고
    • CIDEr: Consensus-based image description evaluation
    • Ramakrishna Vedantam, C. Lawrence Zitnick, and Devi Parikh. 2015. CIDEr: Consensus-based image description evaluation. In CVPR
    • (2015) CVPR
    • Vedantam, R.1    Lawrence Zitnick, C.2    Parikh, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.