메뉴 건너뛰기




Volumn 2013-August, Issue , 2013, Pages 121-126

DKPro similarity: An open source framework for text similarity

Author keywords

[No Author keywords available]

Indexed keywords

COMMON SUBSEQUENCE; DIMENSIONAL VECTORS; HIGH-DIMENSIONAL; HIGHER-DIMENSIONAL; N-GRAMS; OPEN SOURCE FRAMEWORKS; SIMILARITY MEASURE; SIMPLE++; STANDARDIZED INTERFACES; TEXT SIMILARITY;

EID: 84960103968     PISSN: 0736587X     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (61)

References (28)
  • 9
    • 7444236194 scopus 로고    scopus 로고
    • UIMA: An Architectural Approach to Unstructured Information Processing in the Corporate Research Environment
    • David Ferrucci and Adam Lally. 2004. UIMA: An Architectural Approach to Unstructured Information Processing in the Corporate Research Environment. Natural Language Engineering, 10(3-4):327–348.
    • (2004) Natural Language Engineering , vol.10 , Issue.3-4 , pp. 327-348
    • Ferrucci, David1    Lally, Adam2
  • 10
    • 84880915872 scopus 로고    scopus 로고
    • Computing Semantic Relatedness using Wikipedia-based Explicit Semantic Analysis
    • Hyderabad, India
    • Evgeniy Gabrilovich and Shaul Markovitch. 2007. Computing Semantic Relatedness using Wikipedia-based Explicit Semantic Analysis. In Proceedings of IJCAI, pages 1606–1611, Hyderabad, India.
    • (2007) Proceedings of IJCAI , pp. 1606-1611
    • Gabrilovich, Evgeniy1    Markovitch, Shaul2
  • 13
    • 84863347445 scopus 로고    scopus 로고
    • Detecting text similarity over short passages: Exploring linguistic feature combinations via machine learning
    • Vasileios Hatzivassiloglou, Judith L. Klavans, and Eleazar Eskin. 1999. Detecting text similarity over short passages: Exploring linguistic feature combinations via machine learning. In Proceedings of EMNLP/VLC, pages 203–212.
    • (1999) Proceedings of EMNLP/VLC , pp. 203-212
    • Hatzivassiloglou, Vasileios1    Klavans, Judith L.2    Eskin, Eleazar3
  • 14
    • 85087282802 scopus 로고    scopus 로고
    • Semantic similarity based on corpus statistics and lexical taxonomy
    • Jay J. Jiang and David W. Conrath. 1997. Semantic similarity based on corpus statistics and lexical taxonomy. In Proceedings of ROCLING, pages 19–33.
    • (1997) Proceedings of ROCLING , pp. 19-33
    • Jiang, Jay J.1    Conrath, David W.2
  • 18
    • 0001116877 scopus 로고
    • Binary codes capable of correcting deletions, insertions, and reversals
    • Vladimir I. Levenshtein. 1966. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady, 10(8):707–710.
    • (1966) Soviet Physics Doklady , vol.10 , Issue.8 , pp. 707-710
    • Levenshtein, Vladimir I.1
  • 19
    • 77955897943 scopus 로고    scopus 로고
    • MTLD, vocd-D, and HD-D: A validation study of sophisticated approaches to lexical diversity assessment
    • Philip M. McCarthy and Scott Jarvis. 2010. MTLD, vocd-D, and HD-D: A validation study of sophisticated approaches to lexical diversity assessment. Behavior research methods, 42(2):381–392.
    • (2010) Behavior research methods , vol.42 , Issue.2 , pp. 381-392
    • McCarthy, Philip M.1    Jarvis, Scott2
  • 20
    • 33750693384 scopus 로고    scopus 로고
    • Corpus-based and Knowledge-based Measures of Text Semantic Similarity
    • Boston, MA, USA
    • Rada Mihalcea, Courtney Corley, and Carlo Strapparava. 2006. Corpus-based and Knowledge-based Measures of Text Semantic Similarity. In Proceedings of AAAI-06, pages 775–780, Boston, MA, USA.
    • (2006) Proceedings of AAAI-06 , pp. 775-780
    • Mihalcea, Rada1    Corley, Courtney2    Strapparava, Carlo3
  • 23
    • 13344267227 scopus 로고    scopus 로고
    • The double metaphone search algorithm
    • Lawrence Philips. 2000. The double metaphone search algorithm. C/C++ Users Jour., 18(6):38–43.
    • (2000) C/C++ Users Jour , vol.18 , Issue.6 , pp. 38-43
    • Philips, Lawrence1
  • 24
    • 0003033112 scopus 로고
    • Using Information Content to Evaluate Semantic Similarity in a Taxonomy
    • Philip Resnik. 1995. Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In Proceedings of the IJCAI, pages 448–453.
    • (1995) Proceedings of the IJCAI , pp. 448-453
    • Resnik, Philip1
  • 26
    • 79952051431 scopus 로고    scopus 로고
    • The Semantic Vectors Package: New Algorithms and Public Tools for Distributional Semantics
    • Dominic Widdows and Trevor Cohen. 2010. The Semantic Vectors Package: New Algorithms and Public Tools for Distributional Semantics. In Proceedings of IEEE-ICSC, pages 9–15.
    • (2010) Proceedings of IEEE-ICSC , pp. 9-15
    • Widdows, Dominic1    Cohen, Trevor2
  • 27
    • 0008976521 scopus 로고
    • String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage
    • William E. Winkler. 1990. String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage. In Proceedings of the Survey Research Methods Section, pages 354–359.
    • (1990) Proceedings of the Survey Research Methods Section , pp. 354-359
    • Winkler, William E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.