메뉴 건너뛰기




Volumn 11, Issue 2, 2012, Pages 180-199

Baselines for natural language processing tasks based on soft cardinality spectra

Author keywords

Entity resolution; G Grams; Information retrieval; NLP baselines; Paraphrase recognition; SC spectra; Soft cardinality; Text similarity; Textual entailment recognition

Indexed keywords


EID: 84865216291     PISSN: 16833511     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Conference Paper
Times cited : (7)

References (45)
  • 2
    • 44949156668 scopus 로고    scopus 로고
    • Using machine translation evaluation techniques to determine sentence-level semantic equivalence
    • Jeju Island, Korea
    • Andrew Finch, Young-Sook Hwang, and Eiichiro Sumita. Using machine translation evaluation techniques to determine sentence-level semantic equivalence, In Proceedings of the 3rd Int. Workshop on Paraphrasing, Jeju Island, Korea, 2005.
    • (2005) Proceedings of the 3rd Int. Workshop on Paraphrasing
    • Finch, A.1    Hwang, Y.-S.2    Sumita, E.3
  • 3
    • 85105937549 scopus 로고    scopus 로고
    • Unsupervised construction of large paraphrase corpora: Exploiting massively parallel news sources
    • Stroudsburg, PA, USA. Association for Computational Linguistics
    • Bill Dolan, Chris Quirk, and Chris Brockett. Unsupervised construction of large paraphrase corpora: exploiting massively parallel news sources, In Proceedings of the 20th international conference on Computational Linguistics, COLING '04, Stroudsburg, PA, USA, 2004. Association for Computational Linguistics.
    • (2004) Proceedings of the 20th International Conference on Computational Linguistics, COLING '04
    • Dolan, B.1    Quirk, C.2    Brockett, C.3
  • 4
    • 0036358995 scopus 로고    scopus 로고
    • The spectrum kernel: A string kernel for SVM protein classification
    • Kauai, Hawaii, USA
    • Christina Leslie, Eleazar Eskin, and William Stafford Noble. The spectrum kernel: A string kernel for SVM protein classification, In Biocomputing 2002 - Proceedings of the Pacific Symposium, Kauai, Hawaii, USA, 2001, pp.564-575.
    • (2001) Biocomputing 2002 - Proceedings of the Pacific Symposium , pp. 564-575
    • Leslie, C.1    Eskin, E.2    Noble, W.S.3
  • 10
    • 80052900431 scopus 로고    scopus 로고
    • Robust similarity measures for named entities matching
    • Manchester, United Kingdom, Association for Computational Linguistics
    • Erwan Moreau, Francois Yvon, and Olivier Cappe1. Robust similarity measures for named entities matching, In Proceedings of the 22nd International Conference on Computational Linguistics, V.l, Manchester, United Kingdom, Association for Computational Linguistics, 2008, pp.593-600.
    • (2008) Proceedings of the 22nd International Conference on Computational Linguistics , vol.1 , pp. 593-600
    • Moreau, E.1    Yvon, F.2    Cappel, O.3
  • 11
    • 0016572913 scopus 로고
    • A vector space model for automatic indexing
    • Gerard Salton, Andrew K. C. Wong, and Chung-Shu Yang. A vector space model for automatic indexing, Commun. ACM, V.18, N.1l, 1975, pp.613-620.
    • (1975) Commun. ACM , vol.518 , Issue.2 , pp. 613-620
    • Salton, G.1    Andrew, K.2    Wong, C.3    Yang, C.-S.4
  • 15
    • 77957887924 scopus 로고    scopus 로고
    • A survey of paraphrasing and textual entailment methods
    • May
    • Ion Androutsopoulos and Prodromos Malakasiotis, A survey of paraphrasing and textual entailment methods, J. Artif. Int. Res., V.38, N.1, May 2010, pp.135-187.
    • (2010) J. Artif. Int. Res. , vol.38 , Issue.1 , pp. 135-187
    • Androutsopoulos, I.1    Malakasiotis, P.2
  • 17
    • 0010111194 scopus 로고
    • A binary N-Gram technique for automatic correction of substitution, deletion, insertion and reversal errors in words
    • January
    • Julian R. Ullmann. A binary N-Gram technique for automatic correction of substitution, deletion, insertion and reversal errors in words, The Computer Journal, V.20, N.2, January 1977, pp.141-147.
    • (1977) The Computer Journal , vol.20 , Issue.2 , pp. 141-147
    • Ullmann, J.R.1
  • 18
    • 0026979939 scopus 로고
    • Techniques for automatically correcting words in text
    • DOI 10.1145/146370.146380
    • Karen Kukich, Techniques for automatically correcting words in text, ACM Computing Surveys, V.24, December 1992, pp.377-439. (Pubitemid 23687641)
    • (1992) ACM Computing Surveys , vol.24 , Issue.4 , pp. 377-439
    • Kukich Karen1
  • 19
    • 0142218940 scopus 로고    scopus 로고
    • Non-adjacent digrams improve matching of cross-lingual spelling variants
    • Manaus, Brazil
    • Keskustalo, H., Pirkola, A., Visala, K. and Leppanen, E. Non-adjacent digrams improve matching of cross-lingual spelling variants, In LNCS 2857, Manaus, Brazil, 2003, pp.252-265.
    • (2003) LNCS , vol.2857 , pp. 252-265
    • Keskustalo, H.1    Pirkola, A.2    Visala, K.3    Leppanen, E.4
  • 20
    • 0000250265 scopus 로고
    • Measures of the amount of ecologic association between species
    • Lee R. Dice, Measures of the amount of ecologic association between species, Ecology, 1945, pp.297-302.
    • (1945) Ecology , pp. 297-302
    • Dice, L.R.1
  • 22
    • 85157048678 scopus 로고
    • Martin Porter an algorithm for suffix stripping
    • October
    • Martin Porter An algorithm for suffix stripping, Program, V.3, N.14, October 1980, pp.130-137.
    • (1980) Program , vol.3 , Issue.14 , pp. 130-137
  • 25
    • 0001368373 scopus 로고
    • Etude comparative de la distribution florare dans une portion des alpes et des jura
    • Paul Jaccard. Etude comparative de la distribution florare dans une portion des alpes et des jura, Bulletin de la Société Vaudoise des Sciences Naturelles, 1901, pp.547-579.
    • (1901) Bulletin de la Société Vaudoise des Sciences Naturelles , pp. 547-579
    • Jaccard, P.1
  • 26
    • 78449293191 scopus 로고    scopus 로고
    • A comparison of personal name matching: Techniques and practical issues
    • Los Alamitos, CA, USA, IEEE Computer Society
    • Peter Christen, A comparison of personal name matching: Techniques and practical issues, In Data Mining Workshops, International Conference on, Los Alamitos, CA, USA, IEEE Computer Society, 2006, pp.290-294.
    • (2006) Data Mining Workshops, International Conference on , pp. 290-294
    • Christen, P.1
  • 27
    • 84857178726 scopus 로고    scopus 로고
    • Paraphrase recognition using machine learning to combine similarity measures
    • Stroudsburg, PA, USA, Association for Computational Linguistics
    • Prodromos Malakasiotis, Paraphrase recognition using machine learning to combine similarity measures, In Proceedings of the ACL-IJCNLP 2009 Student Research Workshop, ACLstudent '09, Stroudsburg, PA, USA, Association for Computational Linguistics, 2009, pp.27-35.
    • (2009) Proceedings of the ACL-IJCNLP 2009 Student Research Workshop, ACLstudent '09 , pp. 27-35
    • Malakasiotis, P.1
  • 31
    • 0029732591 scopus 로고    scopus 로고
    • The resemblance coefficients in group technology: A survey and comparative study of relational metrics
    • DOI 10.1016/0360-8352(95)00024-0
    • Sarker, B. The resemblance coefficients in group technology: A survey and comparative study of relational metrics, Computers & Industrial Engineering, V.30, N.1, January 1996, pp.103-116. (Pubitemid 126397726)
    • (1996) Computers and Industrial Engineering , vol.30 , Issue.1 , pp. 103-116
    • Sarker, B.R.1
  • 34
    • 82555180517 scopus 로고    scopus 로고
    • SC spectra: A Linear-Time soft cardinality approximation for text comparison
    • volume 7095 of Lecture Notes in Computer Science, Springer Berlin / Heidelberg
    • Sergio Jimenez Vargas and Alexander Gelbukh. SC spectra: A Linear-Time soft cardinality approximation for text comparison, In Advances in Soft Computing, volume 7095 of Lecture Notes in Computer Science, Springer Berlin / Heidelberg, 2011, pp.213-224.
    • (2011) Advances in Soft Computing , pp. 213-224
    • Vargas, S.J.1    Gelbukh, A.2
  • 38
    • 85081941118 scopus 로고    scopus 로고
    • WordNet::Similarity: Measuring the relatedness of concepts
    • Stroudsburg, PA, USA, Association for Computational Linguistics
    • Ted Pedersen, Siddharth Patwardhan, and Jason Michelizzi. WordNet::Similarity: measuring the relatedness of concepts, In Proceedings HLT-NAACL-Demonstration Papers, Stroudsburg, PA, USA, Association for Computational Linguistics, 2004.
    • (2004) Proceedings HLT-NAACL-Demonstration Papers
    • Pedersen, T.1    Patwardhan, S.2    Michelizzi, J.3
  • 39
    • 80053403826 scopus 로고    scopus 로고
    • Ensemble methods in machine learning
    • volume 1857 of Lecture Notes in Computer Science, Springer Berlin / Heidelberg
    • Thomas Dietterich. Ensemble methods in machine learning, In Multiple Classifier Systems, volume 1857 of Lecture Notes in Computer Science, Springer Berlin / Heidelberg, 2000, pp.1-15.
    • (2000) Multiple Classifier Systems , pp. 1-15
    • Dietterich, T.1
  • 40
    • 0001116877 scopus 로고
    • Binary codes capable of correcting deletions, insertions, and reversals
    • Vladimir I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals, Soviet Physics Doklady, V.10, N.8, 1966, pp.707-710.
    • (1966) Soviet Physics Doklady , vol.10 , Issue.8 , pp. 707-710
    • Levenshtein, V.I.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.