-
1
-
-
85018108837
-
The field matching problem: Algorithms and applications
-
Portland, OR
-
Alvaro, E. Monge and Charles Elkan. The field matching problem: Algorithms and applications, In Proceeding of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD-96), Portland, OR, 1996, pp.267-270.
-
(1996)
Proceeding of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD-96)
, pp. 267-270
-
-
Monge, A.E.1
Elkan, C.2
-
2
-
-
44949156668
-
Using machine translation evaluation techniques to determine sentence-level semantic equivalence
-
Jeju Island, Korea
-
Andrew Finch, Young-Sook Hwang, and Eiichiro Sumita. Using machine translation evaluation techniques to determine sentence-level semantic equivalence, In Proceedings of the 3rd Int. Workshop on Paraphrasing, Jeju Island, Korea, 2005.
-
(2005)
Proceedings of the 3rd Int. Workshop on Paraphrasing
-
-
Finch, A.1
Hwang, Y.-S.2
Sumita, E.3
-
3
-
-
85105937549
-
Unsupervised construction of large paraphrase corpora: Exploiting massively parallel news sources
-
Stroudsburg, PA, USA. Association for Computational Linguistics
-
Bill Dolan, Chris Quirk, and Chris Brockett. Unsupervised construction of large paraphrase corpora: exploiting massively parallel news sources, In Proceedings of the 20th international conference on Computational Linguistics, COLING '04, Stroudsburg, PA, USA, 2004. Association for Computational Linguistics.
-
(2004)
Proceedings of the 20th International Conference on Computational Linguistics, COLING '04
-
-
Dolan, B.1
Quirk, C.2
Brockett, C.3
-
4
-
-
0036358995
-
The spectrum kernel: A string kernel for SVM protein classification
-
Kauai, Hawaii, USA
-
Christina Leslie, Eleazar Eskin, and William Stafford Noble. The spectrum kernel: A string kernel for SVM protein classification, In Biocomputing 2002 - Proceedings of the Pacific Symposium, Kauai, Hawaii, USA, 2001, pp.564-575.
-
(2001)
Biocomputing 2002 - Proceedings of the Pacific Symposium
, pp. 564-575
-
-
Leslie, C.1
Eskin, E.2
Noble, W.S.3
-
6
-
-
85049063686
-
Measuring the semantic similarity of texts
-
Stroudsburg, PA, USA, Association for Computational Linguistics
-
Courtney Cor ley and Rada Mihalcea. Measuring the semantic similarity of texts, In Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment, EMSEE '05, Stroudsburg, PA, USA, Association for Computational Linguistics, 2005, pp.13-18.
-
(2005)
Proceedings of the ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment, EMSEE '05
, pp. 13-18
-
-
Ley, C.C.1
Mihalcea, R.2
-
7
-
-
85120048801
-
The third PASCAL recognizing textual entailment challenge
-
Stroudsburg, PA, USA, Association for Computational Linguistics
-
Danilo Giampiccolo, Bernardo Magnini, Ido Dagan, and Bill Dolan. The third PASCAL recognizing textual entailment challenge, In Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, RTE '07, Stroudsburg, PA, USA, Association for Computational Linguistics, 2007, pp.1-9.
-
(2007)
Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, RTE '07
, pp. 1-9
-
-
Giampiccolo, D.1
Magnini, B.2
Dagan, I.3
Dolan, B.4
-
9
-
-
84859902208
-
Paraphrase identification as probabilistic quasi-synchronous recognition
-
Stroudsburg, PA, USA, Association for Computational Linguistics
-
Dipanjan Das and Noah A. Smith. Paraphrase identification as probabilistic quasi-synchronous recognition, In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1, ACL '09, Stroudsburg, PA, USA, Association for Computational Linguistics, 2009, pp.468-476.
-
(2009)
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1, ACL '09
, pp. 468-476
-
-
Das, D.1
Smith, N.A.2
-
10
-
-
80052900431
-
Robust similarity measures for named entities matching
-
Manchester, United Kingdom, Association for Computational Linguistics
-
Erwan Moreau, Francois Yvon, and Olivier Cappe1. Robust similarity measures for named entities matching, In Proceedings of the 22nd International Conference on Computational Linguistics, V.l, Manchester, United Kingdom, Association for Computational Linguistics, 2008, pp.593-600.
-
(2008)
Proceedings of the 22nd International Conference on Computational Linguistics
, vol.1
, pp. 593-600
-
-
Moreau, E.1
Yvon, F.2
Cappel, O.3
-
11
-
-
0016572913
-
A vector space model for automatic indexing
-
Gerard Salton, Andrew K. C. Wong, and Chung-Shu Yang. A vector space model for automatic indexing, Commun. ACM, V.18, N.1l, 1975, pp.613-620.
-
(1975)
Commun. ACM
, vol.518
, Issue.2
, pp. 613-620
-
-
Salton, G.1
Andrew, K.2
Wong, C.3
Yang, C.-S.4
-
12
-
-
85157226943
-
-
Gravano, L., Panagiotis, G. Ipeirotis, Nick Koudas, and Divesh Srivastava, Text joins in a RDBMS for web data integration, 2003.
-
(2003)
Text Joins in A RDBMS for Web Data Integration
-
-
Gravano, L.1
Ipeirotis, P.G.2
Koudas, N.3
Srivastava, D.4
-
13
-
-
82555200592
-
English-Spanish large statistical dictionary of inflectional forms
-
Valletta, Malta, European Language Resources Association (ELRA)
-
Grigori Sidorov, Alberto Barrón-Cedeño, and Paolo Rosso. English-Spanish large statistical dictionary of inflectional forms, In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10), Valletta, Malta, European Language Resources Association (ELRA), 2010, pp.277-281.
-
(2010)
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)
, pp. 277-281
-
-
Sidorov, G.1
Barrón-Cedeño, A.2
Rosso, P.3
-
14
-
-
34147121577
-
Moving approximation transform and local trend associations in time series data bases
-
Ildar Batyrshin, Janusz Kacprzyk, Leonid Sheremetov, and Lotfi Zadeh, editors, Springer Berlin / Heidelberg
-
Ildar Batyrshin, Raul Herrera-Avelar, Leonid Sheremetov, and Aleksandra Panova. Moving approximation transform and local trend associations in time series data bases, In Ildar Batyrshin, Janusz Kacprzyk, Leonid Sheremetov, and Lotfi Zadeh, editors, Perception-based Data Mining and Decision Making in Economics and Finance, of Studies in Computational Intelligence, V.36, Springer Berlin / Heidelberg, 2007, pp.55-83.
-
(2007)
Perception-based Data Mining and Decision Making in Economics and Finance, of Studies in Computational Intelligence
, vol.36
, pp. 55-83
-
-
Batyrshin, I.1
Herrera-Avelar, R.2
Sheremetov, L.3
Panova, A.4
-
15
-
-
77957887924
-
A survey of paraphrasing and textual entailment methods
-
May
-
Ion Androutsopoulos and Prodromos Malakasiotis, A survey of paraphrasing and textual entailment methods, J. Artif. Int. Res., V.38, N.1, May 2010, pp.135-187.
-
(2010)
J. Artif. Int. Res.
, vol.38
, Issue.1
, pp. 135-187
-
-
Androutsopoulos, I.1
Malakasiotis, P.2
-
17
-
-
0010111194
-
A binary N-Gram technique for automatic correction of substitution, deletion, insertion and reversal errors in words
-
January
-
Julian R. Ullmann. A binary N-Gram technique for automatic correction of substitution, deletion, insertion and reversal errors in words, The Computer Journal, V.20, N.2, January 1977, pp.141-147.
-
(1977)
The Computer Journal
, vol.20
, Issue.2
, pp. 141-147
-
-
Ullmann, J.R.1
-
18
-
-
0026979939
-
Techniques for automatically correcting words in text
-
DOI 10.1145/146370.146380
-
Karen Kukich, Techniques for automatically correcting words in text, ACM Computing Surveys, V.24, December 1992, pp.377-439. (Pubitemid 23687641)
-
(1992)
ACM Computing Surveys
, vol.24
, Issue.4
, pp. 377-439
-
-
Kukich Karen1
-
19
-
-
0142218940
-
Non-adjacent digrams improve matching of cross-lingual spelling variants
-
Manaus, Brazil
-
Keskustalo, H., Pirkola, A., Visala, K. and Leppanen, E. Non-adjacent digrams improve matching of cross-lingual spelling variants, In LNCS 2857, Manaus, Brazil, 2003, pp.252-265.
-
(2003)
LNCS
, vol.2857
, pp. 252-265
-
-
Keskustalo, H.1
Pirkola, A.2
Visala, K.3
Leppanen, E.4
-
20
-
-
0000250265
-
Measures of the amount of ecologic association between species
-
Lee R. Dice, Measures of the amount of ecologic association between species, Ecology, 1945, pp.297-302.
-
(1945)
Ecology
, pp. 297-302
-
-
Dice, L.R.1
-
21
-
-
55849090098
-
Paraphrase recognition via dissimilarity significance classification
-
Stroudsburg, PA, USA, Association for Computational Linguistics
-
Long Qiu, Min-Yen Kan, and Tat-Seng Chua Paraphrase recognition via dissimilarity significance classification, In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, EMNLP '06, Stroudsburg, PA, USA, Association for Computational Linguistics, 2006, pp.18-26.
-
(2006)
Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, EMNLP '06
, pp. 18-26
-
-
Qiu, L.1
Kan, M.-Y.2
Chua, T.-S.3
-
22
-
-
85157048678
-
Martin Porter an algorithm for suffix stripping
-
October
-
Martin Porter An algorithm for suffix stripping, Program, V.3, N.14, October 1980, pp.130-137.
-
(1980)
Program
, vol.3
, Issue.14
, pp. 130-137
-
-
-
24
-
-
2342447399
-
Adaptive name matching in information integration
-
Mikhail Bilenko, Raymond Mooney, William Cohen, Pradeep Ravikumar, and Stephen Fienberg Adaptive name matching in information integration, IEEE Intelligent Systems, V.18, N.5, 2003, pp.16-23.
-
(2003)
IEEE Intelligent Systems
, vol.518
, Issue.5
, pp. 16-23
-
-
Bilenko, M.1
Mooney, R.2
Cohen, W.3
Ravikumar, P.4
Fienberg, S.5
-
25
-
-
0001368373
-
Etude comparative de la distribution florare dans une portion des alpes et des jura
-
Paul Jaccard. Etude comparative de la distribution florare dans une portion des alpes et des jura, Bulletin de la Société Vaudoise des Sciences Naturelles, 1901, pp.547-579.
-
(1901)
Bulletin de la Société Vaudoise des Sciences Naturelles
, pp. 547-579
-
-
Jaccard, P.1
-
26
-
-
78449293191
-
A comparison of personal name matching: Techniques and practical issues
-
Los Alamitos, CA, USA, IEEE Computer Society
-
Peter Christen, A comparison of personal name matching: Techniques and practical issues, In Data Mining Workshops, International Conference on, Los Alamitos, CA, USA, IEEE Computer Society, 2006, pp.290-294.
-
(2006)
Data Mining Workshops, International Conference on
, pp. 290-294
-
-
Christen, P.1
-
27
-
-
84857178726
-
Paraphrase recognition using machine learning to combine similarity measures
-
Stroudsburg, PA, USA, Association for Computational Linguistics
-
Prodromos Malakasiotis, Paraphrase recognition using machine learning to combine similarity measures, In Proceedings of the ACL-IJCNLP 2009 Student Research Workshop, ACLstudent '09, Stroudsburg, PA, USA, Association for Computational Linguistics, 2009, pp.27-35.
-
(2009)
Proceedings of the ACL-IJCNLP 2009 Student Research Workshop, ACLstudent '09
, pp. 27-35
-
-
Malakasiotis, P.1
-
28
-
-
85157233902
-
-
Quang Xuan Do, Dan Roth, Mark Sammons, Yuancheng Tu, and Vinod V. G. Vydiswaran. Robust, lightweight approaches to compute lexical similarity, Technical report, 2010.
-
(2010)
Robust, Lightweight Approaches to Compute Lexical Similarity, Technical Report
-
-
Do, Q.X.1
Roth, D.2
Sammons, M.3
Tu, Y.4
Vinod, V.5
Vydiswaran, G.6
-
31
-
-
0029732591
-
The resemblance coefficients in group technology: A survey and comparative study of relational metrics
-
DOI 10.1016/0360-8352(95)00024-0
-
Sarker, B. The resemblance coefficients in group technology: A survey and comparative study of relational metrics, Computers & Industrial Engineering, V.30, N.1, January 1996, pp.103-116. (Pubitemid 126397726)
-
(1996)
Computers and Industrial Engineering
, vol.30
, Issue.1
, pp. 103-116
-
-
Sarker, B.R.1
-
32
-
-
84870725740
-
Soft cardinality: A parameterized similarity function for text comparison
-
Montreal, Canada
-
Sergio Jimenez, Claudia Becerra, and Alexander Gelbukh. Soft cardinality: A parameterized similarity function for text comparison, In In Proceedings of the 6th International Workshop on Semantic Evaluation (Se-mEval 2012), in conjunction with the First Joint Conference on Lexical and Computational Semantics (*SEM 2012), Montreal, Canada, 2012.
-
(2012)
Proceedings of the 6th International Workshop on Semantic Evaluation (Se-mEval 2012), in Conjunction with the First Joint Conference on Lexical and Computational Semantics (*SEM 2012)
-
-
Jimenez, S.1
Becerra, C.2
Gelbukh, A.3
-
33
-
-
78449297542
-
Text comparison using soft cardinality
-
volume 6393 of LNCS, Springer Berlin Heidelberg, Berlin, Heidelberg
-
Sergio Jimenez, Fabio Gonzalez, and Alexander Gelbukh. Text comparison using soft cardinality, In Edgar Chavez and Stefano Lonardi, editors, String Processing and Information Retrieval, volume 6393 of LNCS, Springer Berlin Heidelberg, Berlin, Heidelberg, 2010, pp.297-302.
-
(2010)
Edgar Chavez and Stefano Lonardi, Editors, String Processing and Information Retrieval
, pp. 297-302
-
-
Jimenez, S.1
Gonzalez, F.2
Gelbukh, A.3
-
34
-
-
82555180517
-
SC spectra: A Linear-Time soft cardinality approximation for text comparison
-
volume 7095 of Lecture Notes in Computer Science, Springer Berlin / Heidelberg
-
Sergio Jimenez Vargas and Alexander Gelbukh. SC spectra: A Linear-Time soft cardinality approximation for text comparison, In Advances in Soft Computing, volume 7095 of Lecture Notes in Computer Science, Springer Berlin / Heidelberg, 2011, pp.213-224.
-
(2011)
Advances in Soft Computing
, pp. 213-224
-
-
Vargas, S.J.1
Gelbukh, A.2
-
35
-
-
77952713589
-
Using dependency-based features to take the para-farce out of paraphrase
-
Sydney, Australia
-
Stephen Wan, Mark Dras, Robert Dale, and Cecile Paris. Using Dependency-Based Features to Take the Para-Farce Out of Paraphrase, In Proceedings of the Australasian Language Technology Workshop, Sydney, Australia, 2006.
-
(2006)
Proceedings of the Australasian Language Technology Workshop
-
-
Wan, S.1
Dras, M.2
Dale, R.3
Paris, C.4
-
36
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
San Diego, California
-
Surajit Chaudhuri, Kris Ganjam, Venkatesh Ganti, and Rajeev Motwani. Robust and efficient fuzzy match for online data cleaning, In Proceedings of the 2003 ACM SIGMOD international conference on management of data, San Diego, California, 2003, pp.313-324.
-
(2003)
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data
, pp. 313-324
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
38
-
-
85081941118
-
WordNet::Similarity: Measuring the relatedness of concepts
-
Stroudsburg, PA, USA, Association for Computational Linguistics
-
Ted Pedersen, Siddharth Patwardhan, and Jason Michelizzi. WordNet::Similarity: measuring the relatedness of concepts, In Proceedings HLT-NAACL-Demonstration Papers, Stroudsburg, PA, USA, Association for Computational Linguistics, 2004.
-
(2004)
Proceedings HLT-NAACL-Demonstration Papers
-
-
Pedersen, T.1
Patwardhan, S.2
Michelizzi, J.3
-
39
-
-
80053403826
-
Ensemble methods in machine learning
-
volume 1857 of Lecture Notes in Computer Science, Springer Berlin / Heidelberg
-
Thomas Dietterich. Ensemble methods in machine learning, In Multiple Classifier Systems, volume 1857 of Lecture Notes in Computer Science, Springer Berlin / Heidelberg, 2000, pp.1-15.
-
(2000)
Multiple Classifier Systems
, pp. 1-15
-
-
Dietterich, T.1
-
40
-
-
0001116877
-
Binary codes capable of correcting deletions, insertions, and reversals
-
Vladimir I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals, Soviet Physics Doklady, V.10, N.8, 1966, pp.707-710.
-
(1966)
Soviet Physics Doklady
, vol.10
, Issue.8
, pp. 707-710
-
-
Levenshtein, V.I.1
-
41
-
-
77954932775
-
Human assessments of document similarity
-
April
-
Westerman, S.J, Cribbin, T. and Collins, J. Human assessments of document similarity, Journal of the American Society for Information Science and Technology, V.61, N.8, April 2010, pp.1535-1542.
-
(2010)
Journal of the American Society for Information Science and Technology
, vol.61
, Issue.8
, pp. 1535-1542
-
-
Westerman, S.J.1
Cribbin, T.2
Collins, J.3
-
45
-
-
77951970723
-
Recent advances in computational linguistics, Informatica
-
Yulia Ledeneva and Grigori Sidorov, Recent advances in computational linguistics, Informatica. International Journal of Computing and Informatics, V.34, N.1, 2010, pp.3-18.
-
(2010)
International Journal of Computing and Informatics
, vol.34
, Issue.1
, pp. 3-18
-
-
Ledeneva, Y.1
Sidorov, G.2
|