-
1
-
-
34250638291
-
A Web-based Kernel Function for Measuring the Similarity of Short Text Snippets
-
Sahami M. and Heilman T. D., "A Web-based Kernel Function for Measuring the Similarity of Short Text Snippets," in proceedings of WWW,2006.
-
Proceedings of WWW,2006
-
-
Sahami, M.1
Heilman, T.D.2
-
2
-
-
0037319544
-
Methods for Identifying Versioned and Plagiarized Documents
-
Hoad T. C. and Zobel. J., "Methods for Identifying Versioned and Plagiarized Documents," in JASIST,vol. 54,2003,pp. 203-215.
-
(2003)
JASIST
, vol.54
, pp. 203-215
-
-
Hoad, T.C.1
Zobel, J.2
-
3
-
-
32344452531
-
Evaluating Similarity Measures: A Large-scale Study in the Orkut Social Network
-
Spertus E.,Sahami M. and Buyukkokten O., "Evaluating Similarity Measures: A Large-scale Study in the Orkut Social Network," in proceedings of KDD,2005.
-
Proceedings of KDD,2005
-
-
Spertus, E.1
Buyukkokten O, S.M.2
-
4
-
-
79952095979
-
Features Based Text Similarity Detection
-
Kent C.W. and Salim N., "Features Based Text Similarity Detection," in Journal of Computing,vol. 2,2010,pp. 53-57.
-
(2010)
Journal of Computing
, vol.2
, pp. 53-57
-
-
Kent, C.W.1
Salim, N.2
-
8
-
-
85094045221
-
Document Representation and Multilevel Measures of Document Similarity
-
Matveeva I., "Document Representation and Multilevel Measures of Document Similarity," in proceedings of ACL-HLT, 2006.
-
Proceedings of ACL-HLT, 2006
-
-
Matveeva, I.1
-
9
-
-
84863347445
-
Detecting Text Similarity over Short Passages: Exploring Linguistic Feature Cominations via Machine Learning
-
Hatzivassiloglou V.,Klavans J.L. and Eskin E., "Detecting Text Similarity over Short Passages: Exploring Linguistic Feature Cominations via Machine Learning," in proceedings of SIGDAT,1999.
-
Proceedings of SIGDAT,1999
-
-
Hatzivassiloglou, V.1
Klavans, J.L.2
Eskin, E.3
-
10
-
-
79957966387
-
Learning Term-weighting Functions for Similarity Measures
-
Yih W., "Learning Term-weighting Functions for Similarity Measures," in proceedings of EMNLP,2009.
-
Proceedings of EMNLP,2009
-
-
Yih, W.1
-
12
-
-
4944224800
-
Identifying and Filtering Near Duplicate Documents
-
Broder A. Z., "Identifying and Filtering Near Duplicate Documents," in processing of COM,2000.
-
(2000)
Processing of COM
-
-
Broder, A.Z.1
-
13
-
-
0001368373
-
Comparative de la Distribution Florale Dans Une Portion des Alpes et des Jura
-
Jaccard P. E., "Comparative de la Distribution Florale Dans Une Portion Des Alpes et Des Jura," in Bulletin del la Socit Vaudoise des Sciences Naturelles,vol. 37,1901,pp. 547-579.
-
(1901)
Bulletin del la Socit Vaudoise des Sciences Naturelles
, vol.37
, pp. 547-579
-
-
Jaccard, P.E.1
-
14
-
-
77956051500
-
Efficient Partial-Duplicate Detection Based on Sequence Matching
-
Qi Zhang,Yue Zhang,Hao. Yu and Xuan. Huang, "Efficient Partial-Duplicate Detection Based on Sequence Matching," in proceedings of SIGIR,2010.
-
Proceedings of SIGIR,2010
-
-
Zhang, Q.1
Zhang, Y.2
Yu, H.3
Huang, X.4
-
16
-
-
81255176821
-
XML Structural Similarity Search Using MapReduce
-
Pei. Yuan,Chao. Sha,Xiao. Wang,Bin Yang,Ao. Zhou and Su Yang, "XML Structural Similarity Search Using MapReduce,"in proceedings of WAIM,2010.
-
Proceedings of WAIM,2010
-
-
Yuan, P.1
Sha, C.2
Wang, X.3
Yang, B.4
Zhou, A.5
Yang, S.6
-
18
-
-
84859921422
-
Pairwise Document Similarity in Large Collections with MapReduce
-
Elsayed t.,Lin J. and Oard D.W., "Pairwise Document Similarity in Large Collections with MapReduce," in proceedings of ACL-HLT,2008.
-
Proceedings of ACL-HLT,2008
-
-
T, E.1
Lin, J.2
Oard, D.W.3
-
20
-
-
37549003336
-
MapReduce:Simplified Data Processing on Large Clusters
-
Dean,J. and Ghemawat,S., "MapReduce:Simplified Data Processing on Large Clusters," in Commun. ACM,vol. 51,2008,pp 107-113.
-
(2008)
Commun. ACM
, vol.51
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
21
-
-
81255211426
-
-
website
-
Hadoop website, http://hadoop.apache.org/, 2011.
-
(2011)
-
-
|