-
1
-
-
0008989930
-
Special issue on methods and tools for the automatic construction of hypertext
-
March
-
M. Agosti and J. Allan. Special issue on methods and tools for the automatic construction of hypertext. Information Processing and Management, 33(2), March 1997.
-
(1997)
Information Processing and Management
, vol.33
, Issue.2
-
-
Agosti, M.1
Allan, J.2
-
2
-
-
0031098964
-
On the use of information retrieval techniques for the automatic construction of hypertext
-
March
-
M. Agosti, F. Crestani, and M. Melucci. On the use of information retrieval techniques for the automatic construction of hypertext. Information Processing and Management, 33(2): 133-144, March 1997.
-
(1997)
Information Processing and Management
, vol.33
, Issue.2
, pp. 133-144
-
-
Agosti, M.1
Crestani, F.2
Melucci, M.3
-
3
-
-
84976810280
-
Copy detection mechanisms for digital documents
-
San Jose, California, USA, May
-
S. Brin, J. Davis, and H. García-Molina. Copy detection mechanisms for digital documents. In Proceedings of the A CM SIGMOD International Conference of Managemen of Data, pages 398-409, San Jose, California, USA, May 1995.
-
(1995)
Proceedings of the A CM SIGMOD International Conference of Managemen of Data
, pp. 398-409
-
-
Brin, S.1
Davis, J.2
García-Molina, H.3
-
4
-
-
0002698813
-
On the resemblance and containment of documents
-
A. Z. Broder. On the resemblance and containment of documents. In SEQS: Sequences '91, 1998.
-
(1998)
SEQS: Sequences '91
-
-
Broder, A.Z.1
-
5
-
-
0004278262
-
Syntactic clustering of the web
-
Technical report, Digial Systems Research Center, Palo Alto, California, USA, July
-
A. Z. Broder, S. C. Glassman, M. S. Manasse, and G. Zweig. Syntactic clustering of the web. Technical report, Digial Systems Research Center, Palo Alto, California, USA, July 1997.
-
(1997)
-
-
Broder, A.Z.1
Glassman, S.C.2
Manasse, M.S.3
Zweig, G.4
-
6
-
-
0013206133
-
Collection statistics for fast duplicate document detection
-
April
-
A. Chowdhury, O. Frieder, D. Grossman, and M. C. McCabe. Collection statistics for fast duplicate document detection. ACM Transections on Information Systems, 20(2):171-191, April 2002.
-
(2002)
ACM Transections on Information Systems
, vol.20
, Issue.2
, pp. 171-191
-
-
Chowdhury, A.1
Frieder, O.2
Grossman, D.3
McCabe, M.C.4
-
7
-
-
37549003336
-
Mapreduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. Communications of the ACM, 51(1):107-113, 2008.
-
(2008)
Communications of the ACM
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
8
-
-
84885639910
-
Detecting phrase-level duplication on the world wide web
-
Salvador, Brazil, August
-
D. Fetterly, M. Manasse, and M. Najork. Detecting phrase-level duplication on the world wide web. In Proceedings of the ACM SIGIR '05, Salvador, Brazil, August 2005.
-
(2005)
Proceedings of the ACM SIGIR '05
-
-
Fetterly, D.1
Manasse, M.2
Najork, M.3
-
9
-
-
16644399456
-
Signature extraction for overlap detection in documents
-
Melbourne, Australia, January
-
R. A. Finkel, A. Zaslavsky. K. Monostori, and H. Schmidt. Signature extraction for overlap detection in documents. In Twenty-Fifth Australasian Computer Science Conference (ACSC2002), pages 59-64, Melbourne, Australia, January 2002.
-
(2002)
Twenty-Fifth Australasian Computer Science Conference (ACSC2002)
, pp. 59-64
-
-
Finkel, R.A.1
Zaslavsky, A.2
Monostori, K.3
Schmidt, H.4
-
10
-
-
0345120055
-
Inter-linker consistency in the manual construction of hypertext documents
-
J. Furrier, D. Ellis, and P. Willett. Inter-linker consistency in the manual construction of hypertext documents, ACM Comput. Surv., page 18, 1999.
-
(1999)
ACM Comput. Surv
, pp. 18
-
-
Furrier, J.1
Ellis, D.2
Willett, P.3
-
11
-
-
0001553729
-
From Ukkonen to McCreight and Weiner: A unifying view of linear-time suffix tree construction
-
R. Giegerich and S. Kurtz. From Ukkonen to McCreight and Weiner: A unifying view of linear-time suffix tree construction. Algorithrnica, 19(3):331-353, 1997.
-
(1997)
Algorithrnica
, vol.19
, Issue.3
, pp. 331-353
-
-
Giegerich, R.1
Kurtz, S.2
-
12
-
-
0031622479
-
Citeseer: An automatic citation indexing system
-
New York, New York, USA, ACM
-
C. L. Giles, K. D. Bollacker, and S. Lawrence. Citeseer: an automatic citation indexing system. In DL '98: Proceedings of the third A CM conference on Digital libraries, pages 89-98, New York, New York, USA, 1998. ACM.
-
(1998)
DL '98: Proceedings of the third A CM conference on Digital libraries
, pp. 89-98
-
-
Giles, C.L.1
Bollacker, K.D.2
Lawrence, S.3
-
14
-
-
0016942292
-
A space-economical suffix tree construction algorithm
-
April
-
E. M. McCreight. A space-economical suffix tree construction algorithm. Journal of the ACM, 23(2):262-272, April 1976.
-
(1976)
Journal of the ACM
, vol.23
, Issue.2
, pp. 262-272
-
-
McCreight, E.M.1
-
15
-
-
0033650834
-
Document overlap detection system for distributed digital lbraries
-
San Antonio, Texas, USA, June
-
K. Monostori, A. Zaslavsky, and H. Schmidt. Document overlap detection system for distributed digital lbraries. In Proceedings of the ACM Digital Libraries 2000 (DL00), pages 226-227, San Antonio, Texas, USA, June 2000.
-
(2000)
Proceedings of the ACM Digital Libraries 2000 (DL00)
, pp. 226-227
-
-
Monostori, K.1
Zaslavsky, A.2
Schmidt, H.3
-
16
-
-
85015564457
-
Xanalogical structure, needed now more than ever: Parallel documents, deep links to content, deep versioning, and deep re-use
-
December
-
T. H. Nelson. Xanalogical structure, needed now more than ever: Parallel documents, deep links to content, deep versioning, and deep re-use. ACM Computing Surveys, 31(4), December 1999.
-
(1999)
ACM Computing Surveys
, vol.31
, Issue.4
-
-
Nelson, T.H.1
-
19
-
-
79959990623
-
Plagiarism detection in arXiv
-
Hong Kong, December
-
D. Sorokina, J. Gehrke, S. Warner, and P. Ginsparg. Plagiarism detection in arXiv. In Proceedings of the Sixth International Conference on Data Mining (ICDM '06), Hong Kong, December 2006.
-
(2006)
Proceedings of the Sixth International Conference on Data Mining (ICDM '06)
-
-
Sorokina, D.1
Gehrke, J.2
Warner, S.3
Ginsparg, P.4
-
20
-
-
0031354094
-
Duplicate document detection
-
San Jose, California, USA
-
A. L. Spitz. Duplicate document detection. In Proceedings of the SPIE - International Society for Optical Engineering, Document Recognition IV, pages 88-94, San Jose, California, USA, 1997.
-
(1997)
Proceedings of the SPIE - International Society for Optical Engineering, Document Recognition IV
, pp. 88-94
-
-
Spitz, A.L.1
-
21
-
-
57349087444
-
Near similarity search and plagiarism analysis
-
Springer, April
-
B. Stein and S. M. zu Eissen. Near similarity search and plagiarism analysis. In From Data and Information Analysis to Knowledge Engineering, Studies in Classification, Data Analysis, and Knowledge Organization, pages 430-437. Springer, April 2006.
-
(2006)
From Data and Information Analysis to Knowledge Engineering, Studies in Classification, Data Analysis, and Knowledge Organization
, pp. 430-437
-
-
Stein, B.1
zu Eissen, S.M.2
-
22
-
-
0001704377
-
On-line construction of suffix trees
-
E. Ukkonen. On-line construction of suffix trees. Algorithrmica, 14(3):249-260, 1995.
-
(1995)
Algorithrmica
, vol.14
, Issue.3
, pp. 249-260
-
-
Ukkonen, E.1
|