-
1
-
-
79951773735
-
Toward a federal benchmarking standard for evaluating information retrieval products used in e-discovery
-
J. R. Baron. Toward a federal benchmarking standard for evaluating information retrieval products used in e-discovery. 6th Sedona Conference Journal, pages 237-246, 2005.
-
(2005)
6th Sedona Conference Journal
, pp. 237-246
-
-
Baron, J.R.1
-
2
-
-
0442289065
-
Survey of clustering data mining techniques
-
Pavel Berkhin. Survey of clustering data mining techniques. 2002.
-
(2002)
-
-
Berkhin, P.1
-
4
-
-
0010362121
-
Syntactic clustering of the web
-
A. Z. Broder, S. C. Glassman, M. S. Manasse, and G. Zweig. Syntactic clustering of the web. Computer Networks, 29(8-13):1157-1166, 1997.
-
(1997)
Computer Networks
, vol.29
, Issue.8-13
, pp. 1157-1166
-
-
Broder, A.Z.1
Glassman, S.C.2
Manasse, M.S.3
Zweig, G.4
-
6
-
-
0013206133
-
Collection statistics for fast duplicate document detection
-
A. Chowdhury., O. Frieder, D. Grossman, and M. McCabe. Collection statistics for fast duplicate document detection. ACM Transactions on Information Systems (TOIS), 20:171-191, 2002.
-
(2002)
ACM Transactions on Information Systems (TOIS)
, vol.20
, pp. 171-191
-
-
Chowdhury, A.1
Frieder, O.2
Grossman, D.3
McCabe, M.4
-
11
-
-
33645492588
-
Federal Information Processing Standards
-
FIPS. Secure hash standard. Technical Report FIPS PUB 180-1Publication
-
FIPS. Secure hash standard. Technical Report FIPS PUB 180-1, Federal Information Processing Standards Publication, 1995.
-
(1995)
-
-
-
13
-
-
33750296887
-
Finding near-duplicate web pages: A large-scale evaluation of algorithms
-
August
-
Monika Henzinger. Finding near-duplicate web pages: A large-scale evaluation of algorithms. Proceedings of SIGIR, pages 284-291, August 2006.
-
(2006)
Proceedings of SIGIR
, pp. 284-291
-
-
Henzinger, M.1
-
17
-
-
0004141898
-
Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering
-
Andrew Kachites McCallum. Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering. http://www.cs.cmu.edu/mccallum/bow, 1996.
-
(1996)
-
-
McCallum, A.K.1
-
19
-
-
52649144196
-
An algebraic approach to rule-based information extraction
-
Frederick Reiss, Sriram Raghavan, Rajasekar Krishnamurthy, Huaiyu Zhu, and Shivakumar Vaithyanathan. An algebraic approach to rule-based information extraction. Proceedings of ICDE, pages 933-942, 2008.
-
(2008)
Proceedings of ICDE
, pp. 933-942
-
-
Reiss, F.1
Raghavan, S.2
Krishnamurthy, R.3
Zhu, H.4
Vaithyanathan, S.5
-
20
-
-
84863763759
-
-
RFC 2822 Internet Message Format.
-
RFC 2822 Internet Message Format., 2001. http://www.faqs.org/rfcs/rfc2822.html.
-
(2001)
-
-
-
21
-
-
84863763760
-
The 2006 Scoha-Gelbmann electronic discovery survey report
-
George Scoha and Tom Gelbmann. The 2006 Scoha-Gelbmann electronic discovery survey report. Socha Consulting, 2007.
-
(2007)
Socha Consulting
-
-
Scoha, G.1
Gelbmann, T.2
-
22
-
-
57349131623
-
Spotsigs: robust and efficient near duplicate detection in large web collections
-
Martin Theobald, Jonathan Siddharth, and Andreas Paepcke. Spotsigs: robust and efficient near duplicate detection in large web collections. Proceedings of SIGIR, pages 563-570, 2008.
-
(2008)
Proceedings of SIGIR
, pp. 563-570
-
-
Theobald, M.1
Siddharth, J.2
Paepcke, A.3
-
23
-
-
36448988354
-
Indexing emails and email threads for retrieval
-
Y. Wu and D. Oard. Indexing emails and email threads for retrieval. Proceedings of SIGIR, pages 665 - 666, 2005.
-
(2005)
Proceedings of SIGIR
, pp. 665-666
-
-
Wu, Y.1
Oard, D.2
-
24
-
-
84904789014
-
Email thread reassembly using similarity matching
-
Jen Yuan Yeh and Aaron Harnly. Email thread reassembly using similarity matching. Proceedings of CEAS, 2006.
-
(2006)
Proceedings of CEAS
-
-
Yeh, J.Y.1
Harnly, A.2
|