-
1
-
-
42949138243
-
Finding high quality content in social media, with an application to community-based question answering
-
Stanford, CA, USA, February, ACM Press
-
E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Finding high quality content in social media, with an application to community-based question answering. In Proceedings of ACM WSDM, pages 183-194, Stanford, CA, USA, February 2008. ACM Press.
-
(2008)
Proceedings of ACM WSDM
, pp. 183-194
-
-
Agichtein, E.1
Castillo, C.2
Donato, D.3
Gionis, A.4
Mishne, G.5
-
3
-
-
70350648616
-
-
W. M. Barczynski, F. Brauer, A. Loeser, and A. Mocan. Algebraic information extraction of enterprise data: Methodology and operators. In IK-KR Workshop at IJCAI 2009 (to be published), 2009.
-
W. M. Barczynski, F. Brauer, A. Loeser, and A. Mocan. Algebraic information extraction of enterprise data: Methodology and operators. In IK-KR Workshop at IJCAI 2009 (to be published), 2009.
-
-
-
-
5
-
-
0036040277
-
Similarity estimation techniques from rounding algorithms
-
New York, NY, USA, ACM
-
M. S. Charikar. Similarity estimation techniques from rounding algorithms. In STOC 02: Proceedings of the thiry-fourth annual ACM symposium on Theory of computing, pages 380-388, New York, NY, USA, 2002. ACM.
-
(2002)
STOC 02: Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
, pp. 380-388
-
-
Charikar, M.S.1
-
6
-
-
0013206133
-
Collection statistics for fast duplicate document detection
-
A. Chowdhury, O. Frieder, D. Grossman, and M. C. McCabe. Collection statistics for fast duplicate document detection. ACM Trans. Inf. Syst., 20(2):171-191, 2002.
-
(2002)
ACM Trans. Inf. Syst
, vol.20
, Issue.2
, pp. 171-191
-
-
Chowdhury, A.1
Frieder, O.2
Grossman, D.3
McCabe, M.C.4
-
7
-
-
52649090203
-
-
X. Z. Fern and W. Lin. Cluster ensemble selection. In SDM, pages 787-797. SIAM, 2008.
-
X. Z. Fern and W. Lin. Cluster ensemble selection. In SDM, pages 787-797. SIAM, 2008.
-
-
-
-
8
-
-
0001944742
-
Similarity search in high dimensions via hashing
-
San Francisco, CA, USA, Morgan Kaufmann Publishers Inc
-
A. Gionis, P. Indyk, and R. Motwani. Similarity search in high dimensions via hashing. In VLDB '99: Proceedings of the 25th International Conference on Very Large Data Bases, pages 518-529, San Francisco, CA, USA, 1999. Morgan Kaufmann Publishers Inc.
-
(1999)
VLDB '99: Proceedings of the 25th International Conference on Very Large Data Bases
, pp. 518-529
-
-
Gionis, A.1
Indyk, P.2
Motwani, R.3
-
9
-
-
33750296887
-
Finding near-duplicate web pages: A large-scale evaluation of algorithms
-
New York, NY, USA, ACM
-
M. Henzinger. Finding near-duplicate web pages: a large-scale evaluation of algorithms. In SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 284-291, New York, NY, USA, 2006. ACM.
-
(2006)
SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
, pp. 284-291
-
-
Henzinger, M.1
-
10
-
-
85043988965
-
Finding similar files in a large file system
-
San Fransisco, CA, USA, JanuaryJuly-FebruaryJanuary
-
U. Manber. Finding similar files in a large file system. In Proceedings of the USENIX Winter 1994 Technical Conference, pages 1-10, San Fransisco, CA, USA, JanuaryJuly-FebruaryJanuary 1994.
-
(1994)
Proceedings of the USENIX Winter 1994 Technical Conference
, pp. 1-10
-
-
Manber, U.1
-
12
-
-
57349131623
-
Spotsigs: Robust and efficient near duplicate detection in large web collections
-
New York, NY, USA, ACM
-
M. Theobald, J. Siddharth, and A. Paepcke. Spotsigs: robust and efficient near duplicate detection in large web collections. In SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, pages 563-570, New York, NY, USA, 2008. ACM.
-
(2008)
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
, pp. 563-570
-
-
Theobald, M.1
Siddharth, J.2
Paepcke, A.3
-
13
-
-
34250633663
-
Avatar information extraction system
-
May
-
T.S.Jayram, R. Krishnamurthy, S. Raghavan, S. Vaithyanathan, and H. Zhu. Avatar information extraction system. In IEEE Data Engineering Bulletin, May 2006.
-
(2006)
IEEE Data Engineering Bulletin
-
-
Jayram, T.S.1
Krishnamurthy, R.2
Raghavan, S.3
Vaithyanathan, S.4
Zhu, H.5
-
14
-
-
84885588967
-
Simfusion: Measuring similarity using unified relationship matrix
-
New York, NY, USA, ACM
-
W. Xi, E. A. Fox, W. Fan, B. Zhang, Z. Chen, J. Yan, and D. Zhuang. Simfusion: measuring similarity using unified relationship matrix. In SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 130-137, New York, NY, USA, 2005. ACM.
-
(2005)
SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
, pp. 130-137
-
-
Xi, W.1
Fox, E.A.2
Fan, W.3
Zhang, B.4
Chen, Z.5
Yan, J.6
Zhuang, D.7
-
15
-
-
35048904039
-
A query-dependent duplicate detection approach for large scale search engines
-
S. Ye, R. Song, J.-R. Wen, and W.-Y. Ma. A query-dependent duplicate detection approach for large scale search engines. In APWeb, pages 48-58, 2004.
-
(2004)
APWeb
, pp. 48-58
-
-
Ye, S.1
Song, R.2
Wen, J.-R.3
Ma, W.-Y.4
|