-
1
-
-
74549172900
-
Url normalization for de-duplication of web pages
-
A. Agarwal, H. S. Koppula, K. P. Leela, K. P. Chitrapura, S. Garg, P. K. GM, C. Haty, A. Roy, and A. Sasturkar. Url normalization for de-duplication of web pages. In CIKM, pages 1987-1990, 2009.
-
(2009)
CIKM
, pp. 1987-1990
-
-
Agarwal, A.1
Koppula, H.S.2
Leela, K.P.3
Chitrapura, K.P.4
Garg, S.5
Gm, P.K.6
Haty, C.7
Roy, A.8
Sasturkar, A.9
-
2
-
-
84865658375
-
Purely url-based topic classification
-
ACM
-
E. Baykan, M. R. Henzinger, L. Marian, and I. Weber. Purely url-based topic classification. In WWW, pages 1109-1110. ACM, 2009.
-
(2009)
WWW
, pp. 1109-1110
-
-
Baykan, E.1
Henzinger, M.R.2
Marian, L.3
Weber, I.4
-
3
-
-
78149404935
-
Web page language identification based on urls
-
E. Baykan, M. R. Henzinger, and I. Weber. Web page language identification based on urls. PVLDB, 1(1):176-187, 2008.
-
(2008)
PVLDB
, vol.1
, Issue.1
, pp. 176-187
-
-
Baykan, E.1
Henzinger, M.R.2
Weber, I.3
-
4
-
-
77952649961
-
Linked data - The story so far
-
C. Bizer, T. Heath, and T. Berners-Lee. Linked data - the story so far. Int. J. Semantic Web Inf. Syst., 5(3):1-22, 2009.
-
(2009)
Int. J. Semantic Web Inf. Syst.
, vol.5
, Issue.3
, pp. 1-22
-
-
Bizer, C.1
Heath, T.2
Berners-Lee, T.3
-
5
-
-
52149122472
-
Entity name system: The back-bone of an open and scalable web of data
-
P. Bouquet, H. Stoermer, C. Niedeŕee, and A. Mana. Entity name system: The back-bone of an open and scalable web of data. In ICSC, pages 554-561, 2008.
-
(2008)
ICSC
, pp. 554-561
-
-
Bouquet, P.1
Stoermer, H.2
Niedeŕee, C.3
Mana, A.4
-
6
-
-
11144240583
-
A comparison of string distance metrics for name-matching tasks
-
W. W. Cohen, P. D. Ravikumar, and S. E. Fienberg. A comparison of string distance metrics for name-matching tasks. In IIWeb, pages 73-78, 2003.
-
(2003)
IIWeb
, pp. 73-78
-
-
Cohen, W.W.1
Ravikumar, P.D.2
Fienberg, S.E.3
-
7
-
-
33845667955
-
Duplicate record detection: A survey
-
DOI 10.1109/TKDE.2007.250581
-
A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios. Duplicate record detection: A survey. IEEE Trans. Knowl. Data Eng., 19(1):1-16, 2007. (Pubitemid 44955773)
-
(2007)
IEEE Transactions on Knowledge and Data Engineering
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
9
-
-
74549160353
-
A framework for semantic link discovery over relational data
-
O. Hassanzadeh, A. Kementsietsidis, L. Lim, R. J. Miller, and M. Wang. A framework for semantic link discovery over relational data. In CIKM, pages 1027-1036, 2009.
-
(2009)
CIKM
, pp. 1027-1036
-
-
Hassanzadeh, O.1
Kementsietsidis, A.2
Lim, L.3
Miller, R.J.4
Wang, M.5
-
10
-
-
35248813379
-
Architecture of the world wide web, volume one
-
December
-
I. Jacobs and N. Walsh. Architecture of the world wide web, volume one. W3C Recommendation, December 2004.
-
(2004)
W3C Recommendation
-
-
Jacobs, I.1
Walsh, N.2
-
11
-
-
84950419860
-
Advances in record-linkage methodology as applied to matching the 1985 census of tampa, florida
-
June
-
M. A. Jaro. Advances in record-linkage methodology as applied to matching the 1985 census of tampa, florida. Journal of the American Statistical Association, 84(406):414-420, June 1989.
-
(1989)
Journal of The American Statistical Association
, vol.84
, Issue.406
, pp. 414-420
-
-
Jaro, M.A.1
-
12
-
-
77950949494
-
Learning url patterns for webpage de-duplication
-
H. S. Koppula, K. P. Leela, A. Agarwal, K. P. Chitrapura, S. Garg, and A. Sasturkar. Learning url patterns for webpage de-duplication. In WSDM, pages 381-390, 2010.
-
(2010)
WSDM
, pp. 381-390
-
-
Koppula, H.S.1
Leela, K.P.2
Agarwal, A.3
Chitrapura, K.P.4
Garg, S.5
Sasturkar, A.6
-
13
-
-
24944580931
-
On URL normalization
-
Computational Science and Its Applications - ICCSA 2005: International Conference, Proceedings
-
S. H. Lee, S. J. Kim, and S.-H. Hong. On url normalization. In ICCSA (2), pages 1076-1085, 2005. (Pubitemid 41313790)
-
(2005)
Lecture Notes in Computer Science
, vol.3481
, Issue.2
, pp. 1076-1085
-
-
Lee, S.H.1
Kim, S.J.2
Hong, S.H.3
-
15
-
-
77954589296
-
A pattern tree-based approach to learning url normalization rules
-
T. Lei, R. Cai, J.-M. Yang, Y. Ke, X. Fan, and L. Zhang. A pattern tree-based approach to learning url normalization rules. In WWW, pages 611-620, 2010.
-
(2010)
WWW
, pp. 611-620
-
-
Lei, T.1
Cai, R.2
Yang, J.-M.3
Ke, Y.4
Fan, X.5
Zhang, L.6
-
16
-
-
84885010922
-
Microsearch: An interface for semantic search
-
P. Mika. Microsearch: An interface for semantic search. In SemSearch, pages 79-88, 2008.
-
(2008)
SemSearch
, pp. 79-88
-
-
Mika, P.1
-
17
-
-
84872322700
-
-
Equivalence mining and matching frameworks
-
SWEO Community Project: Linking Open Data on the Semantic Web. Equivalence mining and matching frameworks. http://esw.w3.org/topic/TaskForces/ CommunityProjects/LinkingOpenData/EquivalenceMining.
-
SWEO Community Project: Linking Open Data on The Semantic Web
-
-
-
18
-
-
78650453294
-
Discovering and maintaining links on the web of data
-
Volume 5823 of Lecture Notes in Computer Science, Springer
-
J. Volz, C. Bizer, M. Gaedke, and G. Kobilarov. Discovering and maintaining links on the web of data. In ISWC 2009, volume 5823 of Lecture Notes in Computer Science, pages 650-665. Springer, 2009.
-
(2009)
ISWC 2009
, pp. 650-665
-
-
Volz, J.1
Bizer, C.2
Gaedke, M.3
Kobilarov, G.4
|