-
1
-
-
36849071950
-
Xproj: A framework for projected structural clustering of xml documents
-
C. C. Aggarwal, N. Ta, J. Wang, J. Feng, and M. J. Zaki. Xproj: a framework for projected structural clustering of xml documents. In Proc. KDD Conf., pages 46-55, 2007.
-
(2007)
Proc. KDD Conf
, pp. 46-55
-
-
Aggarwal, C.C.1
Ta, N.2
Wang, J.3
Feng, J.4
Zaki, M.J.5
-
2
-
-
52649127789
-
Approximate joins for data-centric xml
-
N. Augsten, M. H. Böhlen, C. E. Dyreson, and J. Gamper. Approximate joins for data-centric xml. In Proc. ICDE Conf., pages 814-823, 2008.
-
(2008)
Proc. ICDE Conf
, pp. 814-823
-
-
Augsten, N.1
Böhlen, M.H.2
Dyreson, C.E.3
Gamper, J.4
-
3
-
-
33745628382
-
Approximate matching of hierarchical data using pq-grams
-
N. Augsten, M. H. Böhlen, and J. Gamper. Approximate matching of hierarchical data using pq-grams. In Proc. VLDB Conf., pages 301-312, 2005.
-
(2005)
Proc. VLDB Conf
, pp. 301-312
-
-
Augsten, N.1
Böhlen, M.H.2
Gamper, J.3
-
4
-
-
35348849154
-
Scaling up all pairs similarity search
-
R. J. Bayardo, Y. Ma, and R. Srikant. Scaling up all pairs similarity search. In Proc. WWW Conf., pages 131-140, 2007.
-
(2007)
, pp. 131-140
-
-
Bayardo, R.J.1
Ma, Y.2
Srikant, R.3
-
6
-
-
12744279236
-
A short survey of document structure similarity algorithms
-
D. Buttler. A short survey of document structure similarity algorithms. In Proc. Intl. Conf. on Internet Computing, pages 3-9, 2004.
-
(2004)
Proc. Intl. Conf. on Internet Computing
, pp. 3-9
-
-
Buttler, D.1
-
7
-
-
1542377549
-
Searching xml documents via xml fragments
-
D. Carmel, Y. S. Maarek, M. Mandelbrod, Y. Mass, and A. Soffer. Searching xml documents via xml fragments. In Proc. SIGIR Conf., pages 151-158, 2003.
-
(2003)
Proc. SIGIR Conf
, pp. 151-158
-
-
Carmel, D.1
Maarek, Y.S.2
Mandelbrod, M.3
Mass, Y.4
Soffer, A.5
-
8
-
-
35448984015
-
Benchmarking declarative approximate selection predicates
-
A. Chandel, O. Hassanzadeh, N. Koudas, M. Sadoghi, and D. Srivastava. Benchmarking declarative approximate selection predicates. In Proc. SIGMOD Conf., pages 353-364, 2007.
-
(2007)
Proc. SIGMOD Conf
, pp. 353-364
-
-
Chandel, A.1
Hassanzadeh, O.2
Koudas, N.3
Sadoghi, M.4
Srivastava, D.5
-
9
-
-
33749597967
-
A primitive operator for similarity joins in data cleaning
-
S. Chaudhuri, V. Ganti, and R. Kaushik. A primitive operator for similarity joins in data cleaning. In Proc. ICDE Conf., page 5, 2006.
-
(2006)
Proc. ICDE Conf
, pp. 5
-
-
Chaudhuri, S.1
Ganti, V.2
Kaushik, R.3
-
10
-
-
0002818648
-
Combining approaches to information retrieval
-
W. B. Croft. Combining approaches to information retrieval. Advances in information retrieval, 7:1-36, 2000.
-
(2000)
Advances in information retrieval
, vol.7
, pp. 1-36
-
-
Croft, W.B.1
-
11
-
-
70350639186
-
Overview of the inex 2008 xml mining track
-
S. Geva, J. Kamps, and A. Trotman, editors, Proc. INEX 2008
-
L. Denoyer and P. Gallinari. Overview of the inex 2008 xml mining track. In S. Geva, J. Kamps, and A. Trotman, editors, Proc. INEX 2008, LNCS, 2009.
-
(2009)
LNCS
-
-
Denoyer, L.1
Gallinari, P.2
-
12
-
-
14644439871
-
Fast detection of xml structural similarity
-
S. Flesca, G. Manco, E. Masciari, L. Pontieri, and A. Pugliese. Fast detection of xml structural similarity. TKDE, 17(2):160-175, 2005.
-
(2005)
TKDE
, vol.17
, Issue.2
, pp. 160-175
-
-
Flesca, S.1
Manco, G.2
Masciari, E.3
Pontieri, L.4
Pugliese, A.5
-
13
-
-
33745218927
-
Integrating xml data sources using approximate joins
-
S. Guha, H. V. Jagadish, N. Koudas, D. Srivastava, and T. Yu. Integrating xml data sources using approximate joins. TODS, 31(1):161-207, 2006.
-
(2006)
TODS
, vol.31
, Issue.1
, pp. 161-207
-
-
Guha, S.1
Jagadish, H.V.2
Koudas, N.3
Srivastava, D.4
Yu, T.5
-
14
-
-
52649145249
-
Fast indexes and algorithms for set similarity selection queries
-
M. Hadjieleftheriou, A. Chandel, N. Koudas, and D. Srivastava. Fast indexes and algorithms for set similarity selection queries. In Proc. ICDE Conf., pages 267-276, 2008.
-
(2008)
Proc. ICDE Conf
, pp. 267-276
-
-
Hadjieleftheriou, M.1
Chandel, A.2
Koudas, N.3
Srivastava, D.4
-
15
-
-
47949084700
-
Comparison of complete and elementless native storage of xml documents
-
T. Härder, C. Mathis, and K. Schmidt. Comparison of complete and elementless native storage of xml documents. In Proc. IDEAS Conf., pages 102-113, 2007.
-
(2007)
Proc. IDEAS Conf
, pp. 102-113
-
-
Härder, T.1
Mathis, C.2
Schmidt, K.3
-
16
-
-
34147139436
-
An efficient infrastructure for native transactional xml processing
-
M. P. Haustein and T. Härder. An efficient infrastructure for native transactional xml processing. DKE, 61(3):500-523, 2007.
-
(2007)
DKE
, vol.61
, Issue.3
, pp. 500-523
-
-
Haustein, M.P.1
Härder, T.2
-
17
-
-
84912150847
-
Measuring the structural similarity of semistructured documents using entropy
-
S. Helmer. Measuring the structural similarity of semistructured documents using entropy. In Proc. VLDB Conf., pages 1022-1032, 2007.
-
(2007)
Proc. VLDB Conf
, pp. 1022-1032
-
-
Helmer, S.1
-
19
-
-
2442561063
-
A bag of paths model for measuring structural similarity in web documents
-
S. Joshi, N. Agrawal, R. Krishnapuram, and S. Negi. A bag of paths model for measuring structural similarity in web documents. In Proc. KDD Conf., pages 577-582, 2003.
-
(2003)
Proc. KDD Conf
, pp. 577-582
-
-
Joshi, S.1
Agrawal, N.2
Krishnapuram, R.3
Negi, S.4
-
20
-
-
33845514928
-
Articulating information needs in xml query languages
-
J. Kamps, M. Marx, M. de Rijke, and B. Sigurbjörnsson. Articulating information needs in xml query languages. TOIS, 24(4):407-436, 2006.
-
(2006)
TOIS
, vol.24
, Issue.4
, pp. 407-436
-
-
Kamps, J.1
Marx, M.2
de Rijke, M.3
Sigurbjörnsson, B.4
-
21
-
-
34250670467
-
Record linkage: Similarity measures and algorithms
-
N. Koudas, S. Sarawagi, and D. Srivastava. Record linkage: similarity measures and algorithms. In Proc. SIGMOD Conf., pages 802-803, 2006.
-
(2006)
Proc. SIGMOD Conf
, pp. 802-803
-
-
Koudas, N.1
Sarawagi, S.2
Srivastava, D.3
-
22
-
-
0037481024
-
Xclust: Clustering xml schemas for effective integration
-
M.-L. Lee, L. H. Yang, W. Hsu, and X. Yang. Xclust: clustering xml schemas for effective integration. In Proc. CIKM Conf., pages 292-299, 2002.
-
(2002)
Proc. CIKM Conf
, pp. 292-299
-
-
Lee, M.-L.1
Yang, L.H.2
Hsu, W.3
Yang, X.4
-
23
-
-
67649647573
-
A decade of xml data management: An industrial experience report from oracle
-
Z. H. Liu and R. Murthy. A decade of xml data management: An industrial experience report from oracle. In Proc. ICDE Conf., pages 1351-1362, 2009.
-
(2009)
Proc. ICDE Conf
, pp. 1351-1362
-
-
Liu, Z.H.1
Murthy, R.2
-
24
-
-
0001906874
-
Index structures for path expressions
-
T. Milo and D. Suciu. Index structures for path expressions. In Proc. ICDT Conf., pages 277-295, 1999.
-
(1999)
Proc. ICDT Conf
, pp. 277-295
-
-
Milo, T.1
Suciu, D.2
-
25
-
-
14644393851
-
Evaluating structural similarity in xml documents
-
A. Nierman and H. V. Jagadish. Evaluating structural similarity in xml documents. In Proc. WebDB Workshop, pages 61-66, 2002.
-
(2002)
Proc. WebDB Workshop
, pp. 61-66
-
-
Nierman, A.1
Jagadish, H.V.2
-
26
-
-
1542287497
-
Combining document representations for known-item search
-
P. Ogilvie and J. P. Callan. Combining document representations for known-item search. In Proc. SIGIR Conf., pages 143-150, 2003.
-
(2003)
Proc. SIGIR Conf
, pp. 143-150
-
-
Ogilvie, P.1
Callan, J.P.2
-
27
-
-
70349138843
-
Clustering the tagged web
-
D. Ramage, P. Heymann, C. D. Manning, and H. Garcia-Molina. Clustering the tagged web. In Proc. WSDM Conf., pages 54-63, 2009.
-
(2009)
Proc. WSDM Conf
, pp. 54-63
-
-
Ramage, D.1
Heymann, P.2
Manning, C.D.3
Garcia-Molina, H.4
-
28
-
-
54249160182
-
Evaluating performance and quality of xml-based similarity joins
-
L. A. Ribeiro and T. Härder. Evaluating performance and quality of xml-based similarity joins. In Proc. ADBIS Conf., pages 246-261, 2008.
-
(2008)
Proc. ADBIS Conf
, pp. 246-261
-
-
Ribeiro, L.A.1
Härder, T.2
-
31
-
-
3142777876
-
Efficient set joins on similarity predicates
-
S. Sarawagi and A. Kirpal. Efficient set joins on similarity predicates. In Proc. SIGMOD Conf., pages 743-754, 2004.
-
(2004)
Proc. SIGMOD Conf
, pp. 743-754
-
-
Sarawagi, S.1
Kirpal, A.2
-
32
-
-
0002442796
-
Machine learning in automated text categorization
-
F. Sebastiani. Machine learning in automated text categorization. CSUR, 34(1):1-47, 2002.
-
(2002)
CSUR
, vol.34
, Issue.1
, pp. 1-47
-
-
Sebastiani, F.1
-
33
-
-
0018491659
-
The tree-to-tree correction problem
-
K.-C. Tai. The tree-to-tree correction problem. JACM, 26(3):422-433, 1979.
-
(1979)
JACM
, vol.26
, Issue.3
, pp. 422-433
-
-
Tai, K.-C.1
-
34
-
-
0001467848
-
Query evaluation: Strategies and optimizations
-
H. R. Turtle and J. Flood. Query evaluation: Strategies and optimizations. Information Processing Management, 31(6):831-850, 1995.
-
(1995)
Information Processing Management
, vol.31
, Issue.6
, pp. 831-850
-
-
Turtle, H.R.1
Flood, J.2
-
35
-
-
29844441371
-
Dogmatix tracks down duplicates in xml
-
M. Weis and F. Naumann. Dogmatix tracks down duplicates in xml. In Proc. SIGMOD Conf., pages 431-442, 2005.
-
(2005)
Proc. SIGMOD Conf
, pp. 431-442
-
-
Weis, M.1
Naumann, F.2
-
36
-
-
70350645042
-
-
W. Winkler. Overview of record linkage and current research directions. Technical report, Statistical Research Division, U.S. Bureau of the Census, 2006.
-
W. Winkler. Overview of record linkage and current research directions. Technical report, Statistical Research Division, U.S. Bureau of the Census, 2006.
-
-
-
-
37
-
-
0000307499
-
On the editing distance between unordered labeled trees
-
K. Zhang, R. Statman, and D. Shasha. On the editing distance between unordered labeled trees. Information Processing Letters (IPL), 42(3):133 139, 1992.
-
(1992)
Information Processing Letters (IPL)
, vol.42
, Issue.3
, pp. 133-139
-
-
Zhang, K.1
Statman, R.2
Shasha, D.3
|