-
2
-
-
84976669911
-
Algorithms for string matching: A survey
-
BAEZA-YATES, R. 1989. Algorithms for string matching: A survey. ACM SIGIR Forum 23, 3-4, 34-58.
-
(1989)
ACM SIGIR Forum
, vol.23
, Issue.3-4
, pp. 34-58
-
-
BAEZA-YATES, R.1
-
3
-
-
0002652285
-
A maximum entropy approach to natural language processing
-
BERGER, A. L., DELLA-PIETRA, S. A., AND DELLA-PIETRA, V. J. 1996. A maximum entropy approach to natural language processing. Comput. Linguist. 22, 1, 39-71.
-
(1996)
Comput. Linguist
, vol.22
, Issue.1
, pp. 39-71
-
-
BERGER, A.L.1
DELLA-PIETRA, S.A.2
DELLA-PIETRA, V.J.3
-
4
-
-
85060117263
-
-
BERGMAN, M. K. 2001. The deep Web: Surfacing hidden value. White paper, BrightPlanet Corporation. http://www.brightplanet.com/resources/details/ deepweb.html.
-
BERGMAN, M. K. 2001. The deep Web: Surfacing hidden value. White paper, BrightPlanet Corporation. http://www.brightplanet.com/resources/details/ deepweb.html.
-
-
-
-
6
-
-
85060120169
-
-
BORTHWICK, A. 1999. A maximum entropy approach to named entity recognition. Ph.D. thesis, Computer Science Department, New York University.
-
BORTHWICK, A. 1999. A maximum entropy approach to named entity recognition. Ph.D. thesis, Computer Science Department, New York University.
-
-
-
-
9
-
-
5444262639
-
Structured databases on the Web: Observations and implications
-
CHANG, K. C.-C., HE, B., LI, C., PATEL, M., AND ZHANG, Z. 2004. Structured databases on the Web: Observations and implications. SIGMOD Rec. 33, 3, 61-70.
-
(2004)
SIGMOD Rec
, vol.33
, Issue.3
, pp. 61-70
-
-
CHANG, K.C.-C.1
HE, B.2
LI, C.3
PATEL, M.4
ZHANG, Z.5
-
10
-
-
4444313943
-
Automatic composite wrapper generation for semi-structured biological data based on table structure identification
-
CHEN, L., JAMIL, H. M., AND WANG, N. 2004. Automatic composite wrapper generation for semi-structured biological data based on table structure identification. SIGMOD Rec. 33, 2, 58-64.
-
(2004)
SIGMOD Rec
, vol.33
, Issue.2
, pp. 58-64
-
-
CHEN, L.1
JAMIL, H.M.2
WANG, N.3
-
11
-
-
85060117804
-
-
Factored values are those values that appear once for a group of records (e.g., year value 2008 appears once for the group of 2008 cars, year value 2007 appears once for the group of 2007 cars, etc.).
-
13Factored values are those values that appear once for a group of records (e.g., year value 2008 appears once for the group of 2008 cars, year value 2007 appears once for the group of 2007 cars, etc.).
-
-
-
-
14
-
-
0033225222
-
Conceptual-model-based data extraction from multiple-record Web
-
EMBLEY, D. W., CAMPBELL, D. M., JIANG, Y. S., LIDDLE, S. W., LONSDALE, D. W., Ng, Y.-K., AND SMITH, R. D. 1999. Conceptual-model-based data extraction from multiple-record Web pages. IEEE Trans. Data Knowl. Engin. 31, 3, 227-251.
-
(1999)
IEEE Trans. Data Knowl. Engin
, vol.31
, Issue.3
, pp. 227-251
-
-
EMBLEY, D.W.1
CAMPBELL, D.M.2
JIANG, Y.S.3
LIDDLE, S.W.4
LONSDALE, D.W.5
Ng, Y.-K.6
SMITH, R.D.7
-
15
-
-
0037745099
-
KBFS: K-best-first search
-
FEINER, A., KRAUS, S., AND KORF, R. E. 2003. KBFS: K-best-first search. Ann. Math. Artif. Intell. 39, 1-2, 19-39.
-
(2003)
Ann. Math. Artif. Intell
, vol.39
, Issue.1-2
, pp. 19-39
-
-
FEINER, A.1
KRAUS, S.2
KORF, R.E.3
-
16
-
-
0344127987
-
QProber: A system for automatic classification of hidden Web databases
-
GRAVANO, L., IPEIROTIS, P. G., AND SAHAMI, M. 2003. QProber: A system for automatic classification of hidden Web databases. ACM Trans. Inform. Syst. 21, 1, 1-41.
-
(2003)
ACM Trans. Inform. Syst
, vol.21
, Issue.1
, pp. 1-41
-
-
GRAVANO, L.1
IPEIROTIS, P.G.2
SAHAMI, M.3
-
17
-
-
85060117265
-
-
GUSFIELD, D. 1997. Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, Cambridge, UK.
-
GUSFIELD, D. 1997. Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, Cambridge, UK.
-
-
-
-
18
-
-
33745213976
-
Automatic complex schema matching across Web query interfaces: A correlation mining approach
-
HE, B. AND CHANG, K. C.-C. 2006. Automatic complex schema matching across Web query interfaces: A correlation mining approach. ACM Trans. Datab. Syst. 31, 1, 346-396.
-
(2006)
ACM Trans. Datab. Syst
, vol.31
, Issue.1
, pp. 346-396
-
-
HE, B.1
CHANG, K.C.-C.2
-
19
-
-
0032309862
-
Generating finite-state transducers for semi-structured data extraction from the Web
-
HSU, C.-N. AND DUNG, M.-T. 1998. Generating finite-state transducers for semi-structured data extraction from the Web. Inform. Syst. 23, 8, 521-538.
-
(1998)
Inform. Syst
, vol.23
, Issue.8
, pp. 521-538
-
-
HSU, C.-N.1
DUNG, M.-T.2
-
20
-
-
0034172374
-
Wrapper induction: Efficiency and expressiveness
-
KUSHMERICK, N. 2000. Wrapper induction: Efficiency and expressiveness. Artif. Intell. 118, 1-2, 15-68.
-
(2000)
Artif. Intell
, vol.118
, Issue.1-2
, pp. 15-68
-
-
KUSHMERICK, N.1
-
21
-
-
3142742483
-
Using the structure of Web sites for automatic segmentation of tables
-
LERMAN, K., GETOOR, L., MINTON, S., AND KNOBLOCK, C. 2004. Using the structure of Web sites for automatic segmentation of tables. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 119-130.
-
(2004)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 119-130
-
-
LERMAN, K.1
GETOOR, L.2
MINTON, S.3
KNOBLOCK, C.4
-
23
-
-
34548707913
-
Annotating structured data of the deep Web
-
LU, Y., HE, H., ZHAO, H., MENG, W., AND YU, C. 2007. Annotating structured data of the deep Web. In Proceedings of the 23rd IEEE International Conference on Data Engineering. 376-385.
-
(2007)
Proceedings of the 23rd IEEE International Conference on Data Engineering
, pp. 376-385
-
-
LU, Y.1
HE, H.2
ZHAO, H.3
MENG, W.4
YU, C.5
-
24
-
-
85060117266
-
-
MINKA, T. 2003. A comparison of numerical optimizers for logistic regression. Tech. rep., Department of Statistics, Carnegie Mellon University.
-
MINKA, T. 2003. A comparison of numerical optimizers for logistic regression. Tech. rep., Department of Statistics, Carnegie Mellon University.
-
-
-
-
27
-
-
33845269075
-
Ontobuilder: Fully automatic extraction and consolidation of ontologies from Web sources using sequence semantics
-
ROITMAN, H. AND GAL, A. 2006. Ontobuilder: Fully automatic extraction and consolidation of ontologies from Web sources using sequence semantics. In Proceedings of the EDBT Workshops. 573-576.
-
(2006)
Proceedings of the EDBT Workshops
, pp. 573-576
-
-
ROITMAN, H.1
GAL, A.2
-
31
-
-
85060119329
-
-
SU, W., WANG, J., LOCHOVSKY, F. H., AND LIU, Y. 2009. PADE: Pair-wise alignment-based data extraction. Tech. rep. HKUST-CS09-01, Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong.
-
SU, W., WANG, J., LOCHOVSKY, F. H., AND LIU, Y. 2009. PADE: Pair-wise alignment-based data extraction. Tech. rep. HKUST-CS09-01, Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Hong Kong.
-
-
-
-
32
-
-
67349276460
-
Automatic hidden-Web table interpretation, conceptualization, and semantic annotation
-
TAO, C. AND EMBLEY, D. W. 2009. Automatic hidden-Web table interpretation, conceptualization, and semantic annotation. Data Knowl. Engin. 68, 7, 683-703.
-
(2009)
Data Knowl. Engin. 68
, vol.7
, pp. 683-703
-
-
TAO, C.1
EMBLEY, D.W.2
-
33
-
-
38349008590
-
Automatic hidden-Web table interpretation by sibling page comparison
-
Conceptual Modeling, ER'07, Springer Berlin
-
TAO, C. AND EMBLEY, D. W. 2007. Automatic hidden-Web table interpretation by sibling page comparison. In Conceptual Modeling - ER'07. Lecture Notes in Computer Science, vol. 4801 Springer Berlin, 566-581.
-
(2007)
Lecture Notes in Computer Science
, vol.4801
, pp. 566-581
-
-
TAO, C.1
EMBLEY, D.W.2
-
34
-
-
27844536405
-
Towards ontology generation from tables
-
TIJERINO, Y. A., EMBLEY, D. W., LONSDALE, D. W., DING, Y., AND NAGY, G. 2005. Towards ontology generation from tables. World Wide Web 8, 3, 261-285.
-
(2005)
World Wide Web
, vol.8
, Issue.3
, pp. 261-285
-
-
TIJERINO, Y.A.1
EMBLEY, D.W.2
LONSDALE, D.W.3
DING, Y.4
NAGY, G.5
-
37
-
-
79952787982
-
Instance-Based schema matching for Web databases by domain-specific query probing
-
WANG, J., WEN, J., LOCHOVSKY, F. H., AND MA, W. Y. 2004. Instance-Based schema matching for Web databases by domain-specific query probing. In Proceedings of the 30th International Conference on Very Large Data Bases. 408-419.
-
(2004)
Proceedings of the 30th International Conference on Very Large Data Bases
, pp. 408-419
-
-
WANG, J.1
WEN, J.2
LOCHOVSKY, F.H.3
MA, W.Y.4
-
38
-
-
68549114493
-
-
WORLD WIDE WEB CONSORTIUM. 1999. HTML 4.01 specification. http://www.w3.org/TR/REC-html40/.
-
WORLD WIDE WEB CONSORTIUM. 1999. HTML 4.01 specification. http://www.w3.org/TR/REC-html40/.
-
-
-
-
39
-
-
33745380092
-
Boot-strapping domain ontology for semantic Web services from source Web sites
-
WU, W., DOAN, A., YU, C., AND MENG, W. 2005. Boot-strapping domain ontology for semantic Web services from source Web sites. In Proceedings of the 6th VLDB Workshop on Technologies for E-Services. 11-12.
-
(2005)
Proceedings of the 6th VLDB Workshop on Technologies for E-Services
, pp. 11-12
-
-
WU, W.1
DOAN, A.2
YU, C.3
MENG, W.4
-
40
-
-
33750797710
-
Structured data extraction from the Web based on partial tree alignment
-
ZHAI, Y. AND LIU, B. 2006. Structured data extraction from the Web based on partial tree alignment. IEEE Trans. Knowledge Data Engin. 18, 12, 1614-1628.
-
(2006)
IEEE Trans. Knowledge Data Engin. 18
, vol.12
, pp. 1614-1628
-
-
ZHAI, Y.1
LIU, B.2
-
41
-
-
0001868572
-
Text categorization based on regularized linear classification methods
-
ZHANG, T. AND OLES, F. J. 2001. Text categorization based on regularized linear classification methods. Inform. Retriev. 4, 1, 5-31.
-
(2001)
Inform. Retriev
, vol.4
, Issue.1
, pp. 5-31
-
-
ZHANG, T.1
OLES, F.J.2
-
42
-
-
33744899132
-
Fully automatic wrapper generation for search engines
-
ZHAO, H., MENG, W., WU, Z., RAGHAVAN, V., AND YU, C. 2005. Fully automatic wrapper generation for search engines. In Proceedings of the 14th World Wide Web Conference. 66-75.
-
(2005)
Proceedings of the 14th World Wide Web Conference
, pp. 66-75
-
-
ZHAO, H.1
MENG, W.2
WU, Z.3
RAGHAVAN, V.4
YU, C.5
|