-
3
-
-
30544447615
-
Thresher: Automating the unwrapping of semantic content from the World Wide Web
-
Chiba, ACM, Japan
-
Andrew Hogue, David Karger, Thresher: automating the unwrapping of semantic content from the World Wide Web, in: Proceedings of the 14th International Conference on World Wide Web Chiba, ACM, Japan, 2005.
-
(2005)
Proceedings of the 14th International Conference on World Wide Web
-
-
Hogue, A.1
Karger, D.2
-
4
-
-
0023586274
-
The longest common subsequence problem revisited
-
Apostolico A., and Guerra C. The longest common subsequence problem revisited. Algorithmica 2 (1987)
-
(1987)
Algorithmica
, vol.2
-
-
Apostolico, A.1
Guerra, C.2
-
5
-
-
0343725648
-
Building intelligent web applications using lightweight wrappers
-
Sahuguet A., and Azavant F. Building intelligent web applications using lightweight wrappers. Data Knowl. Eng. 36 (2001) 283-316
-
(2001)
Data Knowl. Eng.
, vol.36
, pp. 283-316
-
-
Sahuguet, A.1
Azavant, F.2
-
7
-
-
77952333945
-
Mining data records in web pages
-
ACM, Washington, D.C.
-
Liu B., Grossman R., and Zhai Y. Mining data records in web pages. Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2003), ACM, Washington, D.C.
-
(2003)
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
-
-
Liu, B.1
Grossman, R.2
Zhai, Y.3
-
8
-
-
33744786780
-
NET - a system for extracting web data from flat and nested data records
-
WISE
-
Bing Liu, Yanhong Zhai, NET - a system for extracting web data from flat and nested data records, in: Web Information Systems Engineering - WISE 2005, 2005, pp. 487-495.
-
(2005)
Web Information Systems Engineering
, pp. 487-495
-
-
Liu, B.1
Zhai, Y.2
-
9
-
-
0032092761
-
NoDoSE{minus 45 degree rule}-a tool for semi-automatically extracting structured and semistructured data from text documents
-
ACM, Washington, United States
-
Brad Adelberg, NoDoSE{minus 45 degree rule}-a tool for semi-automatically extracting structured and semistructured data from text documents, in: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data Seattle, ACM, Washington, United States, 1998.
-
(1998)
Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data Seattle
-
-
Adelberg, B.1
-
12
-
-
10944234975
-
Olera: semisupervised Web-data extraction with visual support
-
Chang C.-H., and Kuo S.-C. Olera: semisupervised Web-data extraction with visual support. IEEE Intell. Syst. 19 (2004) 56-64
-
(2004)
IEEE Intell. Syst.
, vol.19
, pp. 56-64
-
-
Chang, C.-H.1
Kuo, S.-C.2
-
13
-
-
0032309862
-
Generating finite-state transducers for semi-structured data extraction from the Web
-
Hsu C.-N., and Dung M.-T. Generating finite-state transducers for semi-structured data extraction from the Web. Inf. Syst. 23 (1998) 521-538
-
(1998)
Inf. Syst.
, vol.23
, pp. 521-538
-
-
Hsu, C.-N.1
Dung, M.-T.2
-
14
-
-
67349276460
-
Automatic hidden-web table interpretation, conceptualization, and semantic annotation
-
Tao C., and Embley D.W. Automatic hidden-web table interpretation, conceptualization, and semantic annotation. Data Knowl. Eng. 68 (2009) 683-703
-
(2009)
Data Knowl. Eng.
, vol.68
, pp. 683-703
-
-
Tao, C.1
Embley, D.W.2
-
17
-
-
0033225222
-
-
Data Knowl. Eng
-
D.W. Embley, D.M. Campbell, Y.S. Jiang, S.W. Liddle, D.W. Lonsdale, Conceptual-model-based data extraction from multiple-record Web pages, Data Knowl. Eng. (1999).
-
(1999)
Conceptual-model-based data extraction from multiple-record Web
-
-
Embley, D.W.1
Campbell, D.M.2
Jiang, Y.S.3
Liddle, S.W.4
Lonsdale, D.W.5
-
21
-
-
72649094815
-
Episode matching
-
Gautam Das, Rudolf Fleischer, Leszek Gasieniec, Dimitris Gunopulos, Juha Karkkainen, Episode matching, in: Proceedings of the Eighth Annual Symposium on Combinatorial Pattern Matching, 1997.
-
(1997)
Proceedings of the Eighth Annual Symposium on Combinatorial Pattern Matching
-
-
Das, G.1
Fleischer, R.2
Gasieniec, L.3
Gunopulos, D.4
Karkkainen, J.5
-
22
-
-
84865659127
-
Extracting data records from the web using tag path clustering
-
Spain, Madrid
-
Gengxin Miao, Junichi Tatemura, Wang-Pin Hsiung, Arsany Sawires, Louise E. Moser, Extracting data records from the web using tag path clustering, in: Proceedings of the 18th International Conference on World Wide Web, Spain, Madrid, 2009.
-
(2009)
Proceedings of the 18th International Conference on World Wide Web
-
-
Miao, G.1
Tatemura, J.2
Hsiung, W.-P.3
Sawires, A.4
Moser, L.E.5
-
23
-
-
0345566149
-
A guided tour to approximate string matching
-
Navarro G. A guided tour to approximate string matching. ACM Comput. Surv. 33 (2001) 31-88
-
(2001)
ACM Comput. Surv.
, vol.33
, pp. 31-88
-
-
Navarro, G.1
-
24
-
-
0031649136
-
WebOQL: Restructuring documents, databases and webs
-
Gustavo O. Arocena, Alberto O. Mendelzon, WebOQL: restructuring documents, databases and webs, in: Proceedings of the 14th International Conference on Data Engineering, 1998, pp. 24-33.
-
(1998)
Proceedings of the 14th International Conference on Data Engineering
, pp. 24-33
-
-
Arocena, G.O.1
Mendelzon, A.O.2
-
25
-
-
85044217577
-
Automatic extraction of dynamic record sections from search engine result
-
VLDB Endowment, Seoul, Korea
-
Hongkun Zhao, Weiyi Meng, Clement Yu, Automatic extraction of dynamic record sections from search engine result pages, in: Proceedings of the 32nd International Conference on Very Large Data Bases, VLDB Endowment, Seoul, Korea, 2006.
-
(2006)
Proceedings of the 32nd International Conference on Very Large Data Bases
-
-
Zhao, H.1
Meng, W.2
Yu, C.3
-
26
-
-
36849073188
-
Mining templates from search result records of search engines
-
ACM, San Jose, California, USA
-
Zhao H., Meng W., and Yu C. Mining templates from search result records of search engines. Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007), ACM, San Jose, California, USA
-
(2007)
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
-
-
Zhao, H.1
Meng, W.2
Yu, C.3
-
27
-
-
33744899132
-
Fully automatic wrapper generation for search engines
-
ACM, Japan
-
Zhao H., Meng W., Wu Z., Raghavan V., and Yu C. Fully automatic wrapper generation for search engines. Proceedings of the 14th International Conference on World Wide Web Chiba (2005), ACM, Japan
-
(2005)
Proceedings of the 14th International Conference on World Wide Web Chiba
-
-
Zhao, H.1
Meng, W.2
Wu, Z.3
Raghavan, V.4
Yu, C.5
-
28
-
-
0742268832
-
Mining web informative structures and contents based on entropy analysis
-
Kao H.-Y., Lin S.-H., Ho J.-M., and Chen M.-S. Mining web informative structures and contents based on entropy analysis. IEEE T. Knowl. Data Eng. 16 (2004) 41-55
-
(2004)
IEEE T. Knowl. Data Eng.
, vol.16
, pp. 41-55
-
-
Kao, H.-Y.1
Lin, S.-H.2
Ho, J.-M.3
Chen, M.-S.4
-
29
-
-
0032684968
-
A hierarchical approach to wrapper induction
-
ACM, Washington, United States
-
Muslea I., Minton S., and Knoblock C. A hierarchical approach to wrapper induction. Proceedings of the Third Annual Conference on Autonomous Agents Seattle (1999), ACM, Washington, United States
-
(1999)
Proceedings of the Third Annual Conference on Autonomous Agents Seattle
-
-
Muslea, I.1
Minton, S.2
Knoblock, C.3
-
31
-
-
0008762950
-
Semistructured data: The TSIMMIS experience
-
Joachim Hammer, Jason McHugh, Hector Garcia-Molina, Semistructured data: the TSIMMIS experience, in: Proc. First East-European Symposium Advances in Databases and Information Systems (ADBIS), 1997, pp. 1-8.
-
(1997)
Proc. First East-European Symposium Advances in Databases and Information Systems (ADBIS)
, pp. 1-8
-
-
Hammer, J.1
McHugh, J.2
Garcia-Molina, H.3
-
32
-
-
71049182378
-
Can we learn a template-independent wrapper for news article extraction from a single training site?
-
ACM, France
-
Wang J., Chen C., Wang C., Pei J., Bu J., Guan Z., and Zhang W.V. Can we learn a template-independent wrapper for news article extraction from a single training site?. Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Paris (2009), ACM, France
-
(2009)
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Paris
-
-
Wang, J.1
Chen, C.2
Wang, C.3
Pei, J.4
Bu, J.5
Guan, Z.6
Zhang, W.V.7
-
34
-
-
35448958926
-
AllInOneNews: development and evaluation of a large-scale news metasearch engine
-
ACM, China
-
Liu K.-L., Meng W., Qiu J., Yu C., Raghavan V., Wu Z., Lu Y., He H., and Zhao H. AllInOneNews: development and evaluation of a large-scale news metasearch engine. Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data Beijing (2007), ACM, China
-
(2007)
Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data Beijing
-
-
Liu, K.-L.1
Meng, W.2
Qiu, J.3
Yu, C.4
Raghavan, V.5
Wu, Z.6
Lu, Y.7
He, H.8
Zhao, H.9
-
35
-
-
0018491659
-
The tree-to-tree correction problem
-
Tai K.-C. The tree-to-tree correction problem. J. ACM 26 (1979) 422-433
-
(1979)
J. ACM
, vol.26
, pp. 422-433
-
-
Tai, K.-C.1
-
36
-
-
0033893885
-
XWRAP: An XML-enabled wrapper construction system for web information sources
-
Ling Liu, Calton Pu, Wei Han, XWRAP: an XML-enabled wrapper construction system for web information sources, in: Proceedings of the 16th International Conference on Data Engineering, 2000, pp. 611-621.
-
(2000)
Proceedings of the 16th International Conference on Data Engineering
, pp. 611-621
-
-
Liu, L.1
Pu, C.2
Han, W.3
-
37
-
-
47949107901
-
-
Longzhuang Li, Yonghuai Liu, Abel Obregon, Matt Weatherston, Visual segmentation-based data record extraction from web documents, in: IEEE International Conference on Information Reuse and Integration, IRI 2007, 2007, pp. 502-507.
-
Longzhuang Li, Yonghuai Liu, Abel Obregon, Matt Weatherston, Visual segmentation-based data record extraction from web documents, in: IEEE International Conference on Information Reuse and Integration, IRI 2007, 2007, pp. 502-507.
-
-
-
-
38
-
-
37349086786
-
Extracting lists of data records from semi-structured web pages
-
lvarez M., Pan A., Raposo J., Bellas F., and Cacheda F. Extracting lists of data records from semi-structured web pages. Data Knowl. Eng. 64 (2008) 491-509
-
(2008)
Data Knowl. Eng.
, vol.64
, pp. 491-509
-
-
lvarez, M.1
Pan, A.2
Raposo, J.3
Bellas, F.4
Cacheda, F.5
-
40
-
-
34250196851
-
Integration of association rules and ontologies for semantic query expansion
-
Song M., Song I.-Y., Hu X., and Allen R.B. Integration of association rules and ontologies for semantic query expansion. Data Knowl. Eng. 63 (2007) 63-75
-
(2007)
Data Knowl. Eng.
, vol.63
, pp. 63-75
-
-
Song, M.1
Song, I.-Y.2
Hu, X.3
Allen, R.B.4
-
42
-
-
84976669911
-
Algorithms for string searching
-
Baeza-Yates R.A. Algorithms for string searching. SIGIR Forum 23 (1989) 34-58
-
(1989)
SIGIR Forum
, vol.23
, pp. 34-58
-
-
Baeza-Yates, R.A.1
-
43
-
-
0014757386
-
A general method applicable to the search for similarities in the amino acid sequences of two proteins
-
Needleman S.B., and Wünsch C.D. A general method applicable to the search for similarities in the amino acid sequences of two proteins. J. Mol. Biol. (1970)
-
(1970)
J. Mol. Biol.
-
-
Needleman, S.B.1
Wünsch, C.D.2
-
45
-
-
0032307936
-
Grammars have exceptions
-
Crescenzi V., and Mecca G. Grammars have exceptions. Inf. Syst. 23 (1998) 539-565
-
(1998)
Inf. Syst.
, vol.23
, pp. 539-565
-
-
Crescenzi, V.1
Mecca, G.2
-
47
-
-
0001116877
-
Binary codes capable of correcting deletions, insertions, and reversals
-
Levenshtein V.I. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady 10 (1966) 707
-
(1966)
Soviet Physics Doklady
, vol.10
, pp. 707
-
-
Levenshtein, V.I.1
-
49
-
-
72649105417
-
-
Wei Liu, Xiaofeng Meng, Weiyi Meng, ViDE: a vision-based approach for deep web data extraction, IEEE T. Knowl. Data Eng. (2009).
-
Wei Liu, Xiaofeng Meng, Weiyi Meng, ViDE: a vision-based approach for deep web data extraction, IEEE T. Knowl. Data Eng. (2009).
-
-
-
-
51
-
-
0026185673
-
Identifying syntactic differences between two programs
-
Yang W. Identifying syntactic differences between two programs. Softw. Pract. Exper. 21 (1991) 739-755
-
(1991)
Softw. Pract. Exper.
, vol.21
, pp. 739-755
-
-
Yang, W.1
-
53
-
-
33750797710
-
Structured data extraction from the web based on partial tree alignment
-
Zhai Y., and Liu B. Structured data extraction from the web based on partial tree alignment. IEEE T. Knowl. Data Eng. 18 (2006) 1614-1628
-
(2006)
IEEE T. Knowl. Data Eng.
, vol.18
, pp. 1614-1628
-
-
Zhai, Y.1
Liu, B.2
-
54
-
-
34247869740
-
Extracting web data using instance-based learning
-
Zhai Y., and Liu B. Extracting web data using instance-based learning. World Wide Web 10 (2007) 113-132
-
(2007)
World Wide Web
, vol.10
, pp. 113-132
-
-
Zhai, Y.1
Liu, B.2
-
55
-
-
52649084677
-
Extracting loosely structured data records through mining strict patterns
-
Yipu Wu, Jing Chen, Qing Li, Extracting loosely structured data records through mining strict patterns, in: ICDE 2008, IEEE 24th International Conference on Data Engineering, 2008, pp. 1322-1324.
-
(2008)
ICDE 2008, IEEE 24th International Conference on Data Engineering
, pp. 1322-1324
-
-
Wu, Y.1
Chen, J.2
Li, Q.3
-
56
-
-
72649098560
-
-
Yuan Kui Shen, Automatic record extraction for the World Wide Web, in: Department of Electrical Engineering and Computer Science, Master of Science in Computer Science and Engineering, MIT, 2006.
-
Yuan Kui Shen, Automatic record extraction for the World Wide Web, in: Department of Electrical Engineering and Computer Science, Master of Science in Computer Science and Engineering, MIT, 2006.
-
-
-
-
58
-
-
72649102832
-
-
http://en.wikipedia.org/wiki/Web_template.
-
-
-
|