-
1
-
-
0031649136
-
WebOQL: Restructuring documents, databases, and webs
-
G.O. Arocena and A.O. Mendelzon, "WebOQL: Restructuring Documents, Databases, and Webs," Proc. Int'l Conf. Data Eng. (ICDE), pp. 24-33, 1998.
-
(1998)
Proc. Int'l Conf. Data Eng. (ICDE)
, pp. 24-33
-
-
Arocena, G.O.1
Mendelzon, A.O.2
-
2
-
-
0035000412
-
A fully automated object extraction system for the world wide web
-
D. Buttler, L. Liu, and C. Pu, "A Fully Automated Object Extraction System for the World Wide Web," Proc. Int'l Conf. Distributed Computing Systems (ICDCS), pp. 361-370, 2001.
-
(2001)
Proc. Int'l Conf. Distributed Computing Systems (ICDCS)
, pp. 361-370
-
-
Buttler, D.1
Liu, L.2
Pu, C.3
-
3
-
-
8644241107
-
Block-level link analysis
-
D. Cai, X. He, J.-R. Wen, and W.-Y. Ma, "Block-Level Link Analysis," Proc. SIGIR, pp. 440-447, 2004.
-
(2004)
Proc. SIGIR
, pp. 440-447
-
-
Cai, D.1
He, X.2
Wen, J.-R.3
Ma, W.-Y.4
-
4
-
-
21144444733
-
Extracting content structure for web pages based on visual representation
-
D. Cai, S. Yu, J. Wen, and W. Ma, "Extracting Content Structure for Web Pages Based on Visual Representation," Proc. Asia Pacific Web Conf. (APWeb), pp. 406-417, 2003.
-
(2003)
Proc. Asia Pacific Web Conf. (APWeb)
, pp. 406-417
-
-
Cai, D.1
Yu, S.2
Wen, J.3
Ma, W.4
-
5
-
-
33748336500
-
A survey of web information extraction systems
-
Oct.
-
C.-H. Chang, M. Kayed, M.R. Girgis, and K.F. Shaalan, "A Survey of Web Information Extraction Systems," IEEE Trans. Knowledge and Data Eng., vol.18, no.10, pp. 1411-1428, Oct. 2006.
-
(2006)
IEEE Trans. Knowledge and Data Eng.
, vol.18
, Issue.10
, pp. 1411-1428
-
-
Chang, C.-H.1
Kayed, M.2
Girgis, M.R.3
Shaalan, K.F.4
-
6
-
-
0037375290
-
Automatic information extraction from semi-structured web pages by pattern discovery
-
C.-H. Chang, C.-N. Hsu, and S.-C. Lui, "Automatic Information Extraction from Semi-Structured Web Pages by Pattern Discovery," Decision Support Systems, vol.35, no.1, pp. 129-147, 2003.
-
(2003)
Decision Support Systems
, vol.35
, Issue.1
, pp. 129-147
-
-
Chang, C.-H.1
Hsu, C.-N.2
Lui, S.-C.3
-
7
-
-
0032307936
-
Grammars have exceptions
-
V. Crescenzi and G. Mecca, "Grammars Have Exceptions," Information Systems, vol.23, no.8, pp. 539-565, 1998.
-
(1998)
Information Systems
, vol.23
, Issue.8
, pp. 539-565
-
-
Crescenzi, V.1
Mecca, G.2
-
8
-
-
84944327150
-
RoadRunner: Towards automatic data extraction from large web sites
-
V. Crescenzi, G. Mecca, and P. Merialdo, "RoadRunner: Towards Automatic Data Extraction from Large Web Sites," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 109-118, 2001.
-
(2001)
Proc. Int'l Conf. Very Large Data Bases (VLDB)
, pp. 109-118
-
-
Crescenzi, V.1
Mecca, G.2
Merialdo, P.3
-
9
-
-
0346501095
-
Record-boundary discovery in web documents
-
D.W. Embley, Y.S. Jiang, and Y.-K. Ng, "Record-Boundary Discovery in Web Documents," Proc. ACM SIGMOD, pp. 467-478, 1999.
-
(1999)
Proc. ACM SIGMOD
, pp. 467-478
-
-
Embley, D.W.1
Jiang, Y.S.2
Ng, Y.-K.3
-
10
-
-
35348900845
-
Towards domain independent information extraction from web tables
-
W. Gatterbauer, P. Bohunsky, M. Herzog, B. Krpl, and B. Pollak, "Towards Domain Independent Information Extraction from Web Tables," Proc. Int'l World Wide Web Conf. (WWW), pp. 71-80, 2007.
-
(2007)
Proc. Int'l World Wide Web Conf. (WWW)
, pp. 71-80
-
-
Gatterbauer, W.1
Bohunsky, P.2
Herzog, M.3
Krpl, B.4
Pollak, B.5
-
11
-
-
0008762950
-
Semistructured data: The TSIMMIS experience
-
J. Hammer, J. McHugh, and H. Garcia-Molina, "Semistructured Data: The TSIMMIS Experience," Proc. East-European Workshop Advances in Databases and Information Systems (ADBIS), pp. 1-8, 1997.
-
(1997)
Proc. East-European Workshop Advances in Databases and Information Systems (ADBIS)
, pp. 1-8
-
-
Hammer, J.1
McHugh, J.2
Garcia-Molina, H.3
-
12
-
-
0032309862
-
Generating finite-state transducers for semi-structured data extraction from the web
-
C.-N. Hsu and M.-T. Dung, "Generating Finite-State Transducers for Semi-Structured Data Extraction from the Web," Information Systems, vol.23, no.8, pp. 521-538, 1998.
-
(1998)
Information Systems
, vol.23
, Issue.8
, pp. 521-538
-
-
Hsu, C.-N.1
Dung, M.-T.2
-
13
-
-
76749083388
-
-
http://daisen.cc.kyushu-u.ac.jp/TBDW/, 2009.
-
(2009)
-
-
-
14
-
-
76749125886
-
-
http://www.w3.org/html/wg/html5/, 2009.
-
(2009)
-
-
-
15
-
-
0034172374
-
Wrapper induction: Efficiency and expressiveness
-
N. Kushmerick, "Wrapper Induction: Efficiency and Expressiveness," Artificial Intelligence, vol.118, nos. 1/2, pp. 15-68, 2000.
-
(2000)
Artificial Intelligence
, vol.118
, Issue.1-2
, pp. 15-68
-
-
Kushmerick, N.1
-
16
-
-
0037806547
-
A brief survey of web data extraction tools
-
A. Laender, B. Ribeiro-Neto, A. da Silva, and J. Teixeira, "A Brief Survey of Web Data Extraction Tools," SIGMOD Record, vol.31, no.2, pp. 84-93, 2002.
-
(2002)
SIGMOD Record
, vol.31
, Issue.2
, pp. 84-93
-
-
Laender, A.1
Ribeiro-Neto, B.2
Da Silva, A.3
Teixeira, J.4
-
17
-
-
77952333945
-
Mining data records in web pages
-
B. Liu, R.L. Grossman, and Y. Zhai, "Mining Data Records in Web Pages," Proc. Int'l Conf. Knowledge Discovery and Data Mining (KDD), pp. 601-606, 2003.
-
(2003)
Proc. Int'l Conf. Knowledge Discovery and Data Mining (KDD)
, pp. 601-606
-
-
Liu, B.1
Grossman, R.L.2
Zhai, Y.3
-
18
-
-
43949093039
-
Vision-based web data records extraction
-
June
-
W. Liu, X. Meng, and W. Meng, "Vision-Based Web Data Records Extraction," Proc. Int'l Workshop Web and Databases (WebDB '06), pp. 20-25, June 2006.
-
(2006)
Proc. Int'l Workshop Web and Databases (WebDB '06)
, pp. 20-25
-
-
Liu, W.1
Meng, X.2
Meng, W.3
-
19
-
-
0033893885
-
XWRAP: An XML-enabled wrapper construction system for web information sources
-
L. Liu, C. Pu, and W. Han, "XWRAP: An XML-Enabled Wrapper Construction System for Web Information Sources," Proc. Int'l Conf. Data Eng. (ICDE), pp. 611-621, 2000.
-
(2000)
Proc. Int'l Conf. Data Eng. (ICDE)
, pp. 611-621
-
-
Liu, L.1
Pu, C.2
Han, W.3
-
20
-
-
34548707913
-
Annotating structured data of the deep Web
-
DOI 10.1109/ICDE.2007.367883, 4221686, 23rd International Conference on Data Engineering, ICDE 2007
-
Y. Lu, H. He, H. Zhao, W. Meng, and C.T. Yu, "Annotating Structured Data of the Deep Web," Proc. Int'l Conf. Data Eng. (ICDE), pp. 376-385, 2007. (Pubitemid 47422041)
-
(2007)
Proceedings - International Conference on Data Engineering
, pp. 376-385
-
-
Lu, Y.1
He, H.2
Zhao, H.3
Meng, W.4
Yu, C.5
-
21
-
-
84858669792
-
Web-scale data integration: You can only afford to pay as you go
-
J. Madhavan, S.R. Jeffery, S. Cohen, X.L. Dong, D. Ko, C. Yu, and A. Halevy, "Web-Scale Data Integration: You Can Only Afford to Pay As You Go," Proc. Conf. Innovative Data Systems Research (CIDR), pp. 342-350, 2007.
-
(2007)
Proc. Conf. Innovative Data Systems Research (CIDR)
, pp. 342-350
-
-
Madhavan, J.1
Jeffery, S.R.2
Cohen, S.3
Dong, X.L.4
Ko, D.5
Yu, C.6
Halevy, A.7
-
22
-
-
0035587215
-
Hierarchical wrapper induction for semi-structured information sources
-
I. Muslea, S. Minton, and C.A. Knoblock, "Hierarchical Wrapper Induction for Semi-Structured Information Sources," Autonomous Agents and Multi-Agent Systems, vol.4, nos. 1/2, pp. 93-114, 2001.
-
(2001)
Autonomous Agents and Multi-Agent Systems
, vol.4
, Issue.1-2
, pp. 93-114
-
-
Muslea, I.1
Minton, S.2
Knoblock, C.A.3
-
23
-
-
38549134414
-
Object-level vertical search
-
Z. Nie, J.-R. Wen, and W.-Y. Ma, "Object-Level Vertical Search," Proc. Conf. Innovative Data Systems Research (CIDR), pp. 235-246, 2007.
-
(2007)
Proc. Conf. Innovative Data Systems Research (CIDR)
, pp. 235-246
-
-
Nie, Z.1
Wen, J.-R.2
Ma, W.-Y.3
-
24
-
-
0343725648
-
Building intelligent web applications using lightweight wrappers
-
A. Sahuguet and F. Azavant, "Building Intelligent Web Applications Using Lightweight Wrappers," Data and Knowledge Eng., vol.36, no.3, pp. 283-316, 2001.
-
(2001)
Data and Knowledge Eng.
, vol.36
, Issue.3
, pp. 283-316
-
-
Sahuguet, A.1
Azavant, F.2
-
26
-
-
18744381159
-
Learning block importance models for web pages
-
R. Song, H. Liu, J.-R. Wen, and W.-Y. Ma, "Learning Block Importance Models for Web Pages," Proc. Int'l World Wide Web Conf. (WWW), pp. 203-211, 2004.
-
(2004)
Proc. Int'l World Wide Web Conf. (WWW)
, pp. 203-211
-
-
Song, R.1
Liu, H.2
Wen, J.-R.3
Ma, W.-Y.4
-
28
-
-
32044447684
-
Efficient browsing of web search results on mobile devices based on block importance model
-
X. Xie, G. Miao, R. Song, J.-R. Wen, and W.-Y. Ma, "Efficient Browsing of Web Search Results on Mobile Devices Based on Block Importance Model," Proc. IEEE Int'l Conf. Pervasive Computing and Comm. (PerCom), pp. 17-26, 2005.
-
(2005)
Proc. IEEE Int'l Conf. Pervasive Computing and Comm. (PerCom)
, pp. 17-26
-
-
Xie, X.1
Miao, G.2
Song, R.3
Wen, J.-R.4
Ma, W.-Y.5
-
30
-
-
33744899132
-
Fully automatic wrapper generation for search engines
-
H. Zhao, W. Meng, Z. Wu, V. Raghavan, and C.T. Yu, "Fully Automatic Wrapper Generation for Search Engines," Proc. Int'l World Wide Web Conf. (WWW), pp. 66-75, 2005.
-
(2005)
Proc. Int'l World Wide Web Conf. (WWW)
, pp. 66-75
-
-
Zhao, H.1
Meng, W.2
Wu, Z.3
Raghavan, V.4
Yu, C.T.5
-
31
-
-
85044217577
-
Automatic extraction of dynamic record sections from search engine result pages
-
H. Zhao, W. Meng, and C.T. Yu, "Automatic Extraction of Dynamic Record Sections from Search Engine Result Pages," Proc. Int'l Conf. Very Large Data Bases (VLDB), pp. 989-1000, 2006.
-
(2006)
Proc. Int'l Conf. Very Large Data Bases (VLDB)
, pp. 989-1000
-
-
Zhao, H.1
Meng, W.2
Yu, C.T.3
-
32
-
-
33749623896
-
Simultaneous record detection and attribute labeling in web data extraction
-
J. Zhu, Z. Nie, J. Wen, B. Zhang, and W. Ma, "Simultaneous Record Detection and Attribute Labeling in Web Data Extraction," Proc. Int'l Conf. Knowledge Discovery and Data Mining (KDD), pp. 494-503, 2006.
-
(2006)
Proc. Int'l Conf. Knowledge Discovery and Data Mining (KDD)
, pp. 494-503
-
-
Zhu, J.1
Nie, Z.2
Wen, J.3
Zhang, B.4
Ma, W.5
|