-
1
-
-
34548694821
-
-
M. Álvarez, J. Raposo, F. Cacheda, A. Pan, A Task-specific approach for crawling the deep web, Journal Engineering Letters. Special Issue: Advances in Information Engineering 13 (2) (2006) 204-215.
-
-
-
-
2
-
-
34548693286
-
-
V. Anupan, J. Freire, B. Kumar, D. Lieuwen, Automating Web Navigation with WebVCR, in: Proceedings of the 9th International World Wide Web Conference, 2000, pp. 503-517.
-
-
-
-
3
-
-
1142303684
-
-
A. Arasu, H. García-Molina, Extracting structured data from web pages, in: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003, pp. 337-348.
-
-
-
-
4
-
-
34548696711
-
-
L. Barbosa, J, Freire, Searching for hidden-web databases, in: Proceedings of the ACM WebDB Workshop, 2005, pp. 1-6.
-
-
-
-
5
-
-
34548764453
-
-
A. Bergholz, B. Chidlovskii, Crawling for domain-specific hidden web resources, in: Proceedings of the 4th International Conference on Web Information Systems Engineering (WISE), 2003, pp. 125-133.
-
-
-
-
6
-
-
84863338210
-
-
K.C.-C. Chang, B. He, Z. Zhang, Toward large scale integration: building a metaquerier over databases on the web, in: Proceedings of the Second Conference on Innovative Data Systems Research (CIDR), 2005, pp. 44-55.
-
-
-
-
7
-
-
34548672855
-
-
W. Cohen, P. Ravikumar, S. Fienberg, A comparison of string distance metrics for name-matching tasks, in: Proceedings of IJCAI-03 Workshop on Information Integration on the Web (IIWeb), 2003, pp. 73-78.
-
-
-
-
8
-
-
84944327150
-
-
V. Crescenzi, G. Mecca, P. Merialdo, RoadRunner: towards automatic data extraction from large web sites, in: Proceedings of the 27th International Conference on Very Large Databases (VLDB), 2001, pp. 109-118.
-
-
-
-
9
-
-
33749642793
-
-
E. Dragut, W. Wu, P. Sistla, C. Yu, W. Meng, Merging source query interfaces on web databases, in: Proceedings of the 22nd International Conference on Data Engineering (ICDE), 2006, pp. 679-690.
-
-
-
-
10
-
-
34250750133
-
-
U. Irmak, T. Suel, Interactive wrapper generation with minimal user effort, in: Proceedings of the World Wide Web Conference (WWW), 2006, pp. 553-563.
-
-
-
-
11
-
-
0002781191
-
Accurately and reliably extracting data from the web: a machine learning approach
-
Knoblock C.A., Lerman K., Minton S., and Muslea I. Accurately and reliably extracting data from the web: a machine learning approach. IEEE Data Engineering Bulleting 23 4 (2000) 33-41
-
(2000)
IEEE Data Engineering Bulleting
, vol.23
, Issue.4
, pp. 33-41
-
-
Knoblock, C.A.1
Lerman, K.2
Minton, S.3
Muslea, I.4
-
12
-
-
0032596649
-
-
N. Kushmerick, Regression testing for wrapper maintenance, in: Proceedings of the Sixteenth National Conference on Artificial Intelligence and Innovative Applications of Artificial Intelligence, 1999, pp. 74-79.
-
-
-
-
13
-
-
0034172374
-
Wrapper induction: efficiency and expressiveness
-
Kushmerick N. Wrapper induction: efficiency and expressiveness. Artificial Intelligence 118 (2000) 15-68
-
(2000)
Artificial Intelligence
, vol.118
, pp. 15-68
-
-
Kushmerick, N.1
-
14
-
-
0242460357
-
-
N. Kushmerick, Learning to invoke Web forms, in: Proceedings of Ontologies, Databases and Applications of Semantics (ODBASE) International Conference, 2003, pp. 997-1013.
-
-
-
-
15
-
-
0037806547
-
A brief survey of web data extraction tools
-
Laender A.H.F., Ribeiro-Neto B.A., Soares da Silva A., and Teixeira J.S. A brief survey of web data extraction tools. ACM SIGMOD Record 31 2 (2002) 84-93
-
(2002)
ACM SIGMOD Record
, vol.31
, Issue.2
, pp. 84-93
-
-
Laender, A.H.F.1
Ribeiro-Neto, B.A.2
Soares da Silva, A.3
Teixeira, J.S.4
-
17
-
-
0242276220
-
-
S.W. Liddle, D.W. Embley, D.T. Scott, S.H. Yau, Extracting data behind web forms, in: Proceedings of the 28th International Conference on Very Large Databases (VLDB), 2002, pp. 402-413.
-
-
-
-
18
-
-
18744413271
-
-
X. Meng, D. Hu, C. Li, Schema-guided wrapper maintenance for web data extraction, in: Proceedings of the ACM Fifth International Workshop on Web Information and Data Management (WIDM), 2003, pp. 1-8.
-
-
-
-
19
-
-
15544366313
-
-
R. Mohapatra, K. Rajaraman, S. Sam Yuan, Efficient wrapper reinduction from dynamic web sources, in: Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, 2004, pp. 391-397.
-
-
-
-
20
-
-
34548672220
-
-
A. Pan, J. Raposo, M. Álvarez, J. Hidalgo, A. Viña, Semi-automatic wrapper generation for commercial web sources, in: Proceedings of IFIP WG8.1 Working Conference on Engineering Information Systems in the Internet Context (EISIC), 2002, pp. 265-283.
-
-
-
-
21
-
-
33748848723
-
-
A. Pan, J. Raposo, M. Álvarez, P. Montoto, J. Losada, J. Hidalgo, ITPilot: A toolkit for industrial-strength web data extraction, in: Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI) 2005, pp. 798-801.
-
-
-
-
22
-
-
84944325093
-
-
S. Raghavan, H. García-Molina, Crawling the hidden Web, in: Proceedings of the 27th Conference on Very Large Databases, VLDB 2001, pp. 129-138.
-
-
-
-
23
-
-
33947174604
-
Automatically maintaining wrappers for web sources
-
Raposo J., Pan A., Álvarez M., and Hidalgo J. Automatically maintaining wrappers for web sources. Data & Knowledge Engineering 61 2 (2007) 331-358
-
(2007)
Data & Knowledge Engineering
, vol.61
, Issue.2
, pp. 331-358
-
-
Raposo, J.1
Pan, A.2
Álvarez, M.3
Hidalgo, J.4
-
24
-
-
0026836188
-
Mediators in the architecture of future information systems
-
Wiederhold G. Mediators in the architecture of future information systems. Computer 25 3 (1992) 38-49
-
(1992)
Computer
, vol.25
, Issue.3
, pp. 38-49
-
-
Wiederhold, G.1
-
25
-
-
33750797710
-
Structured data extraction from the web based on partial tree alignment
-
Zhai Y., and Liu B. Structured data extraction from the web based on partial tree alignment. IEEE Transactions on Knowledge and Data Engineering 18 12 (2006) 1614-1628
-
(2006)
IEEE Transactions on Knowledge and Data Engineering
, vol.18
, Issue.12
, pp. 1614-1628
-
-
Zhai, Y.1
Liu, B.2
-
26
-
-
3142681002
-
-
Z. Zhang, B. He, K.C.-C. Chang, Understanding web query interfaces: best-effort parsing with hidden syntax, in: Proceedings of the 2004 ACM SIGMOD Conference, 2004, pp. 107-118.
-
-
-
|