-
1
-
-
84872661420
-
-
http://www.iopus.com/iMacros
-
-
-
-
2
-
-
84872670215
-
-
http://www.newprosoft.com/web-content-extractor.htm
-
-
-
-
3
-
-
84872657819
-
-
http://www.visualwebripper.com
-
-
-
-
4
-
-
84872650110
-
-
http://www.web-harvest.sourceforge.net
-
-
-
-
5
-
-
84872662003
-
-
http://www.w3.org/TR/CSS2/selector.html
-
-
-
-
6
-
-
63349103097
-
Accessing the deep web: When good ideas go bad
-
Alba, A., Bhagwan, V., Grandison, T.: Accessing the deep web: when good ideas go bad. In: OOPSLA (2008)
-
(2008)
OOPSLA
-
-
Alba, A.1
Bhagwan, V.2
Grandison, T.3
-
7
-
-
70350652136
-
XPath-wrapper induction by generalizing tree traversal patterns
-
Anton, T.:XPath-wrapper induction by generalizing tree traversal patterns. In: LWA (2005)
-
(2005)
LWA
-
-
Anton, T.1
-
8
-
-
84872658571
-
Automating web navigation with the webvcr
-
Anupam, V., Freire, J., Kumar, B., Lieuwen, D.: Automating web navigation with the webvcr. In: WWW (2000)
-
(2000)
WWW
-
-
Anupam, V.1
Freire, J.2
Kumar, B.3
Lieuwen, D.4
-
9
-
-
0031649136
-
Weboql: Restructuring documents, databases, and webs
-
Arocena, G.O., Mendelzon, A.O.: Weboql: Restructuring documents, databases, and webs. In: ICDE (1998)
-
(1998)
ICDE
-
-
Arocena, G.O.1
Mendelzon, A.O.2
-
10
-
-
33947304810
-
L-wrappers: Concepts, properties and construction: A declarative approach to data extraction from web sources
-
Badica, C., Badica, A., Popescu, E., Abraham, A.: L-wrappers: concepts, properties and construction: A declarative approach to data extraction from web sources. Soft Comput. 11(8), 753-772 (2007)
-
(2007)
Soft Comput
, vol.11
, Issue.8
, pp. 753-772
-
-
Badica, C.1
Badica, A.2
Popescu, E.3
Abraham, A.4
-
11
-
-
84880902141
-
Open information extraction from the Web
-
Banko, M., Cafarella, M.J., Soderland, S., Broadhead, M., Etzioni, O.: Open information extraction from the Web. In: IJCAI (2007)
-
(2007)
IJCAI
-
-
Banko, M.1
Cafarella, M.J.2
Soderland, S.3
Broadhead, M.4
Etzioni, O.5
-
13
-
-
69149106280
-
Xpath leashed
-
Benedikt, M., Koch, C.: Xpath leashed. CSUR 41, 3:1-3:54 (2009)
-
(2009)
CSUR
, vol.41
, Issue.3
-
-
Benedikt, M.1
Koch, C.2
-
14
-
-
0003259187
-
The deep web: Surfacing hidden value
-
Bergman, M.K.:The deep web: Surfacing hidden value. J. Electron. Publ. 7(1), 1-17 (2001)
-
(2001)
J. Electron. Publ
, vol.7
, Issue.1
, pp. 1-17
-
-
Bergman, M.K.1
-
15
-
-
57349150271
-
Transcendence: Enabling a personal viewof the deep web
-
Bigham, J.P., Cavender, A.C., Kaminsky, R.S., Prince, C.M., Obison T.S.: Transcendence: enabling a personal viewof the deep web. In: IUI (2008)
-
(2008)
IUI
-
-
Bigham, J.P.1
Cavender, A.C.2
Kaminsky, R.S.3
Prince, C.M.4
Obison, T.S.5
-
16
-
-
3042680184
-
Ubicrawler: A scalable fully distributed web crawler
-
Boldi, P., Codenotti, B., Santini, M., Vigna, S.: Ubicrawler: a scalable fully distributed web crawler. Softw. Practice Experience 34, 711-726 (2004)
-
(2004)
Softw. Practice Experience
, vol.34
, pp. 711-726
-
-
Boldi, P.1
Codenotti, B.2
Santini, M.3
Vigna, S.4
-
17
-
-
33749570363
-
Automation and customization of rendered web pages
-
Bolin, M., Webber, M., Rha, P., Wilson, T., Miller, R.C.:. Automation and customization of rendered web pages. In: UIST (2005)
-
(2005)
UIST
-
-
Bolin, M.1
Webber, M.2
Rha, P.3
Wilson, T.4
Miller, R.C.5
-
18
-
-
0038589165
-
The anatomy of a large-scale hypertextual web search engine
-
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst. 30(1-7), 107-117 (1998)
-
(1998)
Comput. Netw. ISDN Syst
, vol.30
, Issue.1-7
, pp. 107-117
-
-
Brin, S.1
Page, L.2
-
19
-
-
84859197607
-
WebTables: Exploring the power of tables on the web
-
Cafarella, M.J., Halevy, A.Y., Wang, D.Z., Wy, E., Zhang, Y.: WebTables: exploring the power of tables on the web. PVLDB 1(1), 538-549 (2008)
-
(2008)
PVLDB
, vol.1
, Issue.1
, pp. 538-549
-
-
Cafarella, M.J.1
Halevy, A.Y.2
Wang, D.Z.3
Wy, E.4
Zhang, Y.5
-
20
-
-
7444256770
-
Intelligent automated navigation through the deep web
-
Centeno, V.L., Kloos, C.D., Fernández, L.S.: García, N.F.: Intelligent automated navigation through the deep web. In: Advances in Web Intelligence (2004)
-
(2004)
Advances in Web Intelligence
-
-
Centeno, V.L.1
Kloos, C.D.2
Fernández, L.S.3
García, N.F.4
-
21
-
-
33748336500
-
A survey of web information extraction systems
-
Chang, C.-H., Kayed, M., Girgis, M.R., Shaalan, K.F.: A survey of web information extraction systems. TKDE 18(10), 1411-1428 (2006)
-
(2006)
TKDE
, vol.18
, Issue.10
, pp. 1411-1428
-
-
Chang, C.-H.1
Kayed, M.2
Girgis, M.R.3
Shaalan, K.F.4
-
22
-
-
0036373394
-
Roadrunner: Automatic data extraction from data-intensive web sites
-
Crescenzi, V., Mecca, G., Merialdo, P.: Roadrunner: automatic data extraction from data-intensive web sites. In: SIGMOD (2002)
-
(2002)
SIGMOD
-
-
Crescenzi, V.1
Mecca, G.2
Merialdo, P.3
-
23
-
-
17644423946
-
Unsupervised named-entity extrac-tion from the Web: An experimental study
-
Cafarella, M.J., Downey, D., Popescu, A.-M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Unsupervised named-entity extrac-tion from the Web: an experimental study. Artif. Intell. 165(1), 91-134 (2005)
-
(2005)
Artif. Intell
, vol.165
, Issue.1
, pp. 91-134
-
-
Cafarella, M.J.1
Downey, D.2
Popescu, A.-M.3
Shaked, T.4
Soderland, S.5
Weld, D.S.6
Yates, A.7
-
24
-
-
84861044865
-
DIADEM: Domain-centric, intelligent, automated data extraction methodology
-
Furche, T., Gottlob, G., Grasso, G., Gunes, O., Guo, X., Kravchenko, A., Orsi, G., Schallhart, C., Sellers, A., Wang, C.: DIADEM: Domain-centric, intelligent, automated data extraction methodology. In: WWW (2012)
-
(2012)
WWW
-
-
Furche, T.1
Gottlob, G.2
Grasso, G.3
Gunes, O.4
Guo, X.5
Kravchenko, A.6
Orsi, G.7
Schallhart, C.8
Sellers, A.9
Wang, C.10
-
25
-
-
84861058689
-
Oxpath: A language for scalable, memory-efficient data extraction from web applications
-
Furche, T., Gottlob, G., Grasso, G., Schallhart, C., Sellers, A.: Oxpath: A language for scalable, memory-efficient data extraction from web applications. PVLDB 4(11), 1016-1027 (2011)
-
(2011)
PVLDB
, vol.4
, Issue.11
, pp. 1016-1027
-
-
Furche, T.1
Gottlob, G.2
Grasso, G.3
Schallhart, C.4
Sellers, A.5
-
26
-
-
23944498592
-
Efficient algorithms for processing XPath queries
-
Gottlob, G., Koch, C., Pichler, R.: Efficient algorithms for processing XPath queries. In: TODS (2005)
-
(2005)
TODS
-
-
Gottlob, G.1
Koch, C.2
Pichler, R.3
-
27
-
-
1842580895
-
How to build a webfountain: An architecture for very large-scale text analytics
-
Gruhl, D., Chavet, L., Gibson, D., Meyer, J., Pattanayak, P., Tomkins, A., Zien, J.: How to build a webfountain: an architecture for very large-scale text analytics. IBM Syst. J. 43, 64-77 (2004)
-
(2004)
IBM Syst. J
, vol.43
, pp. 64-77
-
-
Gruhl, D.1
Chavet, L.2
Gibson, D.3
Meyer, J.4
Pattanayak, P.5
Tomkins, A.6
Zien, J.7
-
28
-
-
34248146053
-
Accessing the deep web
-
He, B., Patel, M., Zhang, Z., Chang, K.C.-C.: Accessing the deep web. Commun. ACM 50(5), 94-101 (2007)
-
(2007)
Commun. ACM
, vol.50
, Issue.5
, pp. 94-101
-
-
He, B.1
Patel, M.2
Zhang, Z.3
Chang, K.C.-C.4
-
29
-
-
79951675059
-
Mercator: A scalable, extensible web crawler
-
Heydon, A., Najork, M.: Mercator: a scalable, extensible web crawler. World Wide Web 2(4), 219-229 (1999)
-
(1999)
World Wide Web
, vol.2
, Issue.4
, pp. 219-229
-
-
Heydon, A.1
Najork, M.2
-
30
-
-
84861014929
-
Spotting the tracks on the oxpath
-
Kranzdorf, J., Sellers, A., Grasso, G., Schallhart, C., Furche, T: Spotting the tracks on the oxpath. In: WWW (2012)
-
(2012)
WWW
-
-
Kranzdorf, J.1
Sellers, A.2
Grasso, G.3
Schallhart, C.4
Furche, T.5
-
31
-
-
56349100780
-
Coscripter: Automating & sharing how-to knowledge in the enterprise
-
Leshed, G., Haber, E.M., Matthews, T., Lau, T.: Coscripter: automating & sharing how-to knowledge in the enterprise. In: CHI (2008)
-
(2008)
CHI
-
-
Leshed, G.1
Haber, E.M.2
Matthews, T.3
Lau, T.4
-
32
-
-
77953872609
-
End-user programming of mashups with vegemite
-
Lin, J., Wong, J., Nichols, J., Cypher, A., Lau, T.A.: End-user programming of mashups with vegemite. In: IUI (2009)
-
(2009)
IUI
-
-
Lin, J.1
Wong, J.2
Nichols, J.3
Cypher, A.4
Lau, T.A.5
-
33
-
-
0033893885
-
Xwrap: An xml-enabled wrapper construction system for web information sources
-
Liu, L., Pu, C., Han, W.: Xwrap: an xml-enabled wrapper construction system for web information sources. In: ICDE (2000)
-
(2000)
ICDE
-
-
Liu, L.1
Pu, C.2
Han, W.3
-
34
-
-
84963903946
-
A rule-based query language for html
-
Liu, M., Ling, T.W.: A rule-based query language for html. In: DASFAA (2001)
-
(2001)
DASFAA
-
-
Liu, M.1
Ling, T.W.2
-
35
-
-
33745196199
-
Conditional XPath
-
Marx, M.: Conditional XPath. ACM Trans. Database Syst. 30(4), 929-959 (2005)
-
(2005)
ACM Trans. Database Syst
, vol.30
, Issue.4
, pp. 929-959
-
-
Marx, M.1
-
36
-
-
24344444895
-
Semantic characterizations of navigational XPath
-
Marx, M., de Rijke, M.: Semantic characterizations of navigational XPath. ACM SIGMOD Rec. 34(2), 41-46 (2005)
-
(2005)
ACM SIGMOD Rec
, vol.34
, Issue.2
, pp. 41-46
-
-
Marx, M.1
de Rijke, M.2
-
37
-
-
84867493321
-
Querying the world wide web
-
Mendelzon, A.O., Mihaila, G.A., Milo, T.: Querying the world wide web. Int. J. Digit. Libr. 1(1), 54-67 (1997)
-
(1997)
Int. J. Digit. Libr
, vol.1
, Issue.1
, pp. 54-67
-
-
Mendelzon, A.O.1
Mihaila, G.A.2
Milo, T.3
-
38
-
-
84873896990
-
Web-prospector-an automatic, sitewide wrapper induction approach for scientific deep-web databases
-
Mir, S., Staab, S., Rojas, I.: Web-prospector-an automatic, sitewide wrapper induction approach for scientific deep-web databases. In: BTW (2009)
-
(2009)
BTW
-
-
Mir, S.1
Staab, S.2
Rojas, I.3
-
39
-
-
79955148837
-
Automating navigation sequences in ajax websites
-
Montoto, P., Pan, A., Raposo, J., Bellas, F., López, J: Automating navigation sequences in ajax websites. In: ICWE (2009)
-
(2009)
ICWE
-
-
Montoto, P.1
Pan, A.2
Raposo, J.3
Bellas, F.4
López, J.5
-
40
-
-
0037025796
-
Effective web data extraction with standard xml technologies
-
Myllymaki, J.: Effective web data extraction with standard xml technologies. Comput. Netw. 39(5), 635-644 (2002)
-
(2002)
Comput. Netw
, vol.39
, Issue.5
, pp. 635-644
-
-
Myllymaki, J.1
-
41
-
-
84885228765
-
XPath: Looking Forward
-
In:, LNCS 2490
-
Olteanu, D., Meuss, H., Furche, T., Bry, F.: XPath: looking Forward. In: EDBT-XML-Based Data Management, LNCS 2490 (2002)
-
(2002)
EDBT-XML-Based Data Management
-
-
Olteanu, D.1
Meuss, H.2
Furche, T.3
Bry, F.4
-
42
-
-
33750819991
-
The wargo system: Semi-automatic wrapper generation in presence of complex data access modes
-
Raposo, J., Pan, A., Álvarez, M., Hidalgo, J., Viña., A.: The wargo system: semi-automatic wrapper generation in presence of complex data access modes. In: DEXA (2002)
-
(2002)
DEXA
-
-
Raposo, J.1
Pan, A.2
Álvarez, M.3
Hidalgo, J.4
Viña, A.5
-
43
-
-
41149085958
-
Web macros by example: Users managing the www of applications
-
In:,. ACM
-
Safonov, A.: Web macros by example: users managing the www of applications. In: CHI, pp. 71-72. ACM (1999)
-
(1999)
CHI
, pp. 71-72
-
-
Safonov, A.1
-
44
-
-
0002763572
-
Building light-weight wrappers for legacy web data-sources usingw4f
-
Sahuguet, A., Azavant, F.: Building light-weight wrappers for legacy web data-sources usingw4f. In: VLDB, pp. 738-741 (1999)
-
(1999)
VLDB
, pp. 738-741
-
-
Sahuguet, A.1
Azavant, F.2
-
45
-
-
57849149062
-
Wraplet: Wrapping your web contents with a lightweight language
-
Sawa, N., Morishima, A., Sugimoto, S., Kitagawa, H.: Wraplet: Wrapping your web contents with a lightweight language. In: SITIS, pp. 387-394 (2007)
-
(2007)
SITIS
, pp. 387-394
-
-
Sawa, N.1
Morishima, A.2
Sugimoto, S.3
Kitagawa, H.4
-
46
-
-
77951560898
-
Declarative information extraction using datalogwith embedded extraction predicates
-
Shen, W., Doan, A., Naughton, J.F., Ramakrishnan, R: Declarative information extraction using datalogwith embedded extraction predicates. In: VLDB (2007)
-
(2007)
VLDB
-
-
Shen, W.1
Doan, A.2
Naughton, J.F.3
Ramakrishnan, R.4
-
47
-
-
77952357091
-
On design of browseroriented data extraction system and plug-ins
-
Su, J.-Y., Sun, D.-J., Wu, I.-C., Chen, L.-P.: On design of browseroriented data extraction system and plug-ins. J. Mar. Sci. Technol. 18(2), 189-200 (2010)
-
(2010)
J. Mar. Sci. Technol
, vol.18
, Issue.2
, pp. 189-200
-
-
Su, J.-Y.1
Sun, D.-J.2
Wu, I.-C.3
Chen, L.-P.4
|