-
1
-
-
0040079611
-
On the design of a learning crawler for topical resource discovery
-
AGGARWAL, C. C., AL-GARAWI, P., AND Yu, P. S. On the design of a learning crawler for topical resource discovery. ACM Transactions on Information Systems 19, 3 (2001), 286-309.
-
(2001)
ACM Transactions on Information Systems
, vol.19
, Issue.3
, pp. 286-309
-
-
Aggarwal, C.C.1
Al-Garawi, P.2
Yu, P.S.3
-
3
-
-
1142303684
-
Extracting structured data from web pages
-
San Diego, CA, USA
-
ARASU, A., AND GARCIA-MOLINA, H. Extracting structured data from web pages. In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data (San Diego, CA, USA, 2003), pp. 337-348.
-
(2003)
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data
, pp. 337-348
-
-
Arasu, A.1
Garcia-Molina, H.2
-
4
-
-
0038494628
-
Web-dl: An experience in building digital libraries from the web
-
McLean, VA, USA
-
CALADO, P., ET AL. Web-dl: an experience in building digital libraries from the web. In Proceedings of the ACM International Conference on Information and Knowledge Management (McLean, VA, USA, 2002), pp. 675-677.
-
(2002)
Proceedings of the ACM International Conference on Information and Knowledge Management
, pp. 675-677
-
-
Calado, P.1
-
5
-
-
77953064623
-
Accelerated focused crawling through online relevance feedback
-
Honolulu, HI, USA
-
CHAKRABARTI, S., PUNERA, K.. AND SUBRAMANYAM, M. Accelerated focused crawling through online relevance feedback. In Proceedings of the 11th International World Wide Web Conference (Honolulu, HI, USA, 2002), pp. 148-159.
-
(2002)
Proceedings of the 11th International World Wide Web Conference
, pp. 148-159
-
-
Chakrabarti, S.1
Punera, K.2
Subramanyam, M.3
-
6
-
-
0033294474
-
Focused crawling: A new approach to topic-specific web resource discovery
-
CHAKRABARTI, S., VAN DEN BERG, M., AND DOM, B. Focused crawling: A new approach to topic-specific web resource discovery. Computer Networks 31, 11-16 (1999), 1623-1640.
-
(1999)
Computer Networks
, vol.31
, Issue.11-16
, pp. 1623-1640
-
-
Chakrabarti, S.1
Van Den Berg, M.2
Dom, B.3
-
8
-
-
14544282729
-
The use of web structure and content to identify subjectively interesting web usage patterns
-
COOLEY, R. The use of web structure and content to identify subjectively interesting web usage patterns. ACM Transactions on Internet Technology 3, 2 (2003), 93-116.
-
(2003)
ACM Transactions on Internet Technology
, vol.3
, Issue.2
, pp. 93-116
-
-
Cooley, R.1
-
9
-
-
18844436436
-
Clustering web pages based on their structure
-
CRESCENZI, V., MERIALDO, P., AND MISSIER, P. Clustering web pages based on their structure. Data and Knowledge Engineering 54, 3 (2004), 277-393.
-
(2004)
Data and Knowledge Engineering
, vol.54
, Issue.3
, pp. 277-393
-
-
Crescenzi, V.1
Merialdo, P.2
Missier, P.3
-
10
-
-
0345870306
-
A layered architecture for querying dynamic web content
-
Philadelphia, PY, USA
-
DAVULCU, H., ET AL. A layered architecture for querying dynamic web content. In Proceedings of the ACM SIGMOD International Conference on Management of Data (Philadelphia, PY, USA, 1999), pp. 491-502.
-
(1999)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 491-502
-
-
Davulcu, H.1
-
12
-
-
4644340823
-
Automatic web news extraction using tree edit distance
-
New York, NY, USA
-
DE CASTRO REIS, D., ET AL. Automatic web news extraction using tree edit distance. In Proceedings of the 13th international conference on World Wide Web (New York, NY, USA, 2004), pp. 502-511.
-
(2004)
Proceedings of the 13th International Conference on World Wide Web
, pp. 502-511
-
-
De Castro Reis, D.1
-
13
-
-
1842832183
-
Automatic generation of agents for collecting hidden web pages for data extraction
-
LAGE. J. P., ET AL. Automatic generation of agents for collecting hidden web pages for data extraction. Data an Knowledge Engineering 49, 2 (2004), 177-196.
-
(2004)
Data An Knowledge Engineering
, vol.49
, Issue.2
, pp. 177-196
-
-
Lage, J.P.1
-
14
-
-
18744408155
-
Probabilistic models for focused web crawling
-
Washington, DC, USA
-
LIU, H., MILIOS, E. E., AND JANSSEN, J. Probabilistic models for focused web crawling. In Procendings of the ACM CIKM International Workshop on Web Information and Data Management (Washington, DC, USA, 2004), pp. 16-22.
-
(2004)
Procendings of the ACM CIKM International Workshop on Web Information and Data Management
, pp. 16-22
-
-
Liu, H.1
Milios, E.E.2
Janssen, J.3
-
15
-
-
85006710010
-
Breadth-first crawling yields high-quality pages
-
Hong Kong, China
-
NAJORK, M., AND WIENER, J. L. Breadth-first crawling yields high-quality pages. In Proceedings of the 10th International World Wide Web Conference (Hong Kong, China, 2001), pp. 114-118.
-
(2001)
Proceedings of the 10th International World Wide Web Conference
, pp. 114-118
-
-
Najork, M.1
Wiener, J.L.2
-
16
-
-
4944246916
-
Building domain-specific web collections for scientific digital libraries: A meta-search enhanced focused crawling method
-
Tuscon, AZ, USA
-
QIN, J., ZHOU, Y., AND CHAU, M. Building domain-specific web collections for scientific digital libraries: a meta-search enhanced focused crawling method. In Joint Conference on Digital Libraries (Tuscon, AZ, USA, 2004), pp. 135-141.
-
(2004)
Joint Conference on Digital Libraries
, pp. 135-141
-
-
Qin, J.1
Zhou, Y.2
Chau, M.3
-
17
-
-
0001122858
-
The tree-to-tree editing problem
-
Dec.
-
SELKOW, S. M. The tree-to-tree editing problem. Information Processing Letters 6 (Dec. 1977), 184-186.
-
(1977)
Information Processing Letters
, vol.6
, pp. 184-186
-
-
Selkow, S.M.1
-
18
-
-
0026185673
-
Identifying syntactic differences between two programs
-
July
-
YANG, W. Identifying syntactic differences between two programs. Software - Practice And Experience 21, 7 (July 1991), 739-755.
-
(1991)
Software - Practice and Experience
, vol.21
, Issue.7
, pp. 739-755
-
-
Yang, W.1
-
19
-
-
33744821948
-
Web data extraction based on partial tree alignment
-
Chiba, Japan
-
ZHAI, Y., AND Liu, B. Web data extraction based on partial tree alignment. In Proceedings of the 14th international conference on World Wide Web (Chiba, Japan, 2005), pp. 76-85.
-
(2005)
Proceedings of the 14th International Conference on World Wide Web
, pp. 76-85
-
-
Zhai, Y.1
Liu, B.2
|