-
1
-
-
0005540823
-
-
Addison-Wesley, Reading, MA, USA, May
-
R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. Addison-Wesley, Reading, MA, USA, May 1999.
-
(1999)
Modern Information Retrieval
-
-
Baeza-Yates, R.1
Ribeiro-Neto, B.2
-
2
-
-
84944318551
-
Visual Web information extraction with Lixto
-
Roma, Italy
-
R. Baumgartner, S. Flesca, and G. Gottlob. Visual Web information extraction with Lixto. In Proceedings of the 27th International Conference on Very Large Databases, pages 119-128, Roma, Italy, 2001.
-
(2001)
Proceedings of the 27th International Conference on Very Large Databases
, pp. 119-128
-
-
Baumgartner, R.1
Flesca, S.2
Gottlob, G.3
-
3
-
-
8644267730
-
Block-based Web search
-
Sheffield, UK, ACM Press
-
D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. Block-based Web search. In Proceedings of the 27th Annual International ACM SIGiR Conference on Research and Development in Information Retrieval, pages 456-463, Sheffield, UK, 2004. ACM Press.
-
(2004)
Proceedings of the 27th Annual International ACM SIGiR Conference on Research and Development in Information Retrieval
, pp. 456-463
-
-
Cai, D.1
Yu, S.2
Wen, J.-R.3
Ma, W.-Y.4
-
4
-
-
85111040505
-
GATE: An architecture for development of robust HLT applications. In ACL
-
Philadelphia, PA, USA, Association for Computational Linguistics
-
H. Cunningham, D. Maynard, K. Bontcheva, and V. Tablan. GATE: An architecture for development of robust HLT applications. In ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pages 168-175, Philadelphia, PA, USA, 2001. Association for Computational Linguistics.
-
(2001)
02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
, pp. 168-175
-
-
Cunningham, H.1
Maynard, D.2
Bontcheva, K.3
Tablan, V.4
-
5
-
-
4644340823
-
Automatic Web news extraction using tree edit distance
-
New York, NY, USA, ACM Press
-
D. de Castro Reis, P. Golgher, A. Silva, and A. Laender. Automatic Web news extraction using tree edit distance. In Proceedings of the 13th International Conference on World Wide Web, pages 502-511, New York, NY, USA, 2004. ACM Press.
-
(2004)
Proceedings of the 13th International Conference on World Wide Web
, pp. 502-511
-
-
de Castro Reis, D.1
Golgher, P.2
Silva, A.3
Laender, A.4
-
6
-
-
31344433523
-
Blogpulse: Automated trend discovery for weblogs
-
New York, NY, USA
-
N. Glance, M. Hurst, and T. Tomokiyo. Blogpulse: Automated trend discovery for weblogs. In Proceedings of the WWW 2004 Workshop on the Weblogging Ecosystem, New York, NY, USA, 2004.
-
(2004)
Proceedings of the WWW 2004 Workshop on the Weblogging Ecosystem
-
-
Glance, N.1
Hurst, M.2
Tomokiyo, T.3
-
8
-
-
14844363192
-
Automating content extraction of HTML documents
-
S. Gupta, G. Kaiser, P. Grimm, M. Chiang, and J. Starren. Automating content extraction of HTML documents. World Wide Web, 8(2):179-224, 2005.
-
(2005)
World Wide Web
, vol.8
, Issue.2
, pp. 179-224
-
-
Gupta, S.1
Kaiser, G.2
Grimm, P.3
Chiang, M.4
Starren, J.5
-
9
-
-
84880498138
-
DOM-based content extraction of HTML documents
-
Budapest, Hungary, ACM Press
-
S. Gupta, G. Kaiser, D. Neistadt, and P. Grimm. DOM-based content extraction of HTML documents. In Proceedings of the 12th International Conference on World Wide Web, pages 207-214, Budapest, Hungary, 2003. ACM Press.
-
(2003)
Proceedings of the 12th International Conference on World Wide Web
, pp. 207-214
-
-
Gupta, S.1
Kaiser, G.2
Neistadt, D.3
Grimm, P.4
-
10
-
-
0029535737
-
Particle swarm optimization
-
Piscataway, NJ, USA, IEEE Computer Society
-
J. Kennedy and R. Eberhart. Particle swarm optimization. In Proceedings of IEEE International Conference on Neural Networks, pages 1942-1948, Piscataway, NJ, USA, 1995. IEEE Computer Society.
-
(1995)
Proceedings of IEEE International Conference on Neural Networks
, pp. 1942-1948
-
-
Kennedy, J.1
Eberhart, R.2
-
11
-
-
0001776223
-
Wrapper induction for information extraction
-
Nagoya, Japan, Morgan Kaufmann
-
N. Kushmerick, D. Weld, and R. Doorenbos. Wrapper induction for information extraction. In International Joint Conference on Artificial Intelligence, pages 729-737, Nagoya, Japan, 1997. Morgan Kaufmann.
-
(1997)
International Joint Conference on Artificial Intelligence
, pp. 729-737
-
-
Kushmerick, N.1
Weld, D.2
Doorenbos, R.3
-
12
-
-
0242456776
-
Discovering informative content blocks from Web documents
-
Edmonton, Alberta, Canada, ACM Press
-
S.-H. Lin and J.-M. Ho. Discovering informative content blocks from Web documents. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 588-593, Edmonton, Alberta, Canada, 2002. ACM Press.
-
(2002)
Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 588-593
-
-
Lin, S.-H.1
Ho, J.-M.2
-
13
-
-
34250653315
-
Detecting spam Web pages through content analysis
-
Edinburgh, Scotland, ACM Press
-
A. Ntoulas, M. Najork, M. Manasse, and D. Fetterly. Detecting spam Web pages through content analysis. In Proceedings of the 15th International Conference on World Wide Web, pages 83-92, Edinburgh, Scotland, 2006. ACM Press.
-
(2006)
Proceedings of the 15th International Conference on World Wide Web
, pp. 83-92
-
-
Ntoulas, A.1
Najork, M.2
Manasse, M.3
Fetterly, D.4
-
15
-
-
33745794109
-
VIPER: Augmenting automatic information extraction with visual perceptions
-
Bremen, Germany, November, ACM Press
-
K. Simon and G. Lausen. VIPER: Augmenting automatic information extraction with visual perceptions. In Proceedings of the 2005 ACM CIKM Conference on Information and Knowledge Management, pages 381-388, Bremen, Germany, November 2005. ACM Press.
-
(2005)
Proceedings of the 2005 ACM CIKM Conference on Information and Knowledge Management
, pp. 381-388
-
-
Simon, K.1
Lausen, G.2
-
16
-
-
18744381159
-
Learning block importance models for Web
-
New York, NY, USA, ACM Press
-
R. Song, H. Liu, J.-R. Wen, and W.-Y. Ma. Learning block importance models for Web pages. In Proceedings of the 13th International Conference on World Wide Web, pages 203-211, New York, NY, USA, 2004. ACM Press.
-
(2004)
Proceedings of the 13th International Conference on World Wide Web
, pp. 203-211
-
-
Song, R.1
Liu, H.2
Wen, J.-R.3
Ma, W.-Y.4
-
17
-
-
18744385246
-
Web unit mining: Finding and classifying subgraphs of Web
-
New Orleans, LA, USA, ACM Press
-
A. Sun and E.-P. Lim. Web unit mining: Finding and classifying subgraphs of Web pages. In Proceedings of the 12th International Conference on Information and Knowledge Management, pages 108-115, New Orleans, LA, USA, 2003. ACM Press.
-
(2003)
Proceedings of the 12th International Conference on Information and Knowledge Management
, pp. 108-115
-
-
Sun, A.1
Lim, E.-P.2
-
18
-
-
42549131779
-
The mining and extraction of primary informative blocks and data objects from systematic Web
-
Hong Kong, China, IEEE Computer Society
-
Y.-F. Tseng and H.-Y Kao. The mining and extraction of primary informative blocks and data objects from systematic Web pages. In Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence, pages 370-373, Hong Kong, China, 2006. IEEE Computer Society.
-
(2006)
Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
, pp. 370-373
-
-
Tseng, Y.-F.1
Kao, H.-Y.2
-
19
-
-
70549092342
-
HTML page analysis based on visual cues
-
Washington, DC, USA, IEEE Computer Society
-
Y. Yang and H.-J. Zhang. HTML page analysis based on visual cues. In Proceedings of the Sixth International Conference on Document Analysis and Recognition, pages 859-864, Washington, DC, USA, 2001. IEEE Computer Society.
-
(2001)
Proceedings of the Sixth International Conference on Document Analysis and Recognition
, pp. 859-864
-
-
Yang, Y.1
Zhang, H.-J.2
-
21
-
-
42549150992
-
Towards automated reputation and brand monitoring on the web
-
Hong Kong, China, December, IEEE Computer Society Press
-
C.-N. Ziegler and M. Skubacz. Towards automated reputation and brand monitoring on the web. In Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence, pages 1066-1070, Hong Kong, China, December 2006. IEEE Computer Society Press.
-
(2006)
Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
, pp. 1066-1070
-
-
Ziegler, C.-N.1
Skubacz, M.2
|