-
1
-
-
0029757055
-
Information extraction
-
J. Cowie and W. Lehnert, "Information Extraction," Comm. ACM, vol.39, no.1, pp. 80-91, 1996.
-
(1996)
Comm. ACM
, vol.39
, Issue.1
, pp. 80-91
-
-
Cowie, J.1
Lehnert, W.2
-
2
-
-
0031360033
-
Empirical methods in information extraction
-
C. Cardie, "Empirical Methods in Information Extraction," AI Magazine, vol.18, no.4, pp. 65-80, 1997.
-
(1997)
AI Magazine
, vol.18
, Issue.4
, pp. 65-80
-
-
Cardie, C.1
-
3
-
-
84944318551
-
Visual web information extraction with lixto
-
R. Baumgartner, S. Flesca, and G. Gottlob, "Visual Web Information Extraction with Lixto," Proc. Conf. Very Large Data Bases (VLDB), pp. 119-128, 2001.
-
(2001)
Proc. Conf. Very Large Data Bases (VLDB)
, pp. 119-128
-
-
Baumgartner, R.1
Flesca, S.2
Gottlob, G.3
-
4
-
-
1142303684
-
Extracting structured data from web pages
-
A. Arasu and H. Garcia-Molina, "Extracting Structured Data from Web Pages," Proc. ACM SIGMOD, pp. 337-348, 2003.
-
(2003)
Proc. ACM SIGMOD
, pp. 337-348
-
-
Arasu, A.1
Garcia-Molina, H.2
-
5
-
-
0346501095
-
Record-boundary discovery in web documents
-
D.W. Embley, Y.S. Jiang, and Y.-K. Ng, "Record-Boundary Discovery in Web Documents," Proc. ACM SIGMOD, pp. 467- 478, 1999.
-
(1999)
Proc. ACM SIGMOD
, pp. 467-478
-
-
Embley, D.W.1
Jiang, Y.S.2
Ng, Y.-K.3
-
6
-
-
0034172374
-
Wrapper induction: Efficiency and expressiveness
-
N. Kushmerick, "Wrapper Induction: Efficiency and Expressiveness," Artificial Intelligence, vol.118, nos. 1/2, pp. 15-68, 2000.
-
(2000)
Artificial Intelligence
, vol.118
, Issue.1-2
, pp. 15-68
-
-
Kushmerick, N.1
-
7
-
-
1542385454
-
Wrapper maintenance: A machine learning approach
-
K. Lerman, S. Minton, and C.A. Knoblock, "Wrapper Maintenance: A Machine Learning Approach," J. Artificial Intelligence Research (JAIR), vol.18, pp. 149-181, 2003.
-
(2003)
J. Artificial Intelligence Research (JAIR)
, vol.18
, pp. 149-181
-
-
Lerman, K.1
Minton, S.2
Knoblock, C.A.3
-
8
-
-
0035587215
-
Hierarchical wrapper induction for semistructured information sources
-
I. Muslea, S. Minton, and C.A. Knoblock, "Hierarchical Wrapper Induction for Semistructured Information Sources," Autonomous Agents and Multi-Agent Systems, vol.4, nos. 1/2, pp. 93-114, 2001.
-
(2001)
Autonomous Agents and Multi-Agent Systems
, vol.4
, Issue.1-2
, pp. 93-114
-
-
Muslea, I.1
Minton, S.2
Knoblock, C.A.3
-
9
-
-
33749623896
-
Simultaneous record detection and attribute labeling in web data extraction
-
J. Zhu, Z. Nie, J.-R. Wen, B. Zhang, and W.-Y. Ma, "Simultaneous Record Detection and Attribute Labeling in Web Data Extraction," Proc. Int'l Conf. Knowledge Discovery and Data Mining (KDD), pp. 494-503, 2006.
-
(2006)
Proc. Int'l Conf. Knowledge Discovery and Data Mining (KDD)
, pp. 494-503
-
-
Zhu, J.1
Nie, Z.2
Wen, J.-R.3
Zhang, B.4
Ma, W.-Y.5
-
10
-
-
35348878636
-
Web object retrieval
-
Z. Nie, Y. Ma, S. Shi, J.-R. Wen, and W.-Y. Ma, "Web Object Retrieval," Proc. Conf. World Wide Web (WWW), pp. 81-90, 2007.
-
(2007)
Proc. Conf. World Wide Web (WWW)
, pp. 81-90
-
-
Nie, Z.1
Ma, Y.2
Shi, S.3
Wen, J.-R.4
Ma, W.-Y.5
-
11
-
-
36849066312
-
Webpage understanding: An integrated approach
-
J. Zhu, B. Zhang, Z. Nie, J.-R. Wen, and H.-W. Hon, "Webpage Understanding: An Integrated Approach," Proc. Int'l Conf. Knowledge Discovery and Data Mining (KDD), pp. 903-912, 2007.
-
(2007)
Proc. Int'l Conf. Knowledge Discovery and Data Mining (KDD)
, pp. 903-912
-
-
Zhu, J.1
Zhang, B.2
Nie, Z.3
Wen, J.-R.4
Hon, H.-W.5
-
16
-
-
8644267730
-
Block-based web search
-
D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma, "Block-Based Web Search," Proc. ACM SIGIR, pp. 456-463, 2004.
-
(2004)
Proc. ACM SIGIR
, pp. 456-463
-
-
Cai, D.1
Yu, S.2
Wen, J.-R.3
Ma, W.-Y.4
-
17
-
-
85042021254
-
Iepad: Information extraction based on pattern discovery
-
C.-H. Chang and S.-C. Lui, "Iepad: Information Extraction Based on Pattern Discovery," Proc. Conf. World Wide Web (WWW), pp. 681-688, 2001.
-
(2001)
Proc. Conf. World Wide Web (WWW)
, pp. 681-688
-
-
Chang, C.-H.1
Lui, S.-C.2
-
18
-
-
84944327150
-
Roadrunner: Towards automatic data extraction from large web sites
-
V. Crescenzi, G. Mecca, and P. Merialdo, "Roadrunner: Towards Automatic Data Extraction from Large Web Sites," Proc. Conf. Very Large Data Bases (VLDB), pp. 109-118, 2001.
-
(2001)
Proc. Conf. Very Large Data Bases (VLDB)
, pp. 109-118
-
-
Crescenzi, V.1
Mecca, G.2
Merialdo, P.3
-
19
-
-
33744899132
-
Fully automatic wrapper generation for search engines
-
H. Zhao, W. Meng, Z. Wu, V. Raghavan, and C.T. Yu, "Fully Automatic Wrapper Generation for Search Engines," Proc. Conf. World Wide Web (WWW), pp. 66-75, 2005.
-
(2005)
Proc. Conf. World Wide Web (WWW)
, pp. 66-75
-
-
Zhao, H.1
Meng, W.2
Wu, Z.3
Raghavan, V.4
Yu, C.T.5
-
20
-
-
3142742483
-
Using the structure of web sites for automatic segmentation of tables
-
K. Lerman, L. Getoor, S. Minton, and C.A. Knoblock, "Using the Structure of Web Sites for Automatic Segmentation of Tables," Proc. ACM SIGMOD, pp. 119-130, 2004.
-
(2004)
Proc. ACM SIGMOD
, pp. 119-130
-
-
Lerman, K.1
Getoor, L.2
Minton, S.3
Knoblock, C.A.4
-
21
-
-
33744821948
-
Web data extraction based on partial tree alignment
-
Y. Zhai and B. Liu, "Web Data Extraction Based on Partial Tree Alignment," Proc. Conf. World Wide Web (WWW), pp. 76-85, 2005.
-
(2005)
Proc. Conf. World Wide Web (WWW)
, pp. 76-85
-
-
Zhai, Y.1
Liu, B.2
-
22
-
-
33750797710
-
Structured data extraction from the web based on partial tree alignment
-
Dec.
-
Y. Zhai and B. Liu, "Structured Data Extraction from the Web Based on Partial Tree Alignment," IEEE Trans. Knowledge and Data Eng., vol.18, no.12, pp. 1614-1628, Dec. 2006.
-
(2006)
IEEE Trans. Knowledge and Data Eng.
, vol.18
, Issue.12
, pp. 1614-1628
-
-
Zhai, Y.1
Liu, B.2
-
23
-
-
18744381159
-
Learning block importance models for web pages
-
R. Song, H. Liu, J.-R. Wen, and W.-Y. Ma, "Learning Block Importance Models for Web Pages," Proc. Conf. World Wide Web (WWW), pp. 203-211, 2004.
-
(2004)
Proc. Conf. World Wide Web (WWW)
, pp. 203-211
-
-
Song, R.1
Liu, H.2
Wen, J.-R.3
Ma, W.-Y.4
-
24
-
-
0142192295
-
Conditional random fields: Probabilistic models for segmenting and labeling sequence data
-
J.D. Lafferty, A. McCallum, and F.C.N. Pereira, "Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data," Proc. Int'l Conf. Machine Learning (ICML), pp. 282- 289, 2001.
-
(2001)
Proc. Int'l Conf. Machine Learning (ICML)
, pp. 282-289
-
-
Lafferty, J.D.1
McCallum, A.2
Pereira, F.C.N.3
-
25
-
-
85075430216
-
Chinese named entity recognition with conditional probabilistic models
-
A. Chen, F. Peng, R. Shan, and G. Sun, "Chinese Named Entity Recognition with Conditional Probabilistic Models," Proc. Fifth SIGHAN Workshop Chinese Language Processing, pp. 173-176, 2006.
-
(2006)
Proc. Fifth SIGHAN Workshop Chinese Language Processing
, pp. 173-176
-
-
Chen, A.1
Peng, F.2
Shan, R.3
Sun, G.4
-
29
-
-
33646887390
-
On the limited memory bfgs method for large scale optimization
-
D.C. Liu and J. Nocedal, "On the Limited Memory bfgs Method for Large Scale Optimization," Math. Programming, vol.45, no.3, pp. 503-528, 1989.
-
(1989)
Math. Programming
, vol.45
, Issue.3
, pp. 503-528
-
-
Liu, D.C.1
Nocedal, J.2
-
30
-
-
17644423946
-
Unsupervised named- entity extraction from the web: An experimental study
-
O. Etzioni, M.J. Cafarella, D. Downey, A.-M. Popescu, T. Shaked, S. Soderland, D.S. Weld, and A. Yates, "Unsupervised Named- Entity Extraction from the Web: An Experimental Study," Artificial Intelligence, vol.165, no.1, pp. 91-134, 2005.
-
(2005)
Artificial Intelligence
, vol.165
, Issue.1
, pp. 91-134
-
-
Etzioni, O.1
Cafarella, M.J.2
Downey, D.3
Popescu, A.-M.4
Shaked, T.5
Soderland, S.6
Weld, D.S.7
Yates, A.8
-
31
-
-
84880862059
-
Locating complex named entities in web text
-
D. Downey, M. Broadhead, and O. Etzioni, "Locating Complex Named Entities in Web Text," Proc. Int'l Joint Conf. Artificial Intelligence (IJCAI), pp. 2733-2739, 2007.
-
(2007)
Proc. Int'l Joint Conf. Artificial Intelligence (IJCAI)
, pp. 2733-2739
-
-
Downey, D.1
Broadhead, M.2
Etzioni, O.3
|