-
1
-
-
12244298488
-
-
E. Agichtein, V. Ganti, Mining reference tables for automatic text segmentation, in: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2004, pp. 20-29.
-
E. Agichtein, V. Ganti, Mining reference tables for automatic text segmentation, in: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2004, pp. 20-29.
-
-
-
-
2
-
-
84880902141
-
-
M. Banko, M. Cafarella, S. Soderland, M. Broadhead, O. Etzioni, Open information extraction from the web, in: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI), 2007, pp. 2670-2676.
-
M. Banko, M. Cafarella, S. Soderland, M. Broadhead, O. Etzioni, Open information extraction from the web, in: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI), 2007, pp. 2670-2676.
-
-
-
-
3
-
-
56249130480
-
-
R. Bunescu, R. Mooney, Collective information extraction with relational markov networks, in: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004, pp. 439-446.
-
R. Bunescu, R. Mooney, Collective information extraction with relational markov networks, in: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004, pp. 439-446.
-
-
-
-
4
-
-
85042021254
-
-
C. Chang, S.C. Lui, IEPAD: information extraction based on pattern discovery, in: Proceedings of the 10th International Conference on World Wide Web (WWW), 2001, pp. 681-688.
-
C. Chang, S.C. Lui, IEPAD: information extraction based on pattern discovery, in: Proceedings of the 10th International Conference on World Wide Web (WWW), 2001, pp. 681-688.
-
-
-
-
5
-
-
33748336500
-
A survey of web information extraction systems
-
Chang C.-H., Kayed M., Girgis M.R., and Shaalan K.F. A survey of web information extraction systems. IEEE Transactions on Knowledge and Data Engineering 18 10 (2006) 1411-1428
-
(2006)
IEEE Transactions on Knowledge and Data Engineering
, vol.18
, Issue.10
, pp. 1411-1428
-
-
Chang, C.-H.1
Kayed, M.2
Girgis, M.R.3
Shaalan, K.F.4
-
6
-
-
8644243246
-
-
S. Chapman, A. Dingli, F. Ciravegna, Armadillo: harvesting information for the semantic web, in: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2004, p. 598.
-
S. Chapman, A. Dingli, F. Ciravegna, Armadillo: harvesting information for the semantic web, in: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2004, p. 598.
-
-
-
-
7
-
-
85011016482
-
-
S.-L. Chuang, K. Chang, C. Zhai, Context-aware wrapping: synchronized data extraction, in: Proceedings of the 33rd Very Large Databases Conference (VLDB), 2007, pp. 699-710.
-
S.-L. Chuang, K. Chang, C. Zhai, Context-aware wrapping: synchronized data extraction, in: Proceedings of the 33rd Very Large Databases Conference (VLDB), 2007, pp. 699-710.
-
-
-
-
8
-
-
84880859303
-
-
2 an adaptive algorithm for information extraction from web-related texts, in: Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI), 2001, pp. 1251-1256.
-
2 an adaptive algorithm for information extraction from web-related texts, in: Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI), 2001, pp. 1251-1256.
-
-
-
-
9
-
-
77953046656
-
-
W. Cohen, M. Hurst, L. Jensen, A flexible learning system for wrapping tables and lists in HTML documents, in: Proceedings of the 11th International World Wide Web Conference (WWW), 2002, pp. 232-241.
-
W. Cohen, M. Hurst, L. Jensen, A flexible learning system for wrapping tables and lists in HTML documents, in: Proceedings of the 11th International World Wide Web Conference (WWW), 2002, pp. 232-241.
-
-
-
-
10
-
-
12344333240
-
Automatic information extraction from large websites
-
Crescenzi V., and Mecca G. Automatic information extraction from large websites. Journal of the ACM 51 5 (2004) 731-779
-
(2004)
Journal of the ACM
, vol.51
, Issue.5
, pp. 731-779
-
-
Crescenzi, V.1
Mecca, G.2
-
11
-
-
84944327150
-
-
V. Crescenzi, G. Mecca, P. Merialdo, ROADRUNNER: towards automatic data extraction from large web sites, in: Proceedings of the 27th Very Large Databases Conference (VLDB), 2001, pp. 109-118.
-
V. Crescenzi, G. Mecca, P. Merialdo, ROADRUNNER: towards automatic data extraction from large web sites, in: Proceedings of the 27th Very Large Databases Conference (VLDB), 2001, pp. 109-118.
-
-
-
-
13
-
-
84858373635
-
-
A. Culotta, A. McCallum, J. Betz, Integrating probabilistic extraction models and data mining to discover relations and patterns in text, in: Proceedings of the Main Conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006, pp. 296-303.
-
A. Culotta, A. McCallum, J. Betz, Integrating probabilistic extraction models and data mining to discover relations and patterns in text, in: Proceedings of the Main Conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006, pp. 296-303.
-
-
-
-
14
-
-
0033225222
-
Conceptual-model-based data extraction from multiple-record Web pages
-
Embley D., Campbell D., Jiang Y., Liddle S., Lonsdale D., Ng Y.-K., and Smith R. Conceptual-model-based data extraction from multiple-record Web pages. Data and Knowledge Engineering 33 3 (1999) 227-251
-
(1999)
Data and Knowledge Engineering
, vol.33
, Issue.3
, pp. 227-251
-
-
Embley, D.1
Campbell, D.2
Jiang, Y.3
Liddle, S.4
Lonsdale, D.5
Ng, Y.-K.6
Smith, R.7
-
15
-
-
17644423946
-
Unsupservised named-entity extraction from the web: an experimental study
-
Etzioni O., Cafarella M., Kok S., Popescu A., Shaked T., Soderland S., Weld D., and Yates A. Unsupservised named-entity extraction from the web: an experimental study. Artificial Intelligence 165 1 (2005) 91-134
-
(2005)
Artificial Intelligence
, vol.165
, Issue.1
, pp. 91-134
-
-
Etzioni, O.1
Cafarella, M.2
Kok, S.3
Popescu, A.4
Shaked, T.5
Soderland, S.6
Weld, D.7
Yates, A.8
-
16
-
-
14544291427
-
Mining interesting knowledge from weblogs: a survey
-
Facca F., and Lanzi P. Mining interesting knowledge from weblogs: a survey. Data and Knowledge Engineering 53 3 (2005) 225-241
-
(2005)
Data and Knowledge Engineering
, vol.53
, Issue.3
, pp. 225-241
-
-
Facca, F.1
Lanzi, P.2
-
17
-
-
33846024370
-
Exploiting structural similarity for effective web information extraction
-
Flesca S., Manco G., Masciari E., Pontieri L., and Pugliese A. Exploiting structural similarity for effective web information extraction. Data and Knowledge Engineering 60 1 (2007) 222-234
-
(2007)
Data and Knowledge Engineering
, vol.60
, Issue.1
, pp. 222-234
-
-
Flesca, S.1
Manco, G.2
Masciari, E.3
Pontieri, L.4
Pugliese, A.5
-
18
-
-
56249115624
-
-
D. Freitag, A. McCallum, Information extraction with HMM structures learned by stochastic optimization, in: Proceedings of the 17th National Conference on Artificial Intelligence (AAAI), 2000, pp. 584-589.
-
D. Freitag, A. McCallum, Information extraction with HMM structures learned by stochastic optimization, in: Proceedings of the 17th National Conference on Artificial Intelligence (AAAI), 2000, pp. 584-589.
-
-
-
-
19
-
-
32344439113
-
-
R. Ghani, Price prediction and insurance for online auctions, in: Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2005, pp. 411-418.
-
R. Ghani, Price prediction and insurance for online auctions, in: Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2005, pp. 411-418.
-
-
-
-
20
-
-
56249108129
-
-
R. Ghani, H. Simmons, Predicting the end-price of online auctions, in: Proceedings of the International Workshop on Data Mining and Adaptive Modeling Methods for Economics and Management, 2004.
-
R. Ghani, H. Simmons, Predicting the end-price of online auctions, in: Proceedings of the International Workshop on Data Mining and Adaptive Modeling Methods for Economics and Management, 2004.
-
-
-
-
21
-
-
84859881704
-
-
T. Grenager, D. Klein, C. Manning, Unsupervised learning of field segmentation models for information extraction, in: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, 2005, pp. 371-378.
-
T. Grenager, D. Klein, C. Manning, Unsupervised learning of field segmentation models for information extraction, in: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, 2005, pp. 371-378.
-
-
-
-
22
-
-
33748195920
-
Sampling, information extraction and summarisation of hidden web databases
-
Hedley Y.-L., Younas M., James A., and Sanderson M. Sampling, information extraction and summarisation of hidden web databases. Data and Knowledge Engineering 59 2 (2006) 213-230
-
(2006)
Data and Knowledge Engineering
, vol.59
, Issue.2
, pp. 213-230
-
-
Hedley, Y.-L.1
Younas, M.2
James, A.3
Sanderson, M.4
-
23
-
-
12244305149
-
-
M. Hu, B. Liu, Mining and summarizing customer reviews, in: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2004, pp. 168-177.
-
M. Hu, B. Liu, Mining and summarizing customer reviews, in: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2004, pp. 168-177.
-
-
-
-
25
-
-
33747058044
-
Information extraction from structured documents using k-testable tree automaton inference
-
Kosala R., Blockeel H., Bruynooghe M., and Van den Bussche J. Information extraction from structured documents using k-testable tree automaton inference. Data and Knowledge Engineering 58 2 (2006) 129-158
-
(2006)
Data and Knowledge Engineering
, vol.58
, Issue.2
, pp. 129-158
-
-
Kosala, R.1
Blockeel, H.2
Bruynooghe, M.3
Van den Bussche, J.4
-
27
-
-
56249093535
-
-
N. Kushmerick, B. Grace, The wrapper induction environment, in: Proceedings of the Workshop on Software Tools for Developing Agents (AAAI), 1998, pp. 131-132.
-
N. Kushmerick, B. Grace, The wrapper induction environment, in: Proceedings of the Workshop on Software Tools for Developing Agents (AAAI), 1998, pp. 131-132.
-
-
-
-
28
-
-
23144437876
-
-
N. Kushmerick, B. Thomas, Adaptive information extraction: core technologies for information agents, in: Intelligent Information Agents R&D in Europe: An AgentLink Perspective, 2002, pp. 79-103.
-
N. Kushmerick, B. Thomas, Adaptive information extraction: core technologies for information agents, in: Intelligent Information Agents R&D in Europe: An AgentLink Perspective, 2002, pp. 79-103.
-
-
-
-
29
-
-
56249087289
-
-
J. Lafferty, A. McCallum, F. Pereira, Conditional random fields: probabilistic models for segmenting and labeling sequence data, in: Proceedings of 18th International Conference on Machine Learning (ICML), 2001, pp. 282-289.
-
J. Lafferty, A. McCallum, F. Pereira, Conditional random fields: probabilistic models for segmenting and labeling sequence data, in: Proceedings of 18th International Conference on Machine Learning (ICML), 2001, pp. 282-289.
-
-
-
-
30
-
-
77952333945
-
-
B. Liu, R. Grossman, Y. Zhai, Mining data records in web pages, in: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2003, pp. 601-606.
-
B. Liu, R. Grossman, Y. Zhai, Mining data records in web pages, in: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2003, pp. 601-606.
-
-
-
-
31
-
-
56249088348
-
-
B. Liu, M. Hu, J. Cheng, Opinion observer: analyzing and comparing opinions on the web, in: Proceedings of the 11th International World Wide Web Conference (WWW), 2005, pp. 342-351.
-
B. Liu, M. Hu, J. Cheng, Opinion observer: analyzing and comparing opinions on the web, in: Proceedings of the 11th International World Wide Web Conference (WWW), 2005, pp. 342-351.
-
-
-
-
32
-
-
56249136619
-
-
A. McCallum, D. Jensen, A note on the unification of information extraction and data mining using conditional-probability, relational models, in: Proceedings of the IJCAI Workshop on Learning Statistical Models from Relational Data, 2003.
-
A. McCallum, D. Jensen, A note on the unification of information extraction and data mining using conditional-probability, relational models, in: Proceedings of the IJCAI Workshop on Learning Statistical Models from Relational Data, 2003.
-
-
-
-
33
-
-
56249089022
-
-
A. McCallum, B. Wellner, Toward conditional models of identity uncertainty with application to proper noun coreference, in: Proceedings of the IJCAI Workshop on Information Integration on the Web, 2003.
-
A. McCallum, B. Wellner, Toward conditional models of identity uncertainty with application to proper noun coreference, in: Proceedings of the IJCAI Workshop on Information Integration on the Web, 2003.
-
-
-
-
34
-
-
0242540451
-
-
S. Morinaga, K. Yamanishi, K. Tateishi, T. Fukushima, Mining product reputation on the Web, in: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2002, pp. 341-349.
-
S. Morinaga, K. Yamanishi, K. Tateishi, T. Fukushima, Mining product reputation on the Web, in: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD), 2002, pp. 341-349.
-
-
-
-
35
-
-
56249092847
-
-
K. Murphy, Y. Weiss, M. Jordan, Loopy belief propagation for approximate inference: an empirical study, in: Proceedings of the 15th Annual Conference on Uncertainty in Artificial Intelligence (UAI), 1999, pp. 467-475.
-
K. Murphy, Y. Weiss, M. Jordan, Loopy belief propagation for approximate inference: an empirical study, in: Proceedings of the 15th Annual Conference on Uncertainty in Artificial Intelligence (UAI), 1999, pp. 467-475.
-
-
-
-
37
-
-
80053270803
-
-
A. Popescu, O. Etzioni, Extracting product features and opinions from reviews, in: Proceedings of the Human Language Technology Conference on Empirical Methods in Natural Language Processing, 2005, pp. 339-346.
-
A. Popescu, O. Etzioni, Extracting product features and opinions from reviews, in: Proceedings of the Human Language Technology Conference on Empirical Methods in Natural Language Processing, 2005, pp. 339-346.
-
-
-
-
38
-
-
84880915291
-
-
K. Probst, M.K.R. Ghai, A. Fano, Y. Liu, Semi-supervised learning of attribute-value pairs from product descriptions, in: Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI), 2007, pp. 2838-2843.
-
K. Probst, M.K.R. Ghai, A. Fano, Y. Liu, Semi-supervised learning of attribute-value pairs from product descriptions, in: Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI), 2007, pp. 2838-2843.
-
-
-
-
39
-
-
56249091145
-
-
F. Sha, F. Pereira, Shallow parsing with conditional random fields, in: Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), 2003, pp. 213-220.
-
F. Sha, F. Pereira, Shallow parsing with conditional random fields, in: Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), 2003, pp. 213-220.
-
-
-
-
41
-
-
84939181118
-
-
S. Tatikonda, Convergence of the sum-product algorithm, in: Proceedings of the 2003 IEEE Information Theory Workshop, 2003, pp. 222-225.
-
S. Tatikonda, Convergence of the sum-product algorithm, in: Proceedings of the 2003 IEEE Information Theory Workshop, 2003, pp. 222-225.
-
-
-
-
42
-
-
84885677547
-
-
P. Viola, M. Narasimhan, Learning to extract information from semi-structured text using a discriminative context free grammar, in: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2005, pp. 330-337.
-
P. Viola, M. Narasimhan, Learning to extract information from semi-structured text using a discriminative context free grammar, in: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2005, pp. 330-337.
-
-
-
-
43
-
-
56249146748
-
-
B. Wellner, A. McCallum, F. Peng, M. Hay, An integrated, conditional model of information extraction and coreference with application to citation matching, in: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence (UAI), 2004, pp. 593-601.
-
B. Wellner, A. McCallum, F. Peng, M. Hay, An integrated, conditional model of information extraction and coreference with application to citation matching, in: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence (UAI), 2004, pp. 593-601.
-
-
-
-
44
-
-
19544378318
-
-
T.L. Wong, W. Lam, A probabilistic approach for adapting information extraction wrappers and discovering new attributes, in: Proceedings of the 2004 IEEE International Conference on Data Mining (ICDM), 2004, pp. 257-264.
-
T.L. Wong, W. Lam, A probabilistic approach for adapting information extraction wrappers and discovering new attributes, in: Proceedings of the 2004 IEEE International Conference on Data Mining (ICDM), 2004, pp. 257-264.
-
-
-
-
45
-
-
2942587187
-
-
T.L. Wong, W. Lam, Text mining from site invariant and dependent features for information extraction knowledge adaptation, in: Proceedings of the 2004 SIAM International Conference on Data Mining (SDM), 2004, pp. 45-56.
-
T.L. Wong, W. Lam, Text mining from site invariant and dependent features for information extraction knowledge adaptation, in: Proceedings of the 2004 SIAM International Conference on Data Mining (SDM), 2004, pp. 45-56.
-
-
-
-
46
-
-
33745441264
-
-
T.L. Wong, W. Lam, Hot item mining and summarization from multiple auction web sites, in: Proceedings of the Fifth IEEE International Conference on Data Mining (ICDM), 2005, pp. 797-800.
-
T.L. Wong, W. Lam, Hot item mining and summarization from multiple auction web sites, in: Proceedings of the Fifth IEEE International Conference on Data Mining (ICDM), 2005, pp. 797-800.
-
-
-
-
47
-
-
33745454259
-
-
T.L. Wong, W. Lam, S.K. Chan, Collaborative information extraction and mining from multiple web documents, in: Proceedings of the 2006 SIAM International Conference on Data Mining (SDM), 2006, pp. 440-450.
-
T.L. Wong, W. Lam, S.K. Chan, Collaborative information extraction and mining from multiple web documents, in: Proceedings of the 2006 SIAM International Conference on Data Mining (SDM), 2006, pp. 440-450.
-
-
-
-
48
-
-
33745773725
-
-
T.L. Wong, W. Lam, S.K. Chan, Extracting and summarizing hot item features across different auction web sites, in: Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2006, pp. 334-345.
-
T.L. Wong, W. Lam, S.K. Chan, Extracting and summarizing hot item features across different auction web sites, in: Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2006, pp. 334-345.
-
-
-
-
49
-
-
56249097304
-
-
World Wide Web Consortium (W3C), Semantic web, 2001. .
-
World Wide Web Consortium (W3C), Semantic web, 2001. .
-
-
-
|