-
1
-
-
77954734166
-
-
Open Provenance Model. http://twiki.ipaw.info/bin/view/Challenge/OPM, 2009.
-
(2009)
Open Provenance Model
-
-
-
2
-
-
0033652943
-
Snowball: Extracting relations from large plain-text collections
-
E. Agichtein and L. Gravano. Snowball: Extracting relations from large plain-text collections. In DL, 2000.
-
(2000)
DL
-
-
Agichtein, E.1
Gravano, L.2
-
4
-
-
0003146263
-
Extracting patterns and relations from the world wide web
-
S. Brin. Extracting patterns and relations from the world wide web. In WebDB, 1998.
-
(1998)
WebDB
-
-
Brin, S.1
-
6
-
-
35348882429
-
Structured querying of web text: A technical challenge
-
M. J. Cafarella, C. Re, D. Suciu, O. Etzioni, and M. Banko. Structured querying of web text: A technical challenge. In Proceedings of CIDR-07, 2007.
-
Proceedings of CIDR-07, 2007
-
-
Cafarella, M.J.1
Re, C.2
Suciu, D.3
Etzioni, O.4
Banko, M.5
-
7
-
-
0013066045
-
Relational learning of pattern-match rules for information extraction
-
M. E. Califf and R. J. Mooney. Relational learning of pattern-match rules for information extraction. In IAAI, 1999.
-
(1999)
IAAI
-
-
Califf, M.E.1
Mooney, R.J.2
-
9
-
-
52649111581
-
Efficient information extraction over evolving text data
-
F. Chen, A. Doan, J. Yang, and R. Ramakrishnan. Efficient information extraction over evolving text data. In ICDE, 2008.
-
(2008)
ICDE
-
-
Chen, F.1
Doan, A.2
Yang, J.3
Ramakrishnan, R.4
-
11
-
-
0004116989
-
-
MIT Press and McGraw-Hill, 2nd edition
-
T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein. Introduction to Algorithms. MIT Press and McGraw-Hill, 2nd edition, 2001.
-
(2001)
Introduction to Algorithms
-
-
Cormen, T.H.1
Leiserson, C.E.2
Rivest, R.L.3
Stein, C.4
-
12
-
-
0038546767
-
Lineage tracing for general data warehouse transformations
-
Y. Cui and J. Widom. Lineage tracing for general data warehouse transformations. VLDB Journal, 12(1), 2003.
-
(2003)
VLDB Journal
, vol.12
, Issue.1
-
-
Cui, Y.1
Widom, J.2
-
14
-
-
17644423946
-
Unsupervised named-entity extraction from the web: An experimental study
-
O. Etzioni, M. Cafarella, D. Downey, A.-M. Popescu, T. Shaked, S. Soderland, D. S. Weld, and A. Yates. Unsupervised named-entity extraction from the web: an experimental study. Artif. Intell., 165(1):91-134, 2005.
-
(2005)
Artif. Intell.
, vol.165
, Issue.1
, pp. 91-134
-
-
Etzioni, O.1
Cafarella, M.2
Downey, D.3
Popescu, A.-M.4
Shaked, T.5
Soderland, S.6
Weld, D.S.7
Yates, A.8
-
15
-
-
32144454462
-
Web-scale information extraction in KnowItAll (preliminary results)
-
O. Etzioni, M. J. Cafarella, D. Downey, S. Kok, A.-M. Popescu, T. Shaked, S. Soderland, D. S. Weld, and A. Yates. Web-scale information extraction in KnowItAll (preliminary results). In Proceedings of WWW-04, 2004.
-
Proceedings of WWW-04, 2004
-
-
Etzioni, O.1
Cafarella, M.J.2
Downey, D.3
Kok, S.4
Popescu, A.-M.5
Shaked, T.6
Soderland, S.7
Weld, D.S.8
Yates, A.9
-
18
-
-
85027780074
-
Curating probabilistic databases from information extraction models
-
R. Gupta and S. Sarawagi. Curating probabilistic databases from information extraction models. In VLDB, 2006.
-
(2006)
VLDB
-
-
Gupta, R.1
Sarawagi, S.2
-
19
-
-
0012990385
-
Automatic acquisition of hyponyms from large text corpora
-
Association for Computational Linguistics
-
M. A. Hearst. Automatic acquisition of hyponyms from large text corpora. In Proceedings of COLING-92. Association for Computational Linguistics, 1992.
-
(1992)
Proceedings of COLING-92
-
-
Hearst, M.A.1
-
21
-
-
63749110502
-
On the provenance of non-answers to queries over extracted data
-
J. Huang, T. Chen, A. Doan, and J. F. Naughton. On the provenance of non-answers to queries over extracted data. PVLDB, 1(1), 2008.
-
(2008)
PVLDB
, vol.1
, Issue.1
-
-
Huang, J.1
Chen, T.2
Doan, A.3
Naughton, J.F.4
-
23
-
-
36949038529
-
Towards a query optimizer for text-centric tasks
-
Dec.
-
P. G. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano. Towards a query optimizer for text-centric tasks. ACM Transactions on Database Systems, 32(4), Dec. 2007.
-
(2007)
ACM Transactions on Database Systems
, vol.32
, Issue.4
-
-
Ipeirotis, P.G.1
Agichtein, E.2
Jain, P.3
Gravano, L.4
-
24
-
-
52649126147
-
Optimizing SQL queries over text databases
-
A. Jain, A. Doan, and L. Gravano. Optimizing SQL queries over text databases. In ICDE, 2008.
-
(2008)
ICDE
-
-
Jain, A.1
Doan, A.2
Gravano, L.3
-
25
-
-
77954713025
-
-
Technical Report CeDER-08-04, New York University
-
A. Jain, P. G. Ipeirotis, A. Doan, and L. Gravano. Join optimization of information extraction output: Quality matters! Technical Report CeDER-08-04, New York University, 2008.
-
(2008)
Join Optimization of Information Extraction Output: Quality Matters!
-
-
Jain, A.1
Ipeirotis, P.G.2
Doan, A.3
Gravano, L.4
-
26
-
-
67649663909
-
Exploring a few good tuples from text databases
-
A. Jain and D. Srivastava. Exploring a few good tuples from text databases. In ICDE, 2009.
-
(2009)
ICDE
-
-
Jain, A.1
Srivastava, D.2
-
27
-
-
33749629820
-
A system for integrating unstructured data into relational databases
-
I. Mansuri and S. Sarawagi. A system for integrating unstructured data into relational databases. In ICDE, 2006.
-
(2006)
ICDE
-
-
Mansuri, I.1
Sarawagi, S.2
-
28
-
-
70350607224
-
Names and similarities on the web: Fact extraction in the fast lane
-
M. Paşca, D. Lin, J. Bigham, A. Lifchits, and A. Jain. Names and similarities on the web: Fact extraction in the fast lane. In Proceedings of ACL06, July 2006.
-
Proceedings of ACL06, July 2006
-
-
Paşca, M.1
Lin, D.2
Bigham, J.3
Lifchits, A.4
Jain, A.5
-
29
-
-
33750687858
-
Organizing and searching the world wide web of facts - Step one: The one-million fact extraction challenge
-
M. Paşca, D. Lin, J. Bigham, A. Lifchits, and A. Jain. Organizing and searching the world wide web of facts - step one: The one-million fact extraction challenge. In Proceedings of AAAI-06, 2006.
-
Proceedings of AAAI-06, 2006
-
-
Paşca, M.1
Lin, D.2
Bigham, J.3
Lifchits, A.4
Jain, A.5
-
30
-
-
84860502098
-
Espresso: Leveraging generic patterns for automatically harvesting semantic relations
-
P. Pantel and M. Pennacchiotti. Espresso: leveraging generic patterns for automatically harvesting semantic relations. In Proc. of ACL, 2006.
-
Proc. of ACL, 2006
-
-
Pantel, P.1
Pennacchiotti, M.2
-
31
-
-
79952758877
-
Approximate lineage for probabilistic databases
-
C. Re and D. Suciu. Approximate lineage for probabilistic databases. In Proc. of VLDB, 2008.
-
Proc. of VLDB, 2008
-
-
Re, C.1
Suciu, D.2
-
32
-
-
0032596539
-
Learning dictionaries for information extraction by multi-level bootstrapping
-
E. Riloff and R. Jones. Learning dictionaries for information extraction by multi-level bootstrapping. In Proceedings of AAAI-99, 1999.
-
Proceedings of AAAI-99, 1999
-
-
Riloff, E.1
Jones, R.2
-
33
-
-
42449132132
-
Provenance in Databases: Past, Current, and Future
-
W.-C. Tan. Provenance in Databases: Past, Current, and Future. IEEE Data Engineering Bulletin, 2008.
-
(2008)
IEEE Data Engineering Bulletin
-
-
Tan, W.-C.1
-
34
-
-
0030682362
-
Supporting fine-grained data lineage in a database visualization environment
-
A. Woodruff and M. Stonebraker. Supporting fine-grained data lineage in a database visualization environment. In Proc. of ICDE, pages 91-102, 1997.
-
(1997)
Proc. of ICDE
, pp. 91-102
-
-
Woodruff, A.1
Stonebraker, M.2
|