-
2
-
-
77952284211
-
Techniques for efficiently querying scientific workflow provenance graphs
-
M. K. Anand, S. Bowers, and B. Ludäscher. Techniques for efficiently querying scientific workflow provenance graphs. In EDBT, 2010.
-
(2010)
EDBT
-
-
Anand, M.K.1
Bowers, S.2
Ludäscher, B.3
-
3
-
-
80053556022
-
-
Apache. Hadoop. http://hadoop.apache.org/.
-
-
-
-
4
-
-
84872769431
-
-
Apache. Hadoop cluster setup. http://hadoop.apache.org/common/docs/r0.21. 0/cluster setup.html.
-
Hadoop Cluster Setup
-
-
-
5
-
-
84894204419
-
-
Apache. Mapreduce tutorial. http://hadoop.apache.org/mapreduce/docs/r0. 21.0/mapred tutorial.html.
-
Mapreduce Tutorial
-
-
-
6
-
-
77952788465
-
-
Apache. Pigmix benchmarks. http://wiki.apache.org/pig/PigMix.
-
Pigmix Benchmarks
-
-
-
7
-
-
52649110562
-
Querying and managing provenance through user views in scientific workflows
-
O. Biton, S. Cohen-Boulakia, S. B. Davidson, and C. S. Hara. Querying and managing provenance through user views in scientific workflows. In ICDE, 2008.
-
(2008)
ICDE
-
-
Biton, O.1
Cohen-Boulakia, S.2
Davidson, S.B.3
Hara, C.S.4
-
8
-
-
24344453002
-
Lineage retrieval for scientific data processing: A survey
-
R. Bose and J. Frew. Lineage retrieval for scientific data processing: a survey. ACM Comput. Surv., 37(1), 2005.
-
(2005)
ACM Comput. Surv.
, vol.37
, Issue.1
-
-
Bose, R.1
Frew, J.2
-
9
-
-
77954727236
-
FlumeJava: Easy, efficient data-parallel pipelines
-
C. Chambers, A. Raniwala, F. Perry, S. Adams, R. R. Henry, R. Bradshaw, and N. Weizenbaum. FlumeJava: Easy, efficient data-parallel pipelines. In PLDI, 2010.
-
(2010)
PLDI
-
-
Chambers, C.1
Raniwala, A.2
Perry, F.3
Adams, S.4
Henry, R.R.5
Bradshaw, R.6
Weizenbaum, N.7
-
11
-
-
77951896306
-
Provenance in databases: Why, how, and where
-
J. Cheney, L. Chiticariu, and W.-C. Tan. Provenance in databases: Why, how, and where. Foundations and Trends in Databases, 1(4), 2009.
-
(2009)
Foundations and Trends in Databases
, vol.1
, Issue.4
-
-
Cheney, J.1
Chiticariu, L.2
Tan, W.-C.3
-
12
-
-
0038546767
-
Lineage tracing for general data warehouse transformations
-
Y. Cui and J. Widom. Lineage tracing for general data warehouse transformations. The VLDB Journal, 12(1), 2003.
-
(2003)
The VLDB Journal
, vol.12
, Issue.1
-
-
Cui, Y.1
Widom, J.2
-
13
-
-
0000278897
-
Tracing the lineage of view data in a warehousing environment
-
Y. Cui, J. Widom, and J. L. Wiener. Tracing the lineage of view data in a warehousing environment. ACM TODS, 25(2), 2000.
-
(2000)
ACM TODS
, vol.25
, Issue.2
-
-
Cui, Y.1
Widom, J.2
Wiener, J.L.3
-
14
-
-
57149126952
-
Provenance and scientific workflows: Challenges and opportunities
-
S. B. Davidson and J. Freire. Provenance and scientific workflows: challenges and opportunities. In SIGMOD, 2008.
-
(2008)
SIGMOD
-
-
Davidson, S.B.1
Freire, J.2
-
15
-
-
85030321143
-
MapReduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. In OSDI, 2004.
-
(2004)
OSDI
-
-
Dean, J.1
Ghemawat, S.2
-
17
-
-
57149123932
-
Efficient lineage tracking for scientific workflows
-
T. Heinis and G. Alonso. Efficient lineage tracking for scientific workflows. In SIGMOD, 2008.
-
(2008)
SIGMOD
-
-
Heinis, T.1
Alonso, G.2
-
19
-
-
77957912417
-
Data lineage model for Taverna workflows with lightweight annotation requirements
-
P. Missier, K. Belhajjame, J. Zhao, M. Roos, and C. Goble. Data lineage model for Taverna workflows with lightweight annotation requirements. In IPAW, 2008.
-
(2008)
IPAW
-
-
Missier, P.1
Belhajjame, K.2
Zhao, J.3
Roos, M.4
Goble, C.5
-
20
-
-
55349148888
-
Pig Latin: A not-so-foreign language for data processing
-
C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. Pig Latin: A not-so-foreign language for data processing. In SIGMOD, 2008.
-
(2008)
SIGMOD
-
-
Olston, C.1
Reed, B.2
Srivastava, U.3
Kumar, R.4
Tomkins, A.5
-
22
-
-
84868288681
-
Information extraction
-
March
-
S. Sarawagi. Information extraction. Found. Trends databases, 1:261-377, March 2008.
-
(2008)
Found. Trends Databases
, vol.1
, pp. 261-377
-
-
Sarawagi, S.1
-
23
-
-
31444456909
-
A survey of data provenance in e-science
-
Y. L. Simmhan, B. Plale, and D. Gannon. A survey of data provenance in e-science. SIGMOD Rec., 34(3), 2005.
-
(2005)
SIGMOD Rec.
, vol.34
, Issue.3
-
-
Simmhan, Y.L.1
Plale, B.2
Gannon, D.3
-
24
-
-
84868325513
-
Hive: A warehousing solution over a map-reduce framework
-
A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wyckoff, and R. Murthy. Hive: A warehousing solution over a map-reduce framework. In VLDB, 2009.
-
(2009)
VLDB
-
-
Thusoo, A.1
Sarma, J.S.2
Jain, N.3
Shao, Z.4
Chakka, P.5
Anthony, S.6
Liu, H.7
Wyckoff, P.8
Murthy, R.9
-
25
-
-
80053500566
-
-
K. Weil. hadoop-lzo. http://www.github.com/kevinweil/hadoop-lzo/.
-
-
-
Weil, K.1
-
26
-
-
0030682362
-
Supporting fine-grained data lineage in a database visualization environment
-
A. Woodruff and M. Stonebraker. Supporting fine-grained data lineage in a database visualization environment. In ICDE, 1997.
-
(1997)
ICDE
-
-
Woodruff, A.1
Stonebraker, M.2
-
27
-
-
35448944021
-
Map-reduce-merge: Simplified relational data processing on large clusters
-
H.-C. Yang, A. Dasdan, R.-L. Hsiao, and D. S. Parker. Map-Reduce-Merge: Simplified relational data processing on large clusters. In SIGMOD, 2007.
-
(2007)
SIGMOD
-
-
Yang, H.-C.1
Dasdan, A.2
Hsiao, R.-L.3
Parker, D.S.4
|