-
1
-
-
0002433547
-
From Data Mining to Knowledge Discovery: An Overview
-
AAAI Press, Menlo Park
-
Fayyad, U.M., Gregory, P.S., Padhraic, S.: From Data Mining to Knowledge Discovery: an Overview. In: Advances in Knowledge Discovery and Data Mining, pp. 1-36. AAAI Press, Menlo Park (1996)
-
(1996)
Advances in Knowledge Discovery and Data Mining
, pp. 1-36
-
-
Fayyad, U.M.1
Gregory, P.S.2
Padhraic, S.3
-
4
-
-
84860443491
-
From Databases to big data
-
Madden, S.: From Databases to big data. IEEE Internet Computing 16(3), 4-6 (2012)
-
(2012)
IEEE Internet Computing
, vol.16
, Issue.3
, pp. 4-6
-
-
Madden, S.1
-
6
-
-
21644437974
-
The Google File System
-
Ghemawat, S., Gobioff, H., Leung, S.T.: The Google File System. In: 19th ACM Symposium on Operating Systems Principles, Bolton Landing, New York, pp. 29-43 (2003)
-
(2003)
19th ACM Symposium on Operating Systems Principles, Bolton Landing, New York
, pp. 29-43
-
-
Ghemawat, S.1
Gobioff, H.2
Leung, S.T.3
-
7
-
-
73649114265
-
MapReduce: A Flexible Data Processing Tool
-
Dean, J., Ghemawat, S.: MapReduce: a Flexible Data Processing Tool. Communication of the ACM 53(1), 72-77 (2010)
-
(2010)
Communication of the ACM
, vol.53
, Issue.1
, pp. 72-77
-
-
Dean, J.1
Ghemawat, S.2
-
8
-
-
85071319367
-
Bigtable: A Distributed Storage System for Structured Data
-
USENIX Association Berkeley, CA
-
Chang, F., Dean, J., Ghemawat, S., et al.: Bigtable: A Distributed Storage System for Structured Data. In: 7th Symposium on Operating Systems Design and Implementation, vol. 7, pp. 205-218. USENIX Association Berkeley, CA (2006)
-
(2006)
7th Symposium on Operating Systems Design and Implementation
, vol.7
, pp. 205-218
-
-
Chang, F.1
Dean, J.2
Ghemawat, S.3
-
9
-
-
79956105412
-
Jampani, et al: Dynamo: Amazon's Highly Available Key-Value Store
-
Stevenson, Washington
-
DeCandia, G., Hastorun, D.: Jampani, et al: Dynamo: Amazon's Highly Available Key-Value Store. In: 21st ACM SIGOPS Symposium on Operating Systems Principles, pp. 14-17. Stevenson, Washington (2007)
-
(2007)
21st ACM SIGOPS Symposium on Operating Systems Principles
, pp. 14-17
-
-
DeCandia, G.1
Hastorun, D.2
-
10
-
-
34547706772
-
-
2nd edn. Wiley & Sons, Hoboken
-
Shmueli, G., Patel, N.R., Bruce, P.C.: Data Mining for Business Intelligence: Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner, 2nd edn. Wiley & Sons, Hoboken (2010)
-
(2010)
Data Mining for Business Intelligence: Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner
-
-
Shmueli, G.1
Patel, N.R.2
Bruce, P.C.3
-
11
-
-
80052651384
-
NIMBLE: A Toolkit for the Implementation of Parallel Data Mining and Machine Learning Algorithms on MapReduce
-
Ghoting, A., Kambadur, P., Pednault, E., Kannan, R.: NIMBLE: a Toolkit for the Implementation of Parallel Data Mining and Machine Learning Algorithms on MapReduce. In: 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, California, USA, pp. 334-342 (2011)
-
(2011)
17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, California, USA
, pp. 334-342
-
-
Ghoting, A.1
Kambadur, P.2
Pednault, E.3
Kannan, R.4
-
12
-
-
84892921296
-
-
Mahout, http://lucene.apache.org/mahout/
-
Mahout
-
-
-
13
-
-
84866010099
-
BC-PDM: Data Mining, Social Network Analysis and Text Mining System Based on Cloud Computing
-
Yu, L., Zheng, J., Shen, W.C., et al.: BC-PDM: Data Mining, Social Network Analysis and Text Mining System Based on Cloud Computing. In: 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1496-1499 (2012)
-
(2012)
18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 1496-1499
-
-
Yu, L.1
Zheng, J.2
Shen, W.C.3
-
14
-
-
77951152705
-
PEGASUS: A Peta-Scale Graph Mining System Implementation and Observations
-
Kang, U., Tsourakakis, C.E., Faloutsos, C.: PEGASUS: A Peta-Scale Graph Mining System Implementation and Observations. In: 9th IEEE International Conference on Data Mining, pp. 229-238 (2009)
-
(2009)
9th IEEE International Conference on Data Mining
, pp. 229-238
-
-
Kang, U.1
Tsourakakis, C.E.2
Faloutsos, C.3
-
16
-
-
84863735533
-
Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud
-
Low, Y., Bickson, D., Gonzalez, J., Guestrin, C., Kyrola, A., Hellerstein, J.M.: Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud. VLDB Endowment 5(8), 71-727 (2012)
-
(2012)
VLDB Endowment
, vol.5
, Issue.8
, pp. 71-727
-
-
Low, Y.1
Bickson, D.2
Gonzalez, J.3
Guestrin, C.4
Kyrola, A.5
Hellerstein, J.M.6
-
18
-
-
24344486420
-
FastBit: An Efficient Indexing Technology for Accelerating Data-intensive Science
-
Wu, K.: FastBit: An Efficient Indexing Technology for Accelerating Data-intensive Science. Journal of Physics, Conference Series 16, 550-560 (2005)
-
(2005)
Journal of Physics, Conference Series
, vol.16
, pp. 550-560
-
-
Wu, K.1
-
19
-
-
84885958023
-
Big data Platforms: What's Next?
-
Borkar, V.R., Carey, M.J., Li, C.: big data Platforms: What's Next? ACM Crossroads 19(1), 44-49 (2012)
-
(2012)
ACM Crossroads
, vol.19
, Issue.1
, pp. 44-49
-
-
Borkar, V.R.1
Carey, M.J.2
Li, C.3
-
20
-
-
84873201254
-
Mining Knowledge from Interconnected Data: A Heterogeneous Information Network Analysis Approach
-
Sun, Y., Han, J., Yan, X., Yu, P.S.: Mining Knowledge from Interconnected Data: A Heterogeneous Information Network Analysis Approach. VLDB Endowment 5(12), 2022-2023 (2012)
-
(2012)
VLDB Endowment
, vol.5
, Issue.12
, pp. 2022-2023
-
-
Sun, Y.1
Han, J.2
Yan, X.3
Yu, P.S.4
-
21
-
-
84878947403
-
-
Technical Report, Center for Information Science and Technology Temple University, ch. 1
-
Obradovic, Z., Vucetic, S.: Challenges in Scientific Data Mining: Heterogeneous, Biased, and Large Samples. Technical Report, Center for Information Science and Technology Temple University, ch. 1, pp. 1-24 (2004)
-
(2004)
Challenges in Scientific Data Mining: Heterogeneous, Biased, and Large Samples
, pp. 1-24
-
-
Obradovic, Z.1
Vucetic, S.2
-
22
-
-
0043099440
-
Discovering Homogeneous Regions in Spatial Data through Competition
-
Vucetic, S., Obradovic, Z.: Discovering Homogeneous Regions in Spatial Data through Competition. In: 17th International Conference of Machine Learning, Stanford, CA, pp. 1095-1102 (2000)
-
(2000)
17th International Conference of Machine Learning, Stanford, CA
, pp. 1095-1102
-
-
Vucetic, S.1
Obradovic, Z.2
-
23
-
-
79961187232
-
Bethel, et al: FastBit: Interactively Searching Massive Data
-
Wu, K., Ahern, S.: Bethel, et al: FastBit: Interactively Searching Massive Data. Sci-DAC 180 (2009)
-
(2009)
Sci-DAC
, pp. 180
-
-
Wu, K.1
Ahern, S.2
-
24
-
-
76649126078
-
Mining Hidden Communities in Heterogeneous Social Network
-
Cai, D., Shao, Z., He, X., Yan, X., Han, J.: Mining Hidden Communities in Heterogeneous Social Network. In: 3rd International Workshop Link Discovery (LinkKDD), pp. 58-65 (2005)
-
(2005)
3rd International Workshop Link Discovery (LinkKDD)
, pp. 58-65
-
-
Cai, D.1
Shao, Z.2
He, X.3
Yan, X.4
Han, J.5
-
25
-
-
84871119172
-
-
Apache Hive, http://hive.apache.org/
-
Apache Hive
-
-
-
27
-
-
84880533620
-
Shark: SQL and Rich Analytics at Scale
-
accepted
-
Xin, R.S., Rosen, J., Zaharia, M., Franklin, M., Shenker, S., Stoica, I.: Shark: SQL and Rich Analytics at Scale. In: ACM SIGMOD Conference (accepted, 2013)
-
(2013)
ACM SIGMOD Conference
-
-
Xin, R.S.1
Rosen, J.2
Zaharia, M.3
Franklin, M.4
Shenker, S.5
Stoica, I.6
-
28
-
-
84873103791
-
-
Agrawal, D., Bernstein, P., Bertino, E., et al.: Challenges and Opportunities With big data - A Community White Paper Developed by Leading Researchers Across the United States (2012), http://cra.org/ccc/docs/init/ bigdatawhitepaper.pdf
-
(2012)
Challenges and Opportunities with Big Data - A Community White Paper Developed by Leading Researchers Across the United States
-
-
Agrawal, D.1
Bernstein, P.2
Bertino, E.3
-
30
-
-
74049128689
-
An Efficient Multi-dimensional Index for Cloud Data Management
-
ACM Press, Hong Kong
-
Zhang, X., Ai, J., Wang, Z., Lu, J., Meng, X.: An Efficient Multi-dimensional Index for Cloud Data Management. In: 1st International Workshop on Cloud Data Management, pp. 17-24. ACM Press, Hong Kong (2009)
-
(2009)
1st International Workshop on Cloud Data Management
, pp. 17-24
-
-
Zhang, X.1
Ai, J.2
Wang, Z.3
Lu, J.4
Meng, X.5
-
31
-
-
36849093958
-
Truth Discovery with Multiple Conflicting Information Providers on the Web
-
Yin, X., Han, J., Yu, P.S.: Truth Discovery with Multiple Conflicting Information Providers on the Web. In: 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Jose, California, pp. 1048-1052 (2007)
-
(2007)
13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Jose, California
, pp. 1048-1052
-
-
Yin, X.1
Han, J.2
Yu, P.S.3
-
32
-
-
77954322933
-
Integrating Conflicting Data: The Role of Source Dependence
-
Dong, X.L., Berti-Equille, L., Srivastava, D.: Integrating Conflicting Data: The Role of Source Dependence. VLDB Endowment 2(1), 550-561 (2009)
-
(2009)
VLDB Endowment
, vol.2
, Issue.1
, pp. 550-561
-
-
Dong, X.L.1
Berti-Equille, L.2
Srivastava, D.3
-
33
-
-
84873482520
-
Semi-Supervised Truth Discovery
-
Yin, X., Tan, W.: Semi-Supervised Truth Discovery. In: 20th International Conference on World Wide Web, Hyderabad, India, pp. 217-226 (2011)
-
(2011)
20th International Conference on World Wide Web, Hyderabad, India
, pp. 217-226
-
-
Yin, X.1
Tan, W.2
-
34
-
-
84866107712
-
Privacy in the Age of big data: A Time for Big Decisions
-
Tene, O., Polonetsky, J.: Privacy in the Age of big data: A Time for Big Decisions. Stanford Law Review Online 64, 63-69 (2012)
-
(2012)
Stanford Law Review Online
, vol.64
, pp. 63-69
-
-
Tene, O.1
Polonetsky, J.2
-
35
-
-
84892930900
-
Big data Mining, Fairness and Privacy - A Vision Statement Towards an Interdisciplinary Roadmap of Research
-
Pedreschi, D., Calders, T., Custers, B., et al.: big data Mining, Fairness and Privacy - A Vision Statement Towards an Interdisciplinary Roadmap of Research. Data Mining and Analytics Software, KDnuggets Review Online 11(26) (2011)
-
(2011)
Data Mining and Analytics Software, KDnuggets Review Online
, vol.11
, Issue.26
-
-
Pedreschi, D.1
Calders, T.2
Custers, B.3
-
37
-
-
84870759190
-
A Metadata Catalog for Organization and Systemization of Fusion Simulation Data
-
Greenwald, M., Fredian, T., Schissel, D., Stillerman, J.: A Metadata Catalog for Organization and Systemization of Fusion Simulation Data. Fusion Engineering & Design 87(12), 2205-2208 (2012)
-
(2012)
Fusion Engineering & Design
, vol.87
, Issue.12
, pp. 2205-2208
-
-
Greenwald, M.1
Fredian, T.2
Schissel, D.3
Stillerman, J.4
|