-
1
-
-
84891087050
-
MillWheel: Fault-tolerant stream processing at internet scale
-
T. Akidau, A. Balikov, K. Bekiroǧlu, S. Chernyak, J. Haberman, R. Lax, S. McVeety, D. Mills, P. Nordstrom, and S. Whittle. MillWheel: Fault-tolerant stream processing at internet scale. VLDB, 2013.
-
(2013)
VLDB
-
-
Akidau, T.1
Balikov, A.2
Bekiroǧlu, K.3
Chernyak, S.4
Haberman, J.5
Lax, R.6
McVeety, S.7
Mills, D.8
Nordstrom, P.9
Whittle, S.10
-
2
-
-
84880518659
-
Photon: Fault-tolerant and scalable joining of continuous data streams
-
R. Ananthanarayanan, V. Basker, S. Das, A. Gupta, H. Jiang, T. Qiu, A. Reznichenko, D. Ryabkov, M. Singh, and S. Venkataraman. Photon: Fault-tolerant and scalable joining of continuous data streams. SIGMOD, 2013.
-
(2013)
SIGMOD
-
-
Ananthanarayanan, R.1
Basker, V.2
Das, S.3
Gupta, A.4
Jiang, H.5
Qiu, T.6
Reznichenko, A.7
Ryabkov, D.8
Singh, M.9
Venkataraman, S.10
-
3
-
-
82155187187
-
Incoop: MapReduce for incremental computations
-
P. Bhatotia, A. Wieder, R. Rodrigues, U. A. Acar, and R. Pasquini. Incoop: MapReduce for incremental computations. SoCC, 2011.
-
(2011)
SoCC
-
-
Bhatotia, P.1
Wieder, A.2
Rodrigues, R.3
Acar, U.A.4
Pasquini, R.5
-
4
-
-
0014814325
-
Space/time trade-offs in hash coding with allowable errors
-
B. Bloom. Space/time trade-offs in hash coding with allowable errors. Communications of the ACM, 13(7):422-426, 1970.
-
(1970)
Communications of the ACM
, vol.13
, Issue.7
, pp. 422-426
-
-
Bloom, B.1
-
5
-
-
84904136037
-
Large-scale machine learning with stochastic gradient descent
-
L. Bottou. Large-scale machine learning with stochastic gradient descent. COMPSTAT, 2010.
-
(2010)
COMPSTAT
-
-
Bottou, L.1
-
6
-
-
0031346696
-
On the resemblance and containment of documents
-
A. Z. Broder. On the resemblance and containment of documents. SEQUENCES, 1997.
-
(1997)
SEQUENCES
-
-
Broder, A.Z.1
-
7
-
-
0037960295
-
Monitoring streams-a new class of data management applications
-
D. Carney, U. Çetintemel, M. Cherniack, C. Convey, S. Lee, G. Seidman, M. Stonebraker, N. Tatbul, and S. Zdonik. Monitoring streams-a new class of data management applications. VLDB, 2002.
-
(2002)
VLDB
-
-
Carney, D.1
Çetintemel, U.2
Cherniack, M.3
Convey, C.4
Lee, S.5
Seidman, G.6
Stonebraker, M.7
Tatbul, N.8
Zdonik, S.9
-
8
-
-
85076771850
-
MapReduce online
-
T. Condie, N. Conway, P. Alvaro, J. M. Hellerstein, K. Elmeleegy, and R. Sears. MapReduce online. NSDI, 2010.
-
(2010)
NSDI
-
-
Condie, T.1
Conway, N.2
Alvaro, P.3
Hellerstein, J.M.4
Elmeleegy, K.5
Sears, R.6
-
9
-
-
14844367057
-
An improved data stream summary: The count-min sketch and its applications
-
G. Cormode and S. Muthukrishnan. An improved data stream summary: The count-min sketch and its applications. Journal of Algorithms, 55(1):58-75, 2005.
-
(2005)
Journal of Algorithms
, vol.55
, Issue.1
, pp. 58-75
-
-
Cormode, G.1
Muthukrishnan, S.2
-
11
-
-
55649105542
-
SPADE: The System S declarative stream processing engine
-
B. Gedik, H. Andrade, K.-L. Wu, P. Yu, and M. Doo. SPADE: The System S declarative stream processing engine. SIGMOD, 2008.
-
(2008)
SIGMOD
-
-
Gedik, B.1
Andrade, H.2
Wu, K.-L.3
Yu, P.4
Doo, M.5
-
14
-
-
84893318699
-
Hourglass: A library for incremental processing on Hadoop
-
M. Hayes and S. Shah. Hourglass: A library for incremental processing on Hadoop. Big Data, 2013.
-
(2013)
Big Data
-
-
Hayes, M.1
Shah, S.2
-
15
-
-
84887748650
-
HyperLogLog in practice: Algorithmic engineering of a state of the art cardinality estimation algorithm
-
S. Heule, M. Nunkesser, and A. Hall. HyperLogLog in practice: Algorithmic engineering of a state of the art cardinality estimation algorithm. EDBT, 2013.
-
(2013)
EDBT
-
-
Heule, S.1
Nunkesser, M.2
Hall, A.3
-
16
-
-
84893305113
-
Mesos: A platform for fine-grained resource sharing in the data center
-
B. Hindman, A. Konwinski, M. Zaharia, A. Ghodsi, A. D. Joseph, R. Katz, S. Shenker, and I. Stoica. Mesos: A platform for fine-grained resource sharing in the data center. NSDI, 2011.
-
(2011)
NSDI
-
-
Hindman, B.1
Konwinski, A.2
Zaharia, M.3
Ghodsi, A.4
Joseph, A.D.5
Katz, R.6
Shenker, S.7
Stoica, I.8
-
17
-
-
84897489756
-
Algebraic classifiers: A generic approach to fast cross-validation, online training, and parallel training
-
M. Izbicki. Algebraic classifiers: A generic approach to fast cross-validation, online training, and parallel training. ICML, 2013.
-
(2013)
ICML
-
-
Izbicki, M.1
-
18
-
-
84873171258
-
Kafka: A distributed messaging system for log processing
-
J. Kreps, N. Narkhede, and J. Rao. Kafka: A distributed messaging system for log processing. NetDB, 2011.
-
(2011)
NetDB
-
-
Kreps, J.1
Narkhede, N.2
Rao, J.3
-
19
-
-
77954715436
-
Continuous analytics over discontinuous streams
-
S. Krishnamurthy, M. Franklin, J. Davis, D. Farina, P. Golovko, A. Li, and N. Thombre. Continuous analytics over discontinuous streams. SIGMOD, 2010.
-
(2010)
SIGMOD
-
-
Krishnamurthy, S.1
Franklin, M.2
Davis, J.3
Farina, D.4
Golovko, P.5
Li, A.6
Thombre, N.7
-
20
-
-
77952270709
-
DEDUCE: At the intersection of MapReduce and stream processing
-
V. Kumar, H. Andrade, B. Gedik, and K.-L. Wu. DEDUCE: At the intersection of MapReduce and stream processing. EDBT, 2010.
-
(2010)
EDBT
-
-
Kumar, V.1
Andrade, H.2
Gedik, B.3
Wu, K.-L.4
-
21
-
-
84872409772
-
Muppet: MapReduce-style processing of fast data
-
W. Lam, L. Liu, S. Prasad, A. Rajaraman, Z. Vacheri, and A. Doan. Muppet: MapReduce-style processing of fast data. VLDB, 2012.
-
(2012)
VLDB
-
-
Lam, W.1
Liu, L.2
Prasad, S.3
Rajaraman, A.4
Vacheri, Z.5
Doan, A.6
-
22
-
-
84984039280
-
Hints for computer system design
-
B. Lampson. Hints for computer system design. SOSP, 1983.
-
(1983)
SOSP
-
-
Lampson, B.1
-
23
-
-
84873206169
-
The unified logging infrastructure for data analytics at Twitter
-
G. Lee, J. Lin, C. Liu, A. Lorek, and D. Ryaboy. The unified logging infrastructure for data analytics at Twitter. VLDB, 2012.
-
(2012)
VLDB
-
-
Lee, G.1
Lin, J.2
Liu, C.3
Lorek, A.4
Ryaboy, D.5
-
24
-
-
84862684679
-
Large-scale machine learning at Twitter
-
J. Lin and A. Kolcz. Large-scale machine learning at Twitter. SIGMOD, 2012.
-
(2012)
SIGMOD
-
-
Lin, J.1
Kolcz, A.2
-
25
-
-
84893267530
-
Scaling big data mining infrastructure: The Twitter experience
-
J. Lin and D. Ryaboy. Scaling big data mining infrastructure: The Twitter experience. SIGKDD Explorations, 14(2):6-19, 2012.
-
(2012)
SIGKDD Explorations
, vol.14
, Issue.2
, pp. 6-19
-
-
Lin, J.1
Ryaboy, D.2
-
26
-
-
80051931078
-
Stateful bulk processing for incremental analytics
-
D. Logothetis, C. Olston, B. Reed, K. C. Webb, and K. Yocum. Stateful bulk processing for incremental analytics. SoCC, 2010.
-
(2010)
SoCC
-
-
Logothetis, D.1
Olston, C.2
Reed, B.3
Webb, K.C.4
Yocum, K.5
-
27
-
-
79953647392
-
A co-relational model of data for large shared data banks
-
E. Meijer and G. Bierman. A co-relational model of data for large shared data banks. Communications of the ACM, 54(4):49-58, 2011.
-
(2011)
Communications of the ACM
, vol.54
, Issue.4
, pp. 49-58
-
-
Meijer, E.1
Bierman, G.2
-
28
-
-
84880559221
-
Fast data in the era of big data: Twitter's real-time related query suggestion architecture
-
G. Mishne, J. Dalton, Z. Li, A. Sharma, and J. Lin. Fast data in the era of big data: Twitter's real-time related query suggestion architecture. SIGMOD, 2013.
-
(2013)
SIGMOD
-
-
Mishne, G.1
Dalton, J.2
Li, Z.3
Sharma, A.4
Lin, J.5
-
29
-
-
84990965478
-
Information network or social network? The structure of the Twitter follow graph
-
S. A. Myers, A. Sharma, P. Gupta, and J. Lin. Information network or social network? The structure of the Twitter follow graph. WWW Companion, 2014.
-
(2014)
WWW Companion
-
-
Myers, S.A.1
Sharma, A.2
Gupta, P.3
Lin, J.4
-
30
-
-
84879520704
-
Incremental stream processing using computational conflict-free replicated data types
-
D. Navalho, S. Duarte, N. Preguiça, and M. Shapiro. Incremental stream processing using computational conflict-free replicated data types. CloudDP, 2013.
-
(2013)
CloudDP
-
-
Navalho, D.1
Duarte, S.2
Preguiça, N.3
Shapiro, M.4
-
32
-
-
55349148888
-
Pig Latin: A not-so-foreign language for data processing
-
C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. Pig Latin: A not-so-foreign language for data processing. SIGMOD, 2008.
-
(2008)
SIGMOD
-
-
Olston, C.1
Reed, B.2
Srivastava, U.3
Kumar, R.4
Tomkins, A.5
-
33
-
-
30344452311
-
Interpreting the data: Parallel analysis with Sawzall
-
R. Pike, S. Dorward, R. Griesemer, and S. Quinlan. Interpreting the data: Parallel analysis with Sawzall. Scientific Programming Journal, 13(4):277-298, 2005.
-
(2005)
Scientific Programming Journal
, vol.13
, Issue.4
, pp. 277-298
-
-
Pike, R.1
Dorward, S.2
Griesemer, R.3
Quinlan, S.4
-
34
-
-
84877690081
-
TimeStream: Reliable stream computation in the cloud
-
Z. Qian, Y. He, C. Su, Z. Wu, H. Zhu, T. Zhang, L. Zhou, Y. Yu, and Z. Zhang. TimeStream: Reliable stream computation in the cloud. EuroSys, 2013.
-
(2013)
EuroSys
-
-
Qian, Z.1
He, Y.2
Su, C.3
Wu, Z.4
Zhu, H.5
Zhang, T.6
Zhou, L.7
Yu, Y.8
Zhang, Z.9
-
35
-
-
80054708353
-
A comprehensive study of Convergent and Commutative Replicated Data Types
-
M. Shapiro, N. Preguiça, C. Baquero, and M. Zawirski. A comprehensive study of Convergent and Commutative Replicated Data Types. Technical report, INRIA, 2011.
-
(2011)
Technical report, INRIA
-
-
Shapiro, M.1
Preguiça, N.2
Baquero, C.3
Zawirski, M.4
-
36
-
-
85002655794
-
Stream-monitoring with BlockMon: Convergence of network measurements and data analytics platforms
-
D. Simoncelli, M. Dusi, F. Gringoli, and S. Niccolini. Stream-monitoring with BlockMon: Convergence of network measurements and data analytics platforms. ACM SIGCOMM Computer Communication Review, 43(2):30-35, 2013.
-
(2013)
ACM SIGCOMM Computer Communication Review
, vol.43
, Issue.2
, pp. 30-35
-
-
Simoncelli, D.1
Dusi, M.2
Gringoli, F.3
Niccolini, S.4
-
37
-
-
84905838951
-
Storm @Twitter
-
A. Toshniwal, S. Taneja, A. Shukla, K. Ramasamy, J. M. Patel, S. Kulkarni, J. Jackson, K. Gade, M. Fu, J. Donham, N. Bhagat, S. Mittal, and D. Ryaboy. Storm @Twitter. SIGMOD, 2014.
-
(2014)
SIGMOD
-
-
Toshniwal, A.1
Taneja, S.2
Shukla, A.3
Ramasamy, K.4
Patel, J.M.5
Kulkarni, S.6
Jackson, J.7
Gade, K.8
Fu, M.9
Donham, J.10
Bhagat, N.11
Mittal, S.12
Ryaboy, D.13
-
38
-
-
70350591395
-
DryadLINQ: A system for general-purpose distributed data-parallel computing using a high-level language
-
Y. Yu, M. Isard, D. Fetterly, M. Budiu, Ú. Erlingsson, P. K. Gunda, and J. Currey. DryadLINQ: A system for general-purpose distributed data-parallel computing using a high-level language. OSDI, 2008.
-
(2008)
OSDI
-
-
Yu, Y.1
Isard, M.2
Fetterly, D.3
Budiu, M.4
Erlingsson, Ú.5
Gunda, P.K.6
Currey, J.7
-
39
-
-
85040175609
-
Resilient Distributed Datasets: A fault-tolerant abstraction for in-memory cluster computing
-
M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley, M. J. Franklin, S. Shenker, and I. Stoica. Resilient Distributed Datasets: A fault-tolerant abstraction for in-memory cluster computing. NSDI, 2012.
-
(2012)
NSDI
-
-
Zaharia, M.1
Chowdhury, M.2
Das, T.3
Dave, A.4
Ma, J.5
McCauley, M.6
Franklin, M.J.7
Shenker, S.8
Stoica, I.9
-
40
-
-
84962598306
-
Discretized streams: An efficient and fault-tolerant model for stream processing on large clusters
-
M. Zaharia, T. Das, H. Li, S. Shenker, and I. Stoica. Discretized streams: An efficient and fault-tolerant model for stream processing on large clusters. HotCloud, 2012.
-
(2012)
HotCloud
-
-
Zaharia, M.1
Das, T.2
Li, H.3
Shenker, S.4
Stoica, I.5
|