-
1
-
-
79957809015
-
HadoopDB: An architectural hybrid of MapReduce and DBMS technologies for analytical workloads
-
Lyon, France, 59
-
Azza Abouzeid, Kamil Bajda-Pawlikowski, Daniel Abadi, Avi Silberschatz, and Alexander Rasin. HadoopDB: An architectural hybrid of MapReduce and DBMS technologies for analytical workloads. In Proceedings of the 35th International Conference on Very Large Data Base (VLDB 2009), pages 922-933, Lyon, France, 2009. 59
-
(2009)
Proceedings of the 35th International Conference On Very Large Data Base (VLDB 2009)
, pp. 922-933
-
-
Abouzeid, A.1
Bajda-Pawlikowski, K.2
Abadi, D.3
Silberschatz, A.4
Rasin, A.5
-
2
-
-
0036013593
-
Statistical mechanics of complex networks
-
92
-
Réka Albert and Albert-László Barabási. Statistical mechanics of complex networks. Reviews of Modern Physics, 74:47-97, 2002. DOI: 10.1103/RevModPhys.74.47 92
-
(2002)
Reviews of Modern Physics
, vol.74
, pp. 47-97
-
-
Albert, R.1
Barabási, A.2
-
3
-
-
0029719644
-
The space complexity of approximating the frequencymoments
-
Philadelphia, Pennsylvania, 145
-
Noga Alon, Yossi Matias, and Mario Szegedy. The space complexity of approximating the frequencymoments. In Proceedings of the 28th Annual ACMSymposium onTheory of Computing (STOC '96), pages 20-29, Philadelphia, Pennsylvania, 1996. DOI: 10.1145/237814.237823 145
-
(1996)
Proceedings of the 28th Annual ACMSymposium OnTheory of Computing (STOC '96)
, pp. 20-29
-
-
Alon, N.1
Matias, Y.2
Szegedy, M.3
-
4
-
-
77952783397
-
BOOM: Data-centric programming in the datacenter
-
University of California at Berkeley, 32
-
Peter Alvaro, Tyson Condie, Neil Conway, Khaled Elmeleegy, Joseph M. Hellerstein, and Russell C. Sears. BOOM: Data-centric programming in the datacenter. Technical Report UCB/EECS-2009-98, Electrical Engineering and Computer Sciences, University of California at Berkeley, 2009. 32
-
(2009)
Technical Report UCB/EECS-2009-98, Electrical Engineering and Computer Sciences
-
-
Alvaro, P.1
Condie, T.2
Conway, N.3
Elmeleegy, K.4
Hellerstein, J.M.5
Sears, R.C.6
-
5
-
-
85060036181
-
Validity of the single processor approach to achieving large-scale computing capabilities
-
17
-
Gene Amdahl. Validity of the single processor approach to achieving large-scale computing capabilities. In Proceedings of the AFIPS Spring Joint Computer Conference, pages 483-485, 1967. DOI: 10.1145/1465482.1465560 17
-
(1967)
Proceedings of the AFIPS Spring Joint Computer Conference
, pp. 483-485
-
-
Amdahl, G.1
-
6
-
-
85068224811
-
Cloud analytics: Do we really need to reinvent the storage stack?
-
San Diego, California, 29
-
Rajagopal Ananthanarayanan, Karan Gupta, Prashant Pandey, Himabindu Pucha, Prasenjit Sarkar, Mansi Shah, and Renu Tewari. Cloud analytics: Do we really need to reinvent the storage stack? In Proceedings of the 2009Workshop onHotTopics in Cloud Computing (HotCloud 09), San Diego, California, 2009. 29
-
(2009)
Proceedings of the 2009Workshop OnHotTopics in Cloud Computing (HotCloud 09)
-
-
Ananthanarayanan, R.1
Gupta, K.2
Pandey, P.3
Pucha, H.4
Sarkar, P.5
Shah, M.6
Tewari, R.7
-
7
-
-
84883505875
-
Serverless network file systems
-
Copper Mountain Resort, Colorado, 29
-
Thomas Anderson, Michael Dahlin, Jeanna Neefe, David Patterson,Drew Roselli, and RandolphWang. Serverless network file systems. In Proceedings of the 15th ACM Symposium on Operating Systems Principles (SOSP 1995), pages 109-126, Copper Mountain Resort, Colorado, 1995. DOI: 10.1145/224056.224066 29
-
(1995)
Proceedings of the 15th ACM Symposium On Operating Systems Principles (SOSP 1995)
, pp. 109-126
-
-
Anderson, T.1
Dahlin, M.2
Neefe, J.3
Patterson, D.4
Roselli, D.5
Wang, R.6
-
8
-
-
22044441103
-
Inverted index compression using word-aligned binary codes
-
76
-
Vo Ngoc Anh and Alistair Moffat. Inverted index compression using word-aligned binary codes. Information Retrieval, 8(1):151-166, 2005. DOI: 10.1023/B:INRT.0000048490.99518.5c 76
-
(2005)
Information Retrieval
, vol.8
, Issue.1
, pp. 151-166
-
-
Anh, V.N.1
Moffat, A.2
-
9
-
-
68249129760
-
Above the clouds:ABerkeley viewof cloud computing
-
University of California at Berkeley, 6
-
Michael Armbrust,ArmandoFox,Rean Griffith,AnthonyD. Joseph,Randy H.Katz,Andrew Konwinski, Gunho Lee, David A. Patterson, Ariel Rabkin, Ion Stoica, and Matei Zaharia. Above the clouds:ABerkeley viewof cloud computing.Technical Report UCB/EECS-2009-28, Electrical Engineering and Computer Sciences, University of California at Berkeley, 2009. 6
-
(2009)
Technical Report UCB/EECS-2009-28, Electrical Engineering and Computer Sciences
-
-
Armbrust, M.1
Fox, A.2
Griffith, R.3
Joseph, A.D.4
Katz, R.H.5
Konwinski, A.6
Lee, G.7
Patterson, D.A.8
Rabkin, A.9
Stoica, I.10
Zaharia, M.11
-
10
-
-
84858773678
-
Asynchronous distributed learning of topic models
-
Vancouver, British Columbia, Canada, 144
-
Arthur Asuncion, Padhraic Smyth, and MaxWelling. Asynchronous distributed learning of topic models. In Advances in Neural Information Processing Systems 21 (NIPS 2008), pages 81-88, Vancouver, British Columbia, Canada, 2008. 144
-
(2008)
Advances in Neural Information Processing Systems 21 (NIPS 2008)
, pp. 81-88
-
-
Asuncion, A.1
Smyth, P.2
Welling, M.3
-
11
-
-
34548710710
-
Challenges on distributed web retrieval
-
Istanbul, Turkey, 83
-
Ricardo Baeza-Yates, Carlos Castillo, Flavio Junqueira, Vassilis Plachouras, and Fabrizio Silvestri. Challenges on distributed web retrieval. In Proceedings of the IEEE 23rd International Conference on Data Engineering (ICDE 2007), pages 6-20, Istanbul, Turkey, 2007. DOI: 10.1109/ICDE.2007.367846 83
-
(2007)
Proceedings of the IEEE 23rd International Conference On Data Engineering (ICDE 2007)
, pp. 6-20
-
-
Baeza-Yates, R.1
Castillo, C.2
Junqueira, F.3
Plachouras, V.4
Silvestri, F.5
-
12
-
-
84876906870
-
PageRank increase under different collusion topologies
-
Chiba, Japan, 96
-
Ricardo Baeza-Yates, Carlos Castillo, and Vicente López. PageRank increase under different collusion topologies. In Proceedings of the First International Workshop on Adversarial Information Retrieval on theWeb (AIRWeb 2005), pages 17-24, Chiba, Japan, 2005. 96
-
(2005)
Proceedings of the First International Workshop On Adversarial Information Retrieval On TheWeb (AIRWeb 2005)
, pp. 17-24
-
-
Baeza-Yates, R.1
Castillo, C.2
López, V.3
-
13
-
-
36448931586
-
The impact of caching on search engines
-
Amsterdam, The Netherlands, 82
-
Ricardo Baeza-Yates, Aristides Gionis, Flavio Junqueira, Vanessa Murdock, Vassilis Plachouras, and Fabrizio Silvestri. The impact of caching on search engines. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2007), pages 183-190, Amsterdam, The Netherlands, 2007. DOI: 10.1145/1277741.1277775 82
-
(2007)
Proceedings of the 30th Annual International ACM SIGIR Conference On Research and Development in Information Retrieval (SIGIR 2007)
, pp. 183-190
-
-
Baeza-Yates, R.1
Gionis, A.2
Junqueira, F.3
Murdock, V.4
Plachouras, V.5
Silvestri, F.6
-
15
-
-
21644433634
-
Xen and the art of virtualization
-
Bolton Landing, New York, 6
-
Paul Barham, Boris Dragovic, Keir Fraser, Steven Hand,Tim Harris, Alex Ho, Rolf Neugebauer, Ian Pratt, and AndrewWarfield. Xen and the art of virtualization. In Proceedings of the 19th ACM Symposium on Operating Systems Principles (SOSP 2003), pages 164-177, Bolton Landing, New York, 2003. DOI: 10.1145/945445.945462 6
-
(2003)
Proceedings of the 19th ACM Symposium On Operating Systems Principles (SOSP 2003)
, pp. 164-177
-
-
Barham, P.1
Dragovic, B.2
Fraser, K.3
Hand, S.4
Harris, T.5
Ho, A.6
Neugebauer, R.7
Pratt, I.8
Warfield, A.9
-
16
-
-
0037619265
-
Web search for a planet: The Google cluster architecture
-
82
-
Luiz André Barroso, Jeffrey Dean, and Urs Hölzle. Web search for a planet: The Google cluster architecture. IEEE Micro, 23(2):22-28, 2003. DOI: 10.1109/MM.2003.1196112 82
-
(2003)
IEEE Micro
, vol.23
, Issue.2
, pp. 22-28
-
-
Barroso, L.A.1
Dean, J.2
Hölzle, U.3
-
17
-
-
47249127725
-
The case for energy-proportional computing
-
9
-
Luiz André Barroso and Urs Hölzle. The case for energy-proportional computing. Computer, 40(12):33-37, 2007. DOI: 10.1109/MC.2007.443 9
-
(2007)
Computer
, vol.40
, Issue.12
, pp. 33-37
-
-
Barroso, L.A.1
Hölzle, U.2
-
18
-
-
67649170859
-
-
Morgan & Claypool Publishers, 8, 9, 10, 14
-
Luiz André Barroso and Urs Hölzle. The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines. Morgan & Claypool Publishers, 2009. DOI: 10.2200/S00193ED1V01Y200905CAC006 8, 9, 10, 14
-
(2009)
The Datacenter As a Computer: An Introduction to the Design of Warehouse-Scale Machines
-
-
Barroso, L.A.1
Hölzle, U.2
-
19
-
-
85048736874
-
-
SLAC Publications SLAC-PUB-12292, Stanford Linear Accelerator Center, May 2006. 2
-
Jacek Becla, Andrew Hanushevsky, Sergei Nikolaev, Ghaleb Abdulla, Alex Szalay, Maria Nieto-Santisteban, Ani Thakar, and Jim Gray. Designing a multi-petabyte database for LSST. SLAC Publications SLAC-PUB-12292, Stanford Linear Accelerator Center, May 2006. 2
-
Designing a Multi-petabyte Database for LSST
-
-
Becla, J.1
Hanushevsky, A.2
Nikolaev, S.3
Abdulla, G.4
Szalay, A.5
Nieto-Santisteban, M.6
Thakar, A.7
Gray, J.8
-
21
-
-
62149144693
-
Beyond the data deluge
-
Gordon Bell,Tony Hey, and Alex Szalay. Beyond the data deluge. Science, 323(5919):1297-1298, 2009. 2
-
(2009)
Science
, vol.323
, Issue.5919
, pp. 1297-1298
-
-
Bell, G.1
Hey, T.2
Szalay, A.3
-
22
-
-
16244392071
-
Inside page rank
-
98, 100
-
Monica Bianchini, Marco Gori, and Franco Scarselli. Inside PageRank. ACMTransactions on Internet Technology, 5(1):92-128, 2005. DOI: 10.1145/1052934.1052938 98, 100
-
(2005)
ACMTransactions On Internet Technology
, vol.5
, Issue.1
, pp. 92-128
-
-
Bianchini, M.1
Gori, M.2
Scarselli, F.3
-
24
-
-
33947180792
-
Stochastic learning
-
In Olivier Bousquet and Ulrike von Luxburg, editors, Springer Verlag, Berlin, 144
-
Léon Bottou. Stochastic learning. In Olivier Bousquet and Ulrike von Luxburg, editors, Advanced Lectures on Machine Learning, Lecture Notes in Artificial Intelligence, LNAI 3176, pages 146-168. Springer Verlag, Berlin, 2004. DOI: 10.1007/b100712 144
-
(2004)
Advanced Lectures On Machine Learning, Lecture Notes in Artificial Intelligence, LNAI 3176
, pp. 146-168
-
-
Bottou, L.1
-
25
-
-
80053375619
-
Large language models in machine translation
-
Prague, Czech Republic, 4, 5, 133
-
Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, and Jeffrey Dean. Large language models in machine translation. In Proceedings of the 2007 Joint Conference on Empirical Methods inNatural Language Processing and ComputationalNatural Language Learning, pages 858-867, Prague, Czech Republic, 2007. 4, 5, 133
-
(2007)
Proceedings of the 2007 Joint Conference On Empirical Methods InNatural Language Processing and ComputationalNatural Language Learning
, pp. 858-867
-
-
Brants, T.1
Popat, A.C.2
Xu, P.3
Och, F.J.4
Dean, J.5
-
27
-
-
0038105095
-
Data-intensive question answering
-
Gaithersburg, Maryland, 4
-
Eric Brill, Jimmy Lin, Michele Banko, Susan Dumais, and Andrew Ng. Data-intensive question answering. In Proceedings of the Tenth Text REtrieval Conference (TREC 2001), pages 393-400, Gaithersburg, Maryland, 2001. 4
-
(2001)
Proceedings of the Tenth Text REtrieval Conference (TREC 2001)
, pp. 393-400
-
-
Brill, E.1
Lin, J.2
Banko, M.3
Dumais, S.4
Ng, A.5
-
29
-
-
85044611587
-
The mathematics of statistical machine translation: Parameter estimation
-
130
-
Peter F. Brown, Vincent J. Della Pietra, Stephen A. Della Pietra, and Robert L. Mercer. The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics, 19(2):263-311, 1993. 130
-
(1993)
Computational Linguistics
, vol.19
, Issue.2
, pp. 263-311
-
-
Brown, P.F.1
Della Pietra, V.J.2
Della Pietra, S.A.3
Mercer, R.L.4
-
30
-
-
78449292738
-
-
MIT Press, Cambridge, Massachusetts, 78, 83
-
Stefan Büttcher, Charles L. A. Clarke, and Gordon V. Cormack. Information Retrieval: Implementing and Evaluating Search Engines. MIT Press, Cambridge, Massachusetts, 2010. 78, 83
-
(2010)
Information Retrieval: Implementing and Evaluating Search Engines.
-
-
Büttcher, S.1
Clarke, C.L.A.2
Cormack, G.V.3
-
31
-
-
63649117166
-
Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility
-
6
-
Rajkumar Buyya, Chee Shin Yeo, Srikumar Venugopal, James Broberg, and Ivona Brandic. Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility. Future Generation Computer Systems, 25(6):599-616, 2009. DOI: 10.1016/j.future.2008.12.001 6
-
(2009)
Future Generation Computer Systems
, vol.25
, Issue.6
, pp. 599-616
-
-
Buyya, R.1
Yeo, C.S.2
Venugopal, S.3
Broberg, J.4
Brandic, I.5
-
32
-
-
0026224590
-
Swift:Using distributed disk striping to provide high I/O data rates
-
29
-
Luis-Felipe Cabrera and DarrellD.E.Long. Swift:Using distributed disk striping to provide high I/O data rates. Computer Systems, 4(4):405-436, 1991. 29
-
(1991)
Computer Systems
, vol.4
, Issue.4
, pp. 405-436
-
-
Cabrera, L.1
Long, D.D.E.2
-
33
-
-
84926181224
-
Findings of the 2009 workshop on statistical machine translation
-
Athens, Greece, 136
-
Chris Callison-Burch, Philipp Koehn, Christof Monz, and Josh Schroeder. Findings of the 2009 workshop on statistical machine translation. In Proceedings of the Fourth Workshop on Statistical Machine Translation (StatMT '09), pages 1-28, Athens, Greece, 2009. 136
-
(2009)
Proceedings of the Fourth Workshop On Statistical Machine Translation (StatMT '09)
, pp. 1-28
-
-
Callison-Burch, C.1
Koehn, P.2
Monz, C.3
Schroeder, J.4
-
34
-
-
85071319367
-
Bigtable:A distributed storage system for structured data
-
Seattle,Washington, 24, 145
-
Fay Chang, Jeffrey Dean, Sanjay Ghemawat,WilsonC. Hsieh, Deborah A.Wallach, Michael Burrows,Tushar Chandra, Andrew Fikes, and RobertGruber. Bigtable:A distributed storage system for structured data. In Proceedings of the 7th Symposium on Operating System Design and Implementation (OSDI 2006), pages 205-218, Seattle,Washington, 2006. 24, 145
-
(2006)
Proceedings of the 7th Symposium On Operating System Design and Implementation (OSDI 2006)
, pp. 205-218
-
-
Chang, F.1
Dean, J.2
Ghemawat, S.3
Hsieh, W.C.4
Wallach, D.A.5
Burrows, M.6
Chandra, T.7
Fikes, A.8
Gruber, R.9
-
35
-
-
85024115120
-
An empirical study of smoothing techniques for language modeling
-
Santa Cruz, California, 5, 133
-
Stanley F. Chen and Joshua Goodman. An empirical study of smoothing techniques for language modeling. In Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL 1996), pages 310-318, Santa Cruz, California, 1996. DOI: 10.3115/981863.981904 5, 133
-
(1996)
Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL 1996)
, pp. 310-318
-
-
Chen, S.F.1
Goodman, J.2
-
36
-
-
35448944021
-
Map-Reduce-Merge: Simplified relational data processing on large clusters
-
146 Beijing, China
-
Hung chih Yang, Ali Dasdan, Ruey-Lung Hsiao, and D. Stott Parker. Map-Reduce-Merge: Simplified relational data processing on large clusters. In Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, pages 1029-1040, Beijing, China, 2007. DOI: 10.1145/1247480.1247602 146
-
(2007)
Proceedings of the 2007 ACM SIGMOD International Conference On Management of Data
, pp. 1029-1040
-
-
Yang, H.C.1
Dasdan, A.2
Hsiao, R.3
Parker, D.S.4
-
37
-
-
56049109090
-
Map-Reduce for machine learning on multicore
-
Vancouver, British Columbia, Canada, 141
-
Cheng-Tao Chu, Sang Kyun Kim,Yi - An Lin,YuanYuan Yu, Gary Bradski, Andrew Ng, and Kunle Olukotun. Map-Reduce for machine learning on multicore. In Advances in Neural Information Processing Systems 19 (NIPS 2006), pages 281-288,Vancouver, British Columbia, Canada, 2006. 141
-
(2006)
Advances in Neural Information Processing Systems 19 (NIPS 2006)
, pp. 281-288
-
-
Chu, C.1
Kim, S.K.2
Lin, Y.3
Yu, Y.Y.4
Bradski, G.5
Ng, A.6
Olukotun, K.7
-
38
-
-
85048736401
-
Church and patrick hanks
-
48
-
KennethW. Church and Patrick Hanks. Word association norms, mutual information, and lexicography. Computational Linguistics, 16(1):22-29, 1990. 48
-
(1990)
Word Association Norms, Mutual Information, and Lexicography. Computational Linguistics
, vol.16
, Issue.1
, pp. 22-29
-
-
Kenneth, W.1
-
39
-
-
67651111624
-
Graph twiddling in a MapReduce world
-
103
-
Jonathan Cohen. Graph twiddling in a MapReduce world. Computing in Science and Engineering, 11(4):29-41, 2009. DOI: 10.1109/MCSE.2009.120 103
-
(2009)
Computing in Science and Engineering
, vol.11
, Issue.4
, pp. 29-41
-
-
Cohen, J.1
-
40
-
-
77954889082
-
Benchmarking cloud serving systems with YCSB
-
Indianapolis, Indiana, 145
-
Brian F. Cooper, Adam Silberstein, Erwin Tam, Raghu Ramakrishnan, and Russell Sears. Benchmarking cloud serving systems with YCSB. In Proceedings of the First ACMSymposium on Cloud Computing (ACM SOCC 2010), Indianapolis, Indiana, 2010. 145
-
(2010)
Proceedings of the First ACMSymposium On Cloud Computing (ACM SOCC 2010)
-
-
Cooper, B.F.1
Silberstein, A.2
Tam, E.3
Ramakrishnan, R.4
Sears, R.5
-
41
-
-
0004116989
-
-
MIT Press, Cambridge, Massachusetts, 88
-
Thomas H. Cormen, Charles E. Leiserson, and Ronald L. Rivest. Introduction to Algorithms. MIT Press, Cambridge, Massachusetts, 1990. 88
-
(1990)
Introduction to Algorithms.
-
-
Cormen, T.H.1
Leiserson, C.E.2
Rivest, R.L.3
-
42
-
-
62549107194
-
-
Addison-Wesley, Reading, Massachusetts, 83
-
W. Bruce Croft, Donald Meztler, andTrevor Strohman. Search Engines: Information Retrieval in Practice. Addison-Wesley, Reading, Massachusetts, 2009. 83
-
(2009)
Search Engines: Information Retrieval in Practice.
-
-
Croft, W.B.1
Meztler, D.2
Strohman, T.3
-
43
-
-
0009346826
-
LogP:Towards a realistic model of parallel computation
-
15
-
David Culler,RichardKarp,DavidPatterson, Abhijit Sahay,Klaus Erik Schauser,Eunice Santos, Ramesh Subramonian, and Thorsten von Eicken. LogP:Towards a realistic model of parallel computation. ACMSIGPLANNotices, 28(7):1-12, 1993.DOI: 10.1145/173284.155333 15
-
(1993)
ACMSIGPLANNotices
, vol.28
, Issue.7
, pp. 1-12
-
-
Culler, D.1
Karp, R.2
Patterson, D.3
Sahay, A.4
Schauser, K.E.5
Santos, E.6
Subramonian, R.7
Von Eicken, T.8
-
44
-
-
0002499328
-
A practical part-of-speech tagger
-
Trento, Italy, 114
-
Doug Cutting, Julian Kupiec, Jan Pedersen, and Penelope Sibun. A practical part-of-speech tagger. In Proceedings of the Third Conference on Applied Natural Language Processing, pages 133-140,Trento, Italy, 1992. DOI: 10.3115/974499.974523 114
-
(1992)
Proceedings of the Third Conference On Applied Natural Language Processing
, pp. 133-140
-
-
Cutting, D.1
Kupiec, J.2
Pedersen, J.3
Sibun, P.4
-
45
-
-
85030321143
-
MapReduce:Simplified data processing on large clusters
-
San Francisco, California, 1, 24, 25
-
Jeffrey Dean and Sanjay Ghemawat. MapReduce:Simplified data processing on large clusters. In Proceedings of the 6th Symposium on Operating System Design and Implementation (OSDI 2004), pages 137-150, San Francisco, California, 2004. 1, 24, 25
-
(2004)
Proceedings of the 6th Symposium On Operating System Design and Implementation (OSDI 2004)
, pp. 137-150
-
-
Dean, J.1
Ghemawat, S.2
-
46
-
-
37549003336
-
MapReduce: Simplified data processing on large clusters
-
2
-
Jeffrey Dean and Sanjay Ghemawat. MapReduce:Simplified data processing on large clusters. Communications of the ACM, 51(1):107-113, 2008. DOI: 10.1145/1327452.1327492 2
-
(2008)
Communications of the ACM
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
47
-
-
73649114265
-
MapReduce: A flexible data processing tool
-
59
-
Jeffrey Dean and Sanjay Ghemawat. MapReduce: A flexible data processing tool. Communications of the ACM, 53(1):72-77, 2010. DOI: 10.1145/1629175.1629198 59
-
(2010)
Communications of the ACM
, vol.53
, Issue.1
, pp. 72-77
-
-
Dean, J.1
Ghemawat, S.2
-
48
-
-
41149092147
-
Dynamo: Amazon's highly available key-value store
-
Stevenson,Washington, 145
-
Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swami Sivasubramanian, Peter Vosshall, and Werner Vogels. Dynamo: Amazon's highly available key-value store. In Proceedings of the 21st ACM Symposium on Operating Systems Principles (SOSP 2007), pages 205-220, Stevenson,Washington, 2007. 145
-
(2007)
Proceedings of the 21st ACM Symposium On Operating Systems Principles (SOSP 2007)
, pp. 205-220
-
-
DeCandia, G.1
Hastorun, D.2
Jampani, M.3
Kakulapati, G.4
Lakshman, A.5
Pilchin, A.6
Sivasubramanian, S.7
Vosshall, P.8
Vogels, W.9
-
49
-
-
0002629270
-
Maximum likelihood from incomplete data via theEMalgorithm
-
108
-
Arthur P. Dempster,Nan M.Laird, and Donald B.Rubin. Maximum likelihood from incomplete data via theEMalgorithm. Journal of the Royal Statistical Society. SeriesB(Methodological), 39(1):1-38, 1977. 108
-
(1977)
Journal of the Royal Statistical Society. SeriesB(Methodological)
, vol.39
, Issue.1
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
50
-
-
0026870271
-
Parallel database systems: The future of high performance database systems
-
12
-
David J. DeWitt and Jim Gray. Parallel database systems: The future of high performance database systems. Communications of the ACM, 35(6):85-98, 1992. DOI: 10.1145/129888.129894 12
-
(1992)
Communications of the ACM
, vol.35
, Issue.6
, pp. 85-98
-
-
DeWitt, D.J.1
Gray, J.2
-
51
-
-
0021587237
-
Implementation techniques for main memory database systems
-
63
-
David J.DeWitt,Randy H.Katz,FrankOlken,LeonardD. Shapiro, Michael R. Stonebraker, and David Wood. Implementation techniques for main memory database systems. ACM SIGMOD Record, 14(2):1-8, 1984. DOI: 10.1145/971697.602261 63
-
(1984)
ACM SIGMOD Record
, vol.14
, Issue.2
, pp. 1-8
-
-
DeWitt, D.J.1
Katz, R.H.2
Olken, F.3
Shapiro, L.D.4
Stonebraker, M.R.5
Wood, D.6
-
52
-
-
78650226247
-
Multi-domain learning by confidence-weighted parameter combination
-
144
-
Mark Dredze, Alex Kulesza, and Koby Crammer. Multi-domain learning by confidence-weighted parameter combination. Machine Learning, 79:123-149, 2010. DOI: 10.1007/s10994-009-5148-0 144
-
(2010)
Machine Learning
, vol.79
, pp. 123-149
-
-
Dredze, M.1
Kulesza, A.2
Crammer, K.3
-
53
-
-
0036989598
-
Web question answering: Is more always better?
-
4 Tampere, Finland
-
Susan Dumais, Michele Banko, Eric Brill, Jimmy Lin, and Andrew Ng. Web question answering: Is more always better? In Proceedings of the 25th Annual International ACMSIGIR Conference on Research and Development in Information Retrieval (SIGIR 2002), pages 291-298,Tampere, Finland, 2002. DOI: 10.1145/564376.564428 4
-
(2002)
Proceedings of the 25th Annual International ACMSIGIR Conference On Research and Development in Information Retrieval (SIGIR 2002)
, pp. 291-298
-
-
Dumais, S.1
Banko, M.2
Brill, E.3
Lin, J.4
Ng, A.5
-
54
-
-
84957855798
-
Fast, easy, and cheap: Construction of statistical machine translation models with MapReduce
-
Columbus, Ohio, 47, 135, 136
-
Chris Dyer, Aaron Cordova, Alex Mont, and Jimmy Lin. Fast, easy, and cheap: Construction of statistical machine translation models with MapReduce. In Proceedings of theThirdWorkshop on Statistical Machine Translation at ACL 2008, pages 199-207, Columbus, Ohio, 2008. 47, 135, 136
-
(2008)
Proceedings of TheThirdWorkshop On Statistical Machine Translation at ACL 2008
, pp. 199-207
-
-
Dyer, C.1
Cordova, A.2
Mont, A.3
Lin, J.4
-
56
-
-
85044980262
-
Training phrase-based machine translation models on the cloud: Open source machine translation toolkit Chaski
-
138
-
Qin Gao and Stephan Vogel. Training phrase-based machine translation models on the cloud: Open source machine translation toolkit Chaski. The Prague Bulletin of Mathematical Linguistics, 93:37-46, 2010. DOI: 10.2478/v10108-010-0004-8 138
-
(2010)
The Prague Bulletin of Mathematical Linguistics
, vol.93
, pp. 37-46
-
-
Gao, Q.1
Vogel, S.2
-
57
-
-
21644437974
-
The google file system
-
Bolton Landing, New York, 29
-
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung. The Google File System. In Proceedings of the 19th ACM Symposium on Operating Systems Principles (SOSP 2003), pages 29-43, Bolton Landing, New York, 2003. DOI: 10.1145/945445.945450 29
-
(2003)
Proceedings of the 19th ACM Symposium On Operating Systems Principles (SOSP 2003)
, pp. 29-43
-
-
Ghemawat, S.1
Gobioff, H.2
Leung, S.3
-
58
-
-
1542640153
-
Brewer's Conjecture and the feasibility of consistent, available, partition-tolerant web services
-
32
-
Seth Gilbert and Nancy Lynch. Brewer's Conjecture and the feasibility of consistent, available, partition-tolerant web services. ACM SIGACT News, 33(2):51-59, 2002. DOI: 10.1145/564585.564601 32
-
(2002)
ACM SIGACT News
, vol.33
, Issue.2
, pp. 51-59
-
-
Gilbert, S.1
Lynch, N.2
-
59
-
-
0037062448
-
Community structure in social and biological networks
-
85
-
Michelle Girvan and Mark E. J. Newman. Community structure in social and biological networks. Proceedings of the National Academy of Science, 99(12):7821-7826, 2002. DOI: 10.1073/pnas.122653799 85
-
(2002)
Proceedings of the National Academy of Science
, vol.99
, Issue.12
, pp. 7821-7826
-
-
Girvan, M.1
Newman, M.E.J.2
-
60
-
-
0003901150
-
-
Addison-Wesley, Reading, Massachusetts, 14, 103
-
Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar. Introduction to Parallel Computing. Addison-Wesley, Reading, Massachusetts, 2003. 14, 103
-
(2003)
Introduction to Parallel Computing.
-
-
Grama, A.1
Gupta, A.2
Karypis, G.3
Kumar, V.4
-
61
-
-
34247960076
-
The strength of weak ties
-
92
-
Mark S.Granovetter.The strength of weak ties. The American Journal of Sociology, 78(6):1360-1380, 1973. DOI: 10.1086/225469 92
-
(1973)
The American Journal of Sociology
, vol.78
, Issue.6
, pp. 1360-1380
-
-
Granovetter, M.S.1
-
62
-
-
0000917844
-
The strength of weak ties: A network theory revisited
-
92
-
Mark S. Granovetter. The strength of weak ties: A network theory revisited. Sociological Theory, 1:201-233, 1983. DOI: 10.2307/202051 92
-
(1983)
Sociological Theory
, vol.1
, pp. 201-233
-
-
Granovetter, M.S.1
-
64
-
-
0003405967
-
-
Cambridge University Press, Cambridge, England, 86
-
Per Hage and Frank Harary. Island Networks: Communication, Kinship, and Classification Structures in Oceania. Cambridge University Press, Cambridge, England, 1996. 86
-
(1996)
Island Networks: Communication, Kinship, and Classification Structures in Oceania.
-
-
Hage, P.1
Harary, F.2
-
65
-
-
70849126253
-
The unreasonable effectiveness of data
-
5
-
Alon Halevy, Peter Norvig, and Fernando Pereira. The unreasonable effectiveness of data. Communications of the ACM, 24(2):8-12, 2009. DOI: 10.1109/MIS.2009.36 5
-
(2009)
Communications of the ACM
, vol.24
, Issue.2
, pp. 8-12
-
-
Halevy, A.1
Norvig, P.2
Pereira, F.3
-
67
-
-
84858671115
-
Cooperative Expendable Micro-Slice Servers (CEMS): Low cost, low power servers for Internet-scale services
-
Asilomar, California, 9, 10
-
James Hamilton. Cooperative Expendable Micro-Slice Servers (CEMS): Low cost, low power servers for Internet-scale services. In Proceedings of the Fourth Biennial Conference on Innovative Data Systems Research (CIDR 2009), Asilomar, California, 2009. 9, 10
-
(2009)
Proceedings of the Fourth Biennial Conference On Innovative Data Systems Research (CIDR 2009)
-
-
Hamilton, J.1
-
68
-
-
85006719793
-
Information platforms and the rise of the data scientist
-
InToby Segaran and JeffHammerbacher, editors, O'Reilly,Sebastopol, California, 6, 59, 146
-
Jeff Hammerbacher. Information platforms and the rise of the data scientist. InToby Segaran and JeffHammerbacher, editors, Beautiful Data,pages 73-84.O'Reilly,Sebastopol, California, 2009. 6, 59, 146
-
(2009)
Beautiful Data
, pp. 73-84
-
-
Hammerbacher, J.1
-
71
-
-
63549097654
-
Mars: A MapReduce framework on graphics processors
-
Toronto, Ontario, Canada, 20
-
Bingsheng He,Wenbin Fang, Qiong Luo, Naga K. Govindaraju, and TuyongWang. Mars: A MapReduce framework on graphics processors. In Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques (PACT 2008), pages 260-269, Toronto, Ontario, Canada, 2008. DOI: 10.1145/1454115.1454152 20
-
(2008)
Proceedings of the 17th International Conference On Parallel Architectures and Compilation Techniques (PACT 2008)
, pp. 260-269
-
-
He, B.1
Fang, W.2
Luo, Q.3
Govindaraju, N.K.4
Wang, T.5
-
72
-
-
77953207828
-
-
Microsoft Research, Redmond,Washington, 3
-
Tony Hey, Stewart Tansley, and Kristin Tolle. The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, Redmond,Washington, 2009. 3
-
(2009)
The Fourth Paradigm: Data-Intensive Scientific Discovery.
-
-
Hey, T.1
Tansley, S.2
Tolle, K.3
-
73
-
-
77954202405
-
Jim Gray on eScience:A transformed scientific method
-
In Tony Hey, Stewart Tansley, and Kristin Tolle, editors, Microsoft Research, Redmond,Washington, 3
-
Tony Hey, StewartTansley, and KristinTolle. Jim Gray on eScience:A transformed scientific method. In Tony Hey, Stewart Tansley, and Kristin Tolle, editors, The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, Redmond,Washington, 2009. 3
-
(2009)
The Fourth Paradigm: Data-Intensive Scientific Discovery.
-
-
Hey, T.1
Tansley, S.2
Tolle, K.3
-
74
-
-
0023964787
-
Scale and performance in a distributed file system
-
29
-
John Howard, Michael Kazar, Sherri Menees, David Nichols, Mahadev Satyanarayanan, Robert Sidebotham, and MichaelWest. Scale and performance in a distributed file system. ACMTransactions on Computer Systems, 6(1):51-81, 1988. DOI: 10.1145/35037.35059 29
-
(1988)
ACMTransactions On Computer Systems
, vol.6
, Issue.1
, pp. 51-81
-
-
Howard, J.1
Kazar, M.2
Menees, S.3
Nichols, D.4
Satyanarayanan, M.5
Sidebotham, R.6
West, M.7
-
75
-
-
34548041192
-
Dryad: Distributed data-parallel programs from sequential building blocks
-
Lisbon, Portugal, 145
-
Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, and Dennis Fetterly. Dryad: Distributed data-parallel programs from sequential building blocks. In Proceedings of the ACM SIGOPS/EuroSys European Conference on Computer Systems 2007 (EuroSys 2007), pages 59-72, Lisbon, Portugal, 2007. DOI: 10.1145/1272998.1273005 145
-
(2007)
Proceedings of the ACM SIGOPS/EuroSys European Conference On Computer Systems 2007 (EuroSys 2007)
, pp. 59-72
-
-
Isard, M.1
Budiu, M.2
Yu, Y.3
Birrell, A.4
Fetterly, D.5
-
76
-
-
78651533629
-
The pathologies of big data
-
11
-
Adam Jacobs. The pathologies of big data. ACMQueue, 7(6), 2009. DOI: 10.1145/1563821.1563874 11
-
(2009)
ACMQueue
, vol.7
, Issue.6
-
-
Jacobs, A.1
-
78
-
-
0003786003
-
-
MIT Press, Cambridge, Massachusetts, 112, 114, 120
-
Frederick Jelinek. Statistical methods for speech recognition. MIT Press, Cambridge, Massachusetts, 1997. 112, 114, 120
-
(1997)
Statistical Methods for Speech Recognition.
-
-
Jelinek, F.1
-
80
-
-
77949807306
-
HADI: Fast diameter estimation and mining in massive graphs with Hadoop
-
Carnegie Mellon University, 103
-
U Kang, CharalamposTsourakakis, Ana Paula Appel, Christos Faloutsos, and Jure Leskovec. HADI: Fast diameter estimation and mining in massive graphs with Hadoop. Technical Report CMU-ML-08-117, School of Computer Science, Carnegie Mellon University, 2008. 103
-
(2008)
Technical Report CMU-ML-08-117, School of Computer Science
-
-
Kang, U.1
Tsourakakis, C.2
Appel, A.P.3
Faloutsos, C.4
Leskovec, J.5
-
81
-
-
77951152705
-
PEGASUS: A peta-scale graph mining system - Implementation and observations
-
Miami, Floria, 103
-
U Kang, Charalampos E. Tsourakakis, and Christos Faloutsos. PEGASUS: A peta-scale graph mining system - implementation and observations. In Proceedings of the 2009 Ninth IEEE International Conference on Data Mining (ICDM 2009), pages 229-238, Miami, Floria, 2009. DOI: 10.1109/ICDM.2009.14 103
-
(2009)
Proceedings of the 2009 Ninth IEEE International Conference On Data Mining (ICDM 2009)
, pp. 229-238
-
-
Kang, U.1
Tsourakakis, C.E.2
Faloutsos, C.3
-
82
-
-
77951678492
-
A model of computation for Map-Reduce
-
Austin,Texas, 15
-
Howard Karloff, Siddharth Suri, and Sergei Vassilvitskii. A model of computation for Map-Reduce. In Proceedings of the 21st Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2010), Austin,Texas, 2010. 15
-
(2010)
Proceedings of the 21st Annual ACM-SIAM Symposium On Discrete Algorithms (SODA 2010)
-
-
Karloff, H.1
Suri, S.2
Vassilvitskii, S.3
-
83
-
-
57449108906
-
Cluster computing for Web-scale data processing
-
Portland, Oregon, 71
-
Aaron Kimball, Sierra Michels-Slettvet, and Christophe Bisciglia. Cluster computing for Web-scale data processing. In Proceedings of the 39th ACM Technical Symposium on Computer Science Education (SIGCSE 2008), pages 116-120, Portland, Oregon, 2008. DOI: 10.1145/1352135.1352177 71
-
(2008)
Proceedings of the 39th ACM Technical Symposium On Computer Science Education (SIGCSE 2008)
, pp. 116-120
-
-
Kimball, A.1
Michels-Slettvet, S.2
Bisciglia, C.3
-
84
-
-
4243148480
-
Authoritative sources in a hyperlinked environment
-
65, 95
-
Jon M. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604-632, 1999. DOI: 10.1145/324133.324140 65, 95
-
(1999)
Journal of the ACM
, vol.46
, Issue.5
, pp. 604-632
-
-
Kleinberg, J.M.1
-
85
-
-
84928706421
-
-
Cambridge University Press, Cambridge, England, 130, 133
-
Philipp Koehn. Statistical Machine Translation. Cambridge University Press, Cambridge, England, 2010. 130, 133
-
(2010)
Statistical Machine Translation.
-
-
Koehn, P.1
-
86
-
-
85118138826
-
Statistical phrase-based translation
-
Edmonton, Alberta, Canada, 131
-
Philipp Koehn, Franz J. Och, and Daniel Marcu. Statistical phrase-based translation. In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT/NAACL 2003), pages 48-54, Edmonton, Alberta, Canada, 2003. DOI: 10.3115/1073445.1073462 131
-
(2003)
Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT/NAACL 2003)
, pp. 48-54
-
-
Koehn, P.1
Och, F.J.2
Marcu, D.3
-
87
-
-
0142192295
-
Conditional random fields:Probabilistic models for segmenting and labeling sequence data
-
San Francisco, California, 138
-
JohnD.Lafferty,AndrewMcCallum,andFernandoPereira. Conditional random fields:Probabilistic models for segmenting and labeling sequence data. In Proceedings of the Eighteenth International Conference on Machine Learning (ICML '01), pages 282-289, San Francisco, California, 2001. 138
-
(2001)
Proceedings of the Eighteenth International Conference On Machine Learning (ICML '01)
, pp. 282-289
-
-
Lafferty, J.D.1
McCallum, A.2
Pereira, F.3
-
88
-
-
0001313149
-
SALSA: The stochastic approach for link-structure analysis
-
65, 95
-
Ronny Lempel and Shlomo Moran. SALSA: The Stochastic Approach for Link-Structure Analysis. ACM Transactions on Information Systems, 19(2):131-160, 2001. DOI: 10.1145/382979.383041 65, 95
-
(2001)
ACM Transactions On Information Systems
, vol.19
, Issue.2
, pp. 131-160
-
-
Lempel, R.1
Moran, S.2
-
89
-
-
80053248035
-
Stream-based translation models for statistical machine translation
-
Los Angeles, California, 145
-
Abby Levenberg, Chris Callison-Burch, and Miles Osborne. Stream-based translation models for statistical machine translation. In Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2010), Los Angeles, California, 2010. 145
-
(2010)
Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2010)
-
-
Levenberg, A.1
Callison-Burch, C.2
Osborne, M.3
-
91
-
-
78651593686
-
Triple-parity RAID and beyond
-
2
-
Adam Leventhal. Triple-parity RAID and beyond. ACM Queue, 7(11), 2009. DOI: 10.1145/1661785.1670144 2
-
(2009)
ACM Queue
, vol.7
, Issue.11
-
-
Leventhal, A.1
-
92
-
-
34247846152
-
An exploration of the principles underlying redundancy-based factoid question answering
-
4
-
Jimmy Lin. An exploration of the principles underlying redundancy-based factoid question answering. ACM Transactions on Information Systems, 27(2):1-55, 2007. DOI: 10.1145/1229179.1229180 4
-
(2007)
ACM Transactions On Information Systems
, vol.27
, Issue.2
, pp. 1-55
-
-
Lin, J.1
-
94
-
-
70849110766
-
Scalable language processing algorithms for the masses:Acase study in computing word co-occurrence matrices with MapReduce
-
Honolulu, Hawaii, 47, 51
-
Jimmy Lin. Scalable language processing algorithms for the masses:Acase study in computing word co-occurrence matrices with MapReduce. In Proceedings of the 2008 Conference on Empirical Methods inNatural Language Processing (EMNLP 2008), pages 419-428, Honolulu, Hawaii, 2008. 47, 51
-
(2008)
Proceedings of the 2008 Conference On Empirical Methods InNatural Language Processing (EMNLP 2008)
, pp. 419-428
-
-
Lin, J.1
-
95
-
-
77954614330
-
Low-latency, highthroughput access to static global resources within the Hadoop framework
-
University of Maryland, College Park, Maryland, January 63
-
Jimmy Lin, Anand Bahety, Shravya Konda, and Samantha Mahindrakar. Low-latency, highthroughput access to static global resources within the Hadoop framework. Technical Report HCIL-2009-01, University of Maryland, College Park, Maryland, January 2009. 63
-
(2009)
Technical Report HCIL-2009-01
-
-
Lin, J.1
Bahety, A.2
Konda, S.3
Mahindrakar, S.4
-
96
-
-
33646887390
-
On the limited memory BFGS method for large scale optimization
-
139
-
Dong C. Liu, Jorge Nocedal, Dong C. Liu, and Jorge Nocedal. On the limited memory BFGS method for large scale optimization. Mathematical Programming B, 45(3):503-528, 1989. DOI: 10.1007/BF01589116 139
-
(1989)
Mathematical Programming B
, vol.45
, Issue.3
, pp. 503-528
-
-
Liu, D.C.1
Nocedal, J.2
Liu, D.C.3
Nocedal, J.4
-
97
-
-
49449119085
-
Statistical machine translation
-
130
-
Adam Lopez. Statistical machine translation. ACM Computing Surveys, 40(3):1-49, 2008. DOI: 10.1145/1380584.1380586 130
-
(2008)
ACM Computing Surveys
, vol.40
, Issue.3
, pp. 1-49
-
-
Lopez, A.1
-
98
-
-
70449672854
-
Pregel: A system for large-scale graph processing
-
Calgary, Alberta, Canada, 86, 145
-
Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski. Pregel: A system for large-scale graph processing. In Proceedings of the 28th ACMSymposium on Principles of Distributed Computing (PODC 2009), page 6, Calgary, Alberta, Canada, 2009. DOI: 10.1145/1583991.1584010 86, 145
-
(2009)
Proceedings of the 28th ACMSymposium On Principles of Distributed Computing (PODC 2009)
, pp. 6
-
-
Malewicz, G.1
Austern, M.H.2
Bik, A.J.C.3
Dehnert, J.C.4
Horn, I.5
Leiser, N.6
Czajkowski, G.7
-
99
-
-
77954723629
-
Pregel: A system for large-scale graph processing
-
86, 145 Indianapolis, Indiana
-
Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski. Pregel: A system for large-scale graph processing. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, Indianapolis, Indiana, 2010. DOI: 10.1145/1582716.1582723 86, 145
-
(2010)
Proceedings of the 2010 ACM SIGMOD International Conference On Management of Data
-
-
Malewicz, G.1
Austern, M.H.2
Bik, A.J.C.3
Dehnert, J.C.4
Horn, I.5
Leiser, N.6
Czajkowski, G.7
-
101
-
-
34548080780
-
-
Cambridge University Press, Cambridge, England, 42, 83
-
Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. An Introduction to Information Retrieval. Cambridge University Press, Cambridge, England, 2008. 42, 83
-
(2008)
An Introduction to Information Retrieval.
-
-
Manning, C.D.1
Raghavan, P.2
Schütze, H.3
-
102
-
-
0003612818
-
-
MIT Press, Cambridge, Massachusetts, 5, 133
-
Christopher D. Manning and Hinrich Schütze. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, Massachusetts, 1999. 5, 133
-
(1999)
Foundations of Statistical Natural Language Processing.
-
-
Manning, C.D.1
Schütze, H.2
-
103
-
-
39649117755
-
The impact of next-generation sequencing technology on genetics
-
2
-
Elaine R. Mardis. The impact of next-generation sequencing technology on genetics. Trends in Genetics, 24(3):133-141, 2008. DOI: 10.1016/j.tig.2007.12.007 2
-
(2008)
Trends in Genetics
, vol.24
, Issue.3
, pp. 133-141
-
-
Mardis, E.R.1
-
104
-
-
62349092536
-
Scalable programming models for massively multicore processors
-
13, 145
-
Michael D. McCool. Scalable programming models for massively multicore processors. Proceedings of the IEEE, 96(5):816-831, 2008. DOI: 10.1109/JPROC.2008.917731 13, 145
-
(2008)
Proceedings of the IEEE
, vol.96
, Issue.5
, pp. 816-831
-
-
McCool, M.D.1
-
105
-
-
77954914834
-
GFS: Evolution on fast-forward
-
32
-
Marshall K. McKusick and Sean Quinlan. GFS: Evolution on fast-forward. ACM Queue, 7(7), 2009. DOI: 10.1145/1594204.1594206 32
-
(2009)
ACM Queue
, vol.7
, Issue.7
-
-
McKusick, M.K.1
Quinlan, S.2
-
106
-
-
1542601822
-
Improving memory hierarchy performance for irregular applications using data and computation reorderings
-
101
-
John Mellor-Crummey, David Whalley, and Ken Kennedy. Improving memory hierarchy performance for irregular applications using data and computation reorderings. International Journal of Parallel Programming, 29(3):217-247, 2001.DOI: 10.1023/A:1011119519789 101
-
(2001)
International Journal of Parallel Programming
, vol.29
, Issue.3
, pp. 217-247
-
-
Mellor-Crummey, J.1
Whalley, D.2
Kennedy, K.3
-
107
-
-
72449204684
-
Building enriched document representations using aggregated anchor text
-
68, 88
-
Donald Metzler, Jasmine Novak, Hang Cui, and Srihari Reddy. Building enriched document representations using aggregated anchor text. In Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2009), pages 219-226, 2009. DOI: 10.1145/1571941.1571981 68, 88
-
(2009)
Proceedings of the 32nd Annual International ACM SIGIR Conference On Research and Development in Information Retrieval (SIGIR 2009)
, pp. 219-226
-
-
Metzler, D.1
Novak, J.2
Cui, H.3
Reddy, S.4
-
108
-
-
85009259903
-
A hidden Markov model information retrieval system
-
Berkeley, California, 114
-
David R. H. Miller,Tim Leek, and RichardM. Schwartz. A hidden Markov model information retrieval system. In Proceedings of the 22nd Annual International ACMSIGIR Conference on Research and Development in Information Retrieval (SIGIR 1999), pages 214-221, Berkeley, California, 1999. DOI: 10.1145/312624.312680 114
-
(1999)
Proceedings of the 22nd Annual International ACMSIGIR Conference On Research and Development in Information Retrieval (SIGIR 1999)
, pp. 214-221
-
-
Miller, D.R.H.1
Leek, T.2
Schwartz, R.M.3
-
109
-
-
33750351831
-
Load balancing for term-distributed parallel retrieval
-
Seattle, Washington, 82
-
Alistair Moffat, William Webber, and Justin Zobel. Load balancing for term-distributed parallel retrieval. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2006), pages 348-355, Seattle, Washington, 2006. DOI: 10.1145/1148170.1148232 82
-
(2006)
Proceedings of the 29th Annual International ACM SIGIR Conference On Research and Development in Information Retrieval (SIGIR 2006)
, pp. 348-355
-
-
Moffat, A.1
Webber, W.2
Zobel, J.3
-
110
-
-
0007771055
-
Using maximum entropy for text classification
-
Stockholm, Sweden, 138
-
Kamal Nigam, John Lafferty, and Andrew McCallum. Using maximum entropy for text classification. In Proceedings of the IJCAI-99 Workshop on Machine Learning for Information Filtering, pages 61-67, Stockholm, Sweden, 1999. 138
-
(1999)
Proceedings of the IJCAI-99 Workshop On Machine Learning for Information Filtering
, pp. 61-67
-
-
Nigam, K.1
Lafferty, J.2
McCallum, A.3
-
111
-
-
70349750047
-
The Eucalyptus open-source cloud-computing system
-
Washington, D.C., 7
-
Daniel Nurmi, Rich Wolski, Chris Grzegorczyk, Graziano Obertelli, Sunil Soman, Lamia Youseff, and Dmitrii Zagorodnov. The Eucalyptus open-source cloud-computing system. In Proceedings of the 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, pages 124-131,Washington, D.C., 2009. DOI: 10.1109/CCGRID.2009.93 7
-
(2009)
Proceedings of the 9th IEEE/ACM International Symposium On Cluster Computing and the Grid
, pp. 124-131
-
-
Nurmi, D.1
Wolski, R.2
Grzegorczyk, C.3
Obertelli, G.4
Soman, S.5
Youseff, L.6
Zagorodnov, D.7
-
112
-
-
0042879653
-
A systematic comparison of various statistical alignment models
-
135
-
Franz J. Och and Hermann Ney. A systematic comparison of various statistical alignment models. Computational Linguistics, 29(1):19-51, 2003.DOI: 10.1162/089120103321337421 135
-
(2003)
Computational Linguistics
, vol.29
, Issue.1
, pp. 19-51
-
-
Och, F.J.1
Ney, H.2
-
114
-
-
55349148888
-
Pig Latin: A not-so-foreign language for data processing
-
Vancouver, British Columbia, Canada, 59, 146
-
ChristopherOlston, Benjamin Reed, Utkarsh Srivastava,Ravi Kumar, and AndrewTomkins. Pig Latin: A not-so-foreign language for data processing. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pages 1099-1110, Vancouver, British Columbia, Canada, 2008. DOI: 10.1145/1376616.1376726 59, 146
-
(2008)
Proceedings of the 2008 ACM SIGMOD International Conference On Management of Data
, pp. 1099-1110
-
-
Olston, C.1
Reed, B.2
Srivastava, U.3
Kumar, R.4
Tomkins, A.5
-
115
-
-
38549121575
-
The future of microprocessors
-
13
-
KunleOlukotun and Lance Hammond. The future of microprocessors. ACMQueue, 3(7):27-34, 2005. DOI: 10.1145/1095408.1095418 13
-
(2005)
ACMQueue
, vol.3
, Issue.7
, pp. 27-34
-
-
Olukotun, K.1
Hammond, L.2
-
116
-
-
79960530372
-
-
Manning Publications Co., Greenwich, Connecticut, 141
-
Sean Owen and Robin Anil. Mahout in Action. Manning Publications Co., Greenwich, Connecticut, 2010. 141
-
(2010)
Mahout in Action.
-
-
Owen, S.1
Anil, R.2
-
117
-
-
0003780986
-
The PageRank citation ranking: Bringing order to the Web
-
Stanford University, 65, 95, 100
-
Lawrence Page, Sergey Brin, Rajeev Motwani, and TerryWinograd. The PageRank citation ranking: Bringing order to the Web. Stanford Digital Library Working Paper SIDL-WP-1999-0120, Stanford University, 1999. 65, 95, 100
-
(1999)
Stanford Digital Library Working Paper SIDL-WP-1999-0120
-
-
Page, L.1
Brin, S.2
Motwani, R.3
Winograd, T.4
-
119
-
-
37549031376
-
The data center is the computer
-
14
-
David A. Patterson. The data center is the computer. Communications of the ACM, 52(1):105, 2008. 14
-
(2008)
Communications of the ACM
, vol.52
, Issue.1
, pp. 105
-
-
Patterson, D.A.1
-
120
-
-
70350512695
-
A comparison of approaches to large-scale data analysis
-
Providence, Rhode Island, 59
-
Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel J. Abadi, David J. DeWitt, Samuel Madden, and Michael Stonebraker. A comparison of approaches to large-scale data analysis. In Proceedings of the 35th ACM SIGMOD International Conference on Management of Data, pages 165-178, Providence, Rhode Island, 2009. DOI: 10.1145/1559845.1559865 59
-
(2009)
Proceedings of the 35th ACM SIGMOD International Conference On Management of Data
, pp. 165-178
-
-
Pavlo, A.1
Paulson, E.2
Rasin, A.3
Abadi, D.J.4
DeWitt, D.J.5
Madden, S.6
Stonebraker, M.7
-
121
-
-
80053272732
-
Streaming first story detection with application to Twitter
-
Los Angeles, California, 145
-
Sasa Petrovic, Miles Osborne, and Victor Lavrenko. Streaming first story detection with application to Twitter. In Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2010), Los Angeles, California, 2010. 145
-
(2010)
Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2010)
-
-
Petrovic, S.1
Osborne, M.2
Lavrenko, V.3
-
122
-
-
30344452311
-
Interpreting the data:Parallel analysis with Sawzall
-
146
-
Rob Pike,Sean Dorward,RobertGriesemer,and SeanQuinlan. Interpreting the data:Parallel analysis with Sawzall. Scientific Programming Journal, 13(4):277-298, 2005. 146
-
(2005)
Scientific Programming Journal
, vol.13
, Issue.4
, pp. 277-298
-
-
Pike, R.1
Dorward, S.2
Griesemer, R.3
Quinlan, S.4
-
123
-
-
84947200665
-
Failure trends in a large disk drive population
-
San Jose, California, 10, 26
-
Eduardo Pinheiro,Wolf-DietrichWeber, and Luiz André Barroso. Failure trends in a large disk drive population. In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST 2007), San Jose, California, 2008. 10, 26
-
(2008)
Proceedings of the 5th USENIX Conference On File and Storage Technologies (FAST 2007)
-
-
Pinheiro, E.1
Weber, W.2
Barroso, L.A.3
-
124
-
-
61949425675
-
Davison.Web page classification: Features and algorithms
-
82
-
Xiaoguang Qi and BrianD. Davison.Web page classification: Features and algorithms. ACM Computing Surveys, 41(2), 2009. DOI: 10.1145/1459352.1459357 82
-
(2009)
ACM Computing Surveys
, vol.41
, Issue.2
-
-
Qi, X.1
Brian, D.2
-
125
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
Morgan Kaufmann Publishers, San Francisco, California, 115, 120
-
Lawrence R. Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. In Readings in Speech Recognition, pages 267-296. Morgan Kaufmann Publishers, San Francisco, California, 1990. DOI: 10.1109/5.18626 115, 120
-
(1990)
Readings in Speech Recognition
, pp. 267-296
-
-
Rabiner, L.R.1
-
126
-
-
70450168340
-
Supporting MapReduce on large-scale asymmetric multi-core clusters
-
20
-
M.Mustafa Rafique, Benjamin Rose, Ali R. Butt, and Dimitrios S.Nikolopoulos. Supporting MapReduce on large-scale asymmetric multi-core clusters. ACM Operating Systems Review, 43(2):25-34, 2009. DOI: 10.1145/1531793.1531800 20
-
(2009)
ACM Operating Systems Review
, vol.43
, Issue.2
, pp. 25-34
-
-
Rafique, M.M.1
Rose, B.2
Butt, A.R.3
Nikolopoulos, D.S.4
-
127
-
-
34547679939
-
Evaluating MapReduce for multi-core and multiprocessor systems
-
Phoenix, Arizona, 20
-
Colby Ranger, Ramanan Raghuraman, Arun Penmetsa, Gary Bradski, and Christos Kozyrakis. Evaluating MapReduce for multi-core and multiprocessor systems. In Proceedings of the 13th International Symposium onHigh-Performance Computer Architecture (HPCA 2007), pages 205-218, Phoenix, Arizona, 2007. DOI: 10.1109/HPCA.2007.346181 20
-
(2007)
Proceedings of the 13th International Symposium OnHigh-Performance Computer Architecture (HPCA 2007)
, pp. 205-218
-
-
Ranger, C.1
Raghuraman, R.2
Penmetsa, A.3
Bradski, G.4
Kozyrakis, C.5
-
129
-
-
1842528329
-
The utility business model and the future of computing services
-
6
-
Michael A. Rappa. The utility business model and the future of computing services. IBM Systems Journal, 34(1):32-42, 2004. DOI: 10.1147/sj.431.0032 6
-
(2004)
IBM Systems Journal
, vol.34
, Issue.1
, pp. 32-42
-
-
Rappa, M.A.1
-
134
-
-
84976736061
-
A performance evaluation of four parallel join algorithms in a shared-nothing multiprocessor environment
-
Portland, Oregon, 60
-
Donovan A. Schneider and David J. DeWitt. A performance evaluation of four parallel join algorithms in a shared-nothing multiprocessor environment. In Proceedings of the 1989 ACM SIGMOD International Conference on Management of Data, pages 110-121,Portland, Oregon, 1989. DOI: 10.1145/67544.66937 60
-
(1989)
Proceedings of the 1989 ACM SIGMOD International Conference On Management of Data
, pp. 110-121
-
-
Schneider, D.A.1
DeWitt, D.J.2
-
135
-
-
70449657893
-
DRAM errors in the wild: A large-scale field study
-
Seattle, Washington, 10, 26
-
Bianca Schroeder, Eduardo Pinheiro, andWolf-DietrichWeber. DRAM errors in the wild: A large-scale field study. In Proceedings of the Eleventh International Joint Conference on Measurement and Modeling of Computer Systems (SIGMETRICS '09), pages 193-204, Seattle, Washington, 2009. DOI: 10.1145/1555349.1555372 10, 26
-
(2009)
Proceedings of the Eleventh International Joint Conference On Measurement and Modeling of Computer Systems (SIGMETRICS '09)
, pp. 193-204
-
-
Schroeder, B.1
Pinheiro, E.2
Weber, W.3
-
136
-
-
0347596961
-
Automatic word sense discrimination
-
48
-
Hinrich Schütze. Automatic word sense discrimination. Computational Linguistics, 24(1):97-123, 1998. 48
-
(1998)
Computational Linguistics
, vol.24
, Issue.1
, pp. 97-123
-
-
Schütze, H.1
-
137
-
-
0031139653
-
A cooccurrence-based thesaurus and two applications to information retrieval
-
48
-
Hinrich Schütze and Jan O. Pedersen. A cooccurrence-based thesaurus and two applications to information retrieval. Information Processing and Management, 33(3):307-318, 1998. DOI: 10.1016/S0306-4573(96)00068-4 48
-
(1998)
Information Processing and Management
, vol.33
, Issue.3
, pp. 307-318
-
-
Schütze, H.1
Pedersen, J.O.2
-
138
-
-
85050328105
-
-
John Benjamins, Amsterdam, The Netherlands, 3
-
Satoshi Sekine and Elisabete Ranchhod. Named Entities: Recognition, Classification and Use. John Benjamins, Amsterdam, The Netherlands, 2009. 3
-
(2009)
Named Entities: Recognition, Classification and Use.
-
-
Sekine, S.1
Ranchhod, E.2
-
139
-
-
0037826642
-
Learning hidden Markov model structure for information extraction
-
Orlando, Florida, 114
-
Kristie Seymore, Andrew Mccallum, and Ronald Rosenfeld. Learning hidden Markov model structure for information extraction. In Proceedings of the AAAI-99 Workshop on Machine Learning for Information Extraction, pages 37-42, Orlando, Florida, 1999. 114
-
(1999)
Proceedings of the AAAI-99 Workshop On Machine Learning for Information Extraction
, pp. 37-42
-
-
Seymore, K.1
Mccallum, A.2
Rosenfeld, R.3
-
140
-
-
85043116988
-
Shallow parsing with conditional random fields
-
Edmonton, Alberta, Canada, 140
-
Fei Sha and Fernando Pereira. Shallow parsing with conditional random fields. In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT/NAACL 2003), pages 134-141, Edmonton, Alberta, Canada, 2003. DOI: 10.3115/1073445.1073473 140
-
(2003)
Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT/NAACL 2003)
, pp. 134-141
-
-
Sha, F.1
Pereira, F.2
-
141
-
-
84906925404
-
-
140
-
Noah Smith. Log-linear models. http://www.cs.cmu.edu/~nasmith/papers/smith. tut04.pdf, 2004. 140
-
(2004)
Log-linear Models
-
-
Smith, N.1
-
142
-
-
84872544168
-
Beyond the tsunami: Developing the infrastructure to deal with life sciences data
-
In Tony Hey, Stewart Tansley, and Kristin Tolle, editors, Microsoft Research, Redmond, Washington, 2
-
Christopher Southan and Graham Cameron. Beyond the tsunami: Developing the infrastructure to deal with life sciences data. In Tony Hey, Stewart Tansley, and Kristin Tolle, editors, The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, Redmond, Washington, 2009. 2
-
(2009)
The Fourth Paradigm: Data-Intensive Scientific Discovery
-
-
Southan, C.1
Cameron, G.2
-
143
-
-
2942527473
-
Gene prediction with a hidden Markov model and a new intron submodel
-
114 October
-
Mario Stanke and Stephan Waack. Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics, 19 Suppl 2:ii215-225, October 2003. DOI: 10.1093/bioinformatics/btg1080 114
-
(2003)
Bioinformatics
, vol.19
, pp. ii215-225
-
-
Stanke, M.1
Waack, S.2
-
144
-
-
73649141347
-
MapReduce and parallel DBMSs: Friends or foes?
-
59
-
Michael Stonebraker, Daniel Abadi, David J. DeWitt, Sam Madden, Erik Paulson, Andrew Pavlo, and Alexander Rasin. MapReduce and parallel DBMSs: Friends or foes? Communications of the ACM, 53(1):64-71, 2010. DOI: 10.1145/1629175.1629197 59
-
(2010)
Communications of the ACM
, vol.53
, Issue.1
, pp. 64-71
-
-
Stonebraker, M.1
Abadi, D.2
DeWitt, D.J.3
Madden, S.4
Paulson, E.5
Pavlo, A.6
Rasin, A.7
-
145
-
-
0003873676
-
Designing and mining multi-terabyte astronomy archives: The Sloan Digital Sky Survey
-
2
-
Alexander S. Szalay,Peter Z.Kunszt, AniThakar, Jim Gray,Don Slutz, and Robert J.Brunner. Designing and mining multi-terabyte astronomy archives: The Sloan Digital Sky Survey. SIGMOD Record, 29(2):451-462, 2000. DOI: 10.1145/335191.335439 2
-
(2000)
SIGMOD Record
, vol.29
, Issue.2
, pp. 451-462
-
-
Szalay, A.S.1
Kunszt, P.Z.2
Thakar, A.3
Gray, J.4
Slutz, D.5
Brunner, R.J.6
-
146
-
-
72049091603
-
-
Carnegie Mellon University, 29
-
Wittawat Tantisiriroj, Swapnil Patil, and Garth Gibson. Data-intensive file systems for Internet services:Arose by any other name⋯.Technical Report CMU-PDL-08-114,Parallel Data Laboratory, Carnegie Mellon University, 2008. 29
-
(2008)
Data-intensive File Systems for Internet Services:Arose by Any Other Name⋯.Technical Report CMU-PDL-08-114,Parallel Data Laboratory
-
-
Tantisiriroj, W.1
Patil, S.2
Gibson, G.3
-
147
-
-
0031540744
-
Frangipani: A scalable distributed file system
-
Saint-Malo, France, 29
-
Chandramohan A. Thekkath, Timothy Mann, and Edward K. Lee. Frangipani: A scalable distributed file system. In Proceedings of the 16th ACMSymposium on Operating Systems Principles (SOSP 1997), pages 224-237, Saint-Malo, France, 1997. DOI: 10.1145/268998.266694 29
-
(1997)
Proceedings of the 16th ACMSymposium On Operating Systems Principles (SOSP 1997)
, pp. 224-237
-
-
Thekkath, C.A.1
Mann, T.2
Lee, E.K.3
-
148
-
-
0025467711
-
A bridging model for parallel computation
-
13, 14, 15, 86, 145
-
Leslie G. Valiant. A bridging model for parallel computation. Communications of the ACM, 33(8):103-111, 1990. DOI: 10.1145/79173.79181 13, 14, 15, 86, 145
-
(1990)
Communications of the ACM
, vol.33
, Issue.8
, pp. 103-111
-
-
Valiant, L.G.1
-
149
-
-
68649100902
-
A break in the clouds: Towards a cloud definition
-
6
-
Luis M. Vaquero, Luis Rodero-Merino, Juan Caceres, and Maik Lindner. A break in the clouds: Towards a cloud definition. ACM SIGCOMM Computer Communication Review, 39(1):50-55, 2009. DOI: 10.1145/1496091.1496100 6
-
(2009)
ACM SIGCOMM Computer Communication Review
, vol.39
, Issue.1
, pp. 50-55
-
-
Vaquero, L.M.1
Rodero-Merino, L.2
Caceres, J.3
Lindner, M.4
-
150
-
-
0004339720
-
HMM-based word alignment in statistical translation
-
Copenhagen, Denmark, 114, 135
-
Stephan Vogel, Hermann Ney, and Christoph Tillmann. HMM-based word alignment in statistical translation. In Proceedings of the 16th International Conference on Computational Linguistics (COLING 1996), pages 836-841, Copenhagen, Denmark, 1996. DOI: 10.3115/993268.993313 114, 135
-
(1996)
Proceedings of the 16th International Conference On Computational Linguistics (COLING 1996)
, pp. 836-841
-
-
Vogel, S.1
Ney, H.2
Tillmann, C.3
-
151
-
-
70350637398
-
PLDA:Parallel latent Dirichlet allocation for large-scale applications
-
San Francisco, California, 141
-
YiWang,Hongjie Bai, Matt Stanton,Wen-Yen Chen, and EdwardY.Chang. PLDA:Parallel latent Dirichlet allocation for large-scale applications. In Proceedings of the Fifth International Conference on Algorithmic Aspects in Information and Management (AAIM 2009), pages 301-314, San Francisco, California, 2009. DOI: 10.1007/978-3-642-02158-9-26 141
-
(2009)
Proceedings of the Fifth International Conference On Algorithmic Aspects in Information and Management (AAIM 2009)
, pp. 301-314
-
-
Wang, Y.1
Bai, H.2
Stanton, M.3
Chen, W.4
Chang, E.Y.5
-
152
-
-
0032482432
-
Collective dynamics of 'small-world' networks
-
92
-
Duncan J. Watts and Steven H. Strogatz. Collective dynamics of 'small-world' networks. Nature, 393:440-442, 1998. DOI: 10.1038/30918 92
-
(1998)
Nature
, vol.393
, pp. 440-442
-
-
Watts, D.J.1
Strogatz, S.H.2
-
153
-
-
56749182827
-
FPGA-based prototype of a PRAM-On-Chip processor
-
Ischia, Italy, 14
-
Xingzhi Wen and Uzi Vishkin. FPGA-based prototype of a PRAM-On-Chip processor. In Proceedings of the 5th Conference on Computing Frontiers, pages 55-66, Ischia, Italy, 2008. DOI: 10.1145/1366230.1366240 14
-
(2008)
Proceedings of the 5th Conference On Computing Frontiers
, pp. 55-66
-
-
Wen, X.1
Vishkin, U.2
-
155
-
-
84980078034
-
The unreasonable effectiveness of mathematics in the natural sciences
-
5
-
Eugene Wigner. The unreasonable effectiveness of mathematics in the natural sciences. Communications in Pure and Applied Mathematics, 13(1):1-14, 1960. DOI: 10.1002/cpa.3160130102 5
-
(1960)
Communications in Pure and Applied Mathematics
, vol.13
, Issue.1
, pp. 1-14
-
-
Wigner, E.1
-
156
-
-
0003756969
-
-
Morgan Kaufmann Publishing, San Francisco, California, 69, 77, 78, 83
-
Ian H.Witten, Alistair Moffat, and Timothy C. Bell. Managing Gigabytes: Compressing and Indexing Documents and Images. Morgan Kaufmann Publishing, San Francisco, California, 1999. DOI: 10.1023/A:1011472308196 69, 77, 78, 83
-
(1999)
Managing Gigabytes: Compressing and Indexing Documents and Images
-
-
Witten, I.H.1
Moffat, A.2
Bell, T.C.3
-
157
-
-
0031599183
-
Corpus-based stemming using cooccurrence of word variants
-
48
-
Jinxi Xu andW. Bruce Croft. Corpus-based stemming using cooccurrence of word variants. ACMTransactions on Information Systems, 16(1):61-81, 1998.DOI: 10.1145/267954.267957 48
-
(1998)
ACMTransactions On Information Systems
, vol.16
, Issue.1
, pp. 61-81
-
-
Xu, J.1
Croft, W.B.2
-
158
-
-
16444383160
-
Survey of clustering algorithms
-
86
-
Rui Xu and DonaldWunsch II. Survey of clustering algorithms. IEEETransactions onNeural Networks, 16(3):645-678, 2005. DOI: 10.1109/TNN.2005.845141 86
-
(2005)
IEEETransactions OnNeural Networks
, vol.16
, Issue.3
, pp. 645-678
-
-
Xu, R.1
Wunsch, D.2
-
159
-
-
85076882757
-
DryadLINQ:A system for general-purpose distributed data-parallel computing using a high-level language
-
San Diego, California, 145
-
Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Budiu, Úlfar Erlingsson, Pradeep Kumar Gunda, and Jon Currey. DryadLINQ:A system for general-purpose distributed data-parallel computing using a high-level language. In Proceedings of the 8th Symposium on Operating System Design and Implementation (OSDI 2008), pages 1-14, San Diego, California, 2008. 145
-
(2008)
Proceedings of the 8th Symposium On Operating System Design and Implementation (OSDI 2008)
, pp. 1-14
-
-
Yu, Y.1
Isard, M.2
Fetterly, D.3
Budiu, M.4
Erlingsson, Ú.5
Gunda, P.K.6
Currey, J.7
-
160
-
-
77951466017
-
Job scheduling for multi-user MapReduce clusters
-
University of California at Berkeley, 25
-
Matei Zaharia, Dhruba Borthakur, Joydeep Sen Sarma, Khaled Elmeleegy, Scott Shenker, and Ion Stoica. Job scheduling for multi-user MapReduce clusters. Technical Report UCB/EECS-2009-55, Electrical Engineering and Computer Sciences, University of California at Berkeley, 2009. 25
-
(2009)
Technical Report UCB/EECS-2009-55, Electrical Engineering and Computer Sciences
-
-
Zaharia, M.1
Borthakur, D.2
Sarma, J.S.3
Elmeleegy, K.4
Shenker, S.5
Stoica, I.6
-
161
-
-
85076883048
-
Improving MapReduce performance in heterogeneous environments
-
San Diego, California, 25
-
Matei Zaharia,Andy Konwinski, AnthonyD. Joseph,RandyKatz, and Ion Stoica. Improving MapReduce performance in heterogeneous environments. In Proceedings of the 8th Symposium on Operating System Design and Implementation (OSDI 2008), pages 29-42, San Diego, California, 2008. 25
-
(2008)
Proceedings of the 8th Symposium On Operating System Design and Implementation (OSDI 2008)
, pp. 29-42
-
-
Zaharia, M.1
Konwinski, A.2
Joseph, A.D.3
Katz, R.4
Stoica, I.5
-
162
-
-
33747729581
-
Inverted files for text search engines
-
78, 83
-
Justin Zobel and Alistair Moffat. Inverted files for text search engines. ACM Computing Surveys, 38(6):1-56, 2006. 78, 83
-
(2006)
ACM Computing Surveys
, vol.38
, Issue.6
, pp. 1-56
-
-
Zobel, J.1
Moffat, A.2
|