-
1
-
-
0033721503
-
Graph structure in the web
-
A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins, and J. Weiner, "Graph structure in the web," Computer Networks 33: 1-6, pp. 309-320, 2000.
-
(2000)
Computer Networks
, vol.33
, Issue.1-6
, pp. 309-320
-
-
Broder, A.1
Kumar, R.2
Maghoul, F.3
Raghavan, P.4
Rajagopalan, S.5
Stata, R.6
Tomkins, A.7
Weiner, J.8
-
3
-
-
0013084629
-
-
Second Edition, Prentice-Hall, Upper Saddle River, NJ
-
H. Garcia-Molina, J.D. Ullman, and J. Widom, Database Systems: The Complete Book Second Edition, Prentice-Hall, Upper Saddle River, NJ, 2009.
-
(2009)
Database Systems: The Complete Book
-
-
Garcia-Molina, H.1
Ullman, J.D.2
Widom, J.3
-
4
-
-
0000811402
-
-
Sorting and Searching), Second Edition, Addison-Wesley, Upper Saddle River, NJ
-
D.E. Knuth, The Art of Computer Programming Vol. 3 (Sorting and Searching), Second Edition, Addison-Wesley, Upper Saddle River, NJ, 1998.
-
(1998)
The Art of Computer Programming
, vol.3
-
-
Knuth, D.E.1
-
6
-
-
0014403118
-
The Matthew effect in science
-
Jan. 5
-
R.K. Merton, "The Matthew effect in science," Science 159: 3810, pp. 56-63, Jan. 5, 1968.
-
(1968)
Science
, vol.159
, Issue.3810
, pp. 56-63
-
-
Merton, R.K.1
-
7
-
-
25144439604
-
-
Addison-Wesley, Upper Saddle River, NJ
-
P.-N. Tan, M. Steinbach, and V. Kumar, Introduction to Data Mining, Addison-Wesley, Upper Saddle River, NJ, 2005.
-
(2005)
Introduction to Data Mining
-
-
Tan, P.-N.1
Steinbach, M.2
Kumar, V.3
-
8
-
-
84924109574
-
Cluster computing, recursion, and Datalog
-
to appear, Elsevier
-
F.N. Afrati, V. Borkar, M. Carey, A. Polyzotis, and J.D. Ullman, "Cluster computing, recursion, and Datalog," to appear in Proc. Datalog 2.0 Workshop, Elsevier, 2011.
-
(2011)
Proc. Datalog 2.0 Workshop
-
-
Afrati, F.N.1
Borkar, V.2
Carey, M.3
Polyzotis, A.4
Ullman, J.D.5
-
9
-
-
85032604190
-
Upper and lower bounds on the cost of a MapReduce computation
-
to appear, Also available as CoRR
-
F.N. Afrati, A. Das Sarma, S. Salihoglu, and J.D. Ullman, "Upper and lower bounds on the cost of a MapReduce computation." to appear in Proc. Intl. Conf. on Very Large Databases, 2013. Also available as CoRR, abs/1206.4377.
-
(2013)
Proc. Intl. Conf. on Very Large Databases
-
-
Afrati, F.N.1
Das Sarma, A.2
Salihoglu, S.3
Ullman, J.D.4
-
12
-
-
79960751096
-
HaLoop: Efficient iterative data processing on large clusters
-
Y. Bu, B. Howe, M. Balazinska, and M. Ernst, "HaLoop: efficient iterative data processing on large clusters," Proc. Intl. Conf. on Very Large Databases, 2010.
-
(2010)
Proc. Intl. Conf. on Very Large Databases
-
-
Bu, Y.1
Howe, B.2
Balazinska, M.3
Ernst, M.4
-
13
-
-
47749140025
-
Bigtable: A distributed storage system for structured data
-
F. Chang, J. Dean, S. Ghemawat, W.C. Hsieh, D.A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R.E. Gruber, "Bigtable: a distributed storage system for structured data," ACM Transactions on Computer Systems 26: 2, pp. 1-26, 2008.
-
(2008)
ACM Transactions on Computer Systems
, vol.26
, Issue.2
, pp. 1-26
-
-
Chang, F.1
Dean, J.2
Ghemawat, S.3
Hsieh, W.C.4
Wallach, D.A.5
Burrows, M.6
Chandra, T.7
Fikes, A.8
Gruber, R.E.9
-
14
-
-
84867112010
-
Pnuts: Yahoo!‘s hosted data serving platform
-
B.F. Cooper, R. Ramakrishnan, U. Srivastava, A. Silberstein, P. Bohannon, H.-A. Jacobsen, N. Puz, D. Weaver, and R. Yerneni, "Pnuts: Yahoo!‘s hosted data serving platform," PVLDB 1: 2, pp. 1277-1288, 2008.
-
(2008)
PVLDB
, vol.1
, Issue.2
, pp. 1277-1288
-
-
Cooper, B.F.1
Ramakrishnan, R.2
Srivastava, U.3
Silberstein, A.4
Bohannon, P.5
Jacobsen, H.-A.6
Puz, N.7
Weaver, D.8
Yerneni, R.9
-
15
-
-
37549003336
-
Mapreduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat, "Mapreduce: simplified data processing on large clusters," Comm. ACM 51: 1, pp. 107-113, 2008.
-
(2008)
Comm. ACM
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
16
-
-
84859198862
-
Clustera: An integrated computation and data management system
-
D.J. DeWitt, E. Paulson, E. Robinson, J.F. Naughton, J. Royalty, S. Shankar, and A. Krioukov, "Clustera: an integrated computation and data management system," PVLDB 1: 1, pp. 28-41, 2008.
-
(2008)
PVLDB
, vol.1
, Issue.1
, pp. 28-41
-
-
DeWitt, D.J.1
Paulson, E.2
Robinson, E.3
Naughton, J.F.4
Royalty, J.5
Shankar, S.6
Krioukov, A.7
-
18
-
-
85032604675
-
-
hadoop.apache.org, Apache Foundation.
-
-
-
-
19
-
-
85032599359
-
-
hadoop.apache.org/hive, Apache Foundation.
-
-
-
-
20
-
-
34548041192
-
Dryad: Distributed data-parallel programs from sequential building blocks
-
ACM
-
M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly. "Dryad: distributed data-parallel programs from sequential building blocks," Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems, pp. 59-72, ACM, 2007.
-
(2007)
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems
, pp. 59-72
-
-
Isard, M.1
Budiu, M.2
Yu, Y.3
Birrell, A.4
Fetterly, D.5
-
21
-
-
77954723629
-
Pregel: A system for large-scale graph processing
-
G. Malewicz, M.N. Austern, A.J.C. Sik, J.C. Denhert, H. Horn, N. Leiser, and G. Czajkowski, "Pregel: a system for large-scale graph processing," Proc. ACM SIGMOD Conference, 2010.
-
(2010)
Proc. ACM SIGMOD Conference
-
-
Malewicz, G.1
Austern, M.N.2
Sik, A.J.C.3
Denhert, J.C.4
Horn, H.5
Leiser, N.6
Czajkowski, G.7
-
22
-
-
55349148888
-
Pig latin: A not-so-foreign language for data processing
-
C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins, "Pig latin: a not-so-foreign language for data processing," Proc. ACM SIGMOD Conference, pp. 1099-1110, 2008.
-
(2008)
Proc. ACM SIGMOD Conference
, pp. 1099-1110
-
-
Olston, C.1
Reed, B.2
Srivastava, U.3
Kumar, R.4
Tomkins, A.5
-
23
-
-
0003584325
-
-
Third Edition, Prentice-Hall, Upper Saddle River, NJ
-
J.D. Ullman and J. Widom, A First Course in Database Systems, Third Edition, Prentice-Hall, Upper Saddle River, NJ, 2008.
-
(2008)
A First Course in Database Systems
-
-
Ullman, J.D.1
Widom, J.2
-
24
-
-
85076882757
-
DryadLINQ: A system for general-purpose distributed dataparallel computing using a high-level language
-
USENIX Association
-
Y. Yu, M. Isard, D. Fetterly, M. Budiu, I. Erlingsson, P.K. Gunda, and J. Currey, "DryadLINQ: a system for general-purpose distributed dataparallel computing using a high-level language," OSDI, pp. 1-14, USENIX Association, 2008.
-
(2008)
OSDI
, pp. 1-14
-
-
Yu, Y.1
Isard, M.2
Fetterly, D.3
Budiu, M.4
Erlingsson, I.5
Gunda, P.K.6
Currey, J.7
-
25
-
-
37549058056
-
Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions
-
A. Andoni and P. Indyk, "Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions," Comm. ACM 51: 1, pp. 117-122, 2008.
-
(2008)
Comm. ACM
, vol.51
, Issue.1
, pp. 117-122
-
-
Andoni, A.1
Indyk, P.2
-
26
-
-
0031346696
-
On the resemblance and containment of documents
-
Positano Italy
-
A.Z. Broder, "On the resemblance and containment of documents," Proc. Compression and Complexity of Sequences, pp. 21-29, Positano Italy, 1997.
-
(1997)
Proc. Compression and Complexity of Sequences
, pp. 21-29
-
-
Broder, A.Z.1
-
27
-
-
0031620041
-
Min-wise independent permutations
-
A.Z. Broder, M. Charikar, A.M. Frieze, and M. Mitzenmacher, "Min-wise independent permutations," ACM Symposium on Theory of Computing, pp. 327-336, 1998.
-
(1998)
ACM Symposium on Theory of Computing
, pp. 327-336
-
-
Broder, A.Z.1
Charikar, M.2
Frieze, A.M.3
Mitzenmacher, M.4
-
28
-
-
0036040277
-
Similarity estimation techniques from rounding algorithms
-
M.S. Charikar, "Similarity estimation techniques from rounding algorithms," ACM Symposium on Theory of Computing, pp. 380-388, 2002.
-
(2002)
ACM Symposium on Theory of Computing
, pp. 380-388
-
-
Charikar, M.S.1
-
30
-
-
4544259509
-
Locality-sensitive hashing scheme based on p-stable distributions
-
M. Datar, N. Immorlica, P. Indyk, and V.S. Mirrokni, "Locality-sensitive hashing scheme based on p-stable distributions," Symposium on Computational Geometry pp. 253-262, 2004.
-
(2004)
Symposium on Computational Geometry
, pp. 253-262
-
-
Datar, M.1
Immorlica, N.2
Indyk, P.3
Mirrokni, V.S.4
-
31
-
-
0001944742
-
Similarity search in high dimensions via hashing
-
A. Gionis, P. Indyk, and R. Motwani, "Similarity search in high dimensions via hashing," Proc. Intl. Conf. on Very Large Databases, pp. 518-529, 1999.
-
(1999)
Proc. Intl. Conf. on Very Large Databases
, pp. 518-529
-
-
Gionis, A.1
Indyk, P.2
Motwani, R.3
-
32
-
-
33750296887
-
Finding near-duplicate web pages: A large-scale evaluation of algorithms
-
M. Henzinger, "Finding near-duplicate web pages: a large-scale evaluation of algorithms," Proc. 29th SIGIR Conf., pp. 284-291, 2006.
-
(2006)
Proc. 29th SIGIR Conf
, pp. 284-291
-
-
Henzinger, M.1
-
33
-
-
0031644241
-
Approximate nearest neighbor: Towards removing the curse of dimensionality
-
P. Indyk and R. Motwani. "Approximate nearest neighbor: towards removing the curse of dimensionality," ACM Symposium on Theory of Computing, pp. 604-613, 1998.
-
(1998)
ACM Symposium on Theory of Computing
, pp. 604-613
-
-
Indyk, P.1
Motwani, R.2
-
34
-
-
85043988965
-
Finding similar files in a large file system
-
U. Manber, "Finding similar files in a large file system," Proc. USENIX Conference, pp. 1-10, 1994.
-
(1994)
Proc. USENIX Conference
, pp. 1-10
-
-
Manber, U.1
-
35
-
-
57349131623
-
SpotSigs: Robust and efficient near duplicate detection in large web collections
-
July, Singapore
-
M. Theobald, J. Siddharth, and A. Paepcke, "SpotSigs: robust and efficient near duplicate detection in large web collections," 31st Annual ACM SIGIR Conference, July, 2008, Singapore.
-
(2008)
31st Annual ACM SIGIR Conference
-
-
Theobald, M.1
Siddharth, J.2
Paepcke, A.3
-
36
-
-
57349141410
-
Efficient similarity joins for near duplicate detection
-
C. Xiao, W. Wang, X. Lin, and J.X. Yu, "Efficient similarity joins for near duplicate detection," Proc. WWW Conference, pp. 131-140, 2008.
-
(2008)
Proc. WWW Conference
, pp. 131-140
-
-
Xiao, C.1
Wang, W.2
Lin, X.3
Yu, J.X.4
-
37
-
-
0029719644
-
The space complexity of approximat ing frequency moments
-
N. Alon, Y. Matias, and M. Szegedy, "The space complexity of approximat ing frequency moments," 28th ACM Symposium on Theory of Computing, pp. 20-29, 1996.
-
(1996)
28th ACM Symposium on Theory of Computing
, pp. 20-29
-
-
Alon, N.1
Matias, Y.2
Szegedy, M.3
-
38
-
-
0036042175
-
Models and issues in data stream systems
-
B. Babcock, S. Babu, M. Datar, R. Motwani, and J. Widom, "Models and issues in data stream systems," Symposium on Principles of Database Systems, pp. 1-16, 2002.
-
(2002)
Symposium on Principles of Database Systems
, pp. 1-16
-
-
Babcock, B.1
Babu, S.2
Datar, M.3
Motwani, R.4
Widom, J.5
-
39
-
-
0014814325
-
Space/time trade-offs in hash coding with allowable errors
-
B.H. Bloom, "Space/time trade-offs in hash coding with allowable errors," Comm. ACM 13: 7, pp. 422-426, 1970.
-
(1970)
Comm. ACM
, vol.13
, Issue.7
, pp. 422-426
-
-
Bloom, B.H.1
-
40
-
-
0036767270
-
Maintaining stream statistics over sliding windows
-
M. Datar, A. Gionis, P. Indyk, and R. Motwani, "Maintaining stream statistics over sliding windows," SIAM J. Computing 31, pp. 1794-1813, 2002.
-
(2002)
SIAM J. Computing
, vol.31
, pp. 1794-1813
-
-
Datar, M.1
Gionis, A.2
Indyk, P.3
Motwani, R.4
-
43
-
-
84944323337
-
Distinct sampling for highly-accurate answers to distinct values queries and event reports
-
P.B. Gibbons, "Distinct sampling for highly-accurate answers to distinct values queries and event reports," Intl. Conf. on Very Large Databases, pp. 541-550, 2001.
-
(2001)
Intl. Conf. on Very Large Databases
, pp. 541-550
-
-
Gibbons, P.B.1
-
44
-
-
0029195607
-
View maintenance issues for the chronicle data model
-
H.V. Jagadish, I.S. Mumick, and A. Silberschatz, "View maintenance issues for the chronicle data model," Proc. ACM Symp. on Principles of Database Systems, pp. 113-124, 1995.
-
(1995)
Proc. ACM Symp. on Principles of Database Systems
, pp. 113-124
-
-
Jagadish, H.V.1
Mumick, I.S.2
Silberschatz, A.3
-
48
-
-
0033721503
-
Graph structure in the web
-
A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins, and J. Weiner, "Graph structure in the web," Computer Networks 33: 1-6, pp. 309-320, 2000.
-
(2000)
Computer Networks
, vol.33
, Issue.1-6
, pp. 309-320
-
-
Broder, A.1
Kumar, R.2
Maghoul, F.3
Raghavan, P.4
Rajagopalan, S.5
Stata, R.6
Tomkins, A.7
Weiner, J.8
-
49
-
-
34548764345
-
Link spam detection based on mass estimation
-
Z. Gyöngi, P. Berkhin, H. Garcia-Molina, and J. Pedersen, "Link spam detection based on mass estimation," Proc. 32nd Intl. Conf. on Very Large Databases, pp. 439-450, 2006.
-
(2006)
Proc. 32nd Intl. Conf. on Very Large Databases
, pp. 439-450
-
-
Gyöngi, Z.1
Berkhin, P.2
Garcia-Molina, H.3
Pedersen, J.4
-
50
-
-
85131818719
-
Combating link spam with trustrank
-
Z. Gyöngi, H. Garcia-Molina, and J. Pedersen, "Combating link spam with trustrank," Proc. 30th Intl. Conf. on Very Large Databases, pp. 576-587, 2004.
-
(2004)
Proc. 30th Intl. Conf. on Very Large Databases
, pp. 576-587
-
-
Gyöngi, Z.1
Garcia-Molina, H.2
Pedersen, J.3
-
51
-
-
85032603151
-
Efficient computation of PageRank," Stanford Univ. Dept. of Computer Science technical report, Sept., 1999. Available as http://infolab.stanford.edu/~taherh/papers/efficient-pr.pdf [6] T.H. Haveliwala, "Topic-sensitive PageRank
-
T.H. Haveliwala, "Efficient computation of PageRank," Stanford Univ. Dept. of Computer Science technical report, Sept., 1999. Available as http://infolab.stanford.edu/~taherh/papers/efficient-pr.pdf [6] T.H. Haveliwala, "Topic-sensitive PageRank," Proc. 11th Intl. World-Wide-Web Conference, pp. 517-526, 2002
-
(2002)
Proc. 11th Intl. World-Wide-Web Conference
, pp. 517-526
-
-
Haveliwala, T.H.1
-
52
-
-
4243148480
-
Authoritative sources in a hyperlinked environment
-
J.M. Kleinberg, "Authoritative sources in a hyperlinked environment," J. ACM 46: 5, pp. 604-632, 1999.
-
(1999)
J. ACM
, vol.46
, Issue.5
, pp. 604-632
-
-
Kleinberg, J.M.1
-
53
-
-
0027621699
-
Mining associations between sets of items in massive databases
-
R. Agrawal, T. Imielinski, and A. Swami, "Mining associations between sets of items in massive databases," Proc. ACM SIGMOD Intl. Conf. on Management of Data, pp. 207-216, 1993.
-
(1993)
Proc. ACM SIGMOD Intl. Conf. on Management of Data
, pp. 207-216
-
-
Agrawal, R.1
Imielinski, T.2
Swami, A.3
-
55
-
-
0002586462
-
Computing iceberg queries efficiently
-
M. Fang, N. Shivakumar, H. Garcia-Molina, R. Motwani, and J.D. Ullman, "Computing iceberg queries efficiently," Intl. Conf. on Very Large Databases, pp. 299-310, 1998.
-
(1998)
Intl. Conf. on Very Large Databases
, pp. 299-310
-
-
Fang, M.1
Shivakumar, N.2
Garcia-Molina, H.3
Motwani, R.4
Ullman, J.D.5
-
56
-
-
84976830511
-
An effective hash-based algorithm for mining association rules
-
J.S. Park, M.-S. Chen, and P.S. Yu, "An effective hash-based algorithm for mining association rules," Proc. ACM SIGMOD Intl. Conf. on Management of Data, pp. 175-186, 1995.
-
(1995)
Proc. ACM SIGMOD Intl. Conf. on Management of Data
, pp. 175-186
-
-
Park, J.S.1
Chen, M.-S.2
Yu, P.S.3
-
57
-
-
0002082857
-
An efficient algorithm for mining association rules in large databases
-
A. Savasere, E. Omiecinski, and S.B. Navathe, "An efficient algorithm for mining association rules in large databases," Intl. Conf. on Very Large Databases, pp. 432-444, 1995.
-
(1995)
Intl. Conf. on Very Large Databases
, pp. 432-444
-
-
Savasere, A.1
Omiecinski, E.2
Navathe, S.B.3
-
58
-
-
0002663969
-
Sampling large databases for association rules
-
H. Toivonen, "Sampling large databases for association rules," Intl. Conf. on Very Large Databases, pp. 134-145, 1996.
-
(1996)
Intl. Conf. on Very Large Databases
, pp. 134-145
-
-
Toivonen, H.1
-
59
-
-
1142293244
-
Maintaining variance and k-medians over data stream windows
-
B. Babcock, M. Datar, R. Motwani, and L. O’Callaghan, "Maintaining variance and k-medians over data stream windows," Proc. ACM Symp. on Principles of Database Systems, pp. 234-243, 2003.
-
(2003)
Proc. ACM Symp. on Principles of Database Systems
, pp. 234-243
-
-
Babcock, B.1
Datar, M.2
Motwani, R.3
O’Callaghan, L.4
-
60
-
-
84953806973
-
Scaling clustering algorithms to large databases
-
P.S. Bradley, U.M. Fayyad, and C. Reina, "Scaling clustering algorithms to large databases," Proc. Knowledge Discovery and Data Mining, pp. 9-15, 1998.
-
(1998)
Proc. Knowledge Discovery and Data Mining
, pp. 9-15
-
-
Bradley, P.S.1
Fayyad, U.M.2
Reina, C.3
-
61
-
-
0032665257
-
Clustering large datasets in arbitrary metric spaces
-
V. Ganti, R. Ramakrishnan, J. Gehrke, A.L. Powell, and J.C. French:, "Clustering large datasets in arbitrary metric spaces," Proc. Intl. Conf. on Data Engineering, pp. 502-511, 1999.
-
(1999)
Proc. Intl. Conf. on Data Engineering
, pp. 502-511
-
-
Ganti, V.1
Ramakrishnan, R.2
Gehrke, J.3
Powell, A.L.4
French, J.C.5
-
62
-
-
0013084629
-
-
Second Edition, Prentice-Hall, Upper Saddle River, NJ
-
H. Garcia-Molina, J.D. Ullman, and J. Widom, Database Systems: The Complete Book Second Edition, Prentice-Hall, Upper Saddle River, NJ, 2009.
-
(2009)
Database Systems: The Complete Book
-
-
Garcia-Molina, H.1
Ullman, J.D.2
Widom, J.3
-
63
-
-
0032091595
-
CURE: An efficient clustering algorithm for large databases
-
S. Guha, R. Rastogi, and K. Shim, "CURE: An efficient clustering algorithm for large databases," Proc. ACM SIGMOD Intl. Conf. on Management of Data, pp. 73-84, 1998.
-
(1998)
Proc. ACM SIGMOD Intl. Conf. on Management of Data
, pp. 73-84
-
-
Guha, S.1
Rastogi, R.2
Shim, K.3
-
64
-
-
0030157145
-
BIRCH: An efficient data clustering method for very large databases
-
T. Zhang, R. Ramakrishnan, and M. Livny, "BIRCH: an efficient data clustering method for very large databases," Proc. ACM SIGMOD Intl. Conf. on Management of Data, pp. 103-114, 1996.
-
(1996)
Proc. ACM SIGMOD Intl. Conf. on Management of Data
, pp. 103-114
-
-
Zhang, T.1
Ramakrishnan, R.2
Livny, M.3
-
65
-
-
42549140738
-
An experimental comparison of click-position bias models
-
N. Craswell, O. Zoeter, M. Taylor, and W. Ramsey, "An experimental comparison of click-position bias models," Proc. Intl. Conf. on Web Search and Web Data Mining pp. 87-94, 2008.
-
(2008)
Proc. Intl. Conf. on Web Search and Web Data Mining
, pp. 87-94
-
-
Craswell, N.1
Zoeter, O.2
Taylor, M.3
Ramsey, W.4
-
66
-
-
0347998609
-
An optimal deterministic algorithm for b-matching
-
B. Kalyanasundaram and K.R. Pruhs, "An optimal deterministic algorithm for b-matching," Theoretical Computer Science 233: 1-2, pp. 319-325, 2000.
-
(2000)
Theoretical Computer Science
, vol.233
, Issue.1-2
, pp. 319-325
-
-
Kalyanasundaram, B.1
Pruhs, K.R.2
-
67
-
-
33748591085
-
Adwords and generalized on-line matching
-
A Mehta, A. Saberi, U. Vazirani, and V. Vazirani, "Adwords and generalized on-line matching," IEEE Symp. on Foundations of Computer Science, pp. 264-273, 2005.
-
(2005)
IEEE Symp. on Foundations of Computer Science
, pp. 264-273
-
-
Mehta, A.1
Saberi, A.2
Vazirani, U.3
Vazirani, V.4
-
68
-
-
20844435854
-
Towards the next generation of recommender systems: A survey of the state-of-the-art and possible extensions
-
G. Adomavicius and A. Tuzhilin, "Towards the next generation of recommender systems: a survey of the state-of-the-art and possible extensions," IEEE Trans. on Data and Knowledge Engineering 17: 6, pp. 734-749, 2005.
-
(2005)
IEEE Trans. on Data and Knowledge Engineering
, vol.17
, Issue.6
, pp. 734-749
-
-
Adomavicius, G.1
Tuzhilin, A.2
-
69
-
-
85032602188
-
-
C. Anderson, http://www.wired.com/wired/archive/12.10/tail.html 2004.
-
(2004)
-
-
Anderson, C.1
-
72
-
-
0037252945
-
Amazon.com recommendations: Itemto- item collaborative filtering
-
G. Linden, B. Smith, and J. York, "Amazon.com recommendations: itemto- item collaborative filtering," Internet Computing 7: 1, pp. 76-80, 2003.
-
(2003)
Internet Computing
, vol.7
, Issue.1
, pp. 76-80
-
-
Linden, G.1
Smith, B.2
York, J.3
-
75
-
-
34247540250
-
Games with a purpose
-
June
-
L. von Ahn, "Games with a purpose," IEEE Computer Magazine, pp. 96-98, June 2006.
-
(2006)
IEEE Computer Magazine
, pp. 96-98
-
-
von Ahn, L.1
-
77
-
-
84876806831
-
Transitive closure and recursive Datalog implemented on clusters
-
F.N. Afrati and J.D. Ullman, "Transitive closure and recursive Datalog implemented on clusters," in Proc. EDBT (2012).
-
(2012)
Proc. EDBT
-
-
Afrati, F.N.1
Ullman, J.D.2
-
79
-
-
80054072531
-
HyperANF: Approximating the neighbourhood function of very large graphs on a budget
-
P. Boldi, M. Rosa, and S. Vigna, "HyperANF: approximating the neighbourhood function of very large graphs on a budget," Proc. WWW Conference (2011), pp. 625-634.
-
(2011)
Proc. WWW Conference
, pp. 625-634
-
-
Boldi, P.1
Rosa, M.2
Vigna, S.3
-
80
-
-
74049087026
-
Community detection in graphs
-
S. Fortunato, "Community detection in graphs," Physics Reports 486: 3-5 (2010), pp. 75-174.
-
(2010)
Physics Reports
, vol.486
, Issue.3-5
, pp. 75-174
-
-
Fortunato, S.1
-
81
-
-
0037062448
-
Community structure in social and biological networks
-
M. Girvan and M.E.J. Newman, "Community structure in social and biological networks," Proc. Natl. Acad. Sci. 99 (2002), pp. 7821-7826.
-
(2002)
Proc. Natl. Acad. Sci
, vol.99
, pp. 7821-7826
-
-
Girvan, M.1
Newman, M.E.J.2
-
84
-
-
0033297068
-
Trawling the Web for emerging cyber-communities
-
May
-
R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins, "Trawling the Web for emerging cyber-communities, Computer Networks 31: 11-16 (May, 1999), pp. 1481-1493.
-
(1999)
Computer Networks
, vol.31
-
-
Kumar, R.1
Raghavan, P.2
Rajagopalan, S.3
Tomkins, A.4
-
86
-
-
0036205389
-
Similarity flooding: A versatile graph matching algorithm and its application to schema matching
-
S. Melnik, H. Garcia-Molina, and E. Rahm, "Similarity flooding: a versatile graph matching algorithm and its application to schema matching, Proc. Intl. Conf. on Data Engineering (2002), pp. 117-128.
-
(2002)
Proc. Intl. Conf. on Data Engineering
, pp. 117-128
-
-
Melnik, S.1
Garcia-Molina, H.2
Rahm, E.3
-
89
-
-
85032594309
-
-
Stanford Network Analysis Platform, http://snap.stanford.edu.
-
-
-
-
91
-
-
34748827346
-
Fast random walk with restart and its applications
-
H. Tong, C. Faloutsos, and J.-Y. Pan, "Fast random walk with restart and its applications," ICDM 2006, pp. 613-622.
-
(2006)
ICDM
, pp. 613-622
-
-
Tong, H.1
Faloutsos, C.2
Pan, J.-Y.3
-
93
-
-
26444432537
-
Evaluation of recursive queries using join indices
-
P. Valduriez and H. Boral, "Evaluation of recursive queries using join indices," Expert Database Conf. (1986), pp. 271-293.
-
(1986)
Expert Database Conf
, pp. 271-293
-
-
Valduriez, P.1
Boral, H.2
-
94
-
-
34548583274
-
A tutorial on spectral clustering
-
2007
-
U. von Luxburg, "A tutorial on spectral clustering," Statistics and Computing 17: 4 (2007), 2007, pp. 395-416.
-
(2007)
Statistics and Computing
, vol.17
, Issue.4
, pp. 395-416
-
-
von Luxburg, U.1
-
95
-
-
84874243514
-
Overlapping community detection at scale: A nonnegative matrix factorization approach
-
J. Yang and J. Leskovec, "Overlapping community detection at scale: a nonnegative matrix factorization approach," ACM Intl. Conf. on Web Search and Data Mining, 2013.
-
(2013)
ACM Intl. Conf. on Web Search and Data Mining
-
-
Yang, J.1
Leskovec, J.2
-
98
-
-
84989525001
-
Indexing by latent semantic analysis
-
S. Deerwester, S.T. Dumais, G.W. Furnas, T.K. Landauer, and R. Harshman, "Indexing by latent semantic analysis," J. American Society for Information Science 41: 6 (1990).
-
(1990)
J. American Society for Information Science
, vol.41
, Issue.6
-
-
Deerwester, S.1
Dumais, S.T.2
Furnas, G.W.3
Landauer, T.K.4
Harshman, R.5
-
99
-
-
33751097630
-
Fast Monte Carlo algorithms for matrices III: Computing a compressed approximate matrix decomposition
-
P. Drineas, R. Kannan, and M.W. Mahoney, "Fast Monte Carlo algorithms for matrices III: Computing a compressed approximate matrix decomposition," SIAM J. Computing 36: 1 (2006), pp. 184-206.
-
(2006)
SIAM J. Computing
, vol.36
, Issue.1
, pp. 184-206
-
-
Drineas, P.1
Kannan, R.2
Mahoney, M.W.3
-
100
-
-
0000288016
-
Calculating the singular values and pseudoinverse of a matrix
-
G.H. Golub and W. Kahan, "Calculating the singular values and pseudoinverse of a matrix," J. SIAM Series B 2: 2 (1965), pp. 205-224.
-
(1965)
J. SIAM Series B
, vol.2
, Issue.2
, pp. 205-224
-
-
Golub, G.H.1
Kahan, W.2
-
102
-
-
33749575020
-
Tensor-CUR decompositions for tensor-based data
-
M.W. Mahoney, M. Maggioni, and P. Drineas, Tensor-CUR decompositions for tensor-based data, SIGKDD, pp. 327-336, 2006.
-
(2006)
SIGKDD
, pp. 327-336
-
-
Mahoney, M.W.1
Maggioni, M.2
Drineas, P.3
-
103
-
-
0000325341
-
On lines and planes of closest fit to systems of points in space
-
K. Pearson, "On lines and planes of closest fit to systems of points in space," Philosophical Magazine 2: 11 (1901), pp. 559-572.
-
(1901)
Philosophical Magazine
, vol.2
, Issue.11
, pp. 559-572
-
-
Pearson, K.1
-
104
-
-
49749100928
-
Less is more: Compact matrix decomposition for large sparse graphs
-
J. Sun, Y. Xie, H. Zhang, and C. Faloutsos, "Less is more: compact matrix decomposition for large sparse graphs," Proc. SIAM Intl. Conf. on Data Mining, 2007.
-
(2007)
Proc. SIAM Intl. Conf. on Data Mining
-
-
Sun, J.1
Xie, Y.2
Zhang, H.3
Faloutsos, C.4
-
105
-
-
2542430932
-
Singular value decomposition and principal component analysis
-
D.P. Berrar, W. Dubitzky, and M. Granzow, eds, Kluwer, Norwell, MA
-
M.E. Wall, A. Reichtsteiner and L.M. Rocha, "Singular value decomposition and principal component analysis," in A Practical Approach to Microarray Data Analysis (D.P. Berrar, W. Dubitzky, and M. Granzow, eds.), pp. 91-109, Kluwer, Norwell, MA, 2003.
-
(2003)
A Practical Approach to Microarray Data Analysis
, pp. 91-109
-
-
Wall, M.E.1
Reichtsteiner, A.2
Rocha, L.M.3
-
106
-
-
0030819669
-
Empirical support for winnow and weighted-majority algorithms: Results on a calendar scheduling domain
-
A. Blum, "Empirical support for winnow and weighted-majority algorithms: results on a calendar scheduling domain," Machine Learning 26 (1997), pp. 5-23.
-
(1997)
Machine Learning
, vol.26
, pp. 5-23
-
-
Blum, A.1
-
107
-
-
84904136037
-
Large-scale machine learning with stochastic gradient descent
-
Springer
-
L. Bottou, "Large-scale machine learning with stochastic gradient descent," Proc. 19th Intl. Conf. on Computational Statistics (2010), pp. 177-187, Springer.
-
(2010)
Proc. 19th Intl. Conf. on Computational Statistics
, pp. 177-187
-
-
Bottou, L.1
-
108
-
-
84921478833
-
Stochastic gradient tricks, neural networks
-
edited by G. Montavon, G.B. Orr and K.-R. Mueller, Lecture Notes in Computer Science (LNCS 7700), Springer
-
L. Bottou, "Stochastic gradient tricks, neural networks," in Tricks of the Trade, Reloaded, pp. 430-445, edited by G. Montavon, G.B. Orr and K.-R. Mueller, Lecture Notes in Computer Science (LNCS 7700), Springer, 2012.
-
(2012)
Tricks of the Trade, Reloaded
, pp. 430-445
-
-
Bottou, L.1
-
109
-
-
27144489164
-
A tutorial on support vector machines for pattern recognition
-
C.J.C. Burges, "A tutorial on support vector machines for pattern recognition," Data Mining and Knowledge Discovery 2 (1998), pp. 121-167.
-
(1998)
Data Mining and Knowledge Discovery
, vol.2
, pp. 121-167
-
-
Burges, C.J.C.1
-
111
-
-
34249753618
-
Support-vector networks
-
C. Cortes and V.N. Vapnik, "Support-vector networks," Machine Learning 20 (1995), pp. 273-297.
-
(1995)
Machine Learning
, vol.20
, pp. 273-297
-
-
Cortes, C.1
Vapnik, V.N.2
-
112
-
-
0033281425
-
Large margin classification using the perceptron algorithm
-
Y. Freund and R.E. Schapire, "Large margin classification using the perceptron algorithm," Machine Learning 37 (1999), pp. 277-296.
-
(1999)
Machine Learning
, vol.37
, pp. 277-296
-
-
Freund, Y.1
Schapire, R.E.2
-
113
-
-
33749563073
-
Training linear SVMs in linear time
-
T. Joachims, "Training linear SVMs in linear time." Proc. 12th ACM SIGKDD (2006), pp. 217-226.
-
(2006)
Proc. 12th ACM SIGKDD
, pp. 217-226
-
-
Joachims, T.1
-
114
-
-
34250091945
-
Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm
-
N. Littlestone, "Learning quickly when irrelevant attributes abound: a new linear-threshold algorithm," Machine Learning 2 (1988), pp. 285-318.
-
(1988)
Machine Learning
, vol.2
, pp. 285-318
-
-
Littlestone, N.1
-
116
-
-
11144273669
-
The perceptron: A probabilistic model for information storage and organization in the brain
-
F. Rosenblatt, "The perceptron: a probabilistic model for information storage and organization in the brain," Psychological Review 65: 6 (1958), pp. 386-408.
-
(1958)
Psychological Review
, vol.65
, Issue.6
, pp. 386-408
-
-
Rosenblatt, F.1
|