-
1
-
-
84862688930
-
-
arXiv:1110.4198
-
Agarwal, Alekh, Chapelle, Olivier, Dudik, Miroslav, and Lang ford, John. A reliable effective terascale linear learning system. Technical report, arXiv:1110.4198, 2011.
-
(2011)
A Reliable Effective Terascale Linear Learning System. Technical Report
-
-
Agarwal, A.1
Chapelle, O.2
Dudik, M.3
Langford, J.4
-
3
-
-
35348849154
-
Scaling up all pairs similarity search
-
Bayardo, Roberto J., Ma, Yiming, and Srikant, Ramakrishnan. Scaling up all pairs similarity search. In WWW, pp. 131-140, 2007.
-
(2007)
WWW
, pp. 131-140
-
-
Bayardo Roberto, J.1
Yiming, M.2
Ramakrishnan, S.3
-
4
-
-
84919824364
-
-
Bottou, Leon, http://leon.bottou.org/projects/sgd.
-
-
-
Bottou, L.1
-
5
-
-
0031346696
-
On the resemblance and containment of docu ments
-
Positano, Italy
-
Broder, Andrei Z. On the resemblance and containment of docu ments. In the Compression and Complexity of Sequences, pp. 21-29, Positano, Italy, 1997.
-
(1997)
The Compression and Complexity of Sequences
, pp. 21-29
-
-
Broder, A.Z.1
-
6
-
-
0010362121
-
Syntactic clustering of the web
-
Santa Clara, CA
-
Broder, Andrei Z., Glassman, Steven C, Manasse, Mark S., and Zweig, Geoffrey. Syntactic clustering of the web. In WWW, pp. 1157-1166, Santa Clara, CA, 1997.
-
(1997)
WWW
, pp. 1157-1166
-
-
Broder, A.Z.1
Glassman, S.C.2
Manasse, M.S.3
Zweig, G.4
-
7
-
-
0031620041
-
Min-wise independent permutations
-
Dallas, TX
-
Broder, Andrei Z., Charikar, Moses, Frieze, Alan M., and Mitzenmacher, Michael. Min-wise independent permutations. In STOC, pp. 327-336, Dallas, TX, 1998.
-
(1998)
STOC
, pp. 327-336
-
-
Broder, A.Z.1
Charikar, M.2
Frieze, A.M.3
Mitzenmacher, M.4
-
8
-
-
84963650728
-
Universal classes of hash functions
-
Carter, J. Lawrence and Wegman, Mark N. Universal classes of hash functions. In STOC, pp. 106-112, 1977.
-
(1977)
STOC
, pp. 106-112
-
-
Carter, J.L.1
Wegman, M.N.2
-
9
-
-
80053284143
-
-
Chandra, Tushar, Ie, Eugene, Goldman, Kenneth, Llinares, Tomas Lloret, McFadden, Jim, Pereira, Fernando, Redstone, Joshua, Shaked, Tal, and Singer, Yoram. Sibyl: a system for large scale machine learning.
-
Sibyl: A System for Large Scale Machine Learning
-
-
Chandra, T.1
Ie, E.2
Goldman, K.3
Llinares, T.L.4
McFadden, J.5
Pereira, F.6
Redstone, J.7
Shaked, T.8
Singer, Y.9
-
10
-
-
0032594951
-
Sup port vector machines for histogram-based image classification
-
Chapelle, Olivier, Haffner, Patrick, and Vapnik, Vladimir N. Sup port vector machines for histogram-based image classification. IEEE Trans. Neural Networks, 10(5):1055-1064, 1999.
-
(1999)
IEEE Trans. Neural Networks
, vol.10
, Issue.5
, pp. 1055-1064
-
-
Chapelle, O.1
Haffner, P.2
Vapnik, V.N.3
-
11
-
-
33646695921
-
Semantic similarity between search engine queries using temporal correlation
-
Chien, Steve and Immorlica, Nicole. Semantic similarity between search engine queries using temporal correlation. In WWW, pp. 2-11, 2005.
-
(2005)
WWW
, pp. 2-11
-
-
Chien, S.1
Immorlica, N.2
-
12
-
-
80052688924
-
Fast locality-sensitive hashing
-
Dasgupta, Anirban, Kumar, Ravi, and Sarlos, Tamas. Fast locality-sensitive hashing. In KDD, pp. 1073-1081, 2011.
-
(2011)
KDD
, pp. 1073-1081
-
-
Dasgupta, A.1
Kumar, R.2
Sarlos, T.3
-
13
-
-
50949133669
-
Liblinear: A library for large linear classification
-
Fan, Rong-En, Chang, Kai-Wei, Hsieh, Cho-Jui, Wang, Xiang-Rui, and Lin, Chih-Jen. Liblinear: A library for large linear classification. Journal of Machine Learning Research, 9:1871-1874, 2008.
-
(2008)
Journal of Machine Learning Research
, vol.9
, pp. 1871-1874
-
-
Fan, R.-E.1
Chang, K.-W.2
Hsieh, C.-J.3
Wang, X.-R.4
Lin, C.-J.5
-
14
-
-
84880492977
-
A large-scale study of the evolution of web pages
-
Budapest, Hungary
-
Fetterly, Dennis, Manasse, Mark, Najork, Marc, and Wiener, Janet L. A large-scale study of the evolution of web pages. In WWW, pp. 669-678, Budapest, Hungary, 2003.
-
(2003)
WWW
, pp. 669-678
-
-
Fetterly, D.1
Manasse, M.2
Najork, M.3
Wiener, J.L.4
-
15
-
-
0016567534
-
An algorithm for finding nearest neighbors
-
Friedman, Jerome H., Baskett, F, and Shustek, L. An algorithm for finding nearest neighbors. IEEE Transactions on Comput ers, 24:1000-1006, 1975.
-
(1975)
IEEE Transactions on Comput Ers
, vol.24
, pp. 1000-1006
-
-
Friedman Jerome, H.1
Baskett, F.2
Shustek, L.3
-
16
-
-
33750296887
-
Finding near-duplicate web pages: A large-scale evaluation of algorithms
-
Henzinger, Monika Rauch. Finding near-duplicate web pages: a large-scale evaluation of algorithms. In SIGIR, pp. 284-291, 2006.
-
(2006)
SIGIR
, pp. 284-291
-
-
Henzinger, M.R.1
-
17
-
-
0031644241
-
Approximate nearest neigh bors: Towards removing the curse of dimensionality
-
Dallas, TX
-
Indyk, Piotr and Motwani, Rajeev. Approximate nearest neigh bors: Towards removing the curse of dimensionality. In STOC, pp. 604-613, Dallas, TX, 1998.
-
(1998)
STOC
, pp. 604-613
-
-
Indyk, P.1
Motwani, R.2
-
18
-
-
33749563073
-
Training linear svms in linear time
-
Pittsburgh, PA
-
Joachims, Thorsten. Training linear svms in linear time. In KDD, pp. 217-226, Pittsburgh, PA, 2006.
-
(2006)
KDD
, pp. 217-226
-
-
Joachims, T.1
-
19
-
-
33745590892
-
Approximate joins: Con cepts and techniques
-
Koudas, Nick and Srivastava, Divesh. Approximate joins: Con cepts and techniques. In VLDB, pp. 1363, 2005.
-
(2005)
VLDB
, pp. 1363
-
-
Koudas, N.1
Srivastava, D.2
-
21
-
-
77954568754
-
B-bit minwise hashing
-
Raleigh, NC
-
Li, Ping and König, Arnd Christian, b-bit minwise hashing. In Proceedings of the 19th International Conference on World Wide Web, pp. 671-680, Raleigh, NC, 2010.
-
(2010)
Proceedings of the 19th International Conference on World Wide Web
, pp. 671-680
-
-
Li, P.1
König, A.C.2
-
22
-
-
84864064770
-
Conditional random sampling: A sketch-based sampling technique for sparse data
-
Vancouver, BC, Canada
-
Li, Ping, Church, Kenneth W., and Hastie, Trevor J. Conditional random sampling: A sketch-based sampling technique for sparse data. In NIPS, pp. 873-880, Vancouver, BC, Canada, 2006.
-
(2006)
NIPS
, pp. 873-880
-
-
Li, P.1
Church, K.W.2
Hastie, T.J.3
-
23
-
-
85162008856
-
B-bit minwise hashing for estimating three-way similarities
-
Vancouver, BC
-
Li, Ping, König, Arnd Christian, and Gui, Wenhao. b-bit minwise hashing for estimating three-way similarities. In Advances in Neural Information Processing Systems, Vancouver, BC, 2010.
-
(2010)
Advances in Neural Information Processing Systems
-
-
Li, P.1
König, A.C.2
Gui, W.3
-
24
-
-
85162551376
-
Hashing algorithms for large-scale learning
-
Granada, Spain
-
Li, Ping, Shrivastava, Anshumali, Moore, Joshua, and König, Arnd Christian. Hashing algorithms for large-scale learning. In NIPS, Granada, Spain, 2011.
-
(2011)
NIPS
-
-
Li, P.1
Shrivastava, A.2
Moore, J.3
König, A.C.4
-
25
-
-
84877737308
-
One permutation hashing
-
Lake Tahoe, NV
-
Li, Ping, Owen, Art B, and Zhang, Cun-Hui. One permutation hashing. In NIPS, Lake Tahoe, NV, 2012.
-
(2012)
NIPS
-
-
Li, P.1
Owen, A.B.2
Zhang, C.-H.3
-
26
-
-
84919824362
-
Bbit minwise hashing in practice
-
Li, Ping, Shrivastava, Anshumali, and König, Arnd Christian, bbit minwise hashing in practice. In Internetware, Changsha, China, 2013.
-
(2013)
Internetware, Changsha, China
-
-
Ping, L.1
Anshumali, S.2
Arnd Christian, K.3
-
27
-
-
35348911985
-
De tecting near-duplicates for web-crawling
-
Banff, Alberta, Canada
-
Manku, Gurmeet Singh, Jain, Arvind, and Sarma, Anish Das. De tecting Near-Duplicates for Web-Crawling. In WWW, Banff, Alberta, Canada, 2007.
-
(2007)
WWW
-
-
Manku, G.S.1
Jain, A.2
Sarma, A.D.3
-
28
-
-
49049116151
-
Why simple hash functions work: Exploiting the entropy in a data stream
-
Mitzenmacher, Michael and Vadhan, Salil. Why simple hash functions work: exploiting the entropy in a data stream. In SODA, 2008.
-
(2008)
SODA
-
-
Mitzenmacher, M.1
Vadhan, S.2
-
30
-
-
34547964973
-
Pegasos: Primal estimated sub-gradient solver for svm
-
Corvalis, Oregon
-
Shalev-Shwartz, Shai, Singer, Yoram, and Srebro, Nathan. Pegasos: Primal estimated sub-gradient solver for svm. In ICML, pp. 807-814, Corvalis, Oregon, 2007.
-
(2007)
ICML
, pp. 807-814
-
-
Shalev-Shwartz, S.1
Singer, Y.2
Srebro, N.3
-
31
-
-
84919824361
-
Fast near neighbor search in high-dimensional binary data
-
Shrivastava, Anshumali and Li, Ping. Fast near neighbor search in high-dimensional binary data. In ECML, 2012.
-
(2012)
ECML
-
-
Anshumali, S.1
Ping, L.2
-
32
-
-
84898997631
-
Beyond pairwise: Provably fast algorithms for approximate k-way similarity search
-
Lake Tahoe, NY
-
Shrivastava, Anshumali and Li, Ping. Beyond pairwise: Provably fast algorithms for approximate k-way similarity search. In NIPS, Lake Tahoe, NY 2013.
-
(2013)
NIPS
-
-
Anshumali, S.1
Ping, L.2
-
34
-
-
71149087699
-
Feature hashing for large scale mul titask learning
-
Weinberger, Kilian, Dasgupta, Anirban, Langford, John, Smola, Alex, and Attenberg, Josh. Feature hashing for large scale mul titask learning. In ICML, pp. 1113-1120, 2009.
-
(2009)
ICML
, pp. 1113-1120
-
-
Weinberger, K.1
Dasgupta, A.2
Langford, J.3
Smola, A.4
Attenberg, J.5
|