메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1-458

Mining of massive datasets: Second edition

Author keywords

[No Author keywords available]

Indexed keywords

DIMENSIONALITY REDUCTION; INTERNET COMMERCES; LARGE DATASETS; LOCALITY SENSITIVE HASHING; MASSIVE DATA SETS; PARALLELIZING; STREAM PROCESSING; WEB TECHNOLOGIES;

EID: 84952760942     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1017/CBO9781139924801     Document Type: Book
Times cited : (1167)

References (116)
  • 4
    • 0000811402 scopus 로고    scopus 로고
    • Sorting and Searching), Second Edition, Addison-Wesley, Upper Saddle River, NJ
    • D.E. Knuth, The Art of Computer Programming Vol. 3 (Sorting and Searching), Second Edition, Addison-Wesley, Upper Saddle River, NJ, 1998.
    • (1998) The Art of Computer Programming , vol.3
    • Knuth, D.E.1
  • 6
    • 0014403118 scopus 로고
    • The Matthew effect in science
    • Jan. 5
    • R.K. Merton, "The Matthew effect in science," Science 159: 3810, pp. 56-63, Jan. 5, 1968.
    • (1968) Science , vol.159 , Issue.3810 , pp. 56-63
    • Merton, R.K.1
  • 15
    • 37549003336 scopus 로고    scopus 로고
    • Mapreduce: Simplified data processing on large clusters
    • J. Dean and S. Ghemawat, "Mapreduce: simplified data processing on large clusters," Comm. ACM 51: 1, pp. 107-113, 2008.
    • (2008) Comm. ACM , vol.51 , Issue.1 , pp. 107-113
    • Dean, J.1    Ghemawat, S.2
  • 18
    • 85032604675 scopus 로고    scopus 로고
    • hadoop.apache.org, Apache Foundation.
  • 19
    • 85032599359 scopus 로고    scopus 로고
    • hadoop.apache.org/hive, Apache Foundation.
  • 24
    • 85076882757 scopus 로고    scopus 로고
    • DryadLINQ: A system for general-purpose distributed dataparallel computing using a high-level language
    • USENIX Association
    • Y. Yu, M. Isard, D. Fetterly, M. Budiu, I. Erlingsson, P.K. Gunda, and J. Currey, "DryadLINQ: a system for general-purpose distributed dataparallel computing using a high-level language," OSDI, pp. 1-14, USENIX Association, 2008.
    • (2008) OSDI , pp. 1-14
    • Yu, Y.1    Isard, M.2    Fetterly, D.3    Budiu, M.4    Erlingsson, I.5    Gunda, P.K.6    Currey, J.7
  • 25
    • 37549058056 scopus 로고    scopus 로고
    • Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions
    • A. Andoni and P. Indyk, "Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions," Comm. ACM 51: 1, pp. 117-122, 2008.
    • (2008) Comm. ACM , vol.51 , Issue.1 , pp. 117-122
    • Andoni, A.1    Indyk, P.2
  • 26
    • 0031346696 scopus 로고    scopus 로고
    • On the resemblance and containment of documents
    • Positano Italy
    • A.Z. Broder, "On the resemblance and containment of documents," Proc. Compression and Complexity of Sequences, pp. 21-29, Positano Italy, 1997.
    • (1997) Proc. Compression and Complexity of Sequences , pp. 21-29
    • Broder, A.Z.1
  • 28
    • 0036040277 scopus 로고    scopus 로고
    • Similarity estimation techniques from rounding algorithms
    • M.S. Charikar, "Similarity estimation techniques from rounding algorithms," ACM Symposium on Theory of Computing, pp. 380-388, 2002.
    • (2002) ACM Symposium on Theory of Computing , pp. 380-388
    • Charikar, M.S.1
  • 32
    • 33750296887 scopus 로고    scopus 로고
    • Finding near-duplicate web pages: A large-scale evaluation of algorithms
    • M. Henzinger, "Finding near-duplicate web pages: a large-scale evaluation of algorithms," Proc. 29th SIGIR Conf., pp. 284-291, 2006.
    • (2006) Proc. 29th SIGIR Conf , pp. 284-291
    • Henzinger, M.1
  • 33
    • 0031644241 scopus 로고    scopus 로고
    • Approximate nearest neighbor: Towards removing the curse of dimensionality
    • P. Indyk and R. Motwani. "Approximate nearest neighbor: towards removing the curse of dimensionality," ACM Symposium on Theory of Computing, pp. 604-613, 1998.
    • (1998) ACM Symposium on Theory of Computing , pp. 604-613
    • Indyk, P.1    Motwani, R.2
  • 34
    • 85043988965 scopus 로고
    • Finding similar files in a large file system
    • U. Manber, "Finding similar files in a large file system," Proc. USENIX Conference, pp. 1-10, 1994.
    • (1994) Proc. USENIX Conference , pp. 1-10
    • Manber, U.1
  • 35
    • 57349131623 scopus 로고    scopus 로고
    • SpotSigs: Robust and efficient near duplicate detection in large web collections
    • July, Singapore
    • M. Theobald, J. Siddharth, and A. Paepcke, "SpotSigs: robust and efficient near duplicate detection in large web collections," 31st Annual ACM SIGIR Conference, July, 2008, Singapore.
    • (2008) 31st Annual ACM SIGIR Conference
    • Theobald, M.1    Siddharth, J.2    Paepcke, A.3
  • 36
    • 57349141410 scopus 로고    scopus 로고
    • Efficient similarity joins for near duplicate detection
    • C. Xiao, W. Wang, X. Lin, and J.X. Yu, "Efficient similarity joins for near duplicate detection," Proc. WWW Conference, pp. 131-140, 2008.
    • (2008) Proc. WWW Conference , pp. 131-140
    • Xiao, C.1    Wang, W.2    Lin, X.3    Yu, J.X.4
  • 39
    • 0014814325 scopus 로고
    • Space/time trade-offs in hash coding with allowable errors
    • B.H. Bloom, "Space/time trade-offs in hash coding with allowable errors," Comm. ACM 13: 7, pp. 422-426, 1970.
    • (1970) Comm. ACM , vol.13 , Issue.7 , pp. 422-426
    • Bloom, B.H.1
  • 40
    • 0036767270 scopus 로고    scopus 로고
    • Maintaining stream statistics over sliding windows
    • M. Datar, A. Gionis, P. Indyk, and R. Motwani, "Maintaining stream statistics over sliding windows," SIAM J. Computing 31, pp. 1794-1813, 2002.
    • (2002) SIAM J. Computing , vol.31 , pp. 1794-1813
    • Datar, M.1    Gionis, A.2    Indyk, P.3    Motwani, R.4
  • 43
    • 84944323337 scopus 로고    scopus 로고
    • Distinct sampling for highly-accurate answers to distinct values queries and event reports
    • P.B. Gibbons, "Distinct sampling for highly-accurate answers to distinct values queries and event reports," Intl. Conf. on Very Large Databases, pp. 541-550, 2001.
    • (2001) Intl. Conf. on Very Large Databases , pp. 541-550
    • Gibbons, P.B.1
  • 47
  • 51
    • 85032603151 scopus 로고    scopus 로고
    • Efficient computation of PageRank," Stanford Univ. Dept. of Computer Science technical report, Sept., 1999. Available as http://infolab.stanford.edu/~taherh/papers/efficient-pr.pdf [6] T.H. Haveliwala, "Topic-sensitive PageRank
    • T.H. Haveliwala, "Efficient computation of PageRank," Stanford Univ. Dept. of Computer Science technical report, Sept., 1999. Available as http://infolab.stanford.edu/~taherh/papers/efficient-pr.pdf [6] T.H. Haveliwala, "Topic-sensitive PageRank," Proc. 11th Intl. World-Wide-Web Conference, pp. 517-526, 2002
    • (2002) Proc. 11th Intl. World-Wide-Web Conference , pp. 517-526
    • Haveliwala, T.H.1
  • 52
    • 4243148480 scopus 로고    scopus 로고
    • Authoritative sources in a hyperlinked environment
    • J.M. Kleinberg, "Authoritative sources in a hyperlinked environment," J. ACM 46: 5, pp. 604-632, 1999.
    • (1999) J. ACM , vol.46 , Issue.5 , pp. 604-632
    • Kleinberg, J.M.1
  • 58
    • 0002663969 scopus 로고    scopus 로고
    • Sampling large databases for association rules
    • H. Toivonen, "Sampling large databases for association rules," Intl. Conf. on Very Large Databases, pp. 134-145, 1996.
    • (1996) Intl. Conf. on Very Large Databases , pp. 134-145
    • Toivonen, H.1
  • 66
    • 0347998609 scopus 로고    scopus 로고
    • An optimal deterministic algorithm for b-matching
    • B. Kalyanasundaram and K.R. Pruhs, "An optimal deterministic algorithm for b-matching," Theoretical Computer Science 233: 1-2, pp. 319-325, 2000.
    • (2000) Theoretical Computer Science , vol.233 , Issue.1-2 , pp. 319-325
    • Kalyanasundaram, B.1    Pruhs, K.R.2
  • 68
    • 20844435854 scopus 로고    scopus 로고
    • Towards the next generation of recommender systems: A survey of the state-of-the-art and possible extensions
    • G. Adomavicius and A. Tuzhilin, "Towards the next generation of recommender systems: a survey of the state-of-the-art and possible extensions," IEEE Trans. on Data and Knowledge Engineering 17: 6, pp. 734-749, 2005.
    • (2005) IEEE Trans. on Data and Knowledge Engineering , vol.17 , Issue.6 , pp. 734-749
    • Adomavicius, G.1    Tuzhilin, A.2
  • 69
    • 85032602188 scopus 로고    scopus 로고
    • C. Anderson, http://www.wired.com/wired/archive/12.10/tail.html 2004.
    • (2004)
    • Anderson, C.1
  • 72
    • 0037252945 scopus 로고    scopus 로고
    • Amazon.com recommendations: Itemto- item collaborative filtering
    • G. Linden, B. Smith, and J. York, "Amazon.com recommendations: itemto- item collaborative filtering," Internet Computing 7: 1, pp. 76-80, 2003.
    • (2003) Internet Computing , vol.7 , Issue.1 , pp. 76-80
    • Linden, G.1    Smith, B.2    York, J.3
  • 75
    • 34247540250 scopus 로고    scopus 로고
    • Games with a purpose
    • June
    • L. von Ahn, "Games with a purpose," IEEE Computer Magazine, pp. 96-98, June 2006.
    • (2006) IEEE Computer Magazine , pp. 96-98
    • von Ahn, L.1
  • 77
    • 84876806831 scopus 로고    scopus 로고
    • Transitive closure and recursive Datalog implemented on clusters
    • F.N. Afrati and J.D. Ullman, "Transitive closure and recursive Datalog implemented on clusters," in Proc. EDBT (2012).
    • (2012) Proc. EDBT
    • Afrati, F.N.1    Ullman, J.D.2
  • 79
    • 80054072531 scopus 로고    scopus 로고
    • HyperANF: Approximating the neighbourhood function of very large graphs on a budget
    • P. Boldi, M. Rosa, and S. Vigna, "HyperANF: approximating the neighbourhood function of very large graphs on a budget," Proc. WWW Conference (2011), pp. 625-634.
    • (2011) Proc. WWW Conference , pp. 625-634
    • Boldi, P.1    Rosa, M.2    Vigna, S.3
  • 80
    • 74049087026 scopus 로고    scopus 로고
    • Community detection in graphs
    • S. Fortunato, "Community detection in graphs," Physics Reports 486: 3-5 (2010), pp. 75-174.
    • (2010) Physics Reports , vol.486 , Issue.3-5 , pp. 75-174
    • Fortunato, S.1
  • 81
    • 0037062448 scopus 로고    scopus 로고
    • Community structure in social and biological networks
    • M. Girvan and M.E.J. Newman, "Community structure in social and biological networks," Proc. Natl. Acad. Sci. 99 (2002), pp. 7821-7826.
    • (2002) Proc. Natl. Acad. Sci , vol.99 , pp. 7821-7826
    • Girvan, M.1    Newman, M.E.J.2
  • 86
    • 0036205389 scopus 로고    scopus 로고
    • Similarity flooding: A versatile graph matching algorithm and its application to schema matching
    • S. Melnik, H. Garcia-Molina, and E. Rahm, "Similarity flooding: a versatile graph matching algorithm and its application to schema matching, Proc. Intl. Conf. on Data Engineering (2002), pp. 117-128.
    • (2002) Proc. Intl. Conf. on Data Engineering , pp. 117-128
    • Melnik, S.1    Garcia-Molina, H.2    Rahm, E.3
  • 89
    • 85032594309 scopus 로고    scopus 로고
    • Stanford Network Analysis Platform, http://snap.stanford.edu.
  • 90
    • 80052672399 scopus 로고    scopus 로고
    • Counting triangles and the curse of the last reducer
    • S. Suri and S. Vassilivitskii, "Counting triangles and the curse of the last reducer," Proc. WWW Conference (2011).
    • (2011) Proc. WWW Conference
    • Suri, S.1    Vassilivitskii, S.2
  • 91
    • 34748827346 scopus 로고    scopus 로고
    • Fast random walk with restart and its applications
    • H. Tong, C. Faloutsos, and J.-Y. Pan, "Fast random walk with restart and its applications," ICDM 2006, pp. 613-622.
    • (2006) ICDM , pp. 613-622
    • Tong, H.1    Faloutsos, C.2    Pan, J.-Y.3
  • 93
    • 26444432537 scopus 로고
    • Evaluation of recursive queries using join indices
    • P. Valduriez and H. Boral, "Evaluation of recursive queries using join indices," Expert Database Conf. (1986), pp. 271-293.
    • (1986) Expert Database Conf , pp. 271-293
    • Valduriez, P.1    Boral, H.2
  • 94
    • 34548583274 scopus 로고    scopus 로고
    • A tutorial on spectral clustering
    • 2007
    • U. von Luxburg, "A tutorial on spectral clustering," Statistics and Computing 17: 4 (2007), 2007, pp. 395-416.
    • (2007) Statistics and Computing , vol.17 , Issue.4 , pp. 395-416
    • von Luxburg, U.1
  • 95
    • 84874243514 scopus 로고    scopus 로고
    • Overlapping community detection at scale: A nonnegative matrix factorization approach
    • J. Yang and J. Leskovec, "Overlapping community detection at scale: a nonnegative matrix factorization approach," ACM Intl. Conf. on Web Search and Data Mining, 2013.
    • (2013) ACM Intl. Conf. on Web Search and Data Mining
    • Yang, J.1    Leskovec, J.2
  • 99
    • 33751097630 scopus 로고    scopus 로고
    • Fast Monte Carlo algorithms for matrices III: Computing a compressed approximate matrix decomposition
    • P. Drineas, R. Kannan, and M.W. Mahoney, "Fast Monte Carlo algorithms for matrices III: Computing a compressed approximate matrix decomposition," SIAM J. Computing 36: 1 (2006), pp. 184-206.
    • (2006) SIAM J. Computing , vol.36 , Issue.1 , pp. 184-206
    • Drineas, P.1    Kannan, R.2    Mahoney, M.W.3
  • 100
    • 0000288016 scopus 로고
    • Calculating the singular values and pseudoinverse of a matrix
    • G.H. Golub and W. Kahan, "Calculating the singular values and pseudoinverse of a matrix," J. SIAM Series B 2: 2 (1965), pp. 205-224.
    • (1965) J. SIAM Series B , vol.2 , Issue.2 , pp. 205-224
    • Golub, G.H.1    Kahan, W.2
  • 102
    • 33749575020 scopus 로고    scopus 로고
    • Tensor-CUR decompositions for tensor-based data
    • M.W. Mahoney, M. Maggioni, and P. Drineas, Tensor-CUR decompositions for tensor-based data, SIGKDD, pp. 327-336, 2006.
    • (2006) SIGKDD , pp. 327-336
    • Mahoney, M.W.1    Maggioni, M.2    Drineas, P.3
  • 103
    • 0000325341 scopus 로고
    • On lines and planes of closest fit to systems of points in space
    • K. Pearson, "On lines and planes of closest fit to systems of points in space," Philosophical Magazine 2: 11 (1901), pp. 559-572.
    • (1901) Philosophical Magazine , vol.2 , Issue.11 , pp. 559-572
    • Pearson, K.1
  • 105
    • 2542430932 scopus 로고    scopus 로고
    • Singular value decomposition and principal component analysis
    • D.P. Berrar, W. Dubitzky, and M. Granzow, eds, Kluwer, Norwell, MA
    • M.E. Wall, A. Reichtsteiner and L.M. Rocha, "Singular value decomposition and principal component analysis," in A Practical Approach to Microarray Data Analysis (D.P. Berrar, W. Dubitzky, and M. Granzow, eds.), pp. 91-109, Kluwer, Norwell, MA, 2003.
    • (2003) A Practical Approach to Microarray Data Analysis , pp. 91-109
    • Wall, M.E.1    Reichtsteiner, A.2    Rocha, L.M.3
  • 106
    • 0030819669 scopus 로고    scopus 로고
    • Empirical support for winnow and weighted-majority algorithms: Results on a calendar scheduling domain
    • A. Blum, "Empirical support for winnow and weighted-majority algorithms: results on a calendar scheduling domain," Machine Learning 26 (1997), pp. 5-23.
    • (1997) Machine Learning , vol.26 , pp. 5-23
    • Blum, A.1
  • 107
    • 84904136037 scopus 로고    scopus 로고
    • Large-scale machine learning with stochastic gradient descent
    • Springer
    • L. Bottou, "Large-scale machine learning with stochastic gradient descent," Proc. 19th Intl. Conf. on Computational Statistics (2010), pp. 177-187, Springer.
    • (2010) Proc. 19th Intl. Conf. on Computational Statistics , pp. 177-187
    • Bottou, L.1
  • 108
    • 84921478833 scopus 로고    scopus 로고
    • Stochastic gradient tricks, neural networks
    • edited by G. Montavon, G.B. Orr and K.-R. Mueller, Lecture Notes in Computer Science (LNCS 7700), Springer
    • L. Bottou, "Stochastic gradient tricks, neural networks," in Tricks of the Trade, Reloaded, pp. 430-445, edited by G. Montavon, G.B. Orr and K.-R. Mueller, Lecture Notes in Computer Science (LNCS 7700), Springer, 2012.
    • (2012) Tricks of the Trade, Reloaded , pp. 430-445
    • Bottou, L.1
  • 109
    • 27144489164 scopus 로고    scopus 로고
    • A tutorial on support vector machines for pattern recognition
    • C.J.C. Burges, "A tutorial on support vector machines for pattern recognition," Data Mining and Knowledge Discovery 2 (1998), pp. 121-167.
    • (1998) Data Mining and Knowledge Discovery , vol.2 , pp. 121-167
    • Burges, C.J.C.1
  • 111
    • 34249753618 scopus 로고
    • Support-vector networks
    • C. Cortes and V.N. Vapnik, "Support-vector networks," Machine Learning 20 (1995), pp. 273-297.
    • (1995) Machine Learning , vol.20 , pp. 273-297
    • Cortes, C.1    Vapnik, V.N.2
  • 112
    • 0033281425 scopus 로고    scopus 로고
    • Large margin classification using the perceptron algorithm
    • Y. Freund and R.E. Schapire, "Large margin classification using the perceptron algorithm," Machine Learning 37 (1999), pp. 277-296.
    • (1999) Machine Learning , vol.37 , pp. 277-296
    • Freund, Y.1    Schapire, R.E.2
  • 113
    • 33749563073 scopus 로고    scopus 로고
    • Training linear SVMs in linear time
    • T. Joachims, "Training linear SVMs in linear time." Proc. 12th ACM SIGKDD (2006), pp. 217-226.
    • (2006) Proc. 12th ACM SIGKDD , pp. 217-226
    • Joachims, T.1
  • 114
    • 34250091945 scopus 로고
    • Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm
    • N. Littlestone, "Learning quickly when irrelevant attributes abound: a new linear-threshold algorithm," Machine Learning 2 (1988), pp. 285-318.
    • (1988) Machine Learning , vol.2 , pp. 285-318
    • Littlestone, N.1
  • 116
    • 11144273669 scopus 로고
    • The perceptron: A probabilistic model for information storage and organization in the brain
    • F. Rosenblatt, "The perceptron: a probabilistic model for information storage and organization in the brain," Psychological Review 65: 6 (1958), pp. 386-408.
    • (1958) Psychological Review , vol.65 , Issue.6 , pp. 386-408
    • Rosenblatt, F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.