메뉴 건너뛰기




Volumn 4, Issue 1-3, 2011, Pages 1-294

Synopses for massive data: Samples, histograms, wavelets, sketches

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATE QUERY PROCESSING; BASIC PRINCIPLES; DATA SETS; ERROR BOUND; HIGH-SPEED DATA; MASSIVE DATA; MASSIVE DATA SETS; OPTIMALITY; RANDOM SAMPLE; SPACE AND TIME;

EID: 84858061988     PISSN: 19317883     EISSN: 19317891     Source Type: Journal    
DOI: 10.1561/1900000004     Document Type: Article
Times cited : (432)

References (299)
  • 1
    • 79957809015 scopus 로고    scopus 로고
    • HadoopDB: An architectural hybrid of MapReduce and DBMS technologies for analytical workloads
    • A. Abouzeid, K. Bajda-Pawlikowski, D. J. Abadi, A. Rasin, and A. Silberschatz, "HadoopDB: An architectural hybrid of MapReduce and DBMS technologies for analytical workloads," PVLDB, vol. 2, no. 1, pp. 922-933, 2009.
    • (2009) PVLDB , vol.2 , Issue.1 , pp. 922-933
    • Abouzeid, A.1    Bajda-Pawlikowski, K.2    Abadi, D.J.3    Rasin, A.4    Silberschatz, A.5
  • 8
    • 79959689047 scopus 로고    scopus 로고
    • Streaming algorithms from precision sampling
    • abs/1011.1263
    • A. Andoni, R. Krauthgamer, and K. Onak, "Streaming algorithms from precision sampling," CoRR, p. abs/1011.1263, 2010.
    • (2010) CoRR
    • Andoni, A.1    Krauthgamer, R.2    Onak, K.3
  • 14
    • 10644244988 scopus 로고    scopus 로고
    • Sampling from a moving window over streaming data
    • B. Babcock, M. Datar, and R. Motwani, "Sampling from a moving window over streaming data," in SODA, pp. 633-634, 2002.
    • (2002) SODA , pp. 633-634
    • Babcock, B.1    Datar, M.2    Motwani, R.3
  • 15
    • 77954707631 scopus 로고    scopus 로고
    • Green: A framework for supporting energyconscious programming using controlled approximation
    • W. Baek and T. Chilimbi, "Green: A framework for supporting energyconscious programming using controlled approximation," in Proceedings of PLDI, pp. 198-209, 2010.
    • (2010) Proceedings of PLDI , pp. 198-209
    • Baek, W.1    Chilimbi, T.2
  • 18
    • 84945709924 scopus 로고
    • On the approximation of curves by line segments using dynamic programming
    • R. Bellman, "On the approximation of curves by line segments using dynamic programming," Communications of ACM, vol. 4, no. 6, p. 284, 1961.
    • (1961) Communications of ACM , vol.4 , Issue.6 , pp. 284
    • Bellman, R.1
  • 24
    • 0014814325 scopus 로고
    • Space/time trade-offs in hash coding with allowable errors
    • July
    • B. Bloom, "Space/time trade-offs in hash coding with allowable errors," Communications of the ACM, vol. 13, no. 7, pp. 422-426, July 1970.
    • (1970) Communications of the ACM , vol.13 , Issue.7 , pp. 422-426
    • Bloom, B.1
  • 26
    • 70450232823 scopus 로고    scopus 로고
    • Network applications of bloom filters: A survey
    • A. Z. Broder and M. Mitzenmacher, "Network applications of bloom filters: A survey," Internet Mathematics, vol. 1, no. 4, 2003.
    • (2003) Internet Mathematics , vol.1 , Issue.4
    • Broder, A.Z.1    Mitzenmacher, M.2
  • 27
    • 33749584309 scopus 로고    scopus 로고
    • Techniques for warehousing of sample data
    • DOI 10.1109/ICDE.2006.157, 1617374, Proceedings of the 22nd International Conference on Data Engineering, ICDE '06
    • P. G. Brown and P. J. Haas, "Techniques for warehousing of sample data," in Proceedings of the International Conference on Data Engineering, p. 6, Washington, DC, USA, 2006. (Pubitemid 44539798)
    • (2006) Proceedings - International Conference on Data Engineering , vol.2006 , pp. 6
    • Brown, P.G.1    Haas, P.J.2
  • 28
    • 0023842101 scopus 로고
    • The estimates of Laplace. an example: Research concerning the population of a large empire, 1785-1812
    • B. Bru, "The estimates of Laplace. an example: Research concerning the population of a large empire, 1785-1812," in Journal de la Société de statistique de Paris, vol. 129, pp. 6-45, 1988.
    • (1988) Journal de la Société de Statistique de Paris , vol.129 , pp. 6-45
    • Bru, B.1
  • 31
    • 34548304108 scopus 로고    scopus 로고
    • A fast and compact method for unveiling significant patterns in high speed networks
    • T. Bu, J. Cao, A. Chen, and P. P. C. Lee, "A fast and compact method for unveiling significant patterns in high speed networks," in IEEE INFOCOMM, 2007.
    • (2007) IEEE INFOCOMM
    • Bu, T.1    Cao, J.2    Chen, A.3    Lee, P.P.C.4
  • 33
    • 79958094117 scopus 로고    scopus 로고
    • Estimating range queries using aggregate data with integrity constraints: A probabilistic approach
    • F. Buccafurri, F. Furfaro, and D. Saccà, "Estimating range queries using aggregate data with integrity constraints: A probabilistic approach," in Proceedings of the International Conference on Database Theory, pp. 390-404, 2001. (Pubitemid 33213338)
    • (2001) Lecture Notes in Computer Science , Issue.1973 , pp. 390-404
    • Buccafurri, F.1    Furfaro, F.2    Sacca, D.3
  • 35
    • 4444306224 scopus 로고    scopus 로고
    • Fast range query estimation by n-level tree histograms
    • F. Buccafurri and G. Lax, "Fast range query estimation by n-level tree histograms," Data Knowledge in Engineering, vol. 51, no. 2, pp. 257-275, 2004.
    • (2004) Data Knowledge in Engineering , vol.51 , Issue.2 , pp. 257-275
    • Buccafurri, F.1    Lax, G.2
  • 37
    • 46749153524 scopus 로고    scopus 로고
    • Enhancing histograms by tree-like bucket indices
    • F. Buccafurri, G. Lax, D. Saccà, L. Pontieri, and D. Rosaci, "Enhancing histograms by tree-like bucket indices," VLDB Journal, vol. 17, no. 5, pp. 1041-1061, 2008.
    • (2008) VLDB Journal , vol.17 , Issue.5 , pp. 1041-1061
    • Buccafurri, F.1    Lax, G.2    Saccà, D.3    Pontieri, L.4    Rosaci, D.5
  • 38
    • 34548771720 scopus 로고    scopus 로고
    • Space efficient streaming algorithms for the maximum error histogram
    • DOI 10.1109/ICDE.2007.368961, 4221751, 23rd International Conference on Data Engineering, ICDE 2007
    • C. Buragohain, N. Shrivastava, and S. Suri, "Space efficient streaming algorithms for the maximum error histogram," in Proceedings of the International Conference on Data Engineering, pp. 1026-1035, 2007. (Pubitemid 47422106)
    • (2007) Proceedings - International Conference on Data Engineering , pp. 1026-1035
    • Buragohaint, C.1    Slirvastava, N.2    Suri, S.3
  • 42
    • 0035486513 scopus 로고    scopus 로고
    • Approximate query processing using wavelets
    • K. Chakrabarti, M. N. Garofalakis, R. Rastogi, and K. Shim, "Approximate query processing using wavelets," The VLDB Journal, vol. 10, no. 2-3, pp. 199-223, September 2001. (Best of VLDB'2000 Special Issue). (Pubitemid 33404198)
    • (2001) VLDB Journal , vol.10 , Issue.2-3 , pp. 199-223
    • Chakrabarti, K.1    Garofalakis, M.2    Rastogi, R.3    Shim, K.4
  • 48
  • 50
    • 0347761807 scopus 로고    scopus 로고
    • On random sampling over joins
    • S. Chaudhuri, R. Motwani, and V. Narasayya, "On random sampling over joins," SIGMOD Record, vol. 28, no. 2, pp. 263-274, 1999.
    • (1999) SIGMOD Record , vol.28 , Issue.2 , pp. 263-274
    • Chaudhuri, S.1    Motwani, R.2    Narasayya, V.3
  • 51
    • 0032089874 scopus 로고    scopus 로고
    • Random sampling for histogram construction: How much is enough?
    • S. Chaudhuri, R. Motwani, and V. R. Narasayya, "Random sampling for histogram construction: How much is enough?," in Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 436-447, 1998. (Pubitemid 128655989)
    • (1998) SIGMOD Record , vol.27 , Issue.2 , pp. 436-447
    • Chaudhuri, S.1    Motwani, R.2    Narasayya, V.3
  • 54
    • 79960173379 scopus 로고    scopus 로고
    • Structure-aware sampling on data streams
    • E. Cohen, G. Cormode, and N. G. Duffield, "Structure-aware sampling on data streams," in SIGMETRICS, pp. 197-208, 2011.
    • (2011) SIGMETRICS , pp. 197-208
    • Cohen, E.1    Cormode, G.2    Duffield, N.G.3
  • 62
    • 84858067790 scopus 로고    scopus 로고
    • Probabilistic histograms for probabilistic data
    • G. Cormode, A. Deligiannakis, M. N. Garofalakis, and A. McGregor, "Probabilistic histograms for probabilistic data," PVLDB, vol. 2, no. 1, pp. 526-537, 2009.
    • (2009) PVLDB , vol.2 , Issue.1 , pp. 526-537
    • Cormode, G.1    Deligiannakis, A.2    Garofalakis, M.N.3    McGregor, A.4
  • 69
    • 8344272783 scopus 로고    scopus 로고
    • What's new: Finding significant differences in network data streams
    • G. Cormode and S. Muthukrishnan, "What's new: Finding significant differences in network data streams," in Proceedings of IEEE Infocom, 2004.
    • (2004) Proceedings of IEEE Infocom
    • Cormode, G.1    Muthukrishnan, S.2
  • 70
    • 14844367057 scopus 로고    scopus 로고
    • An improved data stream summary: The count-min sketch and its applications
    • DOI 10.1016/j.jalgor.2003.12.001, PII S0196677403001913
    • G. Cormode and S. Muthukrishnan, "An improved data stream summary: The count-min sketch and its applications," Journal of Algorithms, vol. 55, no. 1, pp. 58-75, 2005. (Pubitemid 40357145)
    • (2005) Journal of Algorithms , vol.55 , Issue.1 , pp. 58-75
    • Cormode, G.1    Muthukrishnan, S.2
  • 75
    • 67650088338 scopus 로고    scopus 로고
    • Probabilistic databases: Diamonds in the dirt
    • N. N. Dalvi, C. Ré, and D. Suciu, "Probabilistic databases: Diamonds in the dirt," Communications of the ACM, vol. 52, no. 7, pp. 86-94, 2009.
    • (2009) Communications of the ACM , vol.52 , Issue.7 , pp. 86-94
    • Dalvi, N.N.1    Ré, C.2    Suciu, D.3
  • 78
    • 0003833285 scopus 로고
    • Philadelphia, PA: Society for Industrial and Applied Mathematics (SIAM)
    • I. Daubechies, Ten Lectures on Wavelets. Philadelphia, PA: Society for Industrial and Applied Mathematics (SIAM), 1992.
    • (1992) Ten Lectures on Wavelets
    • Daubechies, I.1
  • 84
    • 85009724776 scopus 로고    scopus 로고
    • Nonlinear approximation
    • R. A. DeVore, "Nonlinear approximation," Acta Numerica, vol. 7, pp. 51-150, 1998.
    • (1998) Acta Numerica , vol.7 , pp. 51-150
    • Devore, R.A.1
  • 86
    • 33244483644 scopus 로고    scopus 로고
    • Histograms revisited: When are histograms the best approximation method for aggregates over joins?
    • DOI 10.1145/1065167.1065196, Proceedings of the Twenty-Fourth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2005
    • A. Dobra, "Histograms revisited: When are histograms the best approximation method for aggregates over joins?," in Proceedings of ACM Principles of Database Systems, pp. 228-237, 2005. (Pubitemid 43275485)
    • (2005) Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems , pp. 228-237
    • Dobra, A.1
  • 91
    • 8344290018 scopus 로고    scopus 로고
    • Estimating flow distributions from sampled flow statistics
    • N. Duffield, C. Lund, and M. Thorup, "Estimating flow distributions from sampled flow statistics," in ACM SIGCOMM, 2003.
    • (2003) ACM SIGCOMM
    • Duffield, N.1    Lund, C.2    Thorup, M.3
  • 93
    • 4544360452 scopus 로고    scopus 로고
    • New directions in traffic measurement and accounting
    • C. Estan and G. Varghese, "New directions in traffic measurement and accounting," in ACM SIGCOMM, 2002.
    • (2002) ACM SIGCOMM
    • Estan, C.1    Varghese, G.2
  • 94
    • 84945709595 scopus 로고
    • Development of sampling plans by using sequential (item by item) selection techniques and digital computers
    • C. T. Fan, M. E. Muller, and I. Rezucha, "Development of sampling plans by using sequential (item by item) selection techniques and digital computers," Journal of the American Statistical Association, pp. 387-402, 1962.
    • (1962) Journal of the American Statistical Association , pp. 387-402
    • Fan, C.T.1    Muller, M.E.2    Rezucha, I.3
  • 96
    • 0346575970 scopus 로고
    • On adaptive sampling
    • P. Flajolet, "On adaptive sampling," Computing, vol. 43, no. 4, 1990.
    • (1990) Computing , vol.43 , Issue.4
    • Flajolet, P.1
  • 97
    • 0020828424 scopus 로고
    • Probabilistic counting algorithms for database applications
    • P. Flajolet and G. N. Martin, "Probabilistic counting algorithms for database applications," Journal of Computer and System Sciences, vol. 31, pp. 182-209, 1985.
    • (1985) Journal of Computer and System Sciences , vol.31 , pp. 182-209
    • Flajolet, P.1    Martin, G.N.2
  • 99
    • 33750706512 scopus 로고    scopus 로고
    • Compressed histograms with arbitrary bucket layouts for selectivity estimation
    • DOI 10.1016/j.ins.2006.07.013, PII S0020025506002003
    • D. Fuchs, Z. He, and B. S. Lee, "Compressed histograms with arbitrary bucket layouts for selectivity estimation," Information on Science, vol. 177, no. 3, pp. 680-702, 2007. (Pubitemid 44708142)
    • (2007) Information Sciences , vol.177 , Issue.3 , pp. 680-702
    • Fuchs, D.1    He, Z.2    Lee, B.S.3
  • 100
    • 34248217002 scopus 로고    scopus 로고
    • Counting distinct items over update streams
    • DOI 10.1016/j.tcs.2007.02.031, PII S0304397507001223, Algorithms and Computation
    • S. Ganguly, "Counting distinct items over update streams," Theoretical Computer Science, vol. 378, no. 3, pp. 211-222, 2007. (Pubitemid 46719681)
    • (2007) Theoretical Computer Science , vol.378 , Issue.3 , pp. 211-222
    • Ganguly, S.1
  • 104
    • 84858017984 scopus 로고    scopus 로고
    • CR-precis: A deterministic summary structure for update data streams
    • S. Ganguly and A. Majumder, "CR-precis: A deterministic summary structure for update data streams," in ESCAPE, 2007.
    • (2007) ESCAPE
    • Ganguly, S.1    Majumder, A.2
  • 109
    • 3042590040 scopus 로고    scopus 로고
    • Probabilistic wavelet synopses
    • March. (SIGMOD/ PODS'2002 Special Issue)
    • M. Garofalakis and P. B. Gibbons, "Probabilistic wavelet synopses," ACM Transactions on Database Systems, vol. 29, no. 1, March 2004. (SIGMOD/ PODS'2002 Special Issue).
    • (2004) ACM Transactions on Database Systems , vol.29 , Issue.1
    • Garofalakis, M.1    Gibbons, P.B.2
  • 111
    • 33745197947 scopus 로고    scopus 로고
    • Wavelet synopses for general error metrics
    • December. (SIGMOD/PODS'2004 Special Issue)
    • M. Garofalakis and A. Kumar, "Wavelet synopses for general error metrics," ACM Transactions on Database Systems, vol. 30, no. 4, December 2005. (SIGMOD/PODS'2004 Special Issue).
    • (2005) ACM Transactions on Database Systems , vol.30 , Issue.4
    • Garofalakis, M.1    Kumar, A.2
  • 112
    • 70349681982 scopus 로고    scopus 로고
    • PhD Thesis, Technische Universität Dresden
    • R. Gemulla, "Sampling algorithms for evolving datasets," PhD Thesis, Technische Universität Dresden, Available at http://nbn-resolving. de/urn:nbn:de:bsz:14-ds-1224861856184-11644, 2009.
    • (2009) Sampling Algorithms for Evolving Datasets
    • Gemulla, R.1
  • 114
    • 57149135881 scopus 로고    scopus 로고
    • Sampling time-based sliding windows in bounded space
    • R. Gemulla and W. Lehner, "Sampling time-based sliding windows in bounded space," in SIGMOD Conference, pp. 379-392, 2008.
    • (2008) SIGMOD Conference , pp. 379-392
    • Gemulla, R.1    Lehner, W.2
  • 117
    • 38949217037 scopus 로고    scopus 로고
    • Maintaining bounded-size sample synopses of evolving datasets
    • R. Gemulla, W. Lehner, and P. J. Haas, "Maintaining bounded-size sample synopses of evolving datasets," VLDB Journal, vol. 17, no. 2, pp. 173-202, 2008.
    • (2008) VLDB Journal , vol.17 , Issue.2 , pp. 173-202
    • Gemulla, R.1    Lehner, W.2    Haas, P.J.3
  • 121
    • 84944323337 scopus 로고    scopus 로고
    • Distinct sampling for highly-accurate answers to distinct values queries and event reports
    • P. Gibbons, "Distinct sampling for highly-accurate answers to distinct values queries and event reports," in International Conference on Very Large Data Bases, 2001.
    • (2001) International Conference on Very Large Data Bases
    • Gibbons, P.1
  • 124
    • 0032092365 scopus 로고    scopus 로고
    • New sampling-based summary statistics for improving approximate query answers
    • P. B. Gibbons and Y. Matias, "New sampling-based summary statistics for improving approximate query answers," in Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 331-342, New York, NY, USA, 1998. (Pubitemid 128655980)
    • (1998) SIGMOD Record , vol.27 , Issue.2 , pp. 331-342
    • Gibbons, P.B.1    Matias, Y.2
  • 126
  • 132
    • 33745198571 scopus 로고    scopus 로고
    • Manuscript, September
    • S. Guha, "A note on wavelet optimization," (Manuscript available from: http: //www.cis.upenn.edu/~sudipto/note.html.), September 2004.
    • (2004) A Note on Wavelet Optimization
    • Guha, S.1
  • 133
    • 66749158142 scopus 로고    scopus 로고
    • On the space-time of optimal, approximate and streaming algorithms for synopsis construction problems
    • S. Guha, "On the space-time of optimal, approximate and streaming algorithms for synopsis construction problems," VLDB Journal, vol. 17, no. 6, pp. 1509-1535, 2008.
    • (2008) VLDB Journal , vol.17 , Issue.6 , pp. 1509-1535
    • Guha, S.1
  • 138
    • 33745187544 scopus 로고    scopus 로고
    • Approximation and streaming algorithms for histogram construction problems
    • S. Guha, N. Koudas, and K. Shim, "Approximation and streaming algorithms for histogram construction problems," ACM Transactions on Database Systems, vol. 31, no. 1, pp. 396-438, 2006.
    • (2006) ACM Transactions on Database Systems , vol.31 , Issue.1 , pp. 396-438
    • Guha, S.1    Koudas, N.2    Shim, K.3
  • 140
    • 34249794597 scopus 로고    scopus 로고
    • A note on linear time algorithms for maximum error histograms
    • DOI 10.1109/TKDE.2007.1039
    • S. Guha and K. Shim, "A note on linear time algorithms for maximum error histograms," IEEE Transactions on Knowledge Data Engineering, vol. 19, no. 7, pp. 993-997, 2007. (Pubitemid 46853315)
    • (2007) IEEE Transactions on Knowledge and Data Engineering , vol.19 , Issue.7 , pp. 993-997
    • Guha, S.1    Shim, K.2
  • 145
    • 38749147634 scopus 로고    scopus 로고
    • The need for speed: Speeding up DB2 UDB using sampling
    • P. J. Haas, "The need for speed: Speeding up DB2 UDB using sampling," IDUG Solutions Journal, vol. 10, no. 2, pp. 32-34, 2003.
    • (2003) IDUG Solutions Journal , vol.10 , Issue.2 , pp. 32-34
    • Haas, P.J.1
  • 148
    • 62249189663 scopus 로고    scopus 로고
    • Discovering and exploiting statistical properties for query optimization in relational databases: A survey
    • P. J. Haas, I. F. Ilyas, G. M. Lohman, and V. Markl, "Discovering and exploiting statistical properties for query optimization in relational databases: A survey," Statistical Analysis and Data Mining, vol. 1, no. 4, pp. 223-250, 2009.
    • (2009) Statistical Analysis and Data Mining , vol.1 , Issue.4 , pp. 223-250
    • Haas, P.J.1    Ilyas, I.F.2    Lohman, G.M.3    Markl, V.4
  • 150
    • 33645081700 scopus 로고    scopus 로고
    • An estimator of the number of species from quadrat sampling
    • P. J. Haas, Y. Liu, and L. Stokes, "An estimator of the number of species from quadrat sampling," Biometrics, vol. 62, pp. 135-141, 2006.
    • (2006) Biometrics , vol.62 , pp. 135-141
    • Haas, P.J.1    Liu, Y.2    Stokes, L.3
  • 153
    • 0030169435 scopus 로고    scopus 로고
    • Selectivity and cost estimation for joins based on random sampling
    • DOI 10.1006/jcss.1996.0041
    • P. J. Haas, J. F. Naughton, S. Seshadri, and A. N. Swami, "Selectivity and cost estimation for joins based on random sampling," Journal of Computer and Systems Science, vol. 52, no. 3, pp. 550-569, 1996. (Pubitemid 126359777)
    • (1996) Journal of Computer and System Sciences , vol.52 , Issue.3 , pp. 550-569
    • Haas, P.J.1    Naughton, J.F.2    Seshadri, S.3    Swami, A.N.4
  • 155
    • 0032265203 scopus 로고    scopus 로고
    • Estimating the number of classes in a finite population
    • P. J. Haas and L. Stokes, "Estimating the number of classes in a finite population," Journal of American Statistical Association, vol. 93, no. 444, pp. 1475-1487, 1998. (Pubitemid 128385557)
    • (1998) Journal of the American Statistical Association , vol.93 , Issue.444 , pp. 1475-1487
    • Haas, P.J.1    Stokes, L.2
  • 157
    • 0025383763 scopus 로고
    • A guided tour of chernoff bounds
    • T. Hagerup and C. Rüb, "A guided tour of chernoff bounds," Information on Processing Letters, vol. 33, no. 6, pp. 305-308, 1990.
    • (1990) Information on Processing Letters , vol.33 , Issue.6 , pp. 305-308
    • Hagerup, T.1    Rüb, C.2
  • 159
    • 84972513578 scopus 로고
    • Some history and reminiscences on survey sampling
    • M. Hansen, "Some history and reminiscences on survey sampling," in Statistical Science, vol. 2, pp. 180-190, 1987.
    • (1987) Statistical Science , vol.2 , pp. 180-190
    • Hansen, M.1
  • 163
    • 43049099016 scopus 로고    scopus 로고
    • Proactive and reactive multi-dimensional histogram maintenance for selectivity estimation
    • DOI 10.1016/j.jss.2007.03.088, PII S0164121207000866, Selected Papers from the 2006 Brazilian Symposia on Databases and on Software Engineering
    • Z. He, B. S. Lee, and X. S. Wang, "Proactive and reactive multi-dimensional histogram maintenance for selectivity estimation," Journal of Systems and Software, vol. 81, no. 3, pp. 414-430, 2008. (Pubitemid 351625346)
    • (2008) Journal of Systems and Software , vol.81 , Issue.3 , pp. 414-430
    • He, Z.1    Lee, B.S.2    Wang, X.S.3
  • 165
    • 84906262024 scopus 로고    scopus 로고
    • Algorithmic challenges in search engines
    • M. Henzinger, "Algorithmic challenges in search engines," Internet Mathematics, vol. 1, no. 1, pp. 115-126, 2003.
    • (2003) Internet Mathematics , vol.1 , Issue.1 , pp. 115-126
    • Henzinger, M.1
  • 169
    • 84947403595 scopus 로고
    • Probability inequalities for sums of bounded random variables
    • W. Hoeffding, "Probability inequalities for sums of bounded random variables," Journal of the American Statistical Association, vol. 58, no. 301, p. 1330, 1963.
    • (1963) Journal of the American Statistical Association , vol.58 , Issue.301 , pp. 1330
    • Hoeffding, W.1
  • 170
    • 84947396376 scopus 로고
    • A generalization of sampling without replacement from a finite universe
    • D. G. Horvitz and D. J. Thompson, "A generalization of sampling without replacement from a finite universe," Journal of the American Statistical Association, vol. 47, pp. 663-695, 1952.
    • (1952) Journal of the American Statistical Association , vol.47 , pp. 663-695
    • Horvitz, D.G.1    Thompson, D.J.2
  • 174
    • 0034504507 scopus 로고    scopus 로고
    • Stable distributions, pseudorandom generators, embeddings and data stream computation
    • P. Indyk, "Stable distributions, pseudorandom generators, embeddings and data stream computation," in IEEE Conference on Foundations of Computer Science, 2000.
    • (2000) IEEE Conference on Foundations of Computer Science
    • Indyk, P.1
  • 181
    • 0027872183 scopus 로고
    • Optimal histograms for limiting worst-case error propagation in the size of join results
    • Y. E. Ioannidis and S. Christodoulakis, "Optimal histograms for limiting worst-case error propagation in the size of join results," ACM Transactions on Database Systems, vol. 18, no. 4, 1993.
    • (1993) ACM Transactions on Database Systems , vol.18 , Issue.4
    • Ioannidis, Y.E.1    Christodoulakis, S.2
  • 186
    • 0028496320 scopus 로고
    • An overview of wavelet based multiresolution analyses
    • B. Jawerth and W. Sweldens, "An overview of wavelet based multiresolution analyses," SIAM Review, vol. 36, no. 3, pp. 377-412, 1994.
    • (1994) SIAM Review , vol.36 , Issue.3 , pp. 377-412
    • Jawerth, B.1    Sweldens, W.2
  • 193
    • 33749648773 scopus 로고    scopus 로고
    • New sampling-based estimators for OLAP queries
    • DOI 10.1109/ICDE.2006.106, 1617386, Proceedings of the 22nd International Conference on Data Engineering, ICDE '06
    • R. Jin, L. Glimcher, C. Jermaine, and G. Agrawal, "New sampling-based estimators for OLAP queries," in Proceedings of the International Conference on Data Engineering, p. 18, Washington, DC, USA, 2006. (Pubitemid 44539810)
    • (2006) Proceedings - International Conference on Data Engineering , vol.2006 , pp. 18
    • Jin, R.1    Glimcher, L.2    Jermaine, C.3    Agrawal, G.4
  • 194
    • 0001654702 scopus 로고
    • Extensions of Lipshitz mapping into Hilbert space
    • W. Johnson and J. Lindenstrauss, "Extensions of Lipshitz mapping into Hilbert space," Contemporary Mathematics, vol. 26, pp. 189-206, 1984.
    • (1984) Contemporary Mathematics , vol.26 , pp. 189-206
    • Johnson, W.1    Lindenstrauss, J.2
  • 195
    • 58149477069 scopus 로고    scopus 로고
    • Sampling-based estimators for subset-based queries
    • Accepted for Publication
    • S. Joshi and C. Jermaine, "Sampling-based estimators for subset-based queries," VLDB Journal, Accepted for Publication, 2008.
    • (2008) VLDB Journal
    • Joshi, S.1    Jermaine, C.2
  • 196
    • 79960182864 scopus 로고    scopus 로고
    • Tight bounds for lp samplers, finding duplicates in streams, and related problems
    • H. Jowhari, M. Saglam, and G. Tardos, "Tight bounds for lp samplers, finding duplicates in streams, and related problems," in ACM Principles of Database Systems, 2011.
    • (2011) ACM Principles of Database Systems
    • Jowhari, H.1    Saglam, M.2    Tardos, G.3
  • 198
    • 77955274634 scopus 로고    scopus 로고
    • Optimality and scalability in lattice histogram construction
    • P. Karras, "Optimality and scalability in lattice histogram construction," PVLDB, vol. 2, no. 1, pp. 670-681, 2009.
    • (2009) PVLDB , vol.2 , Issue.1 , pp. 670-681
    • Karras, P.1
  • 199
    • 51149084988 scopus 로고    scopus 로고
    • Hierarchical synopses with optimal error guarantees
    • August
    • P. Karras and N. Mamoulis, "Hierarchical synopses with optimal error guarantees," ACM Transactions on Database Systems, vol. 33, no. 3, August 2008.
    • (2008) ACM Transactions on Database Systems , vol.33 , Issue.3
    • Karras, P.1    Mamoulis, N.2
  • 205
    • 77954697258 scopus 로고    scopus 로고
    • Consistent histograms in the presence of distinct value counts
    • R. Kaushik and D. Suciu, "Consistent histograms in the presence of distinct value counts," PVLDB, vol. 2, no. 1, pp. 850-861, 2009.
    • (2009) PVLDB , vol.2 , Issue.1 , pp. 850-861
    • Kaushik, R.1    Suciu, D.2
  • 207
    • 84950971965 scopus 로고    scopus 로고
    • Efficient array partitioning
    • Automata, Languages and Programming: 24th International Colloquium, ICALP '97
    • S. Khanna, S. Muthukrishnan, and S. Skiena, "Efficient array partitioning," in Proceedings of the International Colloquium on Automata, Languages and Programming, pp. 616-626, 1997. (Pubitemid 127097776)
    • (1997) Lecture Notes in Computer Science , Issue.1256 , pp. 616-626
    • Khanna, S.1    Muthukrishnan, S.2    Skiena, S.3
  • 215
    • 33745632145 scopus 로고    scopus 로고
    • CXHist: An on-line classification-based histogram for XML string selectivity estimation
    • VLDB 2005 - Proceedings of 31st International Conference on Very Large Data Bases
    • L. Lim, M. Wang, and J. S. Vitter, "CXHist: An on-line classification-based histogram for XML string selectivity estimation," in Proceedings of the International Conference on Very Large Data Bases, pp. 1187-1198, 2005. (Pubitemid 43991088)
    • (2005) VLDB 2005 - Proceedings of 31st International Conference on Very Large Data Bases , vol.3 , pp. 1187-1198
    • Lim, L.1    Wang, M.2    Vitter, J.S.3
  • 223
    • 33846807386 scopus 로고    scopus 로고
    • Optimal workload-based weighted wavelet synopses
    • DOI 10.1016/j.tcs.2006.11.018, PII S0304397506008668
    • Y. Matias and D. Urieli, "Optimal workload-based weighted wavelet synopses," Theoretical Computer Science, vol. 371, no. 3, pp. 227-246, 2007. (Pubitemid 46215859)
    • (2007) Theoretical Computer Science , vol.371 , Issue.3 , pp. 227-246
    • Matias, Y.1    Urieli, D.2
  • 224
    • 0032094250 scopus 로고    scopus 로고
    • Wavelet-based histograms for selectivity estimation
    • Y. Matias, J. S. Vitter, and M. Wang, "Wavelet-based histograms for selectivity estimation," in Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 448-459, Seattle, Washington, June 1998. (Pubitemid 128655990)
    • (1998) SIGMOD Record , vol.27 , Issue.2 , pp. 448-459
    • Matias, Y.1    Vitter, J.S.2    Wang, M.3
  • 227
    • 84858041785 scopus 로고    scopus 로고
    • Microsoft. Microsoft StreamInsight. http://msdn.microsoft.com/en-us/ library/ee362541.aspx, 2008.
    • (2008) Microsoft StreamInsight
  • 230
    • 77954737810 scopus 로고    scopus 로고
    • Preventing bad plans by bounding the impact of cardinality estimation errors
    • G. Moerkotte, T. Neumann, and G. Steidl, "Preventing bad plans by bounding the impact of cardinality estimation errors," PVLDB, vol. 2, no. 1, pp. 982-993, 2009.
    • (2009) PVLDB , vol.2 , Issue.1 , pp. 982-993
    • Moerkotte, G.1    Neumann, T.2    Steidl, G.3
  • 233
    • 84872851578 scopus 로고
    • Selection and sorting with limited storage
    • J. I. Munro and M. S. Paterson, "Selection and sorting with limited storage," Theoretical Computer Science, vol. 12, pp. 315-323, 1980.
    • (1980) Theoretical Computer Science , vol.12 , pp. 315-323
    • Munro, J.I.1    Paterson, M.S.2
  • 236
    • 30344485261 scopus 로고    scopus 로고
    • Data streams: Algorithms and applications
    • DOI 10.1561/0400000002
    • S. Muthukrishnan, "Data streams: Algorithms and applications," Foundations and Trends® in Theoretical Computer Science, vol. 1, no. 2, pp. 117-236, 2005. (Pubitemid 44111222)
    • (2005) Foundations and Trends in Theoretical Computer Science , vol.1 , Issue.2 , pp. 117-236
    • Muthukrishnan, S.1
  • 239
    • 27144509742 scopus 로고    scopus 로고
    • Workload-optimal histograms on streams
    • Algorithms - ESA 2005: 13th Annual European Symposium. Proceedings
    • S. Muthukrishnan, M. Strauss, and X. Zheng, "Workload-optimal histograms on streams," in Proceedings of ESA, pp. 734-745, 2005. (Pubitemid 41491571)
    • (2005) Lecture Notes in Computer Science , vol.3669 , pp. 734-745
    • Muthukrishnan, S.1    Strauss, M.2    Zheng, X.3
  • 240
    • 0000399707 scopus 로고
    • On the two different aspects of the representative method: The method of stratified sampling and the method of purposive selection
    • J. Neyman, "On the two different aspects of the representative method: The method of stratified sampling and the method of purposive selection," Journal of the Royal Statistical Society, vol. 97, pp. 558-625, 1934.
    • (1934) Journal of the Royal Statistical Society , vol.97 , pp. 558-625
    • Neyman, J.1
  • 241
    • 0004176090 scopus 로고    scopus 로고
    • Kendall's Advanced Theory of Statistics. Arnold, second ed.
    • A. O'Hagan and J. J. Forster, Bayesian Inference. Volume 2B of Kendall's Advanced Theory of Statistics. Arnold, second ed., 2004.
    • (2004) Bayesian Inference , vol.2 B
    • O'Hagan, A.1    Forster, J.J.2
  • 242
    • 0004140530 scopus 로고
    • Random sampling from databases
    • Lawrence Berekeley National Laboratory
    • F. Olken, "Random sampling from databases," Technical Report LBL-32883, Lawrence Berekeley National Laboratory, 1993.
    • (1993) Technical Report LBL-32883
    • Olken, F.1
  • 246
    • 0025446214 scopus 로고
    • Random sampling from hash files
    • F. Olken, D. Rotem, and P. Xu, "Random sampling from hash files," SIGMOD Record, vol. 19, no. 2, pp. 375-386, 1990.
    • (1990) SIGMOD Record , vol.19 , Issue.2 , pp. 375-386
    • Olken, F.1    Rotem, D.2    Xu, P.3
  • 248
    • 84863769684 scopus 로고    scopus 로고
    • Online aggregation for large MapReduce jobs
    • To appear
    • N. Pansare, V. Borkar, C. Jermaine, and T. Condie, "Online aggregation for large MapReduce jobs," PVLDB, vol. 5, 2011. (To appear).
    • (2011) PVLDB , vol.5
    • Pansare, N.1    Borkar, V.2    Jermaine, C.3    Condie, T.4
  • 249
    • 36849040673 scopus 로고    scopus 로고
    • Range-efficient counting of distinct elements in a massive data stream
    • DOI 10.1137/050643672
    • A. Pavan and S. Tirthapura, "Range-efficient counting of distinct elements in a massive data stream," SIAM Journal on Computing, vol. 37, no. 2, pp. 359-379, 2007. (Pubitemid 351584848)
    • (2007) SIAM Journal on Computing , vol.37 , Issue.2 , pp. 359-379
    • Pavan, A.1    Tirthapura, S.2
  • 250
    • 33745748044 scopus 로고    scopus 로고
    • Structure choices for two-dimensional histogram construction
    • H. T. A. Pham and K. C. Sevcik, "Structure choices for two-dimensional histogram construction," in Proceedings of CASCON, pp. 13-27, 2004.
    • (2004) Proceedings of CASCON , pp. 13-27
    • Pham, H.T.A.1    Sevcik, K.C.2
  • 252
    • 33749597352 scopus 로고    scopus 로고
    • XCluster synopses for structured XML content
    • DOI 10.1109/ICDE.2006.175, 1617431, Proceedings of the 22nd International Conference on Data Engineering, ICDE '06
    • N. Polyzotis and M. N. Garofalakis, "XCluster synopses for structured XML content," in Proceedings of the International Conference on Data Engineering, p. 63, 2006. (Pubitemid 44539855)
    • (2006) Proceedings - International Conference on Data Engineering , vol.2006 , pp. 63
    • Polyzotis, N.1    Garofalakis, M.2
  • 253
    • 33750183317 scopus 로고    scopus 로고
    • XSKETCH synopses for XML data graphs
    • DOI 10.1145/1166074.1166082
    • N. Polyzotis and M. N. Garofalakis, "XSKETCH synopses for XML data graphs," ACM Transactions on Database Systems, vol. 31, no. 3, pp. 1014-1063, 2006. (Pubitemid 44600831)
    • (2006) ACM Transactions on Database Systems , vol.31 , Issue.3 , pp. 1014-1063
    • Polyzotis, N.1    Garofalakis, M.2
  • 260
    • 0034355319 scopus 로고    scopus 로고
    • Online dynamic reordering
    • V. Raman, B. Raman, and J. M. Hellerstein, "Online dynamic reordering," VLDB Journal, vol. 9, no. 3, pp. 247-260, 2000.
    • (2000) VLDB Journal , vol.9 , Issue.3 , pp. 247-260
    • Raman, V.1    Raman, B.2    Hellerstein, J.M.3
  • 264
    • 0000318553 scopus 로고
    • Stochastic complexity and modeling
    • J. Rissanen, "Stochastic complexity and modeling," Annals of Statistics, vol. 14, no. 3, pp. 1080-1100, 1986.
    • (1986) Annals of Statistics , vol.14 , Issue.3 , pp. 1080-1100
    • Rissanen, J.1
  • 266
    • 58149463461 scopus 로고    scopus 로고
    • Hierarchically-compressed wavelet synopses
    • January
    • D. Sacharidis, A. Deligiannakis, and T. Sellis, "Hierarchically- compressed wavelet synopses," The VLDB Journal, vol. 18, no. 1, pp. 203-231, January 2009.
    • (2009) The VLDB Journal , vol.18 , Issue.1 , pp. 203-231
    • Sacharidis, D.1    Deligiannakis, A.2    Sellis, T.3
  • 267
    • 70349843819 scopus 로고    scopus 로고
    • Representing uncertain data: Models, properties, and algorithms
    • A. D. Sarma, O. Benjelloun, A. Y. Halevy, S. U. Nabar, and J. Widom, "Representing uncertain data: Models, properties, and algorithms," VLDB Journal, vol. 18, no. 5, pp. 989-1019, 2009.
    • (2009) VLDB Journal , vol.18 , Issue.5 , pp. 989-1019
    • Sarma, A.D.1    Benjelloun, O.2    Halevy, A.Y.3    Nabar, S.U.4    Widom, J.5
  • 271
  • 281
    • 1842435123 scopus 로고    scopus 로고
    • Tabulation based 4-universal hashing with applications to second moment estimation
    • M. Thorup and Y. Zhang, "Tabulation based 4-universal hashing with applications to second moment estimation," in ACM-SIAM Symposium on Discrete Algorithms, 2004.
    • (2004) ACM-SIAM Symposium on Discrete Algorithms
    • Thorup, M.1    Zhang, Y.2
  • 287
    • 79961180074 scopus 로고    scopus 로고
    • A multi-dimensional histogram for selectivity estimation and fast approximate query answering
    • H.Wang and K. C. Sevcik, "A multi-dimensional histogram for selectivity estimation and fast approximate query answering," in Proceedings of CASCON, pp. 328-342, 2003.
    • (2003) Proceedings of CASCON , pp. 328-342
    • Wang, H.1    Sevcik, K.C.2
  • 288
    • 43249085036 scopus 로고    scopus 로고
    • Histograms based on the minimum description length principle
    • H. Wang and K. C. Sevcik, "Histograms based on the minimum description length principle," VLDB Journal, vol. 17, no. 3, 2008.
    • (2008) VLDB Journal , vol.17 , Issue.3
    • Wang, H.1    Sevcik, K.C.2
  • 289
    • 0025449292 scopus 로고
    • A linear-time probabilistic counting algorithm for database applications
    • K. Y. Whang, B. T. Vander-Zanden, and H. M. Taylor, "A linear-time probabilistic counting algorithm for database applications," ACM Transactions on Database Systems, vol. 15, no. 2, p. 208, 1990.
    • (1990) ACM Transactions on Database Systems , vol.15 , Issue.2 , pp. 208
    • Whang, K.Y.1    Vander-Zanden, B.T.2    Taylor, H.M.3
  • 299
    • 84858043940 scopus 로고
    • Translations of Mathematical Monographs, American Mathematical Society
    • V. M. Zolotarev, One Dimensional Stable Distributions, volume 65 of Translations of Mathematical Monographs. American Mathematical Society, 1983.
    • (1983) One Dimensional Stable Distributions , vol.65
    • Zolotarev, V.M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.