메뉴 건너뛰기




Volumn 17, Issue 2, 2008, Pages 173-201

Maintaining bounded-size sample synopses of evolving datasets

Author keywords

Database sampling; Reservoir sampling; Sample maintenance; Synopsis

Indexed keywords


EID: 38949217037     PISSN: 10668888     EISSN: None     Source Type: Journal    
DOI: 10.1007/s00778-007-0065-y     Document Type: Article
Times cited : (52)

References (46)
  • 1
    • 10644244988 scopus 로고    scopus 로고
    • Sampling from a moving window over streaming data
    • Babcock, B., Datar, M., Motwani, R.: Sampling from a moving window over streaming data. In: Proc. SODA, pp. 633-634 (2002)
    • (2002) Proc. SODA , pp. 633-634
    • Babcock, B.1    Datar, M.2    Motwani, R.3
  • 3
    • 85012113083 scopus 로고    scopus 로고
    • BHUNT: Automatic discovery of fuzzy algebraic constraints in relational data
    • Brown, P., Haas, P.J.: BHUNT: automatic discovery of fuzzy algebraic constraints in relational data. In: Proc. VLDB, pp. 668-679 (2003)
    • (2003) Proc. VLDB , pp. 668-679
    • Brown, P.1    Haas, P.J.2
  • 4
    • 33749584309 scopus 로고    scopus 로고
    • Techniques for warehousing of sample data
    • Brown, P.G., Haas, P.J.: Techniques for warehousing of sample data. In: Proc. ICDE (2006)
    • (2006) Proc. ICDE
    • Brown, P.G.1    Haas, P.J.2
  • 7
    • 33745615174 scopus 로고    scopus 로고
    • Summarizing and mining inverse distributions on data streams via dynamic inverse sampling
    • Cormode, G., Muthukrishnan, S., Rozenbaum, I.: Summarizing and mining inverse distributions on data streams via dynamic inverse sampling. In: Proc. VLDB, pp. 25-36 (2005)
    • (2005) Proc. VLDB , pp. 25-36
    • Cormode, G.1    Muthukrishnan, S.2    Rozenbaum, I.3
  • 8
    • 84945709595 scopus 로고
    • Development of sampling plans by using sequential (item by item) techniques and digital computers
    • Fan C., Muller M. and Rezucha I. (1962). Development of sampling plans by using sequential (item by item) techniques and digital computers. J. Am. Statist. Assoc. 57: 387-402
    • (1962) J. Am. Statist. Assoc. , vol.57 , pp. 387-402
    • Fan, C.1    Muller, M.2    Rezucha, I.3
  • 10
    • 33745587286 scopus 로고    scopus 로고
    • Deferred maintenance of disk-based random samples
    • Gemulla, R., Lehner, W.: Deferred maintenance of disk-based random samples. In: Proc. EDBT, pp. 423-441 (2006)
    • (2006) Proc. EDBT , pp. 423-441
    • Gemulla, R.1    Lehner, W.2
  • 11
    • 35448945510 scopus 로고    scopus 로고
    • A dip in the reservoir: Maintaining sample synopses of evolving datasets
    • Gemulla, R., Lehner, W., Haas, P.J.: A dip in the reservoir: Maintaining sample synopses of evolving datasets. In: Proc. VLDB, pp. 595-606 (2006)
    • (2006) Proc. VLDB , pp. 595-606
    • Gemulla, R.1    Lehner, W.2    Haas, P.J.3
  • 12
    • 35448945249 scopus 로고    scopus 로고
    • Maintaining Bernoulli samples over evolving multisets
    • Gemulla, R., Lehner, W., Haas, P.J.: Maintaining Bernoulli samples over evolving multisets. In: Proc. ACM PODS, pp. 93-102 (2007)
    • (2007) Proc. ACM PODS , pp. 93-102
    • Gemulla, R.1    Lehner, W.2    Haas, P.J.3
  • 14
    • 0032092365 scopus 로고    scopus 로고
    • New sampling-based summary statistics for improving approximate query answers
    • Gibbons, P.B., Matias, Y.: New sampling-based summary statistics for improving approximate query answers. In: Proc. ACM SIGMOD, pp. 331-342 (1998)
    • (1998) Proc. ACM SIGMOD , pp. 331-342
    • Gibbons, P.B.1    Matias, Y.2
  • 15
    • 0041401239 scopus 로고    scopus 로고
    • Fast incremental maintenance of approximate histograms
    • Gibbons P.B., Matias Y. and Poosala V. (2002). Fast incremental maintenance of approximate histograms. ACM Trans. Database Syst. 27: 182-184
    • (2002) ACM Trans. Database Syst. , vol.27 , pp. 182-184
    • Gibbons, P.B.1    Matias, Y.2    Poosala, V.3
  • 16
    • 38949106163 scopus 로고    scopus 로고
    • GSL: GNU Scientific Library
    • GSL: GNU Scientific Library. http://www.gnu.org/software/gsl/
  • 17
    • 3142745395 scopus 로고    scopus 로고
    • A bi-level Bernoulli scheme for database sampling
    • Haas, P., König, C.: A bi-level Bernoulli scheme for database sampling. In: Proc. ACM SIGMOD, pp. 275-286 (2004)
    • (2004) Proc. ACM SIGMOD , pp. 275-286
    • Haas, P.1    König, C.2
  • 18
    • 85046804058 scopus 로고    scopus 로고
    • Data stream sampling: Basic techniques and results
    • Garofalakis, M., Gehrke, J., Rastogi, R. (eds.) Springer, Heidelberg
    • Haas, P.J.: Data stream sampling: Basic techniques and results. In: Garofalakis, M., Gehrke, J., Rastogi, R. (eds.) Data Stream Management: Processing High Speed Data Streams, Springer, Heidelberg (2007)
    • (2007) Data Stream Management: Processing High Speed Data Streams
    • Haas, P.J.1
  • 22
    • 3142708793 scopus 로고    scopus 로고
    • CORDS: Automatic discovery of correlations and soft functional dependencies
    • Ilyas, I.F., Markl, V., Haas, P.J., Brown, P., Aboulnaga, A.: CORDS: automatic discovery of correlations and soft functional dependencies. In: Proc. ACM SIGMOD, pp. 647-658 (2004)
    • (2004) Proc. ACM SIGMOD , pp. 647-658
    • Ilyas, I.F.1    Markl, V.2    Haas, P.J.3    Brown, P.4    Aboulnaga, A.5
  • 23
    • 3142776431 scopus 로고    scopus 로고
    • Online maintenance of very large random samples
    • Jermaine, C., Pol, A., Arumugam, S.: Online maintenance of very large random samples. In: Proc. ACM SIGMOD, pp. 299-310 (2004)
    • (2004) Proc. ACM SIGMOD , pp. 299-310
    • Jermaine, C.1    Pol, A.2    Arumugam, S.3
  • 24
    • 85049139321 scopus 로고    scopus 로고
    • Static versus dynamic sampling for data mining
    • John, G.H., Langley, P.: Static versus dynamic sampling for data mining. In: Proc. KDD, pp. 367-370 (2005)
    • (2005) Proc. KDD , pp. 367-370
    • John, G.H.1    Langley, P.2
  • 26
    • 0345904587 scopus 로고
    • Computer generation of hypergeometric random variables
    • Kachitvichyanukul V. and Schmeiser B. (1985). Computer generation of hypergeometric random variables. J. Stat. Comput. Simul 22: 127-145
    • (1985) J. Stat. Comput. Simul , vol.22 , pp. 127-145
    • Kachitvichyanukul, V.1    Schmeiser, B.2
  • 27
    • 0027927896 scopus 로고
    • The power of sampling in knowledge discovery
    • Kivinen, J., Mannila, H.: The power of sampling in knowledge discovery. In: Proc. ACM PODS, pp. 77-85 (1994)
    • (1994) Proc. ACM PODS , pp. 77-85
    • Kivinen, J.1    Mannila, H.2
  • 30
    • 49749149893 scopus 로고    scopus 로고
    • Uniform random number generation
    • Elsevier Amsterdam
    • L'Ecuyer P. (2006). Uniform random number generation. In: Henderson, S.G. and Nelson, B.L. (eds) Simulation, pp 55-81. Elsevier, Amsterdam
    • (2006) Simulation , pp. 55-81
    • L'Ecuyer, P.1    Henderson, S.G.2    Nelson, B.L.3
  • 31
    • 33749598050 scopus 로고    scopus 로고
    • (Almost) hands-off information integration for the life sciences
    • Leser, U., Naumann, F.: (Almost) hands-off information integration for the life sciences. In: Proc. CIDR, pp. 131-143 (2005)
    • (2005) Proc. CIDR , pp. 131-143
    • Leser, U.1    Naumann, F.2
  • 32
    • 0031599142 scopus 로고    scopus 로고
    • Mersenne twister: A 623-dimensionally equidistributed uniform pseudo-random number generator
    • 1
    • Matsumoto M. and Nishimura T. (1998). Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator. ACM Trans. Model. Comput. Simul. 8(1): 3-30
    • (1998) ACM Trans. Model. Comput. Simul. , vol.8 , pp. 3-30
    • Matsumoto, M.1    Nishimura, T.2
  • 33
    • 0020985860 scopus 로고
    • A convenient algorithm for drawing a simple random sample
    • McLeod A.I. and Bellhouse D.R. (1983). A convenient algorithm for drawing a simple random sample. Appl. Statist. 32: 182-184
    • (1983) Appl. Statist. , vol.32 , pp. 182-184
    • McLeod, A.I.1    Bellhouse, D.R.2
  • 34
    • 0003781238 scopus 로고    scopus 로고
    • Cambridge University Press Cambridge
    • Norris J.R. (1997). Markov Chains. Cambridge University Press, Cambridge
    • (1997) Markov Chains
    • Norris, J.R.1
  • 35
    • 0004140530 scopus 로고
    • Thesis LBL-32883, Information and Computing Sciences Division, Lawrence Berkeley National Laboratory
    • Olken, F.: Random sampling from databases. Thesis LBL-32883, Information and Computing Sciences Division, Lawrence Berkeley National Laboratory (1993)
    • (1993) Random Sampling from Databases
    • Olken, F.1
  • 36
    • 0026618403 scopus 로고
    • Maintenance of materialized views of sampling queries
    • Olken, F., Rotem, D.: Maintenance of materialized views of sampling queries. In: Proc. ICDE (1992)
    • (1992) Proc. ICDE
    • Olken, F.1    Rotem, D.2
  • 37
    • 0030157406 scopus 로고    scopus 로고
    • Improved histograms for selectivity estimation of range predicates
    • Poosala, V., Haas, P.J., Ioannidis, Y.E., Shekita, E.J.: Improved histograms for selectivity estimation of range predicates. In: Proc. ACM SIGMOD, pp. 294-305 (1996)
    • (1996) Proc. ACM SIGMOD , pp. 294-305
    • Poosala, V.1    Haas, P.J.2    Ioannidis, Y.E.3    Shekita, E.J.4
  • 39
    • 0000016172 scopus 로고
    • A stochastic approximation method
    • Robbins H. and Monro S. (1951). A stochastic approximation method. Ann. Math. Statist. 22: 400-407
    • (1951) Ann. Math. Statist. , vol.22 , pp. 400-407
    • Robbins, H.1    Monro, S.2
  • 44
    • 0021464004 scopus 로고
    • Faster methods for random sampling
    • 7
    • Vitter J.S. (1984). Faster methods for random sampling. Commun. ACM 27(7): 703-718
    • (1984) Commun. ACM , vol.27 , pp. 703-718
    • Vitter, J.S.1
  • 45
    • 0022026217 scopus 로고
    • Random sampling with a reservoir
    • 1
    • Vitter J.S. (1985). Random sampling with a reservoir. ACM Trans. Math. Softw. 11(1): 37-57
    • (1985) ACM Trans. Math. Softw. , vol.11 , pp. 37-57
    • Vitter, J.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.