SCOPUS 정보 검색 플랫폼

Proceedings of the ACM SIGMOD International Conference on Management of Data

Volumn , Issue , 2010, Pages 855-866

Unbiased estimation of size and other aggregates over hidden web databases

(5) Dasgupta, Arjun a Jin, Xin b Jewell, Bradley a Zhang, Nan b Das, Gautam a

a UNIVERSITY OF TEXAS AT ARLINGTON (United States)

b GEORGE WASHINGTON UNIVERSITY (United States)

Author keywords

aggregate query processing; hidden databases

Indexed keywords

AGGREGATE QUERY PROCESSING; APPROXIMATE QUERY PROCESSING; FORM-LIKE INTERFACES; HIDDEN WEB DATABASE; NOVEL TECHNIQUES; SEARCH QUERIES; UNBIASED ESTIMATES; UNBIASED ESTIMATION; WEB INTERFACE;

AGGREGATES; QUERY PROCESSING;

DATABASE SYSTEMS;

EID: 77954730150 PISSN: 07308078 EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1807167.1807259 Document Type: Conference Paper

Times cited : (49)

References (28)

1
- 27544470630
- Modeling query-based access to text databases
- E. Agichtein, P. G. Ipeirotis, and L. Gravano. Modeling query-based access to text databases. In WebDB, 2003.
- (2003) WebDB
- Agichtein, E.¹ Ipeirotis, P.G.² Gravano, L.³

2
- 67649647516
- Crawling the content hidden behind web forms
- M. Alvarez, J. Raposo, A. Pan, F. Cacheda, F. Bellas, and V. Carneiro. Crawling the content hidden behind web forms. In ICCSA, 2007.
- (2007) ICCSA
- Alvarez, M.¹ Raposo, J.² Pan, A.³ Cacheda, F.⁴ Bellas, F.⁵ Carneiro, V.⁶

3
- 84884044458
- Princeton University Press
- S. C. Amstrup, B. F. J. Manly, and T. L. McDonald. Handbook of capture-recapture analysis. Princeton University Press, 2005.
- (2005) Handbook of Capture-recapture Analysis
- Amstrup, S.C.¹ Manly, B.F.J.² McDonald, T.L.³

4
- 1142303671
- Dynamic sample selection for approximate query processing
- B. Babcock, S. Chaudhuri, and G. Das. Dynamic sample selection for approximate query processing. In SIGMOD, 2003.
- (2003) SIGMOD
- Babcock, B.¹ Chaudhuri, S.² Das, G.³

5
- 35348840330
- Efficient search engine measurements
- Z. Bar-Yossef and M. Gurevich. Efficient search engine measurements. In WWW, 2007.
- (2007) WWW
- Bar-Yossef, Z.¹ Gurevich, M.²

6
- 74549198813
- Mining search engine query logs via suggestion sampling
- Z. Bar-Yossef and M. Gurevich. Mining search engine query logs via suggestion sampling. In VLDB, 2008.
- (2008) VLDB
- Bar-Yossef, Z.¹ Gurevich, M.²

7
- 56349136928
- Random sampling from a search engine's corpus
- Z. Bar-Yossef and M. Gurevich. Random sampling from a search engine's corpus. Journal of the ACM, 55(5), 2008.
- (2008) Journal of the ACM , vol.55 , pp. 5
- Bar-Yossef, Z.¹ Gurevich, M.²

8
- 27544439829
- A technique for measuring the relative size and overlap of public web search engines
- K. Bharat and A. Broder. A technique for measuring the relative size and overlap of public web search engines. In WWW, 1998.
- (1998) WWW
- Bharat, K.¹ Broder, A.²

9
- 34547629212
- Estimating corpus size via queries
- A. Broder, M. Fontura, V. Josifovski, R. Kumar, R. Motwani, S. U. Nabar, R. Panigrahy, A. Tomkins, and Y. Xu. Estimating corpus size via queries. In CIKM, 2006.
- (2006) CIKM
- Broder, A.¹ Fontura, M.² Josifovski, V.³ Kumar, R.⁴ Motwani, R.⁵ Nabar, S.U.⁶ Panigrahy, R.⁷ Tomkins, A.⁸ Xu, Y.⁹

10
- 0036211203
- Evaluating top-k queries over web-accessible databases
- N. Bruno, L. Gravano, and A. Marian. Evaluating top-k queries over web-accessible databases. In ICDE, 2002.
- (2002) ICDE
- Bruno, N.¹ Gravano, L.² Marian, A.³

11
- 0002104204
- Query-based sampling of text databases
- J. P. Callan and M. E. Connell. Query-based sampling of text databases. ACM TOIS, 19(2):97-130, 2001.
- (2001) ACM TOIS , vol.19 , Issue.2 , pp. 97-130
- Callan, J.P.¹ Connell, M.E.²

12
- 0036372482
- Minimal probing: Supporting expensive predicates for top-k queries
- K. C.-C. Chang and S. won Hwang. Minimal probing: supporting expensive predicates for top-k queries. In SIGMOD, 2002.
- (2002) SIGMOD
- Chang, K.C.-C.¹ Won Hwang, S.²

13
- 35448942785
- A random walk approach to sampling hidden databases
- A. Dasgupta, G. Das, and H. Mannila. A random walk approach to sampling hidden databases. In SIGMOD, 2007.
- (2007) SIGMOD
- Dasgupta, A.¹ Das, G.² Mannila, H.³

14
- 67649671784
- Leveraging count information in sampling hidden databases
- A. Dasgupta, N. Zhang, and G. Das. Leveraging count information in sampling hidden databases. In ICDE, 2009.
- (2009) ICDE
- Dasgupta, A.¹ Zhang, N.² Das, G.³

15
- 70849084625
- Privacy preservation of aggregates in hidden databases: Why and how?
- A. Dasgupta, N. Zhang, G. Das, and S. Chaudhuri. Privacy preservation of aggregates in hidden databases: Why and how? In SIGMOD, 2009.
- (2009) SIGMOD
- Dasgupta, A.¹ Zhang, N.² Das, G.³ Chaudhuri, S.⁴

16
- 0028447111
- Quickly generating billion-record synthetic databases
- J. Gray, P. Sundaresan, S. Englert, K. Baclawski, and P. J. Weinberger. Quickly generating billion-record synthetic databases. In SIGMOD, 1994.
- (1994) SIGMOD
- Gray, J.¹ Sundaresan, P.² Englert, S.³ Baclawski, K.⁴ Weinberger, P.J.⁵

17
- 18744389475
- A two-phase sampling technique for information extraction from hidden web databases
- Y.-L. Hedley, M. Younas, A. E. James, and M. Sanderson. A two-phase sampling technique for information extraction from hidden web databases. In WIDM, 2004.
- (2004) WIDM
- Hedley, Y.-L.¹ Younas, M.² James, A.E.³ Sanderson, M.⁴

18
- 33748195920
- Sampling, information extraction and summarisation of hidden web databases
- Y.-L. Hedley, M. Younas, A. E. James, and M. Sanderson. Sampling, information extraction and summarisation of hidden web databases. Data and Knowledge Engineering, 59(2):213-230, 2006.
- (2006) Data and Knowledge Engineering , vol.59 , Issue.2 , pp. 213-230
- Hedley, Y.-L.¹ Younas, M.² James, A.E.³ Sanderson, M.⁴

19
- 84947396376
- A generalization of sampling without replacement from a finite universe
- D. Horvitz and D. Thompson. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association, 47:663-685, 1952.
- (1952) Journal of the American Statistical Association , vol.47 , pp. 663-685
- Horvitz, D.¹ Thompson, D.²

20
- 1842861284
- Extracting data behind web forms
- S. Liddle, D. Embley, D. Scott, and S. Yau. Extracting data behind web forms. In ER (Workshops), 2002.
- ER (Workshops), 2002
- Liddle, S.¹ Embley, D.² Scott, D.³ Yau, S.⁴

21
- 0037818401
- Discovering the representative of a search engine
- Y. C. Liu, K. and W. Meng. Discovering the representative of a search engine. In CIKM, 2002.
- (2002) CIKM
- Liu, Y.C.¹ Meng, K.W.²

22
- 70349243672
- Efficient estimation of the size of text deep web data source
- J. Lu. Efficient estimation of the size of text deep web data source. In CIKM, 2008.
- (2008) CIKM
- Lu, J.¹

23
- 27544458897
- Downloading textual hidden web content through keyword queries
- A. Ntoulas, P. Zerfos, and J. Cho. Downloading textual hidden web content through keyword queries. In JCDL, 2005.
- (2005) JCDL
- Ntoulas, A.¹ Zerfos, P.² Cho, J.³

24
- 67649663838
- Distributed search over the hidden web: Hierarchical database sampling and selection
- L. G. Panagiotis G. Ipeirotis. Distributed search over the hidden web: Hierarchical database sampling and selection. In VLDB, 2002.
- (2002) VLDB
- Panagiotis, L.G.¹ Ipeirotis, G.²

25
- 84944325093
- Crawling the hidden web
- S. Raghavan and H. Garcia-Molina. Crawling the hidden web. In VLDB, 2001.
- (2001) VLDB
- Raghavan, S.¹ Garcia-Molina, H.²

26
- 0004007880
- Wiley & Sons, New York
- B. Ripley. Stochastic Simulation. Wiley & Sons, New York, 1987.
- (1987) Stochastic Simulation

27
- 0003572871
- MacMillan Press, New York
- G. A. F. Seber. The estimation of animal abundance and related parameters. MacMillan Press, New York, 1982.
- (1982) The Estimation of Animal Abundance and Related Parameters
- Seber, G.A.F.¹

28
- 33750285514
- Capturing collection size for distributed non-cooperative retrieval
- M. Shokouhi, J. Zobel, F. Scholer, and S. Tahaghoghi. Capturing collection size for distributed non-cooperative retrieval. In SIGIR, 2006.
- (2006) SIGIR
- Shokouhi, M.¹ Zobel, J.² Scholer, F.³ Tahaghoghi, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.