SCOPUS 정보 검색 플랫폼

Proceedings - International Conference on Data Engineering

Volumn , Issue , 2007, Pages 326-335

Organizing hidden-Web databases by clustering visible Web documents

(3) Barbosa, Luciano a Freire, Juliana a Silva, Altigran b

a UNIVERSITY OF UTAH (United States)

b FEDERAL UNIVERSITY OF AMAZONAS (Brazil)

Author keywords

[No Author keywords available]

Indexed keywords

CLUSTER ANALYSIS; INFORMATION RETRIEVAL; INTERFACES (COMPUTER); PROBLEM SOLVING; SCALABILITY; SEARCH ENGINES; WORLD WIDE WEB;

CONTENT-RICH FORMS; F-MEASURE; HIDDEN-WEB DATABASES; HIGH-QUALITY CLUSTERS; HYPERLINKED; KEYWORD-BASED SEARCH INTERFACES; VISIBLE INFORMATION; WEB DOCUMENTS;

DATABASE SYSTEMS;

EID: 34548729668 PISSN: 10844627 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICDE.2007.367878 Document Type: Conference Paper

Times cited : (48)

References (40)

1
- 0005540823
- ACM Press/Addison-Wesley
- R. A. Baeza-Yates and B. A. Ribeiro-Neto. Modern Information Retrieval. ACM Press/Addison-Wesley, 1999.
- (1999) Modern Information Retrieval
- Baeza-Yates, R.A.¹ Ribeiro-Neto, B.A.²

2
- 27544435323
- Siphoning Hidden-Web Data through Keyword-Based Interfaces
- L. Barbosa and J. Freire. Siphoning Hidden-Web Data through Keyword-Based Interfaces. In SBBD, pages 309-321, 2004.
- (2004) SBBD , pp. 309-321
- Barbosa, L.¹ Freire, J.²

3
- 34547416385
- Searching for Hidden-Web Databases
- L. Barbosa and J. Freire. Searching for Hidden-Web Databases. In WebDB, pages 1-6, 2005.
- (2005) WebDB , pp. 1-6
- Barbosa, L.¹ Freire, J.²

4
- 34548764453
- Crawling for Domain-Specific Hidden Web Resources
- A. Bergholz and B. Chidlovskii. Crawling for Domain-Specific Hidden Web Resources. In WISE, pages 125-133, 2003.
- (2003) WISE , pp. 125-133
- Bergholz, A.¹ Chidlovskii, B.²

5
- 0010251937
- The connectivity server: Fast access to linkage information on the Web
- K. Bharat, A. Broder, M. Henzinger, P. Kumar, and S. Venkatasubramanian. The connectivity server: Fast access to linkage information on the Web. Computer Networks, 30(1-7):469-477, 1998.
- (1998) Computer Networks , vol.30 , Issue.1-7 , pp. 469-477
- Bharat, K.¹ Broder, A.² Henzinger, M.³ Kumar, P.⁴ Venkatasubramanian, S.⁵

6
- 34548791827
- Brightplanet's searchable databases directory. http://www.completeplanet. com.
- Brightplanet's searchable databases directory

7
- 0038589165
- The anatomy of a large-scale hyper-textual Web search engine
- S. Brin and L. Page. The anatomy of a large-scale hyper-textual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107-117, 1998.
- (1998) Computer Networks and ISDN Systems , vol.30 , Issue.1-7 , pp. 107-117
- Brin, S.¹ Page, L.²

8
- 5444262639
- Structured Databases on the Web: Observations and Implications
- K. C.-C. Chang, B. He, C. Li, M. Patel, and Z. Zhang. Structured Databases on the Web: Observations and Implications. SIGMOD Record, 33(3):61-70, 2004.
- (2004) SIGMOD Record , vol.33 , Issue.3 , pp. 61-70
- Chang, K.C.-C.¹ He, B.² Li, C.³ Patel, M.⁴ Zhang, Z.⁵

9
- 84863338210
- Toward Large-Scale Integration: Building a MetaQuerier over Databases on the Web
- K. C.-C. Chang, B. He, and Z. Zhang. Toward Large-Scale Integration: Building a MetaQuerier over Databases on the Web. In CIDR, pages 44-55, 2005.
- (2005) CIDR , pp. 44-55
- Chang, K.C.-C.¹ He, B.² Zhang, Z.³

10
- 34548784453
- J. Cope, N. Craswell, and D. Hawking. Automated Discovery of Search. Interfaces on the Web. InADC, pages 181-189, 2003.
- J. Cope, N. Craswell, and D. Hawking. Automated Discovery of Search. Interfaces on the Web. InADC, pages 181-189, 2003.

11
- 13444259868
- The molecular biology database collection: 2005 update
- M. Galperin. The molecular biology database collection: 2005 update. Nucleic Acids Res, 33, 2005.
- (2005) Nucleic Acids Res , vol.33
- Galperin, M.¹

12
- 0031617713
- Inferring web communities from link topology
- D. Gibson, J. M. Kleinberg, and P. Raghavan. Inferring web communities from link topology. In UK Conference on Hypertext, pages 225-234, 1998.
- (1998) UK Conference on Hypertext , pp. 225-234
- Gibson, D.¹ Kleinberg, J.M.² Raghavan, P.³

13
- 0001511080
- Gloss: Textsource discovery over the internet
- L. Gravano, H. Garcia-Molina, and A. Tomasic. Gloss: Textsource discovery over the internet. ACM TODS, 24(2), 1999.
- (1999) ACM TODS , vol.24 , Issue.2
- Gravano, L.¹ Garcia-Molina, H.² Tomasic, A.³

14
- 0344127987
- QProber: A system for automatic classification of hidden-Web databases
- L. Gravano, P. G. Ipeirotis, and M. Sahami. QProber: A system for automatic classification of hidden-Web databases. ACM TOIS, 21(1):1-41, 2003.
- (2003) ACM TOIS , vol.21 , Issue.1 , pp. 1-41
- Gravano, L.¹ Ipeirotis, P.G.² Sahami, M.³

15
- 84985927584
- Why your data don't mix
- A. Y. Halevy. Why your data don't mix. ACM Queue, 3(8), 2005.
- (2005) ACM Queue , vol.3 , Issue.8
- Halevy, A.Y.¹

16
- 1142267350
- Statistical Schema Matching across Web Query Interfaces
- B. He and K. C.-C. Chang. Statistical Schema Matching across Web Query Interfaces. In SIGMOD, pages 217-228, 2003.
- (2003) SIGMOD , pp. 217-228
- He, B.¹ Chang, K.C.-C.²

17
- 18744376048
- Organizing structured web sources by query schemas: A clustering approach
- B. He, T. Tao, and K. C.-C. Chang. Organizing structured web sources by query schemas: a clustering approach. In CIKM, pages 22-31, 2004.
- (2004) CIKM , pp. 22-31
- He, B.¹ Tao, T.² Chang, K.C.-C.³

18
- 85012202641
- Wise-integrator: An automatic integrator of web search interfaces for e-commerce
- H. He, W. Meng, C. Yu, and Z. Wu. Wise-integrator: An automatic integrator of web search interfaces for e-commerce. In VLDB, pages 357-368, 2003.
- (2003) VLDB , pp. 357-368
- He, H.¹ Meng, W.² Yu, C.³ Wu, Z.⁴

19
- 6344235509
- Automatic integration of Web search interfaces with WISE-Integrator
- H. He, W. Meng, C T. Yu, and Z. Wu. Automatic integration of Web search interfaces with WISE-Integrator. VLDB Journal, 13(3):256-273, 2004.
- (2004) VLDB Journal , vol.13 , Issue.3 , pp. 256-273
- He, H.¹ Meng, W.² Yu, C.T.³ Wu, Z.⁴

20
- 0037191696
- Web document clustering using hyperlink structures
- X. He, H. Zha, C. H. Q. Ding, and H. D. Simon. Web document clustering using hyperlink structures. Computational Statistics & Data Analysis, 41(1):19-45, 2002.
- (2002) Computational Statistics & Data Analysis , vol.41 , Issue.1 , pp. 19-45
- He, X.¹ Zha, H.² Ding, C.H.Q.³ Simon, H.D.⁴

21
- 19944419350
- Automatically attaching semantic metadata to web services
- A. Hess and N. Kushmerick. Automatically attaching semantic metadata to web services. In Proceedings of IIWeb, pages 111-116, 2003.
- (2003) Proceedings of IIWeb , pp. 111-116
- Hess, A.¹ Kushmerick, N.²

22
- 34250652154
- Data management projects at Google
- W. Hsieh, J. Madhavan, and R. Pike. Data management projects at Google. In SIGMOD, pages 725-726, 2006.
- (2006) SIGMOD , pp. 725-726
- Hsieh, W.¹ Madhavan, J.² Pike, R.³

23
- 34548713521
- Multi-type features based web document clustering
- S. Huang, G.-R. Xue, B. Zhang, Z. Chen, Y. Yu, and W.-Y. Ma. Multi-type features based web document clustering. In WISE, pages 253-265, 2004.
- (2004) WISE , pp. 253-265
- Huang, S.¹ Xue, G.-R.² Zhang, B.³ Chen, Z.⁴ Yu, Y.⁵ Ma, W.-Y.⁶

24
- 0033297068
- Trawling the Web for emerging cyber-communities
- R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. Trawling the Web for emerging cyber-communities. Computer Networks, 31(11-16):1481-1493, 1999.
- (1999) Computer Networks , vol.31 , Issue.11-16 , pp. 1481-1493
- Kumar, R.¹ Raghavan, P.² Rajagopalan, S.³ Tomkins, A.⁴

25
- 0002862737
- Fast and effective text mining using linear-time document clustering
- B. Larsen and C. Aone. Fast and effective text mining using linear-time document clustering. In KDD, pages 16-22, 1999.
- (1999) KDD , pp. 16-22
- Larsen, B.¹ Aone, C.²

26
- 34548750424
- Profusion search engine directory. http://www.profusion.com/nav.
- Profusion search engine directory

27
- 84944325093
- Crawling the Hidden Web
- S. Raghavan and H. Garcia-Molina. Crawling the Hidden Web. In VLDB, pages 129-138, 2001.
- (2001) VLDB , pp. 129-138
- Raghavan, S.¹ Garcia-Molina, H.²

28
- 21344453736
- Indexing the invisible Web: A survey
- Y. Ru and E. Horowitz. Indexing the invisible Web: a survey. Online Information Review, 29(3):249-265, 2005.
- (2005) Online Information Review , vol.29 , Issue.3 , pp. 249-265
- Ru, Y.¹ Horowitz, E.²

29
- 0016572913
- A vector space model for automatic indexing
- G. Salton, A. Wong, and. C. S. Yang. A vector space model for automatic indexing. CACM, 18(11):613-620, 1975.
- (1975) CACM , vol.18 , Issue.11 , pp. 613-620
- Salton, G.¹ Wong, A.² Yang, C.S.³

30
- 34548744643
- Search engines directory. http://www.searchengineguide.com/searchengines. html.
- Search engines directory

31
- 2442439674
- A comparison of document clustering techniques
- M. Steinbach, G. Karypis, and V. Kumar. A comparison of document clustering techniques. In KDD Workshop on Text Mining, 2000.
- (2000) KDD Workshop on Text Mining
- Steinbach, M.¹ Karypis, G.² Kumar, V.³

32
- 25144439604
- Addison-Wesley
- P.-N. Tan, M. Steinbach, and V. Kumar. Introduction to Data Mining. Addison-Wesley, 2005.
- (2005) Introduction to Data Mining
- Tan, P.-N.¹ Steinbach, M.² Kumar, V.³

33
- 5444221956
- The UIUC Web integration repository. http://metaquerier.cs.uiuc.edu/ repository.
- The UIUC Web integration repository

34
- 0038156234
- Evaluating contents-link coupled web page clustering for web search results
- Y. Wang and M. Kitsuregawa. Evaluating contents-link coupled web page clustering for web search results. In CIKM, pages 499-506, 2002.
- (2002) CIKM , pp. 499-506
- Wang, Y.¹ Kitsuregawa, M.²

35
- 0029717331
- Hypursuit: A hierarchical network search engine that exploits content-link hypertext clustering
- R. Weiss, B. Vêlez, and M. A. Sheldon. Hypursuit: a hierarchical network search engine that exploits content-link hypertext clustering. In ACM Hypertext, pages .180-193, 1996.
- (1996) ACM Hypertext , pp. 180-193
- Weiss, R.¹ Vêlez, B.² Sheldon, M.A.³

36
- 33749617417
- Query selection techniques for efficient crawling of structured web sources
- P. Wu, J.-R. Wen, H. Liu, and W.-Y. Ma. Query selection techniques for efficient crawling of structured web sources. In ICDE, 2006.
- (2006) ICDE
- Wu, P.¹ Wen, J.-R.² Liu, H.³ Ma, W.-Y.⁴

37
- 84856825113
- Learning from the web to match query interfaces on the deep web
- W. Wu, A. Doan, and C. Yu. Learning from the web to match query interfaces on the deep web. In ICDE, 2006.
- (2006) ICDE
- Wu, W.¹ Doan, A.² Yu, C.³

38
- 3142679542
- An Interactive Clustering-based Approach to Integrating Source Query interfaces on the Deep Web
- W. Wu, C. Yu, A. Doan, and W. Meng. An Interactive Clustering-based Approach to Integrating Source Query interfaces on the Deep Web. In SIGMOD, pages 95-106, 2004.
- (2004) SIGMOD , pp. 95-106
- Wu, W.¹ Yu, C.² Doan, A.³ Meng, W.⁴

39
- 34548714733
- A methodology to retrieve text documents from multiple databases
- C Yu, K.-L. Liu, W. Meng, Z. Wu, and N. Rishe. A methodology to retrieve text documents from multiple databases. IEEE TKDE, 2002.
- (2002) IEEE TKDE
- Yu, C.¹ Liu, K.-L.² Meng, W.³ Wu, Z.⁴ Rishe, N.⁵

40
- 0032268443
- Web document clustering: A feasibility demonstration
- O. Zamir and O. Etzioni. Web document clustering: a feasibility demonstration. In SIGIR, pages 46-54, 1998.
- (1998) SIGIR , pp. 46-54
- Zamir, O.¹ Etzioni, O.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.