메뉴 건너뛰기




Volumn , Issue , 2007, Pages 326-335

Organizing hidden-Web databases by clustering visible Web documents

Author keywords

[No Author keywords available]

Indexed keywords

CLUSTER ANALYSIS; INFORMATION RETRIEVAL; INTERFACES (COMPUTER); PROBLEM SOLVING; SCALABILITY; SEARCH ENGINES; WORLD WIDE WEB;

EID: 34548729668     PISSN: 10844627     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICDE.2007.367878     Document Type: Conference Paper
Times cited : (48)

References (40)
  • 2
    • 27544435323 scopus 로고    scopus 로고
    • Siphoning Hidden-Web Data through Keyword-Based Interfaces
    • L. Barbosa and J. Freire. Siphoning Hidden-Web Data through Keyword-Based Interfaces. In SBBD, pages 309-321, 2004.
    • (2004) SBBD , pp. 309-321
    • Barbosa, L.1    Freire, J.2
  • 3
    • 34547416385 scopus 로고    scopus 로고
    • Searching for Hidden-Web Databases
    • L. Barbosa and J. Freire. Searching for Hidden-Web Databases. In WebDB, pages 1-6, 2005.
    • (2005) WebDB , pp. 1-6
    • Barbosa, L.1    Freire, J.2
  • 4
    • 34548764453 scopus 로고    scopus 로고
    • Crawling for Domain-Specific Hidden Web Resources
    • A. Bergholz and B. Chidlovskii. Crawling for Domain-Specific Hidden Web Resources. In WISE, pages 125-133, 2003.
    • (2003) WISE , pp. 125-133
    • Bergholz, A.1    Chidlovskii, B.2
  • 7
    • 0038589165 scopus 로고    scopus 로고
    • The anatomy of a large-scale hyper-textual Web search engine
    • S. Brin and L. Page. The anatomy of a large-scale hyper-textual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107-117, 1998.
    • (1998) Computer Networks and ISDN Systems , vol.30 , Issue.1-7 , pp. 107-117
    • Brin, S.1    Page, L.2
  • 8
    • 5444262639 scopus 로고    scopus 로고
    • Structured Databases on the Web: Observations and Implications
    • K. C.-C. Chang, B. He, C. Li, M. Patel, and Z. Zhang. Structured Databases on the Web: Observations and Implications. SIGMOD Record, 33(3):61-70, 2004.
    • (2004) SIGMOD Record , vol.33 , Issue.3 , pp. 61-70
    • Chang, K.C.-C.1    He, B.2    Li, C.3    Patel, M.4    Zhang, Z.5
  • 9
    • 84863338210 scopus 로고    scopus 로고
    • Toward Large-Scale Integration: Building a MetaQuerier over Databases on the Web
    • K. C.-C. Chang, B. He, and Z. Zhang. Toward Large-Scale Integration: Building a MetaQuerier over Databases on the Web. In CIDR, pages 44-55, 2005.
    • (2005) CIDR , pp. 44-55
    • Chang, K.C.-C.1    He, B.2    Zhang, Z.3
  • 10
    • 34548784453 scopus 로고    scopus 로고
    • J. Cope, N. Craswell, and D. Hawking. Automated Discovery of Search. Interfaces on the Web. InADC, pages 181-189, 2003.
    • J. Cope, N. Craswell, and D. Hawking. Automated Discovery of Search. Interfaces on the Web. InADC, pages 181-189, 2003.
  • 11
    • 13444259868 scopus 로고    scopus 로고
    • The molecular biology database collection: 2005 update
    • M. Galperin. The molecular biology database collection: 2005 update. Nucleic Acids Res, 33, 2005.
    • (2005) Nucleic Acids Res , vol.33
    • Galperin, M.1
  • 13
    • 0001511080 scopus 로고    scopus 로고
    • Gloss: Textsource discovery over the internet
    • L. Gravano, H. Garcia-Molina, and A. Tomasic. Gloss: Textsource discovery over the internet. ACM TODS, 24(2), 1999.
    • (1999) ACM TODS , vol.24 , Issue.2
    • Gravano, L.1    Garcia-Molina, H.2    Tomasic, A.3
  • 14
    • 0344127987 scopus 로고    scopus 로고
    • QProber: A system for automatic classification of hidden-Web databases
    • L. Gravano, P. G. Ipeirotis, and M. Sahami. QProber: A system for automatic classification of hidden-Web databases. ACM TOIS, 21(1):1-41, 2003.
    • (2003) ACM TOIS , vol.21 , Issue.1 , pp. 1-41
    • Gravano, L.1    Ipeirotis, P.G.2    Sahami, M.3
  • 15
    • 84985927584 scopus 로고    scopus 로고
    • Why your data don't mix
    • A. Y. Halevy. Why your data don't mix. ACM Queue, 3(8), 2005.
    • (2005) ACM Queue , vol.3 , Issue.8
    • Halevy, A.Y.1
  • 16
    • 1142267350 scopus 로고    scopus 로고
    • Statistical Schema Matching across Web Query Interfaces
    • B. He and K. C.-C. Chang. Statistical Schema Matching across Web Query Interfaces. In SIGMOD, pages 217-228, 2003.
    • (2003) SIGMOD , pp. 217-228
    • He, B.1    Chang, K.C.-C.2
  • 17
    • 18744376048 scopus 로고    scopus 로고
    • Organizing structured web sources by query schemas: A clustering approach
    • B. He, T. Tao, and K. C.-C. Chang. Organizing structured web sources by query schemas: a clustering approach. In CIKM, pages 22-31, 2004.
    • (2004) CIKM , pp. 22-31
    • He, B.1    Tao, T.2    Chang, K.C.-C.3
  • 18
    • 85012202641 scopus 로고    scopus 로고
    • Wise-integrator: An automatic integrator of web search interfaces for e-commerce
    • H. He, W. Meng, C. Yu, and Z. Wu. Wise-integrator: An automatic integrator of web search interfaces for e-commerce. In VLDB, pages 357-368, 2003.
    • (2003) VLDB , pp. 357-368
    • He, H.1    Meng, W.2    Yu, C.3    Wu, Z.4
  • 19
    • 6344235509 scopus 로고    scopus 로고
    • Automatic integration of Web search interfaces with WISE-Integrator
    • H. He, W. Meng, C T. Yu, and Z. Wu. Automatic integration of Web search interfaces with WISE-Integrator. VLDB Journal, 13(3):256-273, 2004.
    • (2004) VLDB Journal , vol.13 , Issue.3 , pp. 256-273
    • He, H.1    Meng, W.2    Yu, C.T.3    Wu, Z.4
  • 21
    • 19944419350 scopus 로고    scopus 로고
    • Automatically attaching semantic metadata to web services
    • A. Hess and N. Kushmerick. Automatically attaching semantic metadata to web services. In Proceedings of IIWeb, pages 111-116, 2003.
    • (2003) Proceedings of IIWeb , pp. 111-116
    • Hess, A.1    Kushmerick, N.2
  • 22
    • 34250652154 scopus 로고    scopus 로고
    • Data management projects at Google
    • W. Hsieh, J. Madhavan, and R. Pike. Data management projects at Google. In SIGMOD, pages 725-726, 2006.
    • (2006) SIGMOD , pp. 725-726
    • Hsieh, W.1    Madhavan, J.2    Pike, R.3
  • 23
    • 34548713521 scopus 로고    scopus 로고
    • Multi-type features based web document clustering
    • S. Huang, G.-R. Xue, B. Zhang, Z. Chen, Y. Yu, and W.-Y. Ma. Multi-type features based web document clustering. In WISE, pages 253-265, 2004.
    • (2004) WISE , pp. 253-265
    • Huang, S.1    Xue, G.-R.2    Zhang, B.3    Chen, Z.4    Yu, Y.5    Ma, W.-Y.6
  • 24
    • 0033297068 scopus 로고    scopus 로고
    • Trawling the Web for emerging cyber-communities
    • R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. Trawling the Web for emerging cyber-communities. Computer Networks, 31(11-16):1481-1493, 1999.
    • (1999) Computer Networks , vol.31 , Issue.11-16 , pp. 1481-1493
    • Kumar, R.1    Raghavan, P.2    Rajagopalan, S.3    Tomkins, A.4
  • 25
    • 0002862737 scopus 로고    scopus 로고
    • Fast and effective text mining using linear-time document clustering
    • B. Larsen and C. Aone. Fast and effective text mining using linear-time document clustering. In KDD, pages 16-22, 1999.
    • (1999) KDD , pp. 16-22
    • Larsen, B.1    Aone, C.2
  • 27
    • 84944325093 scopus 로고    scopus 로고
    • Crawling the Hidden Web
    • S. Raghavan and H. Garcia-Molina. Crawling the Hidden Web. In VLDB, pages 129-138, 2001.
    • (2001) VLDB , pp. 129-138
    • Raghavan, S.1    Garcia-Molina, H.2
  • 28
    • 21344453736 scopus 로고    scopus 로고
    • Indexing the invisible Web: A survey
    • Y. Ru and E. Horowitz. Indexing the invisible Web: a survey. Online Information Review, 29(3):249-265, 2005.
    • (2005) Online Information Review , vol.29 , Issue.3 , pp. 249-265
    • Ru, Y.1    Horowitz, E.2
  • 29
    • 0016572913 scopus 로고
    • A vector space model for automatic indexing
    • G. Salton, A. Wong, and. C. S. Yang. A vector space model for automatic indexing. CACM, 18(11):613-620, 1975.
    • (1975) CACM , vol.18 , Issue.11 , pp. 613-620
    • Salton, G.1    Wong, A.2    Yang, C.S.3
  • 34
    • 0038156234 scopus 로고    scopus 로고
    • Evaluating contents-link coupled web page clustering for web search results
    • Y. Wang and M. Kitsuregawa. Evaluating contents-link coupled web page clustering for web search results. In CIKM, pages 499-506, 2002.
    • (2002) CIKM , pp. 499-506
    • Wang, Y.1    Kitsuregawa, M.2
  • 35
    • 0029717331 scopus 로고    scopus 로고
    • Hypursuit: A hierarchical network search engine that exploits content-link hypertext clustering
    • R. Weiss, B. Vêlez, and M. A. Sheldon. Hypursuit: a hierarchical network search engine that exploits content-link hypertext clustering. In ACM Hypertext, pages .180-193, 1996.
    • (1996) ACM Hypertext , pp. 180-193
    • Weiss, R.1    Vêlez, B.2    Sheldon, M.A.3
  • 36
    • 33749617417 scopus 로고    scopus 로고
    • Query selection techniques for efficient crawling of structured web sources
    • P. Wu, J.-R. Wen, H. Liu, and W.-Y. Ma. Query selection techniques for efficient crawling of structured web sources. In ICDE, 2006.
    • (2006) ICDE
    • Wu, P.1    Wen, J.-R.2    Liu, H.3    Ma, W.-Y.4
  • 37
    • 84856825113 scopus 로고    scopus 로고
    • Learning from the web to match query interfaces on the deep web
    • W. Wu, A. Doan, and C. Yu. Learning from the web to match query interfaces on the deep web. In ICDE, 2006.
    • (2006) ICDE
    • Wu, W.1    Doan, A.2    Yu, C.3
  • 38
    • 3142679542 scopus 로고    scopus 로고
    • An Interactive Clustering-based Approach to Integrating Source Query interfaces on the Deep Web
    • W. Wu, C. Yu, A. Doan, and W. Meng. An Interactive Clustering-based Approach to Integrating Source Query interfaces on the Deep Web. In SIGMOD, pages 95-106, 2004.
    • (2004) SIGMOD , pp. 95-106
    • Wu, W.1    Yu, C.2    Doan, A.3    Meng, W.4
  • 39
    • 34548714733 scopus 로고    scopus 로고
    • A methodology to retrieve text documents from multiple databases
    • C Yu, K.-L. Liu, W. Meng, Z. Wu, and N. Rishe. A methodology to retrieve text documents from multiple databases. IEEE TKDE, 2002.
    • (2002) IEEE TKDE
    • Yu, C.1    Liu, K.-L.2    Meng, W.3    Wu, Z.4    Rishe, N.5
  • 40
    • 0032268443 scopus 로고    scopus 로고
    • Web document clustering: A feasibility demonstration
    • O. Zamir and O. Etzioni. Web document clustering: a feasibility demonstration. In SIGIR, pages 46-54, 1998.
    • (1998) SIGIR , pp. 46-54
    • Zamir, O.1    Etzioni, O.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.