메뉴 건너뛰기




Volumn 23, Issue 4, 2011, Pages 612-626

TEXT: Automatic template extraction from heterogeneous web pages

Author keywords

clustering; MinHash; minimum description length principle; Template extraction

Indexed keywords

CLUSTERING; COMPREHENSIVE ANALYSIS; FAST APPROXIMATION; HIGH PRODUCTIVITY; MINHASH; MINIMUM DESCRIPTION LENGTH PRINCIPLE; NOVEL ALGORITHM; REAL LIFE DATASETS; STATE OF THE ART; TEMPLATE DETECTION; TEMPLATE EXTRACTION; TEMPLATE STRUCTURES; WEB APPLICATION; WEB DOCUMENT; WEB PAGE;

EID: 79951930082     PISSN: 10414347     EISSN: None     Source Type: Journal    
DOI: 10.1109/TKDE.2010.140     Document Type: Article
Times cited : (43)

References (26)
  • 11
    • 18844436436 scopus 로고    scopus 로고
    • Clustering Web pages based on their structure
    • DOI 10.1016/j.datak.2004.11.004, PII S0169023X04002137, Fifth ACM International Workshop on Web Information and Data Management (WIDM 2003)
    • V. Crescenzi, P. Merialdo, and P. Missier, "Clustering (Pubitemid 40683780)
    • (2005) Data and Knowledge Engineering , vol.54 , Issue.3 , pp. 279-299
    • Crescenzi, V.1    Merialdo, P.2    Missier, P.3
  • 16
    • 3142742483 scopus 로고    scopus 로고
    • Using the structure of web sites for automatic segmentation of tables
    • K. Lerman, L. Getoor, S. Minton, and C. Knoblock, "Using the Structure of Web Sites for Automatic Segmentation of Tables," Proc. ACM SIGMOD, 2004.
    • (2004) Proc. ACM SIGMOD
    • Lerman, K.1    Getoor, L.2    Minton, S.3    Knoblock, C.4
  • 17
  • 18
    • 57149147732 scopus 로고    scopus 로고
    • Crd: Fast co-clustering on large data sets utilizing sampling-based matrix decomposition
    • F. Pan, X. Zhang, and W. Wang, "Crd: Fast Co-Clustering on Large Data Sets Utilizing Sampling-Based Matrix Decomposition," Proc. ACM SIGMOD, 2008.
    • (2008) Proc. ACM SIGMOD
    • Pan, F.1    Zhang, X.2    Wang, W.3
  • 20
    • 0018015137 scopus 로고
    • Modeling by shortest data description
    • J. Rissanen, "Modeling by Shortest Data Description," Automatica, vol. 14, pp. 465-471, 1978.
    • (1978) Automatica , vol.14 , pp. 465-471
    • Rissanen, J.1
  • 21
    • 0003250456 scopus 로고
    • Stochastic complexity in statistical inquiry
    • J. Rissanen, Stochastic Complexity in Statistical Inquiry. World Scientific, 1989.
    • (1989) World Scientific
    • Rissanen, J.1
  • 26
    • 36849062139 scopus 로고    scopus 로고
    • Joint optimization of wrapper generation and template detection
    • S. Zheng, D. Wu, R. Song, and J.-R. Wen, "Joint Optimization of Wrapper Generation and Template Detection," Proc. ACM SIGKDD, 2007.
    • (2007) Proc. ACM SIGKDD
    • Zheng, S.1    Wu, D.2    Song, R.3    Wen, J.-R.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.