메뉴 건너뛰기




Volumn 16, Issue 1, 2004, Pages 41-55

Mining Web Informative Structures and Contents Based on Entropy Analysis

Author keywords

Anchor text; Entropy; Hubs and authorities; Information extraction; Informative structure; Link analysis

Indexed keywords

ALGORITHMS; ELECTRONIC COMMERCE; HTML; INFORMATION RETRIEVAL SYSTEMS; INTELLIGENT AGENTS; MARKETING; PROBLEM SOLVING; SEARCH ENGINES; WEBSITES;

EID: 0742268832     PISSN: 10414347     EISSN: None     Source Type: Journal    
DOI: 10.1109/TKDE.2004.1264821     Document Type: Article
Times cited : (72)

References (36)
  • 1
    • 0033661294 scopus 로고    scopus 로고
    • Does "Authority" Mean Quality? Predicting Expert Quality Ratings of Web Documents
    • B. Amento, L. Terveen, and W. Hill, "Does "Authority" Mean Quality? Predicting Expert Quality Ratings of Web Documents," Proc. 23th ACM. SIGIR, 2000.
    • (2000) Proc. 23th ACM. SIGIR
    • Amento, B.1    Terveen, L.2    Hill, W.3
  • 3
    • 0032283569 scopus 로고    scopus 로고
    • Improved Algorithms for Topic Distillation in a Hyperlinked Environment
    • K. Bharat and M.R. Henzinger, "Improved Algorithms for Topic Distillation in a Hyperlinked Environment," Proc. 21st ACM SIGIR, 1998.
    • (1998) Proc. 21st ACM SIGIR
    • Bharat, K.1    Henzinger, M.R.2
  • 4
    • 33745756440 scopus 로고    scopus 로고
    • Mirror and Mirror and on the Web: A Study of Host Pairs with Replicated Content
    • May
    • K. Bharat and A. Broder, "Mirror and Mirror and on the Web: A Study of Host Pairs with Replicated Content," Proc. Eighth Int'l World Wide Web Conf., May 1999.
    • (1999) Proc. Eighth Int'l World Wide Web Conf.
    • Bharat, K.1    Broder, A.2
  • 5
    • 0742329413 scopus 로고    scopus 로고
    • A Comparison of Techniques to Find Mirrored Hosts on the WWW
    • K. Bharat, A. Broder, J. Dean, and M.R. Henzinger, "A Comparison of Techniques to Find Mirrored Hosts on the WWW," IEEE Data Eng. Bull., vol. 23, no. 4, pp. 21-26, 2000.
    • (2000) IEEE Data Eng. Bull. , vol.23 , Issue.4 , pp. 21-26
    • Bharat, K.1    Broder, A.2    Dean, J.3    Henzinger, M.R.4
  • 7
    • 0002505172 scopus 로고    scopus 로고
    • The Anatomy of a Large-Scale Hypertextual Web Search Engine
    • S. Brin and L. Page, "The Anatomy of a Large-Scale Hypertextual Web Search Engine," Proc. Seventh World Wide Web Conf., 1998.
    • (1998) Proc. Seventh World Wide Web Conf.
    • Brin, S.1    Page, L.2
  • 10
    • 0031360033 scopus 로고    scopus 로고
    • Empirical Methods in Information Extraction
    • C. Cardie, "Empirical Methods in Information Extraction," AI Magazine, vol. 18, no. 4, pp. 5-79, 1997.
    • (1997) AI Magazine , vol.18 , Issue.4 , pp. 5-79
    • Cardie, C.1
  • 11
    • 0034791059 scopus 로고    scopus 로고
    • Enhanced Topic Distillation Using Text, Markup Tags, and Hyperlinks
    • S. Chakrabarti, M. Joshi, and V. Tawde, "Enhanced Topic Distillation Using Text, Markup Tags, and Hyperlinks," Proc. 24th ACM SIGIR, 2001.
    • (2001) Proc. 24th ACM SIGIR
    • Chakrabarti, S.1    Joshi, M.2    Tawde, V.3
  • 12
    • 84944134642 scopus 로고    scopus 로고
    • Integrating the Document Object Model with Hyperlinks for Enhanced Topic Distillation and Information Extraction
    • S. Chakrabarti, "Integrating the Document Object Model with Hyperlinks for Enhanced Topic Distillation and Information Extraction," Proc. 10th World Wide Web Conf., 2001.
    • (2001) Proc. 10th World Wide Web Conf.
    • Chakrabarti, S.1
  • 17
    • 0032028932 scopus 로고    scopus 로고
    • Efficient Data Mining for Path Traversal Patterns
    • Apr.
    • M.-S. Chen, J.-S. Park, and P.S. Yu, "Efficient Data Mining for Path Traversal Patterns," IEEE Trans. Knowledge and Data Eng., vol. 10, no. 2, pp. 209-221, Apr. 1998.
    • (1998) IEEE Trans. Knowledge and Data Eng. , vol.10 , Issue.2 , pp. 209-221
    • Chen, M.-S.1    Park, J.-S.2    Yu, P.S.3
  • 21
    • 0242698029 scopus 로고    scopus 로고
    • PhD Dissertation, Computer Science Dept., Carnegie Mellon Univ., Pittsburgh, PA
    • D. Freitag, "Machine Learning for Information Extraction," PhD Dissertation, Computer Science Dept., Carnegie Mellon Univ., Pittsburgh, PA, 1998.
    • (1998) Machine Learning for Information Extraction
    • Freitag, D.1
  • 22
    • 0032309862 scopus 로고    scopus 로고
    • Generating Finite-State Transducers for Semi-Structured Data Extraction from the Web
    • C.N. Hsu and M.T. Dung, "Generating Finite-State Transducers for Semi-Structured Data Extraction from the Web," Information Systems, vol. 23, no. 8, pp. 521-538, 1998.
    • (1998) Information Systems , vol.23 , Issue.8 , pp. 521-538
    • Hsu, C.N.1    Dung, M.T.2
  • 27
    • 0000642455 scopus 로고    scopus 로고
    • The Stochastic Approach for Link-Structure Analysis (SALSA) and the TKC Effect
    • R. Lempel and S. Moran, "The Stochastic Approach for Link-Structure Analysis (SALSA) and the TKC Effect," Proc. Ninth Int'l World Wide Web Conf., 2000.
    • (2000) Proc. Ninth Int'l World Wide Web Conf.
    • Lempel, R.1    Moran, S.2
  • 29
    • 0242456776 scopus 로고    scopus 로고
    • Discovering Informative Content Blocks from Web Documents
    • S.H. Lin and J.M. Ho, "Discovering Informative Content Blocks from Web Documents," Proc. Eighth ACM SIGKDD, 2002.
    • (2002) Proc. Eighth ACM SIGKDD
    • Lin, S.H.1    Ho, J.M.2
  • 34
    • 84856043672 scopus 로고
    • A Mathematical Theory of Communication
    • C.E. Shannon, "A Mathematical Theory of Communication," Bell System Technical J., vol. 27, pp. 398-403, 1948.
    • (1948) Bell System Technical J. , vol.27 , pp. 398-403
    • Shannon, C.E.1
  • 35
    • 0033699332 scopus 로고    scopus 로고
    • Discovering Structural Association of Semistructured Data
    • K. Wang and H. Liu, "Discovering Structural Association of Semistructured Data," IEEE Trans. Knowledge and Data Eng., vol. 12, no. 3, pp. 353-371, 2000.
    • (2000) IEEE Trans. Knowledge and Data Eng. , vol.12 , Issue.3 , pp. 353-371
    • Wang, K.1    Liu, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.