메뉴 건너뛰기




Volumn , Issue , 2009, Pages 302-307

Scalable attribute-value extraction from semi-structured text

Author keywords

Information extraction; World wide web

Indexed keywords

ATTRIBUTE-VALUE PAIRS; CANDIDATE GENERATION; F-MEASURE; INFORMATION EXTRACTION; SEMI-STRUCTURED TEXT; TWO PHASIS; WEB PAGE;

EID: 77951192004     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICDMW.2009.81     Document Type: Conference Paper
Times cited : (29)

References (28)
  • 1
    • 0002210449 scopus 로고
    • What's in a link: Foundations for semantic networks
    • D. G. Bobrow and A. M. Collins, Eds. New York: Academic Press
    • W. A. Woods, "What's in a link: Foundations for semantic networks, " in Representation and Understanding: Studies in Cognitive Science, D. G. Bobrow and A. M. Collins, Eds. New York: Academic Press, 1975, pp. 35-82.
    • (1975) Representation and Understanding: Studies in Cognitive Science , pp. 35-82
    • Woods, W.A.1
  • 3
    • 0036768493 scopus 로고    scopus 로고
    • Weaving a web of ideas
    • S. M. Cherry, "Weaving a web of ideas, " IEEE Spectrum, vol. 39, no. 9, pp. 65-69, 2002.
    • (2002) IEEE Spectrum , vol.39 , Issue.9 , pp. 65-69
    • Cherry, S.M.1
  • 4
    • 85099019865 scopus 로고    scopus 로고
    • Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition
    • Edmonton, Canada
    • E. F. Tjong Kim Sang and F. De Meulder, "Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition, " in Proc. of CoNLL, Edmonton, Canada, 2003, pp. 142-147.
    • (2003) Proc. of CoNLL , pp. 142-147
    • Tjong Kim Sang, E.F.1    De Meulder, F.2
  • 7
    • 0035587215 scopus 로고    scopus 로고
    • Hierarchical wrapper induction for semistructured information sources
    • I. Muslea, S. Minton, and C. A. Knoblock, "Hierarchical wrapper induction for semistructured information sources, " JAAMAS, vol. 4, pp. 93-114, 2001.
    • (2001) JAAMAS , vol.4 , pp. 93-114
    • Muslea, I.1    Minton, S.2    Knoblock, C.A.3
  • 8
    • 12344325204 scopus 로고    scopus 로고
    • Automatic annotation of data extracted from large web sites
    • San Diego, CA
    • L. Arlotta, V. Crescenzi, G. Mecca, and P. Merialdo, "Automatic annotation of data extracted from large web sites, " in Proc. of WebDB, San Diego, CA, 2003.
    • (2003) Proc. of WebDB
    • Arlotta, L.1    Crescenzi, V.2    Mecca, G.3    Merialdo, P.4
  • 9
    • 74049093803 scopus 로고    scopus 로고
    • On precision and recall of multi-attribute data extraction from semistructured sources
    • Melbourne, FL
    • G. Yang, S. Mukherjee, and I. V. Ramakrishnan, "On precision and recall of multi-attribute data extraction from semistructured sources, " in Proc. of ICDM, Melbourne, FL, 2003, pp. 395-402.
    • (2003) Proc. of ICDM , pp. 395-402
    • Yang, G.1    Mukherjee, S.2    Ramakrishnan, I.V.3
  • 10
    • 36849066312 scopus 로고    scopus 로고
    • Webpage understanding: An integrated approach
    • San Jose, CA
    • J. Zhu, Z. Nie, J.-R. Wen, B. Zhang, and H.-W. Hon, "Webpage understanding: An integrated approach, " in Proc. of KDD, San Jose, CA, 2007, pp. 903-912.
    • (2007) Proc. of KDD , pp. 903-912
    • Zhu, J.1    Nie, Z.2    Wen, J.-R.3    Zhang, B.4    Hon, H.-W.5
  • 11
    • 0141946356 scopus 로고    scopus 로고
    • Extracting ontologies from world wide web via HTML tables
    • M. Yoshida, K. Torisawa, and J. Tsujii, "Extracting ontologies from world wide web via HTML tables, " in Proc. of PACLING, 2001.
    • (2001) Proc. of PACLING
    • Yoshida, M.1    Torisawa, K.2    Tsujii, J.3
  • 12
    • 77953046656 scopus 로고    scopus 로고
    • A flexible learning system for wrapping tables and lists in HTML documents
    • Honolulu, HI
    • W. W. Cohen, M. Hurst, and L. S. Jensen, "A flexible learning system for wrapping tables and lists in HTML documents, " in Proc. of WWW, Honolulu, HI, 2002, pp. 232-241.
    • (2002) Proc. of WWW , pp. 232-241
    • Cohen, W.W.1    Hurst, M.2    Jensen, L.S.3
  • 13
    • 33845353207 scopus 로고    scopus 로고
    • A machine learning based approach for table detection on the web
    • Honolulu, HI
    • Y. Wang and J. Hu, "A machine learning based approach for table detection on the web, " in Proc. of WWW, Honolulu, HI, 2002, pp. 242-250.
    • (2002) Proc. of WWW , pp. 242-250
    • Wang, Y.1    Hu, J.2
  • 14
    • 3142742483 scopus 로고    scopus 로고
    • Using the structure of web sites for automatic segmentation of tables
    • Paris, France
    • K. Lerman, L. Getoor, S. Minton, and C. Knoblock, "Using the structure of web sites for automatic segmentation of tables, " in Proc. of SIGMOD, Paris, France, 2004, pp. 119-130.
    • (2004) Proc. of SIGMOD , pp. 119-130
    • Lerman, K.1    Getoor, L.2    Minton, S.3    Knoblock, C.4
  • 15
    • 16244404907 scopus 로고    scopus 로고
    • Automating the extraction of data from HTML tables with unknown structure
    • D. W. Embley, C. Tao, and S. W. Liddle, "Automating the extraction of data from HTML tables with unknown structure, " DKE, vol. 54, no. 1, pp. 3-28, 2005.
    • (2005) DKE , vol.54 , Issue.1 , pp. 3-28
    • Embley, D.W.1    Tao, C.2    Liddle, S.W.3
  • 16
    • 33748300912 scopus 로고    scopus 로고
    • Tableprocessing paradigms: A research survey
    • D. W. Embley, M. Hurst, D. Lopresti, and G. Nagy, "Tableprocessing paradigms: A research survey, " IJDAR, vol. 8, no. 2, pp. 66-86, 2006.
    • (2006) IJDAR , vol.8 , Issue.2 , pp. 66-86
    • Embley, D.W.1    Hurst, M.2    Lopresti, D.3    Nagy, G.4
  • 18
    • 35348900845 scopus 로고    scopus 로고
    • Towards domain-independent information extraction from web tables
    • Banff, Canada
    • W. Gatterbauer, P. Bohunsky, M. Herzog, B. Kruepl, and B. Pollak, "Towards domain-independent information extraction from web tables, " in Proc. of WWW, Banff, Canada, 2007, pp. 71-80.
    • (2007) Proc. of WWW , pp. 71-80
    • Gatterbauer, W.1    Bohunsky, P.2    Herzog, M.3    Kruepl, B.4    Pollak, B.5
  • 19
    • 36849094469 scopus 로고    scopus 로고
    • Corroborate and learn facts from the web
    • San Jose, CA
    • S. Zhao and J. Betz, "Corroborate and learn facts from the web, " in Proc. of KDD, San Jose, CA, 2007, pp. 995-1003.
    • (2007) Proc. of KDD , pp. 995-1003
    • Zhao, S.1    Betz, J.2
  • 21
    • 84858385171 scopus 로고    scopus 로고
    • Preemptive information extraction using unrestricted relation discovery
    • New York City, NY
    • Y. Shinyama and S. Sekine, "Preemptive information extraction using unrestricted relation discovery, " in Proc. of HLTNAACL, New York City, NY, 2006, pp. 304-311.
    • (2006) Proc. of HLTNAACL , pp. 304-311
    • Shinyama, Y.1    Sekine, S.2
  • 23
    • 63449137155 scopus 로고    scopus 로고
    • Autonomously semantifying Wikipedia
    • Lisbon, Portugal
    • F. Wu and D. S. Weld, "Autonomously semantifying Wikipedia, " in Proc. of CIKM, Lisbon, Portugal, 2007, pp. 41-50.
    • (2007) Proc. of CIKM , pp. 41-50
    • Wu, F.1    Weld, D.S.2
  • 24
    • 84859197607 scopus 로고    scopus 로고
    • WebTables: Exploring the power of tables on the web
    • Auckland, New Zealand
    • M. J. Cafarella, A. Halevy, D. Z. Wang, E. Wu, and Y. Zhang, "WebTables: Exploring the power of tables on the web, " in Proc. of VLDB, Auckland, New Zealand, 2008, pp. 538-549.
    • (2008) Proc. of VLDB , pp. 538-549
    • Cafarella, M.J.1    Halevy, A.2    Wang, D.Z.3    Wu, E.4    Zhang, Y.5
  • 25
    • 84859892286 scopus 로고    scopus 로고
    • Weakly-supervised acquisition of open-domain classes and class attributes from web documents and query logs
    • Columbus, OH
    • M. Pasca and B. Van Durme, "Weakly-supervised acquisition of open-domain classes and class attributes from web documents and query logs, " in Proc. of ACL-HLT, Columbus, OH, 2008, pp. 19-27.
    • (2008) Proc. of ACL-HLT , pp. 19-27
    • Pasca, M.1    Van Durme, B.2
  • 26
    • 85030321143 scopus 로고    scopus 로고
    • MapReduce: Simplified data processing on large clusters
    • San Francisco, CA
    • J. Dean and S. Ghemawat, "MapReduce: Simplified data processing on large clusters, " in Proc. of OSDI, San Francisco, CA, 2004.
    • (2004) Proc. of OSDI
    • Dean, J.1    Ghemawat, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.