메뉴 건너뛰기




Volumn 6, Issue 10, 2013, Pages 805-816

Extraction and integration of partially overlapping web sources

Author keywords

[No Author keywords available]

Indexed keywords

DATA MINING; EXTRACTION; WEBSITES; WEIRS;

EID: 84891126419     PISSN: None     EISSN: 21508097     Source Type: Journal    
DOI: 10.14778/2536206.2536209     Document Type: Article
Times cited : (61)

References (31)
  • 1
    • 0033652943 scopus 로고    scopus 로고
    • Snowball: extracting relations from large plain-text collections. In DL '00
    • E. Agichtein and L. Gravano. Snowball: extracting relations from large plain-text collections. In DL '00, 2000.
    • (2000)
    • Agichtein, E.1    Gravano, L.2
  • 3
    • 1142303684 scopus 로고    scopus 로고
    • Extracting structured data from web pages.
    • A. Arasu and H. Garcia-Molina. Extracting structured data from web pages. In SIGMOD, 2003.
    • (2003) SIGMOD
    • Arasu, A.1    Garcia-Molina, H.2
  • 5
    • 84860485957 scopus 로고    scopus 로고
    • Generic schema matching, ten years later.
    • P. A. Bernstein, J. Madhavan, and E. Rahm. Generic schema matching, ten years later. PVLDB, 4(11), 2011.
    • (2011) PVLDB , vol.4 , Issue.11
    • Bernstein, P.A.1    Madhavan, J.2    Rahm, E.3
  • 6
    • 77951136761 scopus 로고    scopus 로고
    • Supporting the automatic construction of entity aware search engines.
    • L. Blanco, V. Crescenzi, P. Merialdo, and P. Papotti. Supporting the automatic construction of entity aware search engines. In WIDM, 2008.
    • (2008) WIDM
    • Blanco, L.1    Crescenzi, V.2    Merialdo, P.3    Papotti, P.4
  • 7
    • 79955068748 scopus 로고    scopus 로고
    • Probabilistic models to reconcile complex data from inaccurate data sources.
    • L. Blanco, V. Crescenzi, P. Merialdo, and P. Papotti. Probabilistic models to reconcile complex data from inaccurate data sources. In CAiSE, 2010.
    • (2010) CAiSE
    • Blanco, L.1    Crescenzi, V.2    Merialdo, P.3    Papotti, P.4
  • 8
    • 85039641957 scopus 로고    scopus 로고
    • Extraction and integration of partially overlapping web sources. Tech. rep., DIA -Roma Tre -TR201, Dec. 2012.
    • M. Bronzi, V. Crescenzi, P. Merialdo, and P. Papotti. Extraction and integration of partially overlapping web sources. Tech. rep., DIA -Roma Tre -TR201, Dec. 2012.
    • Bronzi, M.1    Crescenzi, V.2    Merialdo, P.3    Papotti, P.4
  • 10
    • 84859197607 scopus 로고    scopus 로고
    • Webtables: exploring the power of tables on the web.
    • M. J. Cafarella, A. Y. Halevy, D. Z. Wang, E. Wu, and Y. Zhang. Webtables: exploring the power of tables on the web. PVLDB, 1(1), 2008.
    • (2008) PVLDB , vol.1 , Issue.1
    • Cafarella, M.J.1    Halevy, A.Y.2    Wang, D.Z.3    Wu, E.4    Zhang, Y.5
  • 11
    • 38349193107 scopus 로고    scopus 로고
    • Learning semantic definitions of online information sources.
    • M. J. Carman and C. A. Knoblock. Learning semantic definitions of online information sources. J. Artif. Int. Res., 30(1):1-50, 2007.
    • (2007) J. Artif. Int. Res. , vol.30 , Issue.1 , pp. 1-50
    • Carman, M.J.1    Knoblock, C.A.2
  • 13
    • 85011016482 scopus 로고    scopus 로고
    • Context aware wrapping: Synchronized data extraction.
    • S.-L. Chuang, K. C. Chang, and C. X. Zhai. Context aware wrapping: Synchronized data extraction. In VLDB, 2007.
    • (2007) VLDB
    • Chuang, S.-L.1    Chang, K.C.2    Zhai, C.X.3
  • 14
    • 11144240583 scopus 로고    scopus 로고
    • A comparison of string distance metrics for name-matching tasks.
    • W. W. Cohen, P. Ravikumar, and S. E. Fienberg. A comparison of string distance metrics for name-matching tasks. In IIWeb, 2003.
    • (2003) IIWeb
    • Cohen, W.W.1    Ravikumar, P.2    Fienberg, S.E.3
  • 16
  • 18
    • 84861026711 scopus 로고    scopus 로고
    • Automatic wrappers for large scale web extraction. PVLDB
    • N. N. Dalvi, R. Kumar, and M. A. Soliman. Automatic wrappers for large scale web extraction. PVLDB, 4(4), 2011.
    • (2011) , vol.4 , Issue.4
    • Dalvi, N.N.1    Kumar, R.2    Soliman, M.A.3
  • 19
    • 84863761202 scopus 로고    scopus 로고
    • An analysis of structured data on the web.
    • N. N. Dalvi, A. Machanavajjhala, and B. Pang. An analysis of structured data on the web. PVLDB, 5(7), 2012.
    • (2012) PVLDB , vol.5 , Issue.7
    • Dalvi, N.N.1    Machanavajjhala, A.2    Pang, B.3
  • 20
    • 77954301186 scopus 로고    scopus 로고
    • Harvesting relational tables from lists on the web.
    • H. Elmeleegy, J. Madhavan, and A. Y. Halevy. Harvesting relational tables from lists on the web. PVLDB, 2(1), 2009.
    • (2009) PVLDB , vol.2 , Issue.1
    • Elmeleegy, H.1    Madhavan, J.2    Halevy, A.Y.3
  • 21
    • 84866839638 scopus 로고    scopus 로고
    • and Mausam. Open information extraction: The second generation. In IJCAI
    • O. Etzioni, A. Fader, J. Christensen, S. Soderland, and Mausam. Open information extraction: The second generation. In IJCAI, 2011.
    • (2011)
    • Etzioni, O.1    Fader, A.2    Christensen, J.3    Soderland, S.4
  • 23
    • 84055203904 scopus 로고    scopus 로고
    • Exploiting content redundancy for web information extraction.
    • P. Gulhane, R. Rastogi, S. H. Sengamedu, and A. Tengli. Exploiting content redundancy for web information extraction. PVLDB, 3(1), 2010.
    • (2010) PVLDB , vol.3 , Issue.1
    • Gulhane, P.1    Rastogi, R.2    Sengamedu, S.H.3    Tengli, A.4
  • 24
    • 80052120564 scopus 로고    scopus 로고
    • From one tree to a forest: a unified solution for structured web data extraction.
    • Q. Hao, R. Cai, Y. Pang, and L. Zhang. From one tree to a forest: a unified solution for structured web data extraction. In SIGIR, 2011.
    • (2011) SIGIR
    • Hao, Q.1    Cai, R.2    Pang, Y.3    Zhang, L.4
  • 25
    • 75449088101 scopus 로고    scopus 로고
    • Uninterpreted schema matching with embedded value mapping under opaque column names and data values.
    • A. Jaiswal, D. Miller, and P. Mitra. Uninterpreted schema matching with embedded value mapping under opaque column names and data values. IEEE Trans. Knowl. Data Eng., 22(2):291-304, 2010.
    • (2010) IEEE Trans. Knowl. Data Eng. , vol.22 , Issue.2 , pp. 291-304
    • Jaiswal, A.1    Miller, D.2    Mitra, P.3
  • 26
    • 1142267348 scopus 로고    scopus 로고
    • On schema matching with opaque column names and data values.
    • J. Kang and J. F. Naughton. On schema matching with opaque column names and data values. In SIGMOD, 2003.
    • (2003) SIGMOD
    • Kang, J.1    Naughton, J.F.2
  • 27
    • 84875119323 scopus 로고    scopus 로고
    • Truth finding on the deep web: Is the problem solved?
    • X. Li, X. L. Dong, K. Lyons, W. Meng, and D. Srivastava. Truth finding on the deep web: Is the problem solved? PVLDB, 6(2), 2013.
    • (2013) PVLDB , vol.6 , Issue.2
    • Li, X.1    Dong, X.L.2    Lyons, K.3    Meng, W.4    Srivastava, D.5
  • 29
    • 34548080780 scopus 로고    scopus 로고
    • Introduction to Information Retrieval
    • Cambridge University Press
    • C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval, Cambridge University Press, 2008.
    • (2008)
    • Manning, C.D.1    Raghavan, P.2    Schütze, H.3
  • 31
    • 77649265164 scopus 로고    scopus 로고
    • Learning to adapt web information extraction knowledge and discovering new attributes via a bayesian approach.
    • T.-L. Wong and W. Lam. Learning to adapt web information extraction knowledge and discovering new attributes via a bayesian approach. IEEE Trans. Knowl. Data Eng., 22(4):523-536, 2010.
    • (2010) IEEE Trans. Knowl. Data Eng. , vol.22 , Issue.4 , pp. 523-536
    • Wong, T.-L.1    Lam, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.