메뉴 건너뛰기




Volumn , Issue , 2013, Pages 261-271

A framework for learning Web wrappers from the crowd

Author keywords

Active learning; Crowdsourcing; Wrapper generation

Indexed keywords

CROWDSOURCING; WEBSITES;

EID: 84893059618     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2488388.2488412     Document Type: Conference Paper
Times cited : (23)

References (16)
  • 1
    • 0742284346 scopus 로고    scopus 로고
    • Queries revisited
    • D. Angluin. Queries revisited. Theor. Comput. Sci., 313(2):175-194, 2004.
    • (2004) Theor. Comput. Sci. , vol.313 , Issue.2 , pp. 175-194
    • Angluin, D.1
  • 2
    • 1142303684 scopus 로고    scopus 로고
    • Extracting structured data from web pages
    • ACM
    • A. Arasu and H. Garcia-Molina. Extracting structured data from web pages. In SIGMOD Conference, pages 337-348. ACM, 2003.
    • (2003) SIGMOD Conference , pp. 337-348
    • Arasu, A.1    Garcia-Molina, H.2
  • 3
    • 77955659703 scopus 로고    scopus 로고
    • The true sample complexity of active learning
    • M.-F. Balcan, S. Hanneke, and J. W. Vaughan. The true sample complexity of active learning. Machine Learning, 80(2-3):111-139, 2010.
    • (2010) Machine Learning , vol.80 , Issue.2-3 , pp. 111-139
    • Balcan, M.-F.1    Hanneke, S.2    Vaughan, J.W.3
  • 4
    • 85042021254 scopus 로고    scopus 로고
    • IEPAD: Information extraction based on pattern discovery
    • C.-H. Chang and S.-C. Lui. IEPAD: information extraction based on pattern discovery. In WWW, pages 681-688, 2001.
    • (2001) WWW , pp. 681-688
    • Chang, C.-H.1    Lui, S.-C.2
  • 5
    • 84893127452 scopus 로고    scopus 로고
    • Minimizing the costs of the training data for learning web wrappers
    • R. Creo, V. Crescenzi, D. Qiu, and P. Merialdo. Minimizing the costs of the training data for learning web wrappers. In VLDS, pages 35-40, 2012.
    • (2012) VLDS , pp. 35-40
    • Creo, R.1    Crescenzi, V.2    Qiu, D.3    Merialdo, P.4
  • 6
    • 12344333240 scopus 로고    scopus 로고
    • Automatic information extraction from large websites
    • V. Crescenzi and G. Mecca. Automatic information extraction from large websites. J. ACM, 51(5):731-779, 2004.
    • (2004) J. ACM , vol.51 , Issue.5 , pp. 731-779
    • Crescenzi, V.1    Mecca, G.2
  • 8
    • 84861026711 scopus 로고    scopus 로고
    • Automatic wrappers for large scale web extraction
    • N. N. Dalvi, R. Kumar, and M. A. Soliman. Automatic wrappers for large scale web extraction. PVLDB, 4(4):219-230, 2011.
    • (2011) PVLDB , vol.4 , Issue.4 , pp. 219-230
    • Dalvi, N.N.1    Kumar, R.2    Soliman, M.A.3
  • 9
    • 34250750133 scopus 로고    scopus 로고
    • Interactive wrapper generation with minimal user effort
    • ACM
    • U. Irmak and T. Suel. Interactive wrapper generation with minimal user effort. In WWW, pages 553-563. ACM, 2006.
    • (2006) WWW , pp. 553-563
    • Irmak, U.1    Suel, T.2
  • 11
    • 3142745227 scopus 로고    scopus 로고
    • The lixto data extraction project - Back and forth between theory and practice
    • ACM
    • G. Gottlob, C. Koch, R. Baumgartner, M. Herzog, and S. Flesca. The lixto data extraction project - back and forth between theory and practice. In PODS, pages 1-12. ACM, 2004.
    • (2004) PODS , pp. 1-12
    • Gottlob, G.1    Koch, C.2    Baumgartner, R.3    Herzog, M.4    Flesca, S.5
  • 13
    • 68949137209 scopus 로고    scopus 로고
    • Active learning literature survey
    • University of Wisconsin-Madison
    • B. Settles. Active learning literature survey. Computer Sciences Technical Report 1648, University of Wisconsin-Madison, 2009.
    • (2009) Computer Sciences Technical Report 1648
    • Settles, B.1
  • 15
    • 0032594959 scopus 로고    scopus 로고
    • An overview of statistical learning theory
    • V. Vapnik. An overview of statistical learning theory. IEEE Transactions on Neural Networks, 10(5):988-999, 1999.
    • (1999) IEEE Transactions on Neural Networks , vol.10 , Issue.5 , pp. 988-999
    • Vapnik, V.1
  • 16
    • 33750797710 scopus 로고    scopus 로고
    • Structured data extraction from the web based on partial tree alignment
    • Y. Zhai and B. Liu. Structured data extraction from the web based on partial tree alignment. IEEE Trans. Knowl. Data Eng., 18(12):1614-1628, 2006.
    • (2006) IEEE Trans. Knowl. Data Eng. , vol.18 , Issue.12 , pp. 1614-1628
    • Zhai, Y.1    Liu, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.