메뉴 건너뛰기




Volumn 2006, Issue , 2006, Pages 494-503

Simultaneous record detection and attribute labeling in web data extraction

Author keywords

Attribute labeling; Conditional Random Fields; Data record detection; Hierarchical Conditional Random Fields; Web page segmentation

Indexed keywords

DATA ACQUISITION; DATA MINING; HIERARCHICAL SYSTEMS; MATHEMATICAL MODELS; MULTITASKING; WORLD WIDE WEB;

EID: 33749623896     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1150402.1150457     Document Type: Conference Paper
Times cited : (141)

References (33)
  • 2
    • 83055165536 scopus 로고    scopus 로고
    • Collective information extraction with relational Markov networks
    • Bunescu, R. C., and Mooney, R. J. Collective information extraction with relational Markov networks. In Proc. of ACL 2004.
    • (2004) Proc. of ACL
    • Bunescu, R.C.1    Mooney, R.J.2
  • 3
    • 0035000412 scopus 로고    scopus 로고
    • A fully automated object extraction system for the world wide web
    • Buttler, D., Liu, L., and Pu, C. A Fully Automated Object Extraction System for the World Wide Web. In Proc. of IEEE ICDCS, 2001.
    • (2001) Proc. of IEEE ICDCS
    • Buttler, D.1    Liu, L.2    Pu, C.3
  • 5
    • 2342568689 scopus 로고    scopus 로고
    • IEPAD: Information extraction based on pattern discovery
    • Chang, C.-H., and Liu, S.-L. IEPAD: Information Extraction Based on Pattern Discovery. In Proc. of WWW, 2001.
    • (2001) Proc. of WWW
    • Chang, C.-H.1    Liu, S.-L.2
  • 6
    • 0004014502 scopus 로고    scopus 로고
    • A Gaussian prior for smoothing maximum entropy models
    • Carnegie Mellon University
    • Chen, S. F., and Rosenfeld, R. A Gaussian Prior for Smoothing Maximum Entropy Models. Technical Report CMU-CS-99-108, Carnegie Mellon University, 1999.
    • (1999) Technical Report , vol.CMU-CS-99-108
    • Chen, S.F.1    Rosenfeld, R.2
  • 7
    • 12244290581 scopus 로고    scopus 로고
    • Exploiting dictionaries in named entity extraction: Combining Semi-Markov extraction processes and data integration methods
    • Cohen, W. W., and Sarawagi, S. Exploiting Dictionaries in Named Entity Extraction: Combining Semi-Markov Extraction Processes and Data Integration Methods. In Proc. of SIGKDD, 2004.
    • (2004) Proc. of SIGKDD
    • Cohen, W.W.1    Sarawagi, S.2
  • 9
    • 84944327150 scopus 로고    scopus 로고
    • ROADRUNNER: Towards automatic data extraction from large web sites
    • Crescenzi, V., Mecca, G., and Merialdo, P. ROADRUNNER: Towards Automatic Data Extraction from Large Web Sites. In Proc. of VLDB, 2001.
    • (2001) Proc. of VLDB
    • Crescenzi, V.1    Mecca, G.2    Merialdo, P.3
  • 11
    • 0032119668 scopus 로고    scopus 로고
    • The hierarchical hidden Markov model: Analysis and applications
    • Fine, S., Singer Y., and Tishby, N. The hierarchical hidden Markov model: Analysis and applications. Machine Learning, 32:41-62, 1998.
    • (1998) Machine Learning , vol.32 , pp. 41-62
    • Fine, S.1    Singer, Y.2    Tishby, N.3
  • 12
    • 22944476409 scopus 로고    scopus 로고
    • Multi-level boundary classification for information extraction
    • Finn, A., and Kushmerick, N. Multi-level boundary classification for information extraction. In Proc. of ECML, 2004.
    • (2004) Proc. of ECML
    • Finn, A.1    Kushmerick, N.2
  • 15
    • 33745848658 scopus 로고    scopus 로고
    • A hierarchical field framework for unified context-based classification
    • Kumar, S., and Hebert, M. A Hierarchical Field Framework for Unified Context-Based Classification. In Proc. of ICCV, 2005.
    • (2005) Proc. of ICCV
    • Kumar, S.1    Hebert, M.2
  • 16
    • 0034172374 scopus 로고    scopus 로고
    • Wrapper induction: Efficiency and expressiveness
    • Kushmerick, N. Wrapper induction: efficiency and expressiveness. Artificial Intelligence, 118:15-68, 2000.
    • (2000) Artificial Intelligence , vol.118 , pp. 15-68
    • Kushmerick, N.1
  • 17
    • 0142192295 scopus 로고    scopus 로고
    • Conditional random fields: Probabilistic models for segmenting and labelling sequence data
    • Lafferty, J., McCallum, A., and Pereira, F. Conditional random fields: Probabilistic models for segmenting and labelling sequence data. In Proc. of ICML, 2001.
    • (2001) Proc. of ICML
    • Lafferty, J.1    McCallum, A.2    Pereira, F.3
  • 18
  • 20
    • 33947507692 scopus 로고    scopus 로고
    • Location-based activity recognition
    • Liao, L., Fox, D., and Kautz, H. Location-based activity recognition. In Proc. of NIPS, 2005.
    • (2005) Proc. of NIPS
    • Liao, L.1    Fox, D.2    Kautz, H.3
  • 21
    • 33646887390 scopus 로고
    • On the limited memory BFGS method for large scale optimization
    • Liu, D. C., and Nocedal, J. On The Limited Memory BFGS Method for Large Scale Optimization. Mathematical Programming 45, pp. 503-528, 1989.
    • (1989) Mathematical Programming , vol.45 , pp. 503-528
    • Liu, D.C.1    Nocedal, J.2
  • 22
    • 1042264823 scopus 로고    scopus 로고
    • A comparison of algorithms for maximum entropy parameter estimation
    • Malouf, R. A comparison of algorithms for maximum entropy parameter estimation. In Sixth Conf. on Natural Language Learning, pages 49-55, 2002.
    • (2002) Sixth Conf. on Natural Language Learning , pp. 49-55
    • Malouf, R.1
  • 23
    • 0035587215 scopus 로고    scopus 로고
    • Hierarchical wrapper induction for semi-structured information sources
    • 2001
    • Muslea, I., Minton, S., and Knoblock C. A. Hierarchical Wrapper Induction for Semi-structured Information Sources. Autonomous Agents and Multi-Agent 4, 1/2 (2001), 2001.
    • (2001) Autonomous Agents and Multi-agent , vol.4 , Issue.1-2
    • Muslea, I.1    Minton, S.2    Knoblock, C.A.3
  • 24
    • 0001868006 scopus 로고    scopus 로고
    • A mutually beneficial integration of data mining and information extraction
    • Nahm, U. Y., and Mooney, R. J. A Mutually Beneficial Integration of Data Mining and Information Extraction. In Proc. of AAAI, 2001.
    • (2001) Proc. of AAAI
    • Nahm, U.Y.1    Mooney, R.J.2
  • 25
    • 34047192804 scopus 로고    scopus 로고
    • Semi-Markov conditional random fields for information extraction
    • Sarawagi, S., and Cohen, W. W. Semi-Markov Conditional Random Fields for Information Extraction. In Proc. of NIPS, 2004.
    • (2004) Proc. of NIPS
    • Sarawagi, S.1    Cohen, W.W.2
  • 26
    • 84880805498 scopus 로고    scopus 로고
    • Hierarchical hidden Markov models for information extraction
    • Skounakis, M., Craven, M., and Ray S. Hierarchical Hidden Markov Models for Information Extraction. In Proc. of IJCAI, 2003.
    • (2003) Proc. of IJCAI
    • Skounakis, M.1    Craven, M.2    Ray, S.3
  • 27
  • 28
    • 14344253846 scopus 로고    scopus 로고
    • Dynamic conditional random fields: Factorized probabilistic models for labeling and segmenting sequence data
    • Sutton, C., Rohanimanesh, K., and McCallum, A. Dynamic Conditional Random Fields: Factorized Probabilistic Models for Labeling and Segmenting Sequence Data. In Proc. of ICML, 2004.
    • (2004) Proc. of ICML
    • Sutton, C.1    Rohanimanesh, K.2    McCallum, A.3
  • 29
    • 29244441994 scopus 로고    scopus 로고
    • An integrated, conditional model of information extraction and coreference with application to citation matching
    • Wellner, B., McCallum, A., Peng, F., and Hay, M. An Integrated, Conditional Model of Information Extraction and Coreference with Application to Citation Matching. In Proc. of UAI, 2004.
    • (2004) Proc. of UAI
    • Wellner, B.1    McCallum, A.2    Peng, F.3    Hay, M.4
  • 30
    • 77952370025 scopus 로고    scopus 로고
    • Eliminating noisy information in web pages for data mining
    • Yi, L., Liu, B., and Li, X. Eliminating Noisy Information in Web Pages for Data Mining. In Proc. of SIGKDD, 2003.
    • (2003) Proc. of SIGKDD
    • Yi, L.1    Liu, B.2    Li, X.3
  • 31
    • 33744821948 scopus 로고    scopus 로고
    • Web data extraction based on partial tree alignment
    • Zhai, Y., and Liu, B. Web Data Extraction Based on Partial Tree Alignment. In Proc. of WWW, 2005.
    • (2005) Proc. of WWW
    • Zhai, Y.1    Liu, B.2
  • 32
  • 33
    • 31844452562 scopus 로고    scopus 로고
    • 2D conditional random fields for web information extraction
    • Zhu, J., Nie, Z., Wen, J.-R., Zhang, B., and Ma, W.-Y. 2D Conditional Random Fields for Web Information Extraction. In Proc. of ICML, 2005.
    • (2005) Proc. of ICML
    • Zhu, J.1    Nie, Z.2    Wen, J.-R.3    Zhang, B.4    Ma, W.-Y.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.