메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1285-1294

Enabling information extraction by inference of regular expressions from sample entities

Author keywords

information extraction; machine learning; minimum description length; regular expressions

Indexed keywords

ENTERPRISE DATABASE; EXPERT KNOWLEDGE; INFORMATION EXTRACTION; MACHINE-LEARNING; MINIMUM DESCRIPTION LENGTH; PRODUCT CATALOGS; PRODUCT NAME; RECALL AND PRECISION; REGULAR EXPRESSIONS; TEXT DATA;

EID: 83055186929     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2063576.2063763     Document Type: Conference Paper
Times cited : (61)

References (20)
  • 1
    • 77952077835 scopus 로고    scopus 로고
    • Inference of concise regular expressions and dtds
    • G. J. Bex, F. Neven, T. Schwentick, and S. Vansummeren. Inference of concise regular expressions and dtds. TODS, 35(2):1-47, 2010.
    • (2010) TODS , vol.35 , Issue.2 , pp. 1-47
    • Bex, G.J.1    Neven, F.2    Schwentick, T.3    Vansummeren, S.4
  • 2
    • 84946559439 scopus 로고    scopus 로고
    • Regular language inference for domain-specific named entity recognition
    • F. Brauer, R. Rieger, W. Barczynski, and A. Mocan. Regular language inference for domain-specific named entity recognition. In IADIS WWW/Internet, 2009.
    • (2009) IADIS WWW/Internet
    • Brauer, F.1    Rieger, R.2    Barczynski, W.3    Mocan, A.4
  • 3
    • 0032596556 scopus 로고    scopus 로고
    • Two dimensional generalization in information extraction
    • J. Y. Chai, A. W. Biermann, and C. I. Guinn. Two dimensional generalization in information extraction. In AAAI, pages 431-438, 1999.
    • (1999) AAAI , pp. 431-438
    • Chai, J.Y.1    Biermann, A.W.2    Guinn, C.I.3
  • 4
    • 84880859303 scopus 로고    scopus 로고
    • Adaptive information extraction from text by rule induction and generalisation
    • F. Ciravegna. Adaptive information extraction from text by rule induction and generalisation. In IJCAI, pages 1251-1256, 2001.
    • (2001) IJCAI , pp. 1251-1256
    • Ciravegna, F.1
  • 5
    • 60849094038 scopus 로고    scopus 로고
    • Algorithms for learning regular expressions from positive data
    • H. Fernau. Algorithms for learning regular expressions from positive data. Information and Computation, 207(4):521-541, 2009.
    • (2009) Information and Computation , vol.207 , Issue.4 , pp. 521-541
    • Fernau, H.1
  • 6
    • 0033907729 scopus 로고    scopus 로고
    • Machine learning for information extraction in informal domains
    • D. Freitag. Machine learning for information extraction in informal domains. Machine Learning, 39(2-3):169-202, 2000. (Pubitemid 30594820)
    • (2000) Machine Learning , vol.39 , Issue.2 , pp. 169-202
    • Freitag, D.1
  • 7
    • 0000216094 scopus 로고    scopus 로고
    • Xtract: A system for extracting document type descriptors from xml documents
    • M. N. Garofalakis, A. Gionis, R. Rastogi, S. Seshadri, and K. Shim. Xtract: A system for extracting document type descriptors from xml documents. In SIGMOD, pages 165-176, 2000.
    • (2000) SIGMOD , pp. 165-176
    • Garofalakis, M.N.1    Gionis, A.2    Rastogi, R.3    Seshadri, S.4    Shim, K.5
  • 8
    • 56849129940 scopus 로고    scopus 로고
    • A tutorial introduction to the minimum description length principle
    • math.ST/0406077
    • P. Grünwald. A tutorial introduction to the minimum description length principle. CoRR, math.ST/0406077, 2004.
    • (2004) CoRR
    • Grünwald, P.1
  • 9
    • 57149125427 scopus 로고    scopus 로고
    • Naga: Harvesting, searching and ranking knowledge
    • Association for Computing Machinery (ACM)
    • G. Kasneci, F. M. Suchanek, G. Ifrim, S. Elbassuoni, M. Ramanath, and G. Weikum. Naga: Harvesting, searching and ranking knowledge. In SIGMOD, pages 1285-1288. Association for Computing Machinery (ACM), 2008.
    • (2008) SIGMOD , pp. 1285-1288
    • Kasneci, G.1    Suchanek, F.M.2    Ifrim, G.3    Elbassuoni, S.4    Ramanath, M.5    Weikum, G.6
  • 10
    • 0142192295 scopus 로고    scopus 로고
    • Conditional random fields: Probabilistic models for segmenting and labeling sequence data
    • J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In ICML, pages 282-289, 2001.
    • (2001) ICML , pp. 282-289
    • Lafferty, J.D.1    McCallum, A.2    Pereira, F.C.N.3
  • 12
    • 33749612526 scopus 로고    scopus 로고
    • Integrating unstructured data into relational databases
    • I. R. Mansuri and S. Sarawagi. Integrating unstructured data into relational databases. In ICDE, page 29, 2006.
    • (2006) ICDE , pp. 29
    • Mansuri, I.R.1    Sarawagi, S.2
  • 13
    • 70349138710 scopus 로고    scopus 로고
    • High-performance information extraction with alibaba
    • P. Palaga, L. Nguyen, U. Leser, and J. Hakenberg. High-performance information extraction with alibaba. In EDBT, pages 1140-1143, 2009.
    • (2009) EDBT , pp. 1140-1143
    • Palaga, P.1    Nguyen, L.2    Leser, U.3    Hakenberg, J.4
  • 14
    • 0242479542 scopus 로고    scopus 로고
    • Automata induction, grammar inference, and language acquisition
    • Marcel Dekker Inc., New York, Ny, USA
    • R. Parekh and V. Honavar. Automata induction, grammar inference, and language acquisition. In The Handbook of Natural Language Processing, pages 727-764. Marcel Dekker Inc., New York, Ny, USA, 2000.
    • (2000) The Handbook of Natural Language Processing , pp. 727-764
    • Parekh, R.1    Honavar, V.2
  • 15
    • 84944315993 scopus 로고    scopus 로고
    • Potter's wheel: An interactive data cleaning system
    • V. Raman and J. M. Hellerstein. Potter's wheel: An interactive data cleaning system. In VLDB, pages 381-390, 2001.
    • (2001) VLDB , pp. 381-390
    • Raman, V.1    Hellerstein, J.M.2
  • 17
    • 0032624184 scopus 로고    scopus 로고
    • Learning information extraction rules for semi-structured and free text
    • S. Soderland. Learning information extraction rules for semi-structured and free text. Machine Learning, 34(1-3):233-272, 1999.
    • (1999) Machine Learning , vol.34 , Issue.1-3 , pp. 233-272
    • Soderland, S.1
  • 18
    • 78549292887 scopus 로고    scopus 로고
    • A context pattern induction method for named entity extraction
    • P. P. Talukdar, T. Brants, M. Liberman, and F. Pereira. A context pattern induction method for named entity extraction. In CoNLL, pages 141-148, 2006.
    • (2006) CoNLL , pp. 141-148
    • Talukdar, P.P.1    Brants, T.2    Liberman, M.3    Pereira, F.4
  • 20
    • 25144508650 scopus 로고    scopus 로고
    • Discovering patterns to extract protein-protein interactions from the literature: Part II
    • DOI 10.1093/bioinformatics/bti493
    • H. Yu, X. Zhu, M. Huang, and M. Li. Discovering patterns to extract protein-protein interactions from the literature: Part ii. Bioinformatics, 21(15):3294-3300, 2005. (Pubitemid 41418444)
    • (2005) Bioinformatics , vol.21 , Issue.15 , pp. 3294-3300
    • Hao, Y.1    Zhu, X.2    Huang, M.3    Li, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.