메뉴 건너뛰기




Volumn , Issue , 2014, Pages 164-175

Cleaning inconsistencies in information extraction via prioritized repairs

Author keywords

Database repairs; Document spanners; Extraction inconsistency; Information extraction; Prioritized repairs; Regular expressions

Indexed keywords

BIG DATA; CLEANING; DATABASE SYSTEMS; INFORMATION RETRIEVAL; PATTERN MATCHING; SEMANTICS;

EID: 84904283591     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2594538.2594540     Document Type: Conference Paper
Times cited : (22)

References (41)
  • 3
    • 0032640897 scopus 로고    scopus 로고
    • Consistent query answers in inconsistent databases
    • M. Arenas, L. E. Bertossi, and J. Chomicki. Consistent query answers in inconsistent databases. In PODS, pages 68-79, 1999.
    • (1999) PODS , pp. 68-79
    • Arenas, M.1    Bertossi, L.E.2    Chomicki, J.3
  • 4
    • 84859079317 scopus 로고    scopus 로고
    • Event discovery in social media feeds
    • E. Benson, A. Haghighi, and R. Barzilay. Event discovery in social media feeds. In ACL, pages 389-398, 2011.
    • (2011) ACL , pp. 389-398
    • Benson, E.1    Haghighi, A.2    Barzilay, R.3
  • 5
    • 84874803089 scopus 로고    scopus 로고
    • Data cleaning and query answering with matching dependencies and matching functions
    • L. E. Bertossi, S. Kolahi, and L. V. S. Lakshmanan. Data cleaning and query answering with matching dependencies and matching functions. Theory Comput. Syst., 52(3):441-482, 2013.
    • (2013) Theory Comput. Syst , vol.52 , Issue.3 , pp. 441-482
    • Bertossi, L.E.1    Kolahi, S.2    Lakshmanan, L.V.S.3
  • 9
    • 3142696208 scopus 로고    scopus 로고
    • Gate a general architecture for text engineering
    • H. Cunningham. Gate, a general architecture for text engineering. Computers and the Humanities, 36(2):223-254, 2002.
    • (2002) Computers and the Humanities , vol.36 , Issue.2 , pp. 223-254
    • Cunningham, H.1
  • 10
    • 8344280154 scopus 로고    scopus 로고
    • JAPE: A java annotation patterns engine (second edition
    • Department of Computer Science, University of Sheffield, November
    • H. Cunningham, D. Maynard, and V. Tablan. JAPE: a Java Annotation Patterns Engine (Second Edition). Research Memorandum CS-00-10, Department of Computer Science, University of Sheffield, November 2000.
    • (2000) Research Memorandum CS-00-10
    • Cunningham, H.1    Maynard, D.2    Tablan, V.3
  • 11
    • 0002626722 scopus 로고
    • An overview of the frump system
    • W. G. Lehnert and M. H. Ringle, editors Lawrence Erlbaum Associates
    • G. DeJong. An overview of the frump system. In W. G. Lehnert and M. H. Ringle, editors, Strategies for natural language processing, pages 149-176. Lawrence Erlbaum Associates, 1982.
    • (1982) Strategies for Natural Language Processing , pp. 149-176
    • Dejong, G.1
  • 12
    • 84891141678 scopus 로고    scopus 로고
    • A temporal-probabilistic database model for information extraction
    • M. Dylla, I. Miliaraki, and M. Theobald. A temporal-probabilistic database model for information extraction. PVLDB, 6(14):1810-1821, 2013.
    • (2013) PVLDB , vol.6 , Issue.14 , pp. 1810-1821
    • Dylla, M.1    Miliaraki, I.2    Theobald, M.3
  • 14
    • 84880519492 scopus 로고    scopus 로고
    • Spanners: A formal framework for information extraction
    • R. Fagin, B. Kimelfeld, F. Reiss, and S. Vansummeren. Spanners: a formal framework for information extraction. In PODS, pages 37-48, 2013.
    • (2013) PODS , pp. 37-48
    • Fagin, R.1    Kimelfeld, B.2    Reiss, F.3    Vansummeren, S.4
  • 15
    • 57549084481 scopus 로고    scopus 로고
    • Dependencies revisited for improving data quality
    • W. Fan. Dependencies revisited for improving data quality. In PODS, pages 159-170, 2008.
    • (2008) PODS , pp. 159-170
    • Fan, W.1
  • 16
    • 79960451998 scopus 로고    scopus 로고
    • Dynamic constraints for record matching
    • W. Fan, H. Gao, X. Jia, J. Li, and S. Ma. Dynamic constraints for record matching. VLDB J., 20(4):495-520, 2011.
    • (2011) VLDB J , vol.20 , Issue.4 , pp. 495-520
    • Fan, W.1    Gao, H.2    Jia, X.3    Li, J.4    Ma, S.5
  • 17
    • 79959944062 scopus 로고    scopus 로고
    • Interaction between record matching and data repairing
    • W. Fan, J. Li, S. Ma, N. Tang, and W. Yu. Interaction between record matching and data repairing. In SIGMOD Conference, pages 469-480, 2011.
    • (2011) SIGMOD Conference , pp. 469-480
    • Fan, W.1    Li, J.2    Ma, S.3    Tang, N.4    Yu, W.5
  • 18
    • 7444236194 scopus 로고    scopus 로고
    • UIMA: An architectural approach to unstructured information processing in the corporate research environment
    • D. A. Ferrucci and A. Lally. UIMA: an architectural approach to unstructured information processing in the corporate research environment. Natural Language Engineering, 10(3-4):327-348, 2004.
    • (2004) Natural Language Engineering , vol.10 , Issue.3-4 , pp. 327-348
    • Ferrucci, D.A.1    Lally, A.2
  • 20
    • 85149129374 scopus 로고    scopus 로고
    • Toward general-purpose learning for information extraction
    • D. Freitag. Toward general-purpose learning for information extraction. In COLING-ACL, pages 404-408, 1998.
    • (1998) COLING-ACL , pp. 404-408
    • Freitag, D.1
  • 21
    • 77951157255 scopus 로고    scopus 로고
    • Execution anomaly detection in distributed systems through unstructured log analysis
    • Q. Fu, J.-G. Lou, Y. Wang, and J. Li. Execution anomaly detection in distributed systems through unstructured log analysis. In ICDM, pages 149-158, 2009.
    • (2009) ICDM , pp. 149-158
    • Fu, Q.1    Lou, J.-G.2    Wang, Y.3    Li, J.4
  • 22
    • 0002686947 scopus 로고    scopus 로고
    • Message understanding conference-6: A brief history
    • R. Grishman and B. Sundheim. Message understanding conference-6: A brief history. In COLING, pages 466-471, 1996.
    • (1996) COLING , pp. 466-471
    • Grishman, R.1    Sundheim, B.2
  • 23
    • 84857891942 scopus 로고    scopus 로고
    • Multi-head finite automata: Characterizations, concepts and open problems
    • volume 1 of EPTCS
    • M. Holzer, M. Kutrib, and A. Malcher. Multi-head finite automata: Characterizations, concepts and open problems. In CSP, volume 1 of EPTCS, pages 93-107, 2008.
    • (2008) CSP , pp. 93-107
    • Holzer, M.1    Kutrib, M.2    Malcher, A.3
  • 24
    • 84904320910 scopus 로고    scopus 로고
    • Institute of Electrical and Electronic Engineers and the Open group. IEEE Std 1003.1, 2013 Edition
    • Institute of Electrical and Electronic Engineers and the Open group. The open group base specifications issue 7, 2013. IEEE Std 1003.1, 2013 Edition.
    • (2013) The Open Group Base Specifications , Issue.7
  • 25
    • 0142192295 scopus 로고    scopus 로고
    • Conditional random fields: Probabilistic models for segmenting and labeling sequence data
    • J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In ICML, pages 282-289, 2001.
    • (2001) ICML , pp. 282-289
    • Lafferty, J.D.1    McCallum, A.2    Pereira, F.C.N.3
  • 28
    • 79959981102 scopus 로고    scopus 로고
    • Automatic rule refinement for information extraction
    • B. Liu, L. Chiticariu, V. Chu, H. V. Jagadish, and F. Reiss. Automatic rule refinement for information extraction. PVLDB, 3(1):588-597, 2010.
    • (2010) PVLDB , vol.3 , Issue.1 , pp. 588-597
    • Liu, B.1    Chiticariu, L.2    Chu, V.3    Jagadish, H.V.4    Reiss, F.5
  • 29
    • 84890119754 scopus 로고    scopus 로고
    • Extending inclusion dependencies with conditions
    • S. Ma, W. Fan, and L. Bravo. Extending inclusion dependencies with conditions. Theor. Comput. Sci., 515:64-95, 2014.
    • (2014) Theor. Comput. Sci , vol.515 , pp. 64-95
    • Ma, S.1    Fan, W.2    Bravo, L.3
  • 30
    • 0000747663 scopus 로고    scopus 로고
    • Maximum entropy markov models for information extraction and segmentation
    • A. McCallum, D. Freitag, and F. C. N. Pereira. Maximum entropy markov models for information extraction and segmentation. In ICML, pages 591-598, 2000.
    • (2000) ICML , pp. 591-598
    • McCallum, A.1    Freitag, D.2    Pereira, F.C.N.3
  • 31
    • 83055176154 scopus 로고    scopus 로고
    • Tuffy: Scaling up statistical inference in markov logic networks using an rdbms
    • F. Niu, C. Ré, A. Doan, and J. W. Shavlik. Tuffy: Scaling up statistical inference in Markov Logic Networks using an RDBMS. PVLDB, 4(6):373-384, 2011.
    • (2011) PVLDB , vol.4 , Issue.6 , pp. 373-384
    • Niu, F.1    Ré, C.2    Doan, A.3    Shavlik, J.W.4
  • 32
    • 79951631701 scopus 로고    scopus 로고
    • Disambiguation in regular expression matching via position automata with augmented transitions
    • M. Domaratzki and K. Salomaa, editors volume 6482 of Lecture Notes in Computer Science
    • S. Okui and T. Suzuki. Disambiguation in regular expression matching via position automata with augmented transitions. In M. Domaratzki and K. Salomaa, editors, CIAA, volume 6482 of Lecture Notes in Computer Science, pages 231-240, 2010.
    • (2010) CIAA , pp. 231-240
    • Okui, S.1    Suzuki, T.2
  • 33
    • 36348979272 scopus 로고    scopus 로고
    • Joint inference in information extraction
    • AAAI Press
    • H. Poon and P. Domingos. Joint inference in information extraction. In AAAI, pages 913-918. AAAI Press, 2007.
    • (2007) AAAI , pp. 913-918
    • Poon, H.1    Domingos, P.2
  • 35
    • 0027709268 scopus 로고
    • Automatically constructing a dictionary for information extraction tasks
    • E. Riloff. Automatically constructing a dictionary for information extraction tasks. In AAAI, pages 811-816, 1993.
    • (1993) AAAI , pp. 811-816
    • Riloff, E.1
  • 36
    • 77951560898 scopus 로고    scopus 로고
    • Declarative information extraction using datalog with embedded extraction predicates
    • W. Shen, A. Doan, J. F. Naughton, and R. Ramakrishnan. Declarative information extraction using datalog with embedded extraction predicates. In VLDB, pages 1033-1044, 2007.
    • (2007) VLDB , pp. 1033-1044
    • Shen, W.1    Doan, A.2    Naughton, J.F.3    Ramakrishnan, R.4
  • 37
  • 38
    • 84861227359 scopus 로고    scopus 로고
    • Prioritized repairing and consistent query answering in relational databases
    • S. Staworko, J. Chomicki, and J. Marcinkowski. Prioritized repairing and consistent query answering in relational databases. Ann. Math. Artif. Intell., 64(2-3):209-246, 2012.
    • (2012) Ann. Math. Artif. Intell , vol.64 , Issue.2-3 , pp. 209-246
    • Staworko, S.1    Chomicki, J.2    Marcinkowski, J.3
  • 39
    • 33745396295 scopus 로고    scopus 로고
    • Type inference for unique pattern matching
    • S. Vansummeren. Type inference for unique pattern matching. ACM Trans. Program. Lang. Syst., 28(3):389-428, 2006.
    • (2006) ACM Trans. Program. Lang. Syst , vol.28 , Issue.3 , pp. 389-428
    • Vansummeren, S.1
  • 40
    • 77950477064 scopus 로고    scopus 로고
    • Application of information technology: Medex: A medication information extraction system for clinical narratives
    • H. Xu, S. P. Stenner, S. Doan, K. B. Johnson, L. R. Waitman, and J. C. Denny. Application of information technology: Medex: a medication information extraction system for clinical narratives. JAMIA, 17(1):19-24, 2010.
    • (2010) JAMIA , vol.17 , Issue.1 , pp. 19-24
    • Xu, H.1    Stenner, S.P.2    Doan, S.3    Johnson, K.B.4    Waitman, L.R.5    Denny, J.C.6
  • 41
    • 35348866211 scopus 로고    scopus 로고
    • Navigating the intranet with high precision
    • H. Zhu, S. Raghavan, S. Vaithyanathan, and A. Löser. Navigating the intranet with high precision. In WWW, pages 491-500, 2007.
    • (2007) WWW , pp. 491-500
    • Zhu, H.1    Raghavan, S.2    Vaithyanathan, S.3    Löser, A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.