메뉴 건너뛰기




Volumn , Issue , 2012, Pages 1-270

Data matching: Concepts and techniques for record linkage, entity resolution, and duplicate detection

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; DATA HANDLING; DATA MINING; DIGITAL LIBRARIES; INDEXING (MATERIALS WORKING); LEARNING SYSTEMS; OPEN SYSTEMS;

EID: 85031021895     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1007/978-3-642-31164-2     Document Type: Book
Times cited : (818)

References (303)
  • 1
    • 84920600570 scopus 로고    scopus 로고
    • Efficient record linkage using a double embedding scheme
    • Las Vegas
    • Adly, N.: Efficient record linkage using a double embedding scheme. In: DMIN, pp. 274-281. Las Vegas (2009)
    • (2009) DMIN , pp. 274-281
    • Adly, N.1
  • 2
    • 63349112872 scopus 로고    scopus 로고
    • Managing and Mining Uncertain Data
    • Springer
    • Aggarwal, C.C.: Managing and Mining Uncertain Data, Advances in Database Systems, vol. 35. Springer (2009)
    • (2009) Advances in Database Systems , vol.35
    • Aggarwal, C.C.1
  • 3
    • 0034592763 scopus 로고    scopus 로고
    • The IGrid index: Reversing the dimensionality curse for similarity indexing in high dimensional space
    • Boston
    • Aggarwal, C.C., Yu, P.S.: The IGrid index: Reversing the dimensionality curse for similarity indexing in high dimensional space. In: ACM SIGKDD, pp. 119-129. Boston (2000)
    • (2000) ACM SIGKDD , pp. 119-129
    • Aggarwal, C.C.1    Yu, P.S.2
  • 4
    • 80052799499 scopus 로고    scopus 로고
    • Privacy-preserving data mining: Models and algorithms
    • Springer
    • Aggarwal, C.C., Yu, P.S.: Privacy-preserving data mining: models and algorithms, Advances in Database Systems, vol. 34. Springer (2008)
    • (2008) Advances in Database Systems , vol.34
    • Aggarwal, C.C.1    Yu, P.S.2
  • 5
    • 12244298488 scopus 로고    scopus 로고
    • Mining reference tables for automatic text segmentation
    • Seattle
    • Agichtein, E., Ganti, V.: Mining reference tables for automatic text segmentation. In: ACM SIGKDD, pp. 20-29. Seattle (2004)
    • (2004) ACM SIGKDD , pp. 20-29
    • Agichtein, E.1    Ganti, V.2
  • 6
    • 1142303699 scopus 로고    scopus 로고
    • Information sharing across private databases
    • San Diego
    • Agrawal, R., Evfimievski, A., Srikant, R.: Information sharing across private databases. In: ACM SIGMOD, pp. 86-97. San Diego (2003)
    • (2003) ACM SIGMOD , pp. 86-97
    • Agrawal, R.1    Evfimievski, A.2    Srikant, R.3
  • 7
    • 33845363891 scopus 로고    scopus 로고
    • A fast linkage detection scheme for multi-source information integration
    • Tokyo
    • Aizawa, A., Oyama, K.: A fast linkage detection scheme for multi-source information integration. In: WIRI, pp. 30-39. Tokyo (2005)
    • (2005) WIRI , pp. 30-39
    • Aizawa, A.1    Oyama, K.2
  • 9
    • 85048686662 scopus 로고    scopus 로고
    • Interstate voter registration database matching: The Oregon-Washington 2008 pilot project
    • USENIX Association
    • Alvarez, R., Jonas, J., Winkler, W., Wright, R.: Interstate voter registration database matching: the Oregon-Washington 2008 pilot project. In: Workshop on Trustworthy Elections, pp. 17-17. USENIX Association (2009)
    • (2009) Workshop on Trustworthy Elections , pp. 17-17
    • Alvarez, R.1    Jonas, J.2    Winkler, W.3    Wright, R.4
  • 11
    • 77954717287 scopus 로고    scopus 로고
    • On active learning of record matching packages
    • Indianapolis
    • Arasu, A., Götz, M., Kaushik, R.: On active learning of record matching packages. In: ACM SIGMOD, pp. 783-794. Indianapolis (2010)
    • (2010) ACM SIGMOD , pp. 783-794
    • Arasu, A.1    Götz, M.2    Kaushik, R.3
  • 12
    • 70849095483 scopus 로고    scopus 로고
    • A grammar-based entity representation framework for data cleaning
    • Providence, Rhode Island
    • Arasu, A., Kaushik, R.: A grammar-based entity representation framework for data cleaning. In: ACM SIGMOD, pp. 233-244. Providence, Rhode Island (2009)
    • (2009) ACM SIGMOD , pp. 233-244
    • Arasu, A.1    Kaushik, R.2
  • 21
    • 35348849154 scopus 로고    scopus 로고
    • Scaling up all pairs similarity search
    • Banff, Canada
    • Bayardo, R., Ma, Y., Srikant, R.: Scaling up all pairs similarity search. In: WWW, pp. 131-140. Banff, Canada (2007)
    • (2007) WWW , pp. 131-140
    • Bayardo, R.1    Ma, Y.2    Srikant, R.3
  • 22
    • 67649641448 scopus 로고    scopus 로고
    • Space-constrained gram-based indexing for efficient approximate string search
    • Shanghai
    • Behm, A., Ji, S., Li, C., Lu, J.: Space-constrained gram-based indexing for efficient approximate string search. In: IEEE ICDE, pp. 604-615. Shanghai (2009)
    • (2009) IEEE ICDE , pp. 604-615
    • Behm, A.1    Ji, S.2    Li, C.3    Lu, J.4
  • 30
    • 34249831790 scopus 로고
    • Auction algorithms for network flow problems: A tutorial introduction
    • Bertsekas, D.P.: Auction algorithms for network flow problems: A tutorial introduction. Computational Optimization and Applications 1, 7-66 (1992)
    • (1992) Computational Optimization and Applications , vol.1 , pp. 7-66
    • Bertsekas, D.P.1
  • 33
    • 85089829325 scopus 로고    scopus 로고
    • Adaptive product normalization: Using online learning for record linkage in comparison shopping
    • Houston
    • Bilenko, M., Basu, S., Sahami, M.: Adaptive product normalization: Using online learning for record linkage in comparison shopping. In: IEEE ICDM, pp. 58-65. Houston (2005)
    • (2005) IEEE ICDM , pp. 58-65
    • Bilenko, M.1    Basu, S.2    Sahami, M.3
  • 34
    • 84878049861 scopus 로고    scopus 로고
    • Adaptive blocking: Learning to scale up record linkage
    • Hong Kong
    • Bilenko, M., Kamath, B., Mooney, R.J.: Adaptive blocking: Learning to scale up record linkage. In: IEEE ICDM, pp. 87-96. Hong Kong (2006)
    • (2006) IEEE ICDM , pp. 87-96
    • Bilenko, M.1    Kamath, B.2    Mooney, R.J.3
  • 35
    • 77952372966 scopus 로고    scopus 로고
    • Adaptive duplicate detection using learnable string similarity measures
    • Washington DC
    • Bilenko, M., Mooney, R.J.: Adaptive duplicate detection using learnable string similarity measures. In: ACM SIGKDD, pp. 39-48. Washington DC (2003)
    • (2003) ACM SIGKDD , pp. 39-48
    • Bilenko, M.1    Mooney, R.J.2
  • 37
    • 0036990263 scopus 로고    scopus 로고
    • Probabilistic record linkage and a method to calculate the positive predictive value
    • Blakely, T., Salmond, C.: Probabilistic record linkage and a method to calculate the positive predictive value. International Journal of Epidemiology 31:6, 1246-1252 (2002)
    • (2002) International Journal of Epidemiology , vol.31 , Issue.6 , pp. 1246-1252
    • Blakely, T.1    Salmond, C.2
  • 39
    • 0014814325 scopus 로고
    • Space/time trade-offs in hash coding with allowable errors
    • Bloom, B.: Space/time trade-offs in hash coding with allowable errors. Communications of the ACM 13(7), 422-426 (1970)
    • (1970) Communications of the ACM , vol.13 , Issue.7 , pp. 422-426
    • Bloom, B.1
  • 40
    • 84989573853 scopus 로고
    • Getty’s synonameTM and its cousins: A survey of applications of personal name-matching algorithms
    • Borgman, C.L., Siegfried, S.L.: Getty’s synonameTM and its cousins: A survey of applications of personal name-matching algorithms. Journal of the American Society for Information Science 43(7), 459-476 (1992)
    • (1992) Journal of the American Society for Information Science , vol.43 , Issue.7 , pp. 459-476
    • Borgman, C.L.1    Siegfried, S.L.2
  • 41
    • 0040748315 scopus 로고    scopus 로고
    • Automatic segmentation of text into structured records
    • Borkar, V., Deshmukh, K., Sarawagi, S.: Automatic segmentation of text into structured records. ACM SIGMOD Record 30(2), 175-186 (2001)
    • (2001) ACM SIGMOD Record , vol.30 , Issue.2 , pp. 175-186
    • Borkar, V.1    Deshmukh, K.2    Sarawagi, S.3
  • 43
    • 18744398431 scopus 로고    scopus 로고
    • Efficient query evaluation using a two-level retrieval process
    • New Orleans
    • Broder, A., Carmel, D., Herscovici, M., Soffer, A., Zien, J.: Efficient query evaluation using a two-level retrieval process. In: ACM CIKM, pp. 426-434. New Orleans (2003)
    • (2003) ACM CIKM , pp. 426-434
    • Broder, A.1    Carmel, D.2    Herscovici, M.3    Soffer, A.4    Zien, J.5
  • 44
    • 40449103107 scopus 로고    scopus 로고
    • Public good through data linkage: Measuring research outputs from the Western Australian data linkage system
    • Brook, E., Rosman, D., Holman, C.: Public good through data linkage: measuring research outputs from the Western Australian data linkage system. Australian and New Zealand journal of public health 32(1), 19-23 (2008)
    • (2008) Australian and New Zealand journal of public health , vol.32 , Issue.1 , pp. 19-23
    • Brook, E.1    Rosman, D.2    Holman, C.3
  • 45
    • 39049194535 scopus 로고    scopus 로고
    • Reverse geocoding: Concerns about patient confidentiality in the display of geospatial health data
    • American Medical Informatics Association
    • Brownstein, J.S., Cassa, C., Kohane, I.S., Mandl, K.D.: Reverse geocoding: Concerns about patient confidentiality in the display of geospatial health data. In: AMIA Annual Symposium Proceedings, p. 905. American Medical Informatics Association (2005)
    • (2005) AMIA Annual Symposium Proceedings , pp. 905
    • Brownstein, J.S.1    Cassa, C.2    Kohane, I.S.3    Mandl, K.D.4
  • 46
    • 33750089757 scopus 로고    scopus 로고
    • No place to hide-reverse identification of patients from published maps
    • Brownstein, J.S., Cassa, C., Mandl, K.D.: No place to hide-reverse identification of patients from published maps. New England Journal of Medicine 355(16), 1741-1742 (2006)
    • (2006) New England Journal of Medicine , vol.355 , Issue.16 , pp. 1741-1742
    • Brownstein, J.S.1    Cassa, C.2    Mandl, K.D.3
  • 47
    • 39049148128 scopus 로고    scopus 로고
    • Record linkage software in the public domain: A comparison of Link Plus, The Link King, and a basic deterministic algorithm
    • Campbell, K., Deck, D., Krupski, A.: Record linkage software in the public domain: a comparison of Link Plus, The Link King, and a basic deterministic algorithm. Health Informatics Journal 14(1), 5 (2008)
    • (2008) Health Informatics Journal , vol.14 , Issue.1 , pp. 5
    • Campbell, K.1    Deck, D.2    Krupski, A.3
  • 49
    • 51849162587 scopus 로고    scopus 로고
    • Common pitfalls using the normalized compression distance: What to watch out for in a compressor
    • Cebrián, M., Alfonseca, M., Ortega, A.: Common pitfalls using the normalized compression distance: What to watch out for in a compressor. Communications in Information and Systems 5(4), 367-384 (2005)
    • (2005) Communications in Information and Systems , vol.5 , Issue.4 , pp. 367-384
    • Cebrián, M.1    Alfonseca, M.2    Ortega, A.3
  • 52
    • 26444550791 scopus 로고    scopus 로고
    • Robust identification of fuzzy duplicates
    • Tokyo
    • Chaudhuri, S., Ganti, V., Motwani, R.: Robust identification of fuzzy duplicates. In: IEEE ICDE, pp. 865-876. Tokyo (2005)
    • (2005) IEEE ICDE , pp. 865-876
    • Chaudhuri, S.1    Ganti, V.2    Motwani, R.3
  • 54
    • 1942500388 scopus 로고    scopus 로고
    • Crime data mining: A general framework and some examples
    • Chen, H., Chung, W., Xu, J., Wang, G., Qin, Y., Chau, M.: Crime data mining: a general framework and some examples. IEEE Computer 37(4), 50-56 (2004)
    • (2004) IEEE Computer , vol.37 , Issue.4 , pp. 50-56
    • Chen, H.1    Chung, W.2    Xu, J.3    Wang, G.4    Qin, Y.5    Chau, M.6
  • 56
    • 26444478506 scopus 로고    scopus 로고
    • Probabilistic data generation for deduplication and data linkage
    • Brisbane
    • Christen, P.: Probabilistic data generation for deduplication and data linkage. In: IDEAL, Springer LNCS, vol. 3578, pp. 109-116. Brisbane (2005)
    • (2005) IDEAL, Springer LNCS, vol. 3578 , pp. 109-116
    • Christen, P.1
  • 57
    • 78449293191 scopus 로고    scopus 로고
    • A comparison of personal name matching: Techniques and practical issues
    • Hong Kong
    • Christen, P.: A comparison of personal name matching: Techniques and practical issues. In:Workshop on Mining Complex Data, held at IEEE ICDM. Hong Kong (2006)
    • (2006) Workshop on Mining Complex Data, held at IEEE ICDM
    • Christen, P.1
  • 58
    • 67650258952 scopus 로고    scopus 로고
    • Privacy-preserving data linkage and geocoding: Current approaches and research directions
    • Hong Kong
    • Christen, P.: Privacy-preserving data linkage and geocoding: Current approaches and research directions. In: Workshop on Privacy Aspects of Data Mining, held at IEEE ICDM. Hong Kong (2006)
    • (2006) Workshop on Privacy Aspects of Data Mining, held at IEEE ICDM
    • Christen, P.1
  • 59
    • 65449139594 scopus 로고    scopus 로고
    • Automatic record linkage using seeded nearest neighbour and support vector machine classification
    • Las Vegas
    • Christen, P.: Automatic record linkage using seeded nearest neighbour and support vector machine classification. In: ACM SIGKDD, pp. 151-159. Las Vegas (2008)
    • (2008) ACM SIGKDD , pp. 151-159
    • Christen, P.1
  • 60
    • 44649093306 scopus 로고    scopus 로고
    • Automatic training example selection for scalable unsupervised record linkage
    • Osaka
    • Christen, P.: Automatic training example selection for scalable unsupervised record linkage. In: PAKDD, Springer LNAI, vol. 5012, pp. 511-518. Osaka (2008)
    • (2008) PAKDD, Springer LNAI, vol. 5012 , pp. 511-518
    • Christen, P.1
  • 61
    • 65449178105 scopus 로고    scopus 로고
    • Febrl: An open source data cleaning, deduplication and record linkage system with a graphical user interface
    • Las Vegas
    • Christen, P.: Febrl: An open source data cleaning, deduplication and record linkage system with a graphical user interface. In: ACM SIGKDD, pp. 1065-1068. Las Vegas (2008)
    • (2008) ACM SIGKDD , pp. 1065-1068
    • Christen, P.1
  • 62
    • 74049138802 scopus 로고    scopus 로고
    • Development and user experiences of an open source data cleaning, deduplication and record linkage system
    • Christen, P.: Development and user experiences of an open source data cleaning, deduplication and record linkage system. SIGKDD Explorations 11(1), 39-48 (2009)
    • (2009) SIGKDD Explorations , vol.11 , Issue.1 , pp. 39-48
    • Christen, P.1
  • 64
    • 84857183817 scopus 로고    scopus 로고
    • A survey of indexing techniques for scalable record linkage and deduplication
    • Christen, P.: A survey of indexing techniques for scalable record linkage and deduplication. IEEE Transactions on Knowledge and Data Engineering X(Y) (2011)
    • (2011) IEEE Transactions on Knowledge and Data Engineering , vol.X , Issue.Y
    • Christen, P.1
  • 65
    • 84857149294 scopus 로고    scopus 로고
    • Automated probabilistic address standardisation and verification
    • Sydney
    • Christen, P., Belacic, D.: Automated probabilistic address standardisation and verification. In: AusDM, pp. 53-67. Sydney (2005)
    • (2005) AusDM , pp. 53-67
    • Christen, P.1    Belacic, D.2
  • 67
    • 85031020894 scopus 로고    scopus 로고
    • A probabilistic geocoding system based on a national address file
    • Cairns
    • Christen, P., Churches, T., Willmore, A.: A probabilistic geocoding system based on a national address file. In: AusDM. Cairns (2004)
    • (2004) AusDM
    • Christen, P.1    Churches, T.2    Willmore, A.3
  • 69
    • 67650216370 scopus 로고    scopus 로고
    • Towards scalable real-time entity resolution using a similarityaware inverted index approach
    • Glenelg, Australia
    • Christen, P., Gayler, R.: Towards scalable real-time entity resolution using a similarityaware inverted index approach. In: AusDM, CRPIT, vol. 87, pp. 51-60. Glenelg, Australia (2008)
    • (2008) AusDM, CRPIT, vol. 87 , pp. 51-60
    • Christen, P.1    Gayler, R.2
  • 70
    • 74549185155 scopus 로고    scopus 로고
    • Similarity-aware indexing for real-time entity resolution
    • Hong Kong
    • Christen, P., Gayler, R., Hawking, D.: Similarity-aware indexing for real-time entity resolution. In: ACM CIKM, pp. 1565-1568. Hong Kong (2009)
    • (2009) ACM CIKM , pp. 1565-1568
    • Christen, P.1    Gayler, R.2    Hawking, D.3
  • 72
    • 67650700151 scopus 로고    scopus 로고
    • Accurate synthetic generation of realistic personal information
    • Bangkok, Thailand
    • Christen, P., Pudjijono, A.: Accurate synthetic generation of realistic personal information. In: PAKDD, Springer LNAI, vol. 5476, pp. 507-514. Bangkok, Thailand (2009)
    • (2009) PAKDD, Springer LNAI, vol. 5476 , pp. 507-514
    • Christen, P.1    Pudjijono, A.2
  • 73
    • 0642275698 scopus 로고    scopus 로고
    • A proposed architecture and method of operation for improving the protection of privacy and confidentiality in disease registers
    • Churches, T.: A proposed architecture and method of operation for improving the protection of privacy and confidentiality in disease registers. BioMed Central Medical Research Methodology 3(1) (2003)
    • (2003) BioMed Central Medical Research Methodology , vol.3 , Issue.1
    • Churches, T.1
  • 74
    • 7444258692 scopus 로고    scopus 로고
    • Blind data linkage using n-gram similarity comparisons
    • Sydney
    • Churches, T., Christen, P.: Blind data linkage using n-gram similarity comparisons. In: PAKDD, Springer LNAI, vol. 3056, pp. 121-126. Sydney (2004)
    • (2004) PAKDD, Springer LNAI, vol. 3056 , pp. 121-126
    • Churches, T.1    Christen, P.2
  • 78
    • 4344570142 scopus 로고    scopus 로고
    • Practical introduction to record linkage for injury research
    • Clark, D.E.: Practical introduction to record linkage for injury research. Injury Prevention 10, 186-191 (2004)
    • (2004) Injury Prevention , vol.10 , pp. 186-191
    • Clark, D.E.1
  • 81
    • 0010355394 scopus 로고    scopus 로고
    • The WHIRL approach to data integration
    • Cohen, W.: The WHIRL approach to data integration. IEEE Intelligent Systems 13(3), 20-24 (1998)
    • (1998) IEEE Intelligent Systems , vol.13 , Issue.3 , pp. 20-24
    • Cohen, W.1
  • 82
    • 0000666461 scopus 로고    scopus 로고
    • Data integration using similarity joins and a word-based information representation language
    • Cohen, W.: Data integration using similarity joins and a word-based information representation language. ACM Transactions on Information Systems 18(3), 288-321 (2000)
    • (2000) ACM Transactions on Information Systems , vol.18 , Issue.3 , pp. 288-321
    • Cohen, W.1
  • 83
    • 0032091575 scopus 로고    scopus 로고
    • Integration of heterogeneous databases without common domains using queries based on textual similarity
    • Seattle
    • Cohen, W.: Integration of heterogeneous databases without common domains using queries based on textual similarity. In: ACM SIGMOD, pp. 201-212. Seattle (1998)
    • (1998) ACM SIGMOD , pp. 201-212
    • Cohen, W.1
  • 85
    • 0242540438 scopus 로고    scopus 로고
    • Learning to match and cluster large high-dimensional data sets for data integration
    • Edmonton
    • Cohen, W., Richman, J.: Learning to match and cluster large high-dimensional data sets for data integration. In: ACM SIGKDD, pp. 475-480. Edmonton (2002)
    • (2002) ACM SIGKDD , pp. 475-480
    • Cohen, W.1    Richman, J.2
  • 87
    • 33750469045 scopus 로고    scopus 로고
    • Spatial confidentiality and GIS: Re-engineering mortality locations from published maps about Hurricane Katrina
    • Curtis, A.J., Mills, J.W., Leitner, M.: Spatial confidentiality and GIS: Re-engineering mortality locations from published maps about Hurricane Katrina. International Journal of Health Geographics 5(1), 44-56 (2006)
    • (2006) International Journal of Health Geographics , vol.5 , Issue.1 , pp. 44-56
    • Curtis, A.J.1    Mills, J.W.2    Leitner, M.3
  • 89
    • 84941869105 scopus 로고
    • A technique for computer detection and correction of spelling errors
    • Damerau, F.J.: A technique for computer detection and correction of spelling errors. Communications of the ACM 7(3), 171-176 (1964)
    • (1964) Communications of the ACM , vol.7 , Issue.3 , pp. 171-176
    • Damerau, F.J.1
  • 92
    • 0348062787 scopus 로고    scopus 로고
    • Disclosure risk assessment in statistical microdata protection via advanced record linkage
    • Domingo-Ferrer, J., Torra, V.: Disclosure risk assessment in statistical microdata protection via advanced record linkage. Statistics and Computing 13(4), 343-354 (2003)
    • (2003) Statistics and Computing , vol.13 , Issue.4 , pp. 343-354
    • Domingo-Ferrer, J.1    Torra, V.2
  • 93
    • 29844452555 scopus 로고    scopus 로고
    • Reference reconciliation in complex information spaces
    • Baltimore
    • Dong, X., Halevy, A., Madhavan, J.: Reference reconciliation in complex information spaces. In: ACM SIGMOD, pp. 85-96. Baltimore (2005)
    • (2005) ACM SIGMOD , pp. 85-96
    • Dong, X.1    Halevy, A.2    Madhavan, J.3
  • 94
    • 84888417083 scopus 로고    scopus 로고
    • A comparison and generalization of blocking and windowing algorithms for duplicate detection
    • Lyon
    • Draisbach, U., Naumann, F.: A comparison and generalization of blocking and windowing algorithms for duplicate detection. In: Workshop on Quality in Databases, held at VLDB. Lyon (2009)
    • (2009) Workshop on Quality in Databases, held at VLDB
    • Draisbach, U.1    Naumann, F.2
  • 98
    • 84964941330 scopus 로고    scopus 로고
    • Private medical record linkage with approximate matching
    • American Medical Informatics Association
    • Durham, E., Xue, Y., Kantarcioglu, M., Malin, B.: Private medical record linkage with approximate matching. In: AMIA Annual Symposium Proceedings, p. 182. American Medical Informatics Association (2010)
    • (2010) AMIA Annual Symposium Proceedings , pp. 182
    • Durham, E.1    Xue, Y.2    Kantarcioglu, M.3    Malin, B.4
  • 102
  • 104
  • 105
    • 84976803260 scopus 로고
    • Fastmap: A fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets
    • San Jose
    • Faloutsos, C., Lin, K.I.: Fastmap: A fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets. In: ACM SIGMOD, pp. 163-174. San Jose (1995)
    • (1995) ACM SIGMOD , pp. 163-174
    • Faloutsos, C.1    Lin, K.I.2
  • 110
    • 33748542906 scopus 로고    scopus 로고
    • Privacy and confidentiality in an e-commerce world: Data mining, data warehousing, matching and disclosure limitation
    • Fienberg, S.: Privacy and confidentiality in an e-commerce world: Data mining, data warehousing, matching and disclosure limitation. Statistical Science 21(2), 143-154 (2006)
    • (2006) Statistical Science , vol.21 , Issue.2 , pp. 143-154
    • Fienberg, S.1
  • 113
    • 0026786263 scopus 로고
    • Tolerating spelling errors during patient validation
    • Friedman, C., Sideli, R.: Tolerating spelling errors during patient validation. Computers and Biomedical Research 25, 486-509 (1992)
    • (1992) Computers and Biomedical Research , vol.25 , pp. 486-509
    • Friedman, C.1    Sideli, R.2
  • 115
    • 84861452649 scopus 로고    scopus 로고
    • A supervised learning and group linking method for historical census household linkage
    • Ballarat, Australia
    • Fu, Z., Christen, P., Boot, M.: A supervised learning and group linking method for historical census household linkage. In: AusDM, CRPIT, vol. 125. Ballarat, Australia (2011)
    • (2011) AusDM, CRPIT , vol.125
    • Fu, Z.1    Christen, P.2    Boot, M.3
  • 116
    • 84861452098 scopus 로고    scopus 로고
    • Multiple instance learning for group record linkage
    • Kuala Lumpur, Malaysia
    • Fu, Z., Zhou, J., Christen, P., Boot, M.: Multiple instance learning for group record linkage. In: PAKDD, Springer LNAI. Kuala Lumpur, Malaysia (2012)
    • (2012) PAKDD, Springer LNAI
    • Fu, Z.1    Zhou, J.2    Christen, P.3    Boot, M.4
  • 118
    • 13244269176 scopus 로고    scopus 로고
    • OX-LINK: The Oxford medical record linkage system
    • Arlington, Virginia
    • Gill, L.: OX-LINK: The Oxford medical record linkage system. In: Proc. IntGI Record Linkage Workshop and Exposition, pp. 15-33. Arlington, Virginia (1997)
    • (1997) Proc. IntGI Record Linkage Workshop and Exposition , pp. 15-33
    • Gill, L.1
  • 119
    • 1642332418 scopus 로고    scopus 로고
    • Methods for automatic record matching and linking and their use in national statistics
    • National Statistics, London
    • Gill, L.: Methods for automatic record matching and linking and their use in national statistics. Tech. Rep. Methodology Series, no. 25, National Statistics, London (2001)
    • (2001) Tech. Rep. Methodology Series , vol.25
    • Gill, L.1
  • 123
    • 0003839182 scopus 로고    scopus 로고
    • Tech. rep., Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Israel
    • Goldreich, O.: Secure multi-party computation. Tech. rep., Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Israel (2002)
    • (2002) Secure multi-party computation
    • Goldreich, O.1
  • 124
    • 0037198576 scopus 로고    scopus 로고
    • An empirical comparison of record linkage procedures
    • Gomatam, S., Carter, R., Ariet, M., Mitchell, G.: An empirical comparison of record linkage procedures. Statistics in Medicine 21(10), 1485-1496 (2002)
    • (2002) Statistics in Medicine , vol.21 , Issue.10 , pp. 1485-1496
    • Gomatam, S.1    Carter, R.2    Ariet, M.3    Mitchell, G.4
  • 125
    • 32244443683 scopus 로고    scopus 로고
    • Syllable alignment: A novel model for phonetic string search
    • Gong, R., Chan, T.K.: Syllable alignment: A novel model for phonetic string search. IEICE Transactions on Information and Systems E89-D(1), 332-339 (2006)
    • (2006) IEICE Transactions on Information and Systems , vol.E89-D , Issue.1 , pp. 332-339
    • Gong, R.1    Chan, T.K.2
  • 130
    • 70350623193 scopus 로고    scopus 로고
    • Address standardization with latent semantic association
    • Paris
    • Guo, H., Zhu, H., Guo, Z., Zhang, X., Su, Z.: Address standardization with latent semantic association. In: ACM SIGKDD, pp. 1155-1164. Paris (2009)
    • (2009) ACM SIGKDD , pp. 1155-1164
    • Guo, H.1    Zhu, H.2    Guo, Z.3    Zhang, X.4    Su, Z.5
  • 131
    • 77956039068 scopus 로고    scopus 로고
    • Adaptive near-duplicate detection via similarity learning
    • Geneva, Switzerland
    • Hajishirzi, H., Yih, W., Kolcz, A.: Adaptive near-duplicate detection via similarity learning. In: ACM SIGIR, pp. 419-426. Geneva, Switzerland (2010)
    • (2010) ACM SIGIR , pp. 419-426
    • Hajishirzi, H.1    Yih, W.2    Kolcz, A.3
  • 133
    • 84976659284 scopus 로고
    • Approximate string matching
    • Hall, P.A., Dowling, G.R.: Approximate string matching. ACM Computing Surveys 12(4), 381-402 (1980)
    • (1980) ACM Computing Surveys , vol.12 , Issue.4 , pp. 381-402
    • Hall, P.A.1    Dowling, G.R.2
  • 136
    • 33745886270 scopus 로고    scopus 로고
    • Classifier technology and the illusion of progress
    • Hand, D.: Classifier technology and the illusion of progress. Statistical Science 21(1), 1-14 (2006)
    • (2006) Statistical Science , vol.21 , Issue.1 , pp. 1-14
    • Hand, D.1
  • 137
    • 70349826301 scopus 로고    scopus 로고
    • Creating probabilistic databases from duplicated data
    • Hassanzadeh, O., Miller, R.: Creating probabilistic databases from duplicated data. The VLDB Journal 18(5), 1141-1166 (2009)
    • (2009) The VLDB Journal , vol.18 , Issue.5 , pp. 1141-1166
    • Hassanzadeh, O.1    Miller, R.2
  • 139
    • 33750296887 scopus 로고    scopus 로고
    • Finding near-duplicate web pages: A large-scale evaluation of algorithms
    • Seattle
    • Henzinger, M.: Finding near-duplicate web pages: a large-scale evaluation of algorithms. In: ACM SIGIR, pp. 284-291. Seattle (2006)
    • (2006) ACM SIGIR , pp. 284-291
    • Henzinger, M.1
  • 140
    • 84976856849 scopus 로고
    • The merge/purge problem for large databases
    • San Jose
    • Hernandez, M.A., Stolfo, S.J.: The merge/purge problem for large databases. In: ACM SIGMOD, pp. 127-138. San Jose (1995)
    • (1995) ACM SIGMOD , pp. 127-138
    • Hernandez, M.A.1    Stolfo, S.J.2
  • 141
    • 0013331361 scopus 로고    scopus 로고
    • Real-world data is dirty: Data cleansing and the merge/purge problem
    • Hernandez, M.A., Stolfo, S.J.: Real-world data is dirty: Data cleansing and the merge/purge problem. Data Mining and Knowledge Discovery 2(1), 9-37 (1998)
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.1 , pp. 9-37
    • Hernandez, M.A.1    Stolfo, S.J.2
  • 149
    • 84950419860 scopus 로고
    • Advances in record-linkage methodology a applied to matching the 1985 Census of Tampa, Florida
    • Jaro, M.A.: Advances in record-linkage methodology a applied to matching the 1985 Census of Tampa, Florida. Journal of the American Statistical Association 84, 414-420 (1989)
    • (1989) Journal of the American Statistical Association , vol.84 , pp. 414-420
    • Jaro, M.A.1
  • 151
    • 84943425383 scopus 로고    scopus 로고
    • Efficient record linkage in large data sets
    • Tokyo
    • Jin, L., Li, C., Mehrotra, S.: Efficient record linkage in large data sets. In: DASFAA, pp. 137-146. Tokyo (2003)
    • (2003) DASFAA , pp. 137-146
    • Jin, L.1    Li, C.2    Mehrotra, S.3
  • 152
    • 0030412523 scopus 로고    scopus 로고
    • A comparison of approximate string matching algorithms
    • Jokinen, P., Tarhio, J., Ukkonen, E.: A comparison of approximate string matching algorithms. Software-Practice and Experience 26(12), 1439-1458 (1996)
    • (1996) Software-Practice and Experience , vol.26 , Issue.12 , pp. 1439-1458
    • Jokinen, P.1    Tarhio, J.2    Ukkonen, E.3
  • 153
    • 45849148052 scopus 로고    scopus 로고
    • Effective counterterrorism and the limited role of predictive data mining
    • Jonas, J., Harper, J.: Effective counterterrorism and the limited role of predictive data mining. Policy Analysis (584) (2006)
    • (2006) Policy Analysis , Issue.584
    • Jonas, J.1    Harper, J.2
  • 154
  • 155
    • 33745266392 scopus 로고    scopus 로고
    • Domain-independent data cleaning via analysis of entityrelationship graph
    • Kalashnikov, D., Mehrotra, S.: Domain-independent data cleaning via analysis of entityrelationship graph. ACM Transactions on Database Systems 31(2), 716-767 (2006)
    • (2006) ACM Transactions on Database Systems , vol.31 , Issue.2 , pp. 716-767
    • Kalashnikov, D.1    Mehrotra, S.2
  • 161
    • 0036450652 scopus 로고    scopus 로고
    • Research use of linked health data-A best practice protocol
    • Kelman, C.W., Bass, J., Holman, D.: Research use of linked health data-A best practice protocol. Aust NZ Journal of Public Health 26, 251-255 (2002)
    • (2002) Aust NZ Journal of Public Health , vol.26 , pp. 251-255
    • Kelman, C.W.1    Bass, J.2    Holman, D.3
  • 163
    • 63449096255 scopus 로고    scopus 로고
    • Parallel linkage
    • Lisboa, Portugal
    • Kim, H., Lee, D.: Parallel linkage. In: ACM CIKM, pp. 283-292. Lisboa, Portugal (2007)
    • (2007) ACM CIKM , pp. 283-292
    • Kim, H.1    Lee, D.2
  • 164
    • 77952280581 scopus 로고    scopus 로고
    • Harra: Fast iterative hashed record linkage for large-scale data collections
    • Lausanne, Switzerland
    • Kim, H., Lee, D.: Harra: fast iterative hashed record linkage for large-scale data collections. In: International Conference on Extending Database Technology, pp. 525-536. Lausanne, Switzerland (2010)
    • (2010) International Conference on Extending Database Technology , pp. 525-536
    • Kim, H.1    Lee, D.2
  • 166
    • 76249090414 scopus 로고    scopus 로고
    • The normalized compression distance as a distance measure in entity identification. Advances in Data Mining
    • Klenk, S., Thom, D., Heidemann, G.: The normalized compression distance as a distance measure in entity identification. Advances in Data Mining. Applications and Theoretical Aspects pp. 325-337 (2009)
    • (2009) Applications and Theoretical Aspects , pp. 325-337
    • Klenk, S.1    Thom, D.2    Heidemann, G.3
  • 168
    • 72649095071 scopus 로고    scopus 로고
    • Frameworks for entity matching: A comparison
    • Köpcke, H., Rahm, E.: Frameworks for entity matching: A comparison. Data and Knowledge Engineering 69(2), 197-210 (2010)
    • (2010) Data and Knowledge Engineering , vol.69 , Issue.2 , pp. 197-210
    • Köpcke, H.1    Rahm, E.2
  • 169
    • 80455148340 scopus 로고    scopus 로고
    • Evaluation of entity resolution approaches on real-world match problems
    • Köpcke, H., Thor, A., Rahm, E.: Evaluation of entity resolution approaches on real-world match problems. Proceedings of the VLDB Endowment 3(1-2), 484-493 (2010)
    • (2010) Proceedings of the VLDB Endowment , vol.3 , Issue.1-2 , pp. 484-493
    • Köpcke, H.1    Thor, A.2    Rahm, E.3
  • 170
    • 85123004356 scopus 로고    scopus 로고
    • Flexible string matching against large databases in practice
    • Toronto
    • Koudas, N., Marathe, A., Srivastava, D.: Flexible string matching against large databases in practice. In: VLDB, pp. 1086-1094. Toronto (2004)
    • (2004) VLDB , pp. 1086-1094
    • Koudas, N.1    Marathe, A.2    Srivastava, D.3
  • 172
    • 0026979939 scopus 로고
    • Techniques for automatically correcting words in text
    • Kukich, K.: Techniques for automatically correcting words in text. ACM Computing Surveys 24(4), 377-439 (1992)
    • (1992) ACM Computing Surveys , vol.24 , Issue.4 , pp. 377-439
    • Kukich, K.1
  • 173
    • 79961178764 scopus 로고    scopus 로고
    • A constraint satisfaction cryptanalysis of Bloom filters in private record linkage
    • Springer
    • Kuzu, M., Kantarcioglu, M., Durham, E., Malin, B.: A constraint satisfaction cryptanalysis of Bloom filters in private record linkage. In: Privacy Enhancing Technologies, pp. 226-245. Springer (2011)
    • (2011) Privacy Enhancing Technologies , pp. 226-245
    • Kuzu, M.1    Kantarcioglu, M.2    Durham, E.3    Malin, B.4
  • 175
    • 33645619459 scopus 로고
    • Tech. rep., Department of Computer Science, University of Newcastle upon Tyne
    • Lait, A., Randell, B.: An assessment of name matching algorithms. Tech. rep., Department of Computer Science, University of Newcastle upon Tyne (1993)
    • (1993) An assessment of name matching algorithms
    • Lait, A.1    Randell, B.2
  • 180
    • 36049037582 scopus 로고    scopus 로고
    • K-unlinkability: A privacy protection model for distributed data
    • Malin, B.: K-unlinkability: A privacy protection model for distributed data. Data and Knowledge Engineering 64(1), 294-311 (2008)
    • (2008) Data and Knowledge Engineering , vol.64 , Issue.1 , pp. 294-311
    • Malin, B.1
  • 184
    • 0000806922 scopus 로고    scopus 로고
    • Automating the construction of Internet portals with machine learning
    • McCallum, A., Nigam, K., Rennie, J., Seymore, K.: Automating the construction of Internet portals with machine learning. Information Retrieval 3(2), 127-163 (2000)
    • (2000) Information Retrieval , vol.3 , Issue.2 , pp. 127-163
    • McCallum, A.1    Nigam, K.2    Rennie, J.3    Seymore, K.4
  • 185
    • 0034592784 scopus 로고    scopus 로고
    • Efficient clustering of high-dimensional data sets with application to reference matching
    • Boston
    • McCallum, A., Nigam, K., Ungar, L.H.: Efficient clustering of high-dimensional data sets with application to reference matching. In: ACM SIGKDD, pp. 169-178. Boston (2000)
    • (2000) ACM SIGKDD , pp. 169-178
    • McCallum, A.1    Nigam, K.2    Ungar, L.H.3
  • 188
    • 36348932551 scopus 로고    scopus 로고
    • Learning blocking schemes for record linkage
    • Boston
    • Michelson, M., Knoblock, C.A.: Learning blocking schemes for record linkage. In: AAAI. Boston (2006)
    • (2006) AAAI
    • Michelson, M.1    Knoblock, C.A.2
  • 190
    • 0002089617 scopus 로고    scopus 로고
    • Matching algorithms within a duplicate detection system
    • Monge, A.E.: Matching algorithms within a duplicate detection system. IEEE Data Engineering Bulletin 23(4), 14-20 (2000)
    • (2000) IEEE Data Engineering Bulletin , vol.23 , Issue.4 , pp. 14-20
    • Monge, A.E.1
  • 191
    • 85018108837 scopus 로고    scopus 로고
    • The field-matching problem: Algorithm and applications
    • Portland
    • Monge, A.E., Elkan, C.P.: The field-matching problem: Algorithm and applications. In: ACM SIGKDD, pp. 267-270. Portland (1996)
    • (1996) ACM SIGKDD , pp. 267-270
    • Monge, A.E.1    Elkan, C.P.2
  • 194
    • 77953213147 scopus 로고    scopus 로고
    • Myths and fallacies of personally identifiable information
    • Narayanan, A., Shmatikov, V.: Myths and fallacies of personally identifiable information. Communications of the ACM 53(6), 24-26 (2010)
    • (2010) Communications of the ACM , vol.53 , Issue.6 , pp. 24-26
    • Narayanan, A.1    Shmatikov, V.2
  • 196
    • 0345566149 scopus 로고    scopus 로고
    • A guided tour to approximate string matching
    • Navarro, G.: A guided tour to approximate string matching. ACM Computing Surveys 33(1), 31-88 (2001)
    • (2001) ACM Computing Surveys , vol.33 , Issue.1 , pp. 31-88
    • Navarro, G.1
  • 197
    • 0001139918 scopus 로고
    • Record linkage: Making maximum use of the discriminating power of identifying information
    • Newcombe, H., Kennedy, J.: Record linkage: making maximum use of the discriminating power of identifying information. Communications of the ACM 5(11), 563-566 (1962)
    • (1962) Communications of the ACM , vol.5 , Issue.11 , pp. 563-566
    • Newcombe, H.1    Kennedy, J.2
  • 198
    • 0001592068 scopus 로고
    • Automatic linkage of vital records
    • Newcombe, H., Kennedy, J., Axford, S., James, A.: Automatic linkage of vital records. Science 130(3381), 954-959 (1959)
    • (1959) Science , vol.130 , Issue.3381 , pp. 954-959
    • Newcombe, H.1    Kennedy, J.2    Axford, S.3    James, A.4
  • 200
    • 47949115568 scopus 로고    scopus 로고
    • On the use of semantic blocking techniques for data cleansing and integration
    • Banff, Canada
    • Nin, J., Muntes-Mulero, V., Martinez-Bazan, N., Larriba-Pey, J.L.: On the use of semantic blocking techniques for data cleansing and integration. In: IDEAS, pp. 190-198. Banff, Canada (2007)
    • (2007) IDEAS , pp. 190-198
    • Nin, J.1    Muntes-Mulero, V.2    Martinez-Bazan, N.3    Larriba-Pey, J.L.4
  • 203
    • 85031041199 scopus 로고
    • Data matching and merging: An overview
    • Okner, B.: Data matching and merging: An overview. NBER Chapters pp. 49-54 (1974)
    • (1974) NBER Chapters , pp. 49-54
    • Okner, B.1
  • 204
    • 47249101877 scopus 로고    scopus 로고
    • Improving grouped-entity resolution using quasi-cliques
    • On, B.W., Elmacioglu, E., Lee, D., Kang, J., Pei, J.: Improving grouped-entity resolution using quasi-cliques. In: IEEE ICDM, pp. 1008-1015 (2006)
    • (2006) IEEE ICDM , pp. 1008-1015
    • On, B.W.1    Elmacioglu, E.2    Lee, D.3    Kang, J.4    Pei, J.5
  • 206
    • 56249109568 scopus 로고    scopus 로고
    • Synthetic identity fraud: Unseen identity challenge
    • Oscherwitz, T.: Synthetic identity fraud: unseen identity challenge. Bank Security News 3(7) (2005)
    • (2005) Bank Security News , vol.3 , Issue.7
    • Oscherwitz, T.1
  • 207
  • 211
    • 13344267227 scopus 로고    scopus 로고
    • The double-metaphone search algorithm
    • Philips, L.: The double-metaphone search algorithm. C/C++ User’s Journal 18(6) (2000)
    • (2000) C/C++ User’s Journal , vol.18 , Issue.6
    • Philips, L.1
  • 214
    • 84976776121 scopus 로고
    • Automatic spelling correction in scientific and scholarly text
    • Pollock, J.J., Zamora, A.: Automatic spelling correction in scientific and scholarly text. Communications of the ACM 27(4), 358-368 (1984)
    • (1984) Communications of the ACM , vol.27 , Issue.4 , pp. 358-368
    • Pollock, J.J.1    Zamora, A.2
  • 223
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner, L.: A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77(2), 257-286 (1989)
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 224
    • 0002490026 scopus 로고    scopus 로고
    • Data cleaning: Problems and current approaches
    • Rahm, E., Do, H.H.: Data cleaning: Problems and current approaches. IEEE Data Engineering Bulletin 23(4), 3-13 (2000)
    • (2000) IEEE Data Engineering Bulletin , vol.23 , Issue.4 , pp. 3-13
    • Rahm, E.1    Do, H.H.2
  • 225
  • 227
    • 77955652421 scopus 로고    scopus 로고
    • Linking historical censuses: A new approach
    • Ruggles, S.: Linking historical censuses: A new approach. History and Computing 14(1-2), 213-224 (2002)
    • (2002) History and Computing , vol.14 , Issue.1-2 , pp. 213-224
    • Ruggles, S.1
  • 231
    • 0242456811 scopus 로고    scopus 로고
    • Interactive deduplication using active learning
    • Edmonton
    • Sarawagi, S., Bhamidipaty, A.: Interactive deduplication using active learning. In: ACM SIGKDD, pp. 269-278. Edmonton (2002)
    • (2002) ACM SIGKDD , pp. 269-278
    • Sarawagi, S.1    Bhamidipaty, A.2
  • 232
    • 85088005959 scopus 로고    scopus 로고
    • Efficient set joins on similarity predicates
    • Paris
    • Sarawagi, S., Kirpal, A.: Efficient set joins on similarity predicates. In: ACM SIGMOD, pp. 754-765. Paris (2004)
    • (2004) ACM SIGMOD , pp. 754-765
    • Sarawagi, S.1    Kirpal, A.2
  • 233
    • 84863549453 scopus 로고    scopus 로고
    • The RecordLinkage package: Detecting errors in data
    • Sariyar, M., Borg, A.: The RecordLinkage package: Detecting errors in data. The R Journal 2(2), 61-67 (2010)
    • (2010) The R Journal , vol.2 , Issue.2 , pp. 61-67
    • Sariyar, M.1    Borg, A.2
  • 236
    • 84860685505 scopus 로고    scopus 로고
    • On the decidability and complexity of identity knowledge representation
    • Busan, South Korea
    • Schewe, K., Wang, Q.: On the decidability and complexity of identity knowledge representation. In: Database Systems for Advanced Applications, Springer LNCS 7238, pp. 288-302. Busan, South Korea (2012)
    • (2012) Database Systems for Advanced Applications, Springer LNCS 7238 , pp. 288-302
    • Schewe, K.1    Wang, Q.2
  • 241
    • 0016792139 scopus 로고
    • Methods for computer linkage of hospital admission-separation records into cumulative health histories
    • Smith, M., Newcombe, H.: Methods for computer linkage of hospital admission-separation records into cumulative health histories. Methods of Information in Medicine 14(3), 118-125 (1975)
    • (1975) Methods of Information in Medicine , vol.14 , Issue.3 , pp. 118-125
    • Smith, M.1    Newcombe, H.2
  • 242
    • 0018743442 scopus 로고
    • Accuracies of computer versus manual linkages of routine health records
    • Smith, M., Newcombe, H.: Accuracies of computer versus manual linkages of routine health records. Methods of Information in Medicine 18(2), 89-97 (1979)
    • (1979) Methods of Information in Medicine , vol.18 , Issue.2 , pp. 89-97
    • Smith, M.1    Newcombe, H.2
  • 247
  • 250
    • 84871606066 scopus 로고    scopus 로고
    • SOG: A synthetic occupancy generator to support entity resolution instruction and research
    • Potsdam, Germany
    • Talburt, J.R., Zhou, Y., Shivaiah, S.Y.: SOG: A synthetic occupancy generator to support entity resolution instruction and research. In: International Conference on Information Quality, pp. 91-105. Potsdam, Germany (2009)
    • (2009) International Conference on Information Quality , pp. 91-105
    • Talburt, J.R.1    Zhou, Y.2    Shivaiah, S.Y.3
  • 252
    • 0242456803 scopus 로고    scopus 로고
    • Learning domain-independent string transformation weights for high accuracy object identification
    • Edmonton
    • Tejada, S., Knoblock, C.A., Minton, S.: Learning domain-independent string transformation weights for high accuracy object identification. In: ACM SIGKDD, pp. 350-359. Edmonton (2002)
    • (2002) ACM SIGKDD , pp. 350-359
    • Tejada, S.1    Knoblock, C.A.2    Minton, S.3
  • 255
    • 70349844175 scopus 로고    scopus 로고
    • Privacy-preserving string comparisons in record linkage systems: A review
    • Trepetin, S.: Privacy-preserving string comparisons in record linkage systems: a review. Information Security Journal: A Global Perspective 17(5), 253-266 (2008)
    • (2008) Information Security Journal: A Global Perspective , vol.17 , Issue.5 , pp. 253-266
    • Trepetin, S.1
  • 256
    • 3042553478 scopus 로고    scopus 로고
    • Homeland Security and Geographic Information Systems: How GIS and mapping technology can save lives and protect property in post-September 11th America
    • US Federal Geographic Data Committee. Homeland Security and Geographic Information Systems: How GIS and mapping technology can save lives and protect property in post-September 11th America. Public Health GIS News and, Information (52), 21-23 (2003)
    • (2003) Public Health GIS News and, Information , Issue.52 , pp. 21-23
  • 258
    • 79952272717 scopus 로고
    • Triphone analysis: A combined method for the correction of orthographical and typographical errors
    • Austin
    • Van Berkel, B., De Smedt, K.: Triphone analysis: A combined method for the correction of orthographical and typographical errors. In: Second Conference on Applied Natural Language Processing, pp. 77-83. Austin (1988)
    • (1988) Second Conference on Applied Natural Language Processing , pp. 77-83
    • Van Berkel, B.1    De Smedt, K.2
  • 260
    • 84870477881 scopus 로고    scopus 로고
    • An efficient two-party protocol for approximate matching in private record linkage
    • Ballarat, Australia
    • Vatsalan, D., Christen, P., Verykios, V.: An efficient two-party protocol for approximate matching in private record linkage. In: AusDM, CRPIT, vol. 121. Ballarat, Australia (2011)
    • (2011) AusDM, CRPIT , pp. 121
    • Vatsalan, D.1    Christen, P.2    Verykios, V.3
  • 261
    • 0034228352 scopus 로고    scopus 로고
    • Automating the approximate record-matching process
    • Verykios, V., Elmagarmid, A., Houstis, E.: Automating the approximate record-matching process. Information Sciences 126(1-4), 83-98 (2000)
    • (2000) Information Sciences , vol.126 , Issue.1-4 , pp. 83-98
    • Verykios, V.1    Elmagarmid, A.2    Houstis, E.3
  • 263
    • 0038208065 scopus 로고    scopus 로고
    • A Bayesian decision model for cost optimal record matching
    • Verykios, V., George, M.V., Elfeky, M.G.: A Bayesian decision model for cost optimal record matching. The VLDB Journal 12(1), 28-40 (2003)
    • (2003) The VLDB Journal , vol.12 , Issue.1 , pp. 28-40
    • Verykios, V.1    George, M.V.2    Elfeky, M.G.3
  • 265
    • 74549152150 scopus 로고    scopus 로고
    • Robust record linkage blocking using suffix arrays
    • Hong Kong
    • de Vries, T., Ke, H., Chawla, S., Christen, P.: Robust record linkage blocking using suffix arrays. In: ACM CIKM, pp. 305-314. Hong Kong (2009)
    • (2009) ACM CIKM , pp. 305-314
    • de Vries, T.1    Ke, H.2    Chawla, S.3    Christen, P.4
  • 267
    • 1942443495 scopus 로고    scopus 로고
    • Automatically detecting deceptive criminal identities
    • Wang, G., Chen, H., Atabakhsh, H.: Automatically detecting deceptive criminal identities. Communications of the ACM 47(3), 70-76 (2004)
    • (2004) Communications of the ACM , vol.47 , Issue.3 , pp. 70-76
    • Wang, G.1    Chen, H.2    Atabakhsh, H.3
  • 270
    • 29844441371 scopus 로고    scopus 로고
    • Dogmatix tracks down duplicates in XML
    • Baltimore
    • Weis, M., Naumann, F.: Dogmatix tracks down duplicates in XML. In: ACM SIGMOD, pp. 431-442. Baltimore (2005)
    • (2005) ACM SIGMOD , pp. 431-442
    • Weis, M.1    Naumann, F.2
  • 276
    • 85031041797 scopus 로고    scopus 로고
    • Joint entity resolution
    • Arlington, Virginia
    • Whang, S.E., Garcia-Molina, H.: Joint entity resolution. In: IEEE ICDE. Arlington, Virginia (2012)
    • (2012) IEEE ICDE
    • Whang, S.E.1    Garcia-Molina, H.2
  • 278
    • 84883401319 scopus 로고    scopus 로고
    • Rattle: A data mining GUI for R
    • Williams, G.J.: Rattle: a data mining GUI for R. The R Journal 1(2), 45-55 (2009)
    • (2009) The R Journal , vol.1 , Issue.2 , pp. 45-55
    • Williams, G.J.1
  • 279
    • 0008976521 scopus 로고
    • String comparator metrics and enhanced decision rules in the Fellegi-Sunter model of record linkage
    • American Statistical Association
    • Winkler, W.: String comparator metrics and enhanced decision rules in the Fellegi-Sunter model of record linkage. In: Proceedings of the Section on Survey Research Methods, pp. 354-359. American Statistical Association (1990)
    • (1990) Proceedings of the Section on Survey Research Methods , pp. 354-359
    • Winkler, W.1
  • 287
    • 84893021148 scopus 로고    scopus 로고
    • Fast record linkage of very large files in support of decennial and administrative records projects
    • American Statistical Association
    • Winkler, W.E., Yancey, W.E., Porter, E.H.: Fast record linkage of very large files in support of decennial and administrative records projects. In: Proceedings of the Section on Survey Research Methods, pp. 2120-2130. American Statistical Association (2010)
    • (2010) Proceedings of the Section on Survey Research Methods , pp. 2120-2130
    • Winkler, W.E.1    Yancey, W.E.2    Porter, E.H.3
  • 289
    • 70849105253 scopus 로고    scopus 로고
    • Ed-join: An efficient algorithm for similarity joins with edit distance constraints
    • Xiao, C., Wang, W., Lin, X.: Ed-join: an efficient algorithm for similarity joins with edit distance constraints. Proceedings of the VLDB Endowment 1(1), 933-944 (2008)
    • (2008) Proceedings of the VLDB Endowment , vol.1 , Issue.1 , pp. 933-944
    • Xiao, C.1    Wang, W.2    Lin, X.3
  • 290
    • 67649644357 scopus 로고    scopus 로고
    • Efficient private record linkage
    • Yakout, M., Atallah, M., Elmagarmid, A.: Efficient private record linkage. In: IEEE ICDE, pp. 1283-1286 (2009)
    • (2009) IEEE ICDE , pp. 1283-1286
    • Yakout, M.1    Atallah, M.2    Elmagarmid, A.3
  • 299
    • 33845920025 scopus 로고    scopus 로고
    • Semantic matching across heterogeneous data sources
    • Zhao, H.: Semantic matching across heterogeneous data sources. Communications of the ACM 50(1), 45-50 (2007)
    • (2007) Communications of the ACM , vol.50 , Issue.1 , pp. 45-50
    • Zhao, H.1
  • 301
    • 1342281224 scopus 로고    scopus 로고
    • Linking hospital discharge and death records- accuracy and sources of bias
    • Zingmond, D., Ye, Z., Ettner, S., Liu, H.: Linking hospital discharge and death records- accuracy and sources of bias. Journal of Clinical Epidemiology 57, 21-29 (2004)
    • (2004) Journal of Clinical Epidemiology , vol.57 , pp. 21-29
    • Zingmond, D.1    Ye, Z.2    Ettner, S.3    Liu, H.4
  • 302
    • 0030379050 scopus 로고    scopus 로고
    • Phonetic string matching: Lessons from information retrieval
    • Zürich, Switzerland
    • Zobel, J., Dart, P.: Phonetic string matching: Lessons from information retrieval. In: ACM SIGIR, pp. 166-172. Zürich, Switzerland (1996)
    • (1996) ACM SIGIR , pp. 166-172
    • Zobel, J.1    Dart, P.2
  • 303
    • 33747729581 scopus 로고    scopus 로고
    • Inverted files for text search engines
    • Zobel, J., Moffat, A.: Inverted files for text search engines. ACM Computing Surveys 38(2), 6 (2006)
    • (2006) ACM Computing Surveys , vol.38 , Issue.2 , pp. 6
    • Zobel, J.1    Moffat, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.