메뉴 건너뛰기




Volumn 9078, Issue , 2015, Pages 574-585

Unsupervised blocking key selection for real-time entity resolution

Author keywords

Automatic blocking; Key selection; Record linkage; Sorted neighbourhood indexing; Unsupervised learning

Indexed keywords

EFFICIENCY; INDEXING (OF INFORMATION); QUERY PROCESSING; UNSUPERVISED LEARNING; VIRTUAL REALITY;

EID: 84945552843     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-319-18032-8_45     Document Type: Conference Paper
Times cited : (18)

References (24)
  • 1
    • 33845363891 scopus 로고    scopus 로고
    • A fast linkage detection scheme for multi-source information integration
    • Tokyo
    • Aizawa, A., Oyama, K.: A fast linkage detection scheme for multi-source information integration. In: WIRI, Tokyo (2005)
    • (2005) WIRI
    • Aizawa, A.1    Oyama, K.2
  • 2
    • 84878049861 scopus 로고    scopus 로고
    • Adaptive blocking: Learning to scale up record linkage
    • Hong Kong
    • Bilenko, M., Kamath, B., Mooney, R.J.: Adaptive blocking: learning to scale up record linkage. In: IEEE ICDM, Hong Kong (2006)
    • (2006) IEEE ICDM
    • Bilenko, M.1    Kamath, B.2    Mooney, R.J.3
  • 3
    • 84881036145 scopus 로고    scopus 로고
    • Leveraging unlabeled data to scale blocking for record linkage
    • Barcelona
    • Cao, Y., Chen, Z., Zhu, J., Yue, P., Lin, C.Y., Yu, Y.: Leveraging unlabeled data to scale blocking for record linkage. In: IJCAI, Barcelona (2011)
    • (2011) IJCAI
    • Cao, Y.1    Chen, Z.2    Zhu, J.3    Yue, P.4    Lin, C.Y.5    Yu, Y.6
  • 5
    • 84920595044 scopus 로고    scopus 로고
    • A survey of indexing techniques for scalable record linkage and deduplication
    • Christen, P.: A survey of indexing techniques for scalable record linkage and deduplication. IEEE Transactions on Knowledge and Data Engineering 24(9) (2012)
    • (2012) IEEE Transactions on Knowledge and Data Engineering , vol.24 , Issue.9
    • Christen, P.1
  • 6
    • 84871075183 scopus 로고    scopus 로고
    • An automatic blocking mechanism for large-scale de-duplication tasks
    • Hawaii
    • Das Sarma, A., Jain, A., Machanavajjhala, A., Bohannon, P.: An automatic blocking mechanism for large-scale de-duplication tasks. In: ACM CIKM, Hawaii (2012)
    • (2012) ACM CIKM
    • Das Sarma, A.1    Jain, A.2    Machanavajjhala, A.3    Bohannon, P.4
  • 10
    • 84945538473 scopus 로고    scopus 로고
    • A machine learning approach to create blocking criteria for record linkage
    • Giang, P.H.: A machine learning approach to create blocking criteria for record linkage. Health Care Management Science (2014)
    • (2014) Health Care Management Science
    • Giang, P.H.1
  • 11
    • 84976856849 scopus 로고
    • The merge/purge problem for large databases
    • San Jose
    • Hernandez, M.A., Stolfo, S.J.: The merge/purge problem for large databases. In: ACM SIGMOD, San Jose (1995)
    • (1995) ACM SIGMOD
    • Hernandez, M.A.1    Stolfo, S.J.2
  • 12
    • 84894647271 scopus 로고    scopus 로고
    • An unsupervised algorithm for learning blocking schemes
    • Dallas
    • Kejriwal, M., Miranker, D.P.: An unsupervised algorithm for learning blocking schemes. In: IEEE ICDM, Dallas (2013)
    • (2013) IEEE ICDM
    • Kejriwal, M.1    Miranker, D.P.2
  • 13
    • 77952280581 scopus 로고    scopus 로고
    • HARRA: Fast iterative hashed record linkage for large-scale data collections
    • Lausanne, Switzerland
    • Kim, H., Lee, D.: HARRA: fast iterative hashed record linkage for large-scale data collections. In: ICDT, Lausanne, Switzerland (2010)
    • (2010) ICDT
    • Kim, H.1    Lee, D.2
  • 14
    • 80455148340 scopus 로고    scopus 로고
    • Evaluation of entity resolution approaches on real-world match problems
    • Köpcke, H., Thor, A., Rahm, E.: Evaluation of entity resolution approaches on real-world match problems. VLDB Endowment 3(1–2) (2010)
    • (2010) VLDB Endowment , vol.3 , Issue.1-2
    • Köpcke, H.1    Thor, A.2    Rahm, E.3
  • 15
    • 84901276901 scopus 로고    scopus 로고
    • Noise-tolerant approximate blocking for dynamic real-time entity resolution
    • In: Tseng, V.S., Ho, T.B., Zhou, Z.-H., Chen, A.L.P., Kao, H.-Y. (eds.), Springer, Heidelberg
    • Liang, H., Wang, Y., Christen, P., Gayler, R.: Noise-tolerant approximate blocking for dynamic real-time entity resolution. In: Tseng, V.S., Ho, T.B., Zhou, Z.-H., Chen, A.L.P., Kao, H.-Y. (eds.) PAKDD 2014, Part II. LNCS (LNAI), vol. 8444, pp. 449–460. Springer, Heidelberg (2014)
    • (2014) PAKDD 2014, Part II. LNCS (LNAI , vol.8444 , pp. 449-460
    • Liang, H.1    Wang, Y.2    Christen, P.3    Gayler, R.4
  • 16
    • 84874283130 scopus 로고    scopus 로고
    • Typimatch: Type-specific unsupervised learning of keys and key values for heterogeneous web data integration
    • Rome
    • Ma, Y., Tran, T.: Typimatch: type-specific unsupervised learning of keys and key values for heterogeneous web data integration. In: ACM WSDM, Rome (2013)
    • (2013) ACM WSDM
    • Ma, Y.1    Tran, T.2
  • 17
    • 0034592784 scopus 로고    scopus 로고
    • Efficient clustering of high-dimensional data sets with application to reference matching
    • Boston
    • McCallum, A., Nigam, K., Ungar, L.: Efficient clustering of high-dimensional data sets with application to reference matching. In: ACM SIGKDD, Boston (2000)
    • (2000) ACM SIGKDD
    • McCallum, A.1    Nigam, K.2    Ungar, L.3
  • 18
    • 36348932551 scopus 로고    scopus 로고
    • Learning blocking schemes for record linkage
    • Boston
    • Michelson, M., Knoblock, C.A.: Learning blocking schemes for record linkage. In: AAAI, Boston (2006)
    • (2006) AAAI
    • Michelson, M.1    Knoblock, C.A.2
  • 19
    • 84937597822 scopus 로고    scopus 로고
    • Forest-based dynamic sorted neighborhood indexing for real-time entity resolution
    • Shanghai
    • Ramadan, B., Christen, P.: Forest-based dynamic sorted neighborhood indexing for real-time entity resolution. In: ACM CIKM, Shanghai (2014)
    • (2014) ACM CIKM
    • Ramadan, B.1    Christen, P.2
  • 20
    • 84904155169 scopus 로고    scopus 로고
    • Dynamic sorted neighborhood indexing for real-time entity resolution
    • In: Wang, H., Sharaf, M.A. (eds.), Springer, Heidelberg
    • Ramadan, B., Christen, P., Liang, H.: Dynamic sorted neighborhood indexing for real-time entity resolution. In: Wang, H., Sharaf, M.A. (eds.) ADC 2014. LNCS, vol. 8506, pp. 1–12. Springer, Heidelberg (2014)
    • (2014) ADC 2014. LNCS , vol.8506 , pp. 1-12
    • Ramadan, B.1    Christen, P.2    Liang, H.3
  • 21
    • 84892875650 scopus 로고    scopus 로고
    • Dynamic similarity-aware inverted indexing for real-time entity resolution
    • In: Li, J., Cao, L., Wang, C., Tan, K.C., Liu, B., Pei, J., Tseng, V.S. (eds.), Springer, Heidelberg
    • Ramadan, B., Christen, P., Liang, H., Gayler, R.W., Hawking, D.: Dynamic similarity-aware inverted indexing for real-time entity resolution. In: Li, J., Cao, L., Wang, C., Tan, K.C., Liu, B., Pei, J., Tseng, V.S. (eds.) PAKDD 2013 Workshops. LNCS (LNAI), vol. 7867, pp. 47–58. Springer, Heidelberg (2013)
    • (2013) PAKDD 2013 Workshops. LNCS (LNAI) , vol.7867 , pp. 47-58
    • Ramadan, B.1    Christen, P.2    Liang, H.3    Gayler, R.W.4    Hawking, D.5
  • 22
    • 84937477990 scopus 로고    scopus 로고
    • Geco: An online personal data generator and corruptor
    • New York
    • Tran, K.N., Vatsalan, D., Christen, P.: Geco: an online personal data generator and corruptor. In: ACM CIKM, New York (2013)
    • (2013) ACM CIKM
    • Tran, K.N.1    Vatsalan, D.2    Christen, P.3
  • 23
    • 84945565021 scopus 로고    scopus 로고
    • Automatic blocking key selection for duplicate detection based on unigram combinations
    • Istanbul
    • Vogel, T., Naumann, F.: Automatic blocking key selection for duplicate detection based on unigram combinations. In: VLDB Workshops, Istanbul (2012)
    • (2012) VLDB Workshops
    • Vogel, T.1    Naumann, F.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.