메뉴 건너뛰기




Volumn 50, Issue 6, 2013, Pages 1147-1162

An important aspect of big data: Data usability

Author keywords

Big data; Data accuracy; Data completeness; Data consistency; Data currency; Data usability; Entity identity

Indexed keywords

BIG DATUM; DATA ACCURACY; DATA COMPLETENESS; DATA CONSISTENCY; DATA CURRENCY; DATA USABILITY; ENTITY IDENTITY;

EID: 84880000776     PISSN: 10001239     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (73)

References (159)
  • 1
    • 0031988304 scopus 로고    scopus 로고
    • The impact of poor data quality on the typical enterprise
    • Redman T. The impact of poor data quality on the typical enterprise [J]. Communications of the ACM, 1998, 41(2): 79-82
    • (1998) Communications of the ACM , vol.41 , Issue.2 , pp. 79-82
    • Redman, T.1
  • 2
    • 39049191963 scopus 로고    scopus 로고
    • Missing prenatal records at a birth center: A communication problem quantified
    • Maryland: American Medical Informatics Association
    • Miller D W, Yeast J D, Evans R L. Missing prenatal records at a birth center: A communication problem quantified [C]//Proc of AMIA Annual Symp Proceedings. Maryland: American Medical Informatics Association, 2005: 535-539
    • (2005) Proc of AMIA Annual Symp Proceedings , pp. 535-539
    • Miller, D.W.1    Yeast, J.D.2    Evans, R.L.3
  • 3
    • 48249116542 scopus 로고    scopus 로고
    • Gartner warns firms of 'dirty data'
    • Swartz N. Gartner warns firms of 'dirty data' [J]. Information Management Journal, 2007, 41(3): 6
    • (2007) Information Management Journal , vol.41 , Issue.3 , pp. 6
    • Swartz, N.1
  • 5
    • 78649853923 scopus 로고    scopus 로고
    • Data Warehousing Special Report: Data quality and the bottom line
    • Applications Development Trends
    • Eckerson W. Data Warehousing Special Report: Data quality and the bottom line [R]. Applications Development Trends, 2002
    • (2002)
    • Eckerson, W.1
  • 7
    • 84892646568 scopus 로고    scopus 로고
    • Credit card statistics, industry facts, debt statistics
    • 2013-04-20
    • Woolsey B, Schulz M. Credit card statistics, industry facts, debt statistics [OL]. [2013-04-20]. http://www.creditcards.com/credit-card-news/credit-card-industry- facts-personal-debt-statistics-1276.php
    • Woolsey, B.1    Schulz, M.2
  • 8
    • 0003578571 scopus 로고    scopus 로고
    • Enterprise information portals
    • New York: Merrill Lynch
    • Shilakes C, Tylman J. Enterprise information portals [R]. New York: Merrill Lynch, 1998
    • (1998)
    • Shilakes, C.1    Tylman, J.2
  • 9
    • 0002490026 scopus 로고    scopus 로고
    • Data cleaning: Problems and current approaches
    • Rahm E, Do H H. Data cleaning: Problems and current approaches [J]. IEEE Data Engineering Bulletin, 2000, 23(4): 3-13
    • (2000) IEEE Data Engineering Bulletin , vol.23 , Issue.4 , pp. 3-13
    • Rahm, E.1    Do, H.H.2
  • 12
    • 84859258624 scopus 로고    scopus 로고
    • Global detection of complex copying relationships between sources
    • Dong X L, Berti-Equille L, Hu Yifan, et al. Global detection of complex copying relationships between sources [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 1358-1369
    • (2010) Proceedings of the VLDB Endowment , vol.3 , Issue.1-2 , pp. 1358-1369
    • Dong, X.L.1    Berti-Equille, L.2    Hu, Y.3
  • 14
    • 84863067746 scopus 로고    scopus 로고
    • Data fusion: resolving data conflicts for integration
    • Dong X L, Naumann F. Data fusion: resolving data conflicts for integration [J]. Proceedings of the VLDB Endowment, 2009, 2(2): 1654-1655
    • (2009) Proceedings of the VLDB Endowment , vol.2 , Issue.2 , pp. 1654-1655
    • Dong, X.L.1    Naumann, F.2
  • 15
    • 70350212979 scopus 로고    scopus 로고
    • Sampling based (ε, δ)-approximate aggregation algorithm in sensor networks
    • Piscataway, NJ: IEEE
    • Cheng Siyao, Li Jianzhong. Sampling based (ε, δ)-approximate aggregation algorithm in sensor networks [C]//Proc of IEEE ICDCS'09. Piscataway, NJ: IEEE, 2009: 273-280
    • (2009) Proc of IEEE ICDCS'09 , pp. 273-280
    • Cheng, S.1    Li, J.2
  • 16
    • 84855744805 scopus 로고    scopus 로고
    • (ε, δ)-approximate aggregation algorithms in dynamic sensor networks
    • Li Jianzhong, Cheng Siyao. (ε, δ)-approximate aggregation algorithms in dynamic sensor networks [J]. IEEE Trans on Parallel and Distributed Systems, 2012, 23(3): 385-396
    • (2012) IEEE Trans on Parallel and Distributed Systems , vol.23 , Issue.3 , pp. 385-396
    • Li, J.1    Cheng, S.2
  • 17
    • 84883127209 scopus 로고    scopus 로고
    • o(ε)-approximation to physical world by sensor networks
    • Piscataway, NJ: IEEE
    • Cheng Siyao, Li Jianzhong, Cai Zhipeng. o(ε)-approximation to physical world by sensor networks [C]//Proc of IEEE INFOCOM'13. Piscataway, NJ: IEEE, 2013: 3184-3192
    • (2013) Proc of IEEE INFOCOM'13 , pp. 3184-3192
    • Cheng, S.1    Li, J.2    Cai, Z.3
  • 18
    • 84861622956 scopus 로고    scopus 로고
    • Location aware peak value queries in sensor networks
    • NJ: IEEE
    • Cheng Siyao, Li Jianzhong, Liu Yu. Location aware peak value queries in sensor networks [C]//Proc of IEEE INFOCOM'12. Piscataway, NJ: IEEE, 2012: 486-494
    • (2012) Proc of IEEE INFOCOM'12 , pp. 486-494
    • Cheng, S.1    Li, J.2    Liu, Y.3
  • 19
    • 34548731840 scopus 로고    scopus 로고
    • Conditional functional dependencies for data cleaning
    • Piscataway, NJ: IEEE
    • Bohannon P, Fan Wenfei, Geerts F, et al. Conditional functional dependencies for data cleaning [C]//Proc of IEEE ICDE'07. Piscataway, NJ: IEEE, 2007: 746-755
    • (2007) Proc of IEEE ICDE'07 , pp. 746-755
    • Bohannon, P.1    Fan, W.2    Geerts, F.3
  • 20
  • 21
    • 52649161210 scopus 로고    scopus 로고
    • Increasing the expressivity of conditional functional dependencies without extra complexity
    • Piscataway, NJ: IEEE
    • Bravo L, Fan Wenfei, Geerts F, et al. Increasing the expressivity of conditional functional dependencies without extra complexity [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 516-525
    • (2008) Proc of IEEE ICDE'08 , pp. 516-525
    • Bravo, L.1    Fan, W.2    Geerts, F.3
  • 22
    • 79953184736 scopus 로고    scopus 로고
    • Propagating functional dependencies with conditions
    • Fan Wenfei, Ma Shuai, Hu Yanli, et al. Propagating functional dependencies with conditions [J]. Proceedings of the VLDB Endowment, 2008, 1(1): 391-407
    • (2008) Proceedings of the VLDB Endowment , vol.1 , Issue.1 , pp. 391-407
    • Fan, W.1    Ma, S.2    Hu, Y.3
  • 25
    • 67649655745 scopus 로고    scopus 로고
    • Metric functional dependencies
    • Piscataway, NJ: IEEE
    • Koudas N, Saha A, Srivastava D, et al. Metric functional dependencies [C]//Proc of IEEE ICDE'09. Piscataway, NJ: IEEE, 2009: 1275-1278
    • (2009) Proc of IEEE ICDE'09 , pp. 1275-1278
    • Koudas, N.1    Saha, A.2    Srivastava, D.3
  • 28
    • 57449119633 scopus 로고    scopus 로고
    • Checks and balances: Monitoring data quality problems in network traffic database
    • San Francisco, CA: Morgan Kaufmann
    • Korn F, Muthukrishnan S, Zhu Y. Checks and balances: Monitoring data quality problems in network traffic databases [C]//Proc of the 29th Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2003: 536-547
    • (2003) Proc of the 29th Int Conf on Very Large Databases , pp. 536-547
    • Korn, F.1    Muthukrishnan, S.2    Zhu, Y.3
  • 32
    • 0021513522 scopus 로고
    • Incomplete information in relational databases
    • Imieliński T, Lipski Jr W. Incomplete information in relational databases [J]. Journal of the ACM (JACM), 1984, 31(4): 761-791
    • (1984) Journal of the ACM (JACM) , vol.31 , Issue.4 , pp. 761-791
    • Imieliński, T.1    Lipski Jr., W.2
  • 34
    • 0004919827 scopus 로고
    • Closed world databases opened through null values
    • San Francisco, CA: Morgan Kaufmann
    • Gottlob G, Zicari R. Closed world databases opened through null values [C]//Proc of the 14th Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 1988: 50-61
    • (1988) Proc of the 14th Int Conf on Very Large Databases , pp. 50-61
    • Gottlob, G.1    Zicari, R.2
  • 39
    • 61349087255 scopus 로고    scopus 로고
    • Cleaning uncertain data with quality guarantees
    • Cheng R, Chen J, Xie X. Cleaning uncertain data with quality guarantees [J]. Proceedings of the VLDB Endowment, 2008, 1(1): 722-735
    • (2008) Proceedings of the VLDB Endowment , vol.1 , Issue.1 , pp. 722-735
    • Cheng, R.1    Chen, J.2    Xie, X.3
  • 40
    • 14744293228 scopus 로고    scopus 로고
    • Minimal-change integrity maintenance using tuple deletions
    • Chomicki J, Marcinkowski J. Minimal-change integrity maintenance using tuple deletions [J]. Information and Computation, 2005, 197(1): 90-121
    • (2005) Information and Computation , vol.197 , Issue.1 , pp. 90-121
    • Chomicki, J.1    Marcinkowski, J.2
  • 41
    • 0001500141 scopus 로고    scopus 로고
    • Temporal constraints: A survey
    • Schwalb E, Vila L. Temporal constraints: A survey [J]. Constraints, 1998, 3(2/3): 129-149
    • (1998) Constraints , vol.3 , Issue.2-3 , pp. 129-149
    • Schwalb, E.1    Vila, L.2
  • 42
    • 80051951228 scopus 로고    scopus 로고
    • Recognizing patterns in streams with imprecise timestamps
    • Zhang Haopeng, Diao Yanlei, Immerman N. Recognizing patterns in streams with imprecise timestamps [J]. Proceedings of the VLDB Endowment, 2010, 3(1): 244-255
    • (2010) Proceedings of the VLDB Endowment , vol.3 , Issue.1 , pp. 244-255
    • Zhang, H.1    Diao, Y.2    Immerman, N.3
  • 45
    • 0001592068 scopus 로고
    • Automatic linkage of vital records
    • Newcombe H B, Kennedy J M, Axford S J, et al. Automatic linkage of vital records [J]. Science, 1959, 130(3381): 954-959
    • (1959) Science , vol.130 , Issue.3381 , pp. 954-959
    • Newcombe, H.B.1    Kennedy, J.M.2    Axford, S.J.3
  • 47
    • 84976856849 scopus 로고
    • The merge/purge problem for large databases
    • Hernández M A, Stolfo S J. The merge/purge problem for large databases [J]. Proc of ACM SIGMOD Record, 1995, 24(2): 127-138
    • (1995) Proc of ACM SIGMOD Record , vol.24 , Issue.2 , pp. 127-138
    • Hernández, M.A.1    Stolfo, S.J.2
  • 48
    • 0013331361 scopus 로고    scopus 로고
    • Real-world data is dirty: Data cleansing and the merge/purge problem
    • Hernández M A, Stolfo S J. Real-world data is dirty: Data cleansing and the merge/purge problem [J]. Data Mining and Knowledge Discovery, 1998, 2(1): 9-37
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.1 , pp. 9-37
    • Hernández, M.A.1    Stolfo, S.J.2
  • 50
    • 77949817550 scopus 로고    scopus 로고
    • A survey of entity resolution and record linkage methodologies
    • Brizan D G, Tansel A U. A survey of entity resolution and record linkage methodologies [J]. Communications of the IIMA, 2006, 6(3): 41-50
    • (2006) Communications of the IIMA , vol.6 , Issue.3 , pp. 41-50
    • Brizan, D.G.1    Tansel, A.U.2
  • 52
    • 0030083481 scopus 로고    scopus 로고
    • Entity identification in database integration
    • Lim E P, Srivastava J, Prabhakar S, et al. Entity identification in database integration [J]. Information Sciences, 1996, 89(1): 1-38
    • (1996) Information Sciences , vol.89 , Issue.1 , pp. 1-38
    • Lim, E.P.1    Srivastava, J.2    Prabhakar, S.3
  • 53
    • 52649137537 scopus 로고    scopus 로고
    • Transformation-based framework for record matching
    • Piscataway, NJ: IEEE
    • Arasu A, Chaudhuri S, Kaushik R. Transformation-based framework for record matching [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 40-49
    • (2008) Proc of IEEE ICDE'08 , pp. 40-49
    • Arasu, A.1    Chaudhuri, S.2    Kaushik, R.3
  • 56
    • 67649649597 scopus 로고    scopus 로고
    • Large-scale deduplication with constraints using dedupalog
    • Piscataway, NJ: IEEE
    • Arasu A, Ré C, Suciu D. Large-scale deduplication with constraints using dedupalog [C]//Proc of IEEE ICDE'09. Piscataway, NJ: IEEE, 2009: 952-963
    • (2009) Proc of IEEE ICDE'09 , pp. 952-963
    • Arasu, A.1    Ré, C.2    Suciu, D.3
  • 59
    • 84865086832 scopus 로고    scopus 로고
    • Reasoning about record matching rules
    • Fan Wenfei, Jia Xibei, Li Jianzhong, et al. Reasoning about record matching rules [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 407-418
    • (2009) Proceedings of the VLDB Endowment , vol.2 , Issue.1 , pp. 407-418
    • Fan, W.1    Jia, X.2    Li, J.3
  • 60
    • 79960451998 scopus 로고    scopus 로고
    • Dynamic constraints for record matching
    • Fan Wenfei, Gao Hong, Jia Xibei, et al. Dynamic constraints for record matching [J]. The VLDB Journal, 2011, 20(4): 495-520
    • (2011) The VLDB Journal , vol.20 , Issue.4 , pp. 495-520
    • Fan, W.1    Gao, H.2    Jia, X.3
  • 64
    • 79957793038 scopus 로고    scopus 로고
    • Graph homomorphism revisited for graph matching
    • Fan Wenfei, Li Jianzhong, Ma Shuai, et al. Graph homomorphism revisited for graph matching [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 1161-1172
    • (2010) Proceedings of the VLDB Endowment , vol.3 , Issue.1-2 , pp. 1161-1172
    • Fan, W.1    Li, J.2    Ma, S.3
  • 65
    • 79960006256 scopus 로고    scopus 로고
    • Graph pattern matching: From intractable to polynomial time
    • Fan Wenfei, Li Jianzhong, Ma Shuai, et al. Graph pattern matching: from intractable to polynomial time [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 264-275
    • (2010) Proceedings of the VLDB Endowment , vol.3 , Issue.1-2 , pp. 264-275
    • Fan, W.1    Li, J.2    Ma, S.3
  • 66
    • 79960022349 scopus 로고    scopus 로고
    • Incremental graph pattern matching
    • New York: ACM
    • Fan Wenfei, Li Jianzhong, Luo Jizhou, et al. Incremental graph pattern matching [C]//Proc of ACM SIGMOD. New York: ACM, 2011: 925-936
    • (2011) Proc of ACM SIGMOD , pp. 925-936
    • Fan, W.1    Li, J.2    Luo, J.3
  • 69
    • 0004043396 scopus 로고    scopus 로고
    • An efficient domain-independent algorithm for detecting approximately duplicate database record
    • Berlin: Springer
    • Monge A, Elkan C. An efficient domain-independent algorithm for detecting approximately duplicate database records [C]//Proc of Research Issues on Data Mining and Knowledge Discovery. Berlin: Springer, 1997: 1-7
    • (1997) Proc of Research Issues on Data Mining and Knowledge Discovery , pp. 1-7
    • Monge, A.1    Elkan, C.2
  • 70
    • 0000666461 scopus 로고    scopus 로고
    • Data integration using similarity joins and a word-based information representation language
    • Cohen W W. Data integration using similarity joins and a word-based information representation language [J]. ACM Trans on Information Systems (TOIS), 2000, 18(3): 288-321
    • (2000) ACM Trans on Information Systems (TOIS) , vol.18 , Issue.3 , pp. 288-321
    • Cohen, W.W.1
  • 72
    • 26444550791 scopus 로고    scopus 로고
    • Robust identification of fuzzy duplicates
    • Piscataway, NJ: IEEE
    • Chaudhuri S, Ganti V, Motwani R. Robust identification of fuzzy duplicates [C]//Proc of IEEE ICDE'05. Piscataway, NJ: IEEE, 2005: 865-876
    • (2005) Proc of IEEE ICDE'05 , pp. 865-876
    • Chaudhuri, S.1    Ganti, V.2    Motwani, R.3
  • 73
    • 79953162324 scopus 로고    scopus 로고
    • Merging the results of approximate match operations
    • San Francisco, CA: Morgan Kaufmann
    • Guha S, Koudas N, Marathe A, et al. Merging the results of approximate match operations [C]//Proc of the 30th Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2004: 636-647
    • (2004) Proc of the 30th Int Conf on Very Large Databases , pp. 636-647
    • Guha, S.1    Koudas, N.2    Marathe, A.3
  • 75
    • 84878044770 scopus 로고    scopus 로고
    • Entity resolution with markov logic
    • Piscataway, NJ: IEEE
    • Singla P, Domingos P. Entity resolution with markov logic [C]//Proc of IEEE ICDM'06. Piscataway, NJ: IEEE, 2006: 572-582
    • (2006) Proc of IEEE ICDM'06 , pp. 572-582
    • Singla, P.1    Domingos, P.2
  • 76
    • 52649127789 scopus 로고    scopus 로고
    • Approximate joins for data-centric XML
    • Piscataway, NJ: IEEE
    • Augsten N, Bohlen M, Dyreson C, et al. Approximate joins for data-centric XML [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 814-823
    • (2008) Proc of IEEE ICDE'08 , pp. 814-823
    • Augsten, N.1    Bohlen, M.2    Dyreson, C.3
  • 78
    • 70849115286 scopus 로고    scopus 로고
    • Efficient approximate entity extraction with edit distance constraints
    • New York: ACM
    • Wang Wei, Xiao Chuan, Lin Xuemin, et al. Efficient approximate entity extraction with edit distance constraints [C]//Proc of the 35th SIGMOD Int Conf on Management of Data. New York: ACM, 2009: 759-770
    • (2009) Proc of the 35th SIGMOD Int Conf on Management of Data , pp. 759-770
    • Wang, W.1    Xiao, C.2    Lin, X.3
  • 81
    • 79959944062 scopus 로고    scopus 로고
    • Interaction between record matching and data repairing
    • New York: ACM
    • Fan Wenfei, Li Jianzhong, Ma Shuai, et al. Interaction between record matching and data repairing [C]//Proc of the 2011 Int Conf on Management of Data. New York: ACM, 2011: 469-480
    • (2011) Proc of the 2011 Int Conf on Management of Data , pp. 469-480
    • Fan, W.1    Li, J.2    Ma, S.3
  • 82
    • 70349313097 scopus 로고    scopus 로고
    • Analyses and validation of conditional dependencies with built-in predicates
    • Berlin: Springer
    • Chen W, Fan W, Ma S. Analyses and validation of conditional dependencies with built-in predicates [C]//Proc of DEXA'09. Berlin: Springer, 2009: 576-591
    • (2009) Proc of DEXA'09 , pp. 576-591
    • Chen, W.1    Fan, W.2    Ma, S.3
  • 83
    • 46649106686 scopus 로고    scopus 로고
    • Conditional functional dependencies for capturing data inconsistencies
    • Fan Wenfei, Geerts F, Jia Xibei, et al. Conditional functional dependencies for capturing data inconsistencies [J]. ACM Trans on Database Systems (TODS), 2008, 33(2): 1-48
    • (2008) ACM Trans on Database Systems (TODS) , vol.33 , Issue.2 , pp. 1-48
    • Fan, W.1    Geerts, F.2    Jia, X.3
  • 84
    • 77952749687 scopus 로고    scopus 로고
    • Detecting inconsistencies in distributed data
    • Piscataway, NJ: IEEE
    • Fan Wenfei, Geerts F, Ma Shuai, et al. Detecting inconsistencies in distributed data [C]//Proc of IEEE ICDE'10. Piscataway, NJ: IEEE, 2010: 64-75
    • (2010) Proc of IEEE ICDE'10 , pp. 64-75
    • Fan, W.1    Geerts, F.2    Ma, S.3
  • 85
    • 84864198280 scopus 로고    scopus 로고
    • Incremental detection of inconsistencies in distributed data
    • Piscataway, NJ: IEEE
    • Fan W, Li J, Tang N, et al. Incremental detection of inconsistencies in distributed data [C]//Proc of IEEE ICDE'10. Piscataway, NJ: IEEE, 2012: 318-329
    • (2012) Proc of IEEE ICDE'10 , pp. 318-329
    • Fan, W.1    Li, J.2    Tang, N.3
  • 86
    • 72649102401 scopus 로고    scopus 로고
    • Mining document collections to facilitate accurate approximate entity matching
    • Chaudhuri S, Ganti V, Xin D. Mining document collections to facilitate accurate approximate entity matching [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 395-406
    • (2009) Proceedings of the VLDB Endowment , vol.2 , Issue.1 , pp. 395-406
    • Chaudhuri, S.1    Ganti, V.2    Xin, D.3
  • 87
    • 67649669734 scopus 로고    scopus 로고
    • A latent topic model for complete entity resolution
    • Piscataway, NJ: IEEE
    • Shu Liangcai, Long Bo, Meng Weiyi. A latent topic model for complete entity resolution [C]//Proc of IEEE ICDE'09. Piscataway, NJ: IEEE, 2009: 880-891
    • (2009) Proc of IEEE ICDE'09 , pp. 880-891
    • Shu, L.1    Long, B.2    Meng, W.3
  • 88
    • 65449139594 scopus 로고    scopus 로고
    • Automatic record linkage using seeded nearest neighbor and support vector machine classification
    • New York: ACM
    • Christen P. Automatic record linkage using seeded nearest neighbor and support vector machine classification [C]//Proc of the 14th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining. New York: ACM, 2008: 151-159
    • (2008) Proc of the 14th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining , pp. 151-159
    • Christen, P.1
  • 93
    • 77952280581 scopus 로고    scopus 로고
    • HARRA: Fast iterative hashed record linkage for large-scale data collections
    • New York: ACM
    • Kim H, Lee D. HARRA: Fast iterative hashed record linkage for large-scale data collections [C]//Proc of the 13th Int Conf on Extending Database Technology. New York: ACM, 2010: 525-536
    • (2010) Proc of the 13th Int Conf on Extending Database Technology , pp. 525-536
    • Kim, H.1    Lee, D.2
  • 95
    • 84878049861 scopus 로고    scopus 로고
    • Adaptive blocking: Learning to scale up record linkage
    • Piscataway, NJ: IEEE
    • Bilenko M, Kamath B, Mooney R J. Adaptive blocking: Learning to scale up record linkage [C]//Proc of IEEE ICDM'06. Piscataway, NJ: IEEE, 2006: 87-96
    • (2006) Proc of IEEE ICDM'06 , pp. 87-96
    • Bilenko, M.1    Kamath, B.2    Mooney, R.J.3
  • 97
    • 5444258997 scopus 로고    scopus 로고
    • A comparison of fast blocking methods for record linkage
    • New York: ACM
    • Baxter R, Christen P, Churches T. A comparison of fast blocking methods for record linkage [C]//Proc of ACM SIGKDD Workshop. New York: ACM, 2003: 25-27
    • (2003) Proc of ACM SIGKDD Workshop , pp. 25-27
    • Baxter, R.1    Christen, P.2    Churches, T.3
  • 101
  • 102
    • 33749597967 scopus 로고    scopus 로고
    • A primitive operator for similarity joins in data cleaning
    • Piscataway, NJ: IEEE
    • Chaudhuri S, Ganti V, Kaushik R. A primitive operator for similarity joins in data cleaning [C]//Proc of IEEE ICDE'06. Piscataway, NJ: IEEE, 2006: 5-5
    • (2006) Proc of IEEE ICDE'06 , pp. 5-5
    • Chaudhuri, S.1    Ganti, V.2    Kaushik, R.3
  • 103
    • 67649641448 scopus 로고    scopus 로고
    • Space-constrained gram-based indexing for efficient approximate string search
    • Piscataway, NJ: IEEE
    • Behm A, Ji S, Li C, et al. Space-constrained gram-based indexing for efficient approximate string search [C]//Proc of IEEE ICDE'09. Piscataway, NJ: IEEE, 2009: 604-615
    • (2009) Proc of IEEE ICDE'09 , pp. 604-615
    • Behm, A.1    Ji, S.2    Li, C.3
  • 104
    • 80052344031 scopus 로고    scopus 로고
    • Efficient similarity joins for near-duplicate detection
    • Xiao Chuan, Wang Wei, Lin Xuemin, et al. Efficient similarity joins for near-duplicate detection [J]. ACM Trans on Database Systems (TODS), 2011, 36(3): 15
    • (2011) ACM Trans on Database Systems (TODS) , vol.36 , Issue.3 , pp. 15
    • Xiao, C.1    Wang, W.2    Lin, X.3
  • 106
    • 57149130672 scopus 로고    scopus 로고
    • Cost-based variable-length-gram selection for string collections to support approximate queries efficiently
    • New York: ACM
    • Yang Xiaochun, Wang Bin, Li Chen. Cost-based variable-length-gram selection for string collections to support approximate queries efficiently [C]//Proc of the 2008 ACM SIGMOD Int Conf on Management of Data. New York: ACM, 2008: 353-364
    • (2008) Proc of the 2008 ACM SIGMOD Int Conf on Management of Data , pp. 353-364
    • Yang, X.1    Wang, B.2    Li, C.3
  • 107
    • 85011032600 scopus 로고    scopus 로고
    • VGRAM: Improving performance of approximate queries on string collections using variable-length grams
    • San Francisco, CA: Morgan Kaufmann
    • Li Chen, Wang Bin, Yang Xiaochun. VGRAM: Improving performance of approximate queries on string collections using variable-length grams [C]//Proc of the 33rd Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2007: 303-314
    • (2007) Proc of the 33rd Int Conf on Very Large Databases , pp. 303-314
    • Li, C.1    Wang, B.2    Yang, X.3
  • 108
    • 52649086729 scopus 로고    scopus 로고
    • Efficient merging and filtering algorithms for approximate string searches
    • Piscataway, NJ: IEEE
    • Li Chen, Lu Jiaheng, Lu Yiming. Efficient merging and filtering algorithms for approximate string searches [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 257-266
    • (2008) Proc of IEEE ICDE'08 , pp. 257-266
    • Li, C.1    Lu, J.2    Lu, Y.3
  • 109
    • 52649161208 scopus 로고    scopus 로고
    • A fast similarity join algorithm using graphics processing units
    • Piscataway, NJ: IEEE
    • Lieberman M D, Sankaranarayanan J, Samet H. A fast similarity join algorithm using graphics processing units [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 1111-1120
    • (2008) Proc of IEEE ICDE'08 , pp. 1111-1120
    • Lieberman, M.D.1    Sankaranarayanan, J.2    Samet, H.3
  • 111
    • 77952779390 scopus 로고    scopus 로고
    • Hashing tree-structured data: Methods and applications
    • Piscataway, NJ: IEEE
    • Tatikonda S, Parthasarathy S. Hashing tree-structured data: Methods and applications [C]//Proc of IEEE ICDE'10. Piscataway, NJ: IEEE, 2010: 429-440
    • (2010) Proc of IEEE ICDE'10 , pp. 429-440
    • Tatikonda, S.1    Parthasarathy, S.2
  • 112
    • 74049138802 scopus 로고    scopus 로고
    • Development and user experiences of an open source data cleaning, deduplication and record linkage system
    • Christen P. Development and user experiences of an open source data cleaning, deduplication and record linkage system [J]. ACM SIGKDD Explorations Newsletter, 2009, 11(1): 39-48
    • (2009) ACM SIGKDD Explorations Newsletter , vol.11 , Issue.1 , pp. 39-48
    • Christen, P.1
  • 114
    • 85011029434 scopus 로고    scopus 로고
    • Example-driven design of efficient record matching queries
    • San Francisco, CA: Morgan Kaufmann
    • Chaudhuri S, Chen B C, Ganti V, et al. Example-driven design of efficient record matching queries [C]//Proc of the 33rd Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2007: 327-338
    • (2007) Proc of the 33rd Int Conf on Very Large Databases , pp. 327-338
    • Chaudhuri, S.1    Chen, B.C.2    Ganti, V.3
  • 119
    • 80455148340 scopus 로고    scopus 로고
    • Evaluation of entity resolution approaches on real-world match problems
    • Köpcke H, Thor A, Rahm E. Evaluation of entity resolution approaches on real-world match problems [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 484-493
    • (2010) Proceedings of the VLDB Endowment , vol.3 , Issue.1-2 , pp. 484-493
    • Köpcke, H.1    Thor, A.2    Rahm, E.3
  • 120
    • 80052419079 scopus 로고    scopus 로고
    • Comparative evaluation of entity resolution approaches with FEVER
    • Köpcke H, Thor A, Rahm E. Comparative evaluation of entity resolution approaches with FEVER [J]. Proceedings of the VLDB Endowment, 2009, 2(2): 1574-1577
    • (2009) Proceedings of the VLDB Endowment , vol.2 , Issue.2 , pp. 1574-1577
    • Köpcke, H.1    Thor, A.2    Rahm, E.3
  • 121
    • 72649095071 scopus 로고    scopus 로고
    • Frameworks for entity matching: A comparison
    • Köpcke H, Rahm E. Frameworks for entity matching: A comparison [J]. Data & Knowledge Engineering, 2010, 69(2): 197-210
    • (2010) Data & Knowledge Engineering , vol.69 , Issue.2 , pp. 197-210
    • Köpcke, H.1    Rahm, E.2
  • 123
    • 29844436973 scopus 로고    scopus 로고
    • A cost-based model and effective heuristic for repairing constraints by value modification
    • New York: ACM
    • Bohannon P, Fan Wenfei, Flaster M, et al. A cost-based model and effective heuristic for repairing constraints by value modification [C]//Proc of the 2005 ACM SIGMOD Int Conf on Management of Data. New York: ACM, 2005: 143-154
    • (2005) Proc of the 2005 ACM SIGMOD Int Conf on Management of Data , pp. 143-154
    • Bohannon, P.1    Fan, W.2    Flaster, M.3
  • 124
    • 84959912087 scopus 로고    scopus 로고
    • Improving data quality: Consistency and accuracy
    • San Francisco, CA: Morgan Kaufmann
    • Cong Gao, Fan Wenfei, Geerts F, et al. Improving data quality: Consistency and accuracy [C]//Proc of the 33rd Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2007: 315-326
    • (2007) Proc of the 33rd Int Conf on Very Large Databases , pp. 315-326
    • Cong, G.1    Fan, W.2    Geerts, F.3
  • 125
    • 80052917068 scopus 로고    scopus 로고
    • Sampling the repairs of functional dependency violations under hard constraints
    • Beskales G, Ilyas I F, Golab L. Sampling the repairs of functional dependency violations under hard constraints [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 197-207
    • (2010) Proceedings of the VLDB Endowment , vol.3 , Issue.1-2 , pp. 197-207
    • Beskales, G.1    Ilyas, I.F.2    Golab, L.3
  • 126
    • 77954736322 scopus 로고    scopus 로고
    • Consistent query answers in inconsistent probabilistic databases
    • New York: ACM
    • Lian Xiang, Chen Lei, Song Shaoxu. Consistent query answers in inconsistent probabilistic databases [C]//Proc of the 2010 Int Conf on Management of Data. New York: ACM, 2010: 303-314
    • (2010) Proc of the 2010 Int Conf on Management of Data , pp. 303-314
    • Lian, X.1    Chen, L.2    Song, S.3
  • 127
    • 52649155017 scopus 로고    scopus 로고
    • A sampling-based approach to information recovery
    • Piscataway, NJ: IEEE
    • Xie Junyi, Yang Jun, Chen Yuguo, et al. A sampling-based approach to information recovery [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 476-485
    • (2008) Proc of IEEE ICDE'08 , pp. 476-485
    • Xie, J.1    Yang, J.2    Chen, Y.3
  • 130
    • 34249872509 scopus 로고    scopus 로고
    • In-network outlier cleaning for data collection in sensor networks
    • New York: VLDB Endowment
    • Zhuang Yongzhen, Chen Lei. In-network outlier cleaning for data collection in sensor networks [C]//Proc of VLDB Workshop on CleanDB. New York: VLDB Endowment, 2006: 41-48
    • (2006) Proc of VLDB Workshop on CleanDB , pp. 41-48
    • Zhuang, Y.1    Chen, L.2
  • 132
    • 77954695997 scopus 로고    scopus 로고
    • Modeling and querying possible repairs in duplicate detection
    • Beskales G, Soliman M A, Ilyas I F, et al. Modeling and querying possible repairs in duplicate detection [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 598-609
    • (2009) Proceedings of the VLDB Endowment , vol.2 , Issue.1 , pp. 598-609
    • Beskales, G.1    Soliman, M.A.2    Ilyas, I.F.3
  • 133
    • 33749588820 scopus 로고    scopus 로고
    • Clean answers over dirty databases: A probabilistic approach
    • Piscataway, NJ: IEEE
    • Andritsos P, Fuxman A, Miller R J. Clean answers over dirty databases: A probabilistic approach [C]//Proc of IEEE ICDE'06. Piscataway, NJ: IEEE, 2006: 30-30
    • (2006) Proc of IEEE ICDE'06 , pp. 30-30
    • Andritsos, P.1    Fuxman, A.2    Miller, R.J.3
  • 135
    • 72649086387 scopus 로고    scopus 로고
    • Framework for evaluating clustering algorithms in duplicate detection
    • Hassanzadeh O, Chiang F, Lee H C, et al. Framework for evaluating clustering algorithms in duplicate detection [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 1282-1293
    • (2009) Proceedings of the VLDB Endowment , vol.2 , Issue.1 , pp. 1282-1293
    • Hassanzadeh, O.1    Chiang, F.2    Lee, H.C.3
  • 137
    • 79960023714 scopus 로고    scopus 로고
    • Record linkage with uniqueness constraints and erroneous values
    • Guo Songtao, Dong X L, Srivastava D, et al. Record linkage with uniqueness constraints and erroneous values [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 417-428
    • (2010) Proceedings of the VLDB Endowment , vol.3 , Issue.1-2 , pp. 417-428
    • Guo, S.1    Dong, X.L.2    Srivastava, D.3
  • 140
    • 33745628835 scopus 로고    scopus 로고
    • ConQuer: A system for efficient querying over inconsistent databases
    • San Francisco, CA: Morgan Kaufmann
    • Fuxman A, Fuxman D, Miller R J. ConQuer: A system for efficient querying over inconsistent databases [C]//Proc of the 31st Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2005: 1354-1357
    • (2005) Proc of the 31st Int Conf on Very Large Databases , pp. 1354-1357
    • Fuxman, A.1    Fuxman, D.2    Miller, R.J.3
  • 141
    • 0024941096 scopus 로고
    • Integrity=validity+completeness
    • Motro A. Integrity=validity+completeness [J]. ACM Trans on Database Systems (TODS), 1989, 14(4): 480-502
    • (1989) ACM Trans on Database Systems (TODS) , vol.14 , Issue.4 , pp. 480-502
    • Motro, A.1
  • 142
    • 0002842314 scopus 로고    scopus 로고
    • Obtaining complete answers from incomplete databases
    • San Francisco, CA: Morgan Kaufmann
    • Levy A. Obtaining complete answers from incomplete databases [C]//Proc of the 22nd Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 1996: 402-412
    • (1996) Proc of the 22nd Int Conf on Very Large Databases , pp. 402-412
    • Levy, A.1
  • 143
    • 52649139904 scopus 로고    scopus 로고
    • Skyline query processing for incomplete data
    • Piscataway, NJ: IEEE
    • Khalefa M E, Mokbel M F, Levandoski J J. Skyline query processing for incomplete data [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 556-565
    • (2008) Proc of IEEE ICDE'08 , pp. 556-565
    • Khalefa, M.E.1    Mokbel, M.F.2    Levandoski, J.J.3
  • 144
    • 67649637305 scopus 로고    scopus 로고
    • Resolution-aware query answering for business intelligence
    • Piscataway, NJ: IEEE
    • Sismanis Y, Wang L, Fuxman A, et al. Resolution-aware query answering for business intelligence [C]//Proc of IEEE ICDE'09. Piscataway, NJ: IEEE, 2009: 976-987
    • (2009) Proc of IEEE ICDE'09 , pp. 976-987
    • Sismanis, Y.1    Wang, L.2    Fuxman, A.3
  • 149
    • 77955171415 scopus 로고    scopus 로고
    • Mining frequent subgraph patterns from uncertain graph data
    • Zou Zhaonian, Li Jianzhong, Gao Hong, et al. Mining frequent subgraph patterns from uncertain graph data [J]. IEEE Trans on Knowledge and Data Engineering, 2010, 22(9): 1203-1218
    • (2010) IEEE Trans on Knowledge and Data Engineering , vol.22 , Issue.9 , pp. 1203-1218
    • Zou, Z.1    Li, J.2    Gao, H.3
  • 150
  • 151
    • 84869506153 scopus 로고    scopus 로고
    • Mining frequent subgraphs over uncertain graph databases under probabilistic semantics
    • Li Jianzhong, Zou Zhaonian, Gao Hong. Mining frequent subgraphs over uncertain graph databases under probabilistic semantics [J]. The VLDB Journal, 2012, 21(6): 753-777
    • (2012) The VLDB Journal , vol.21 , Issue.6 , pp. 753-777
    • Li, J.1    Zou, Z.2    Gao, H.3
  • 152
    • 77952764293 scopus 로고    scopus 로고
    • Finding top-k maximal cliques in an uncertain graph
    • Piscataway, NJ: IEEE
    • Zou Zhaonian, Li Jianzhong, Gao Hong, et al. Finding top-k maximal cliques in an uncertain graph [C]//Proc of IEEE ICDE'10. Piscataway, NJ: IEEE, 2010: 649-652
    • (2010) Proc of IEEE ICDE'10 , pp. 649-652
    • Zou, Z.1    Li, J.2    Gao, H.3
  • 153
    • 84874024305 scopus 로고    scopus 로고
    • Reliable clustering on uncertain graphs
    • Piscataway, NJ: IEEE
    • Liu Lin, Jin Ruoming, Aggrawal C C, et al. Reliable clustering on uncertain graphs [C]//Proc of IEEE ICDM'12. Piscataway, NJ: IEEE, 2012: 459-468
    • (2012) Proc of IEEE ICDM'12 , pp. 459-468
    • Liu, L.1    Jin, R.2    Aggrawal, C.C.3
  • 154
    • 80052652443 scopus 로고    scopus 로고
    • Distance-constraint reachability computation in uncertain graphs
    • Jin Ruoming, Liu Lin, Ding Bolin, et al. Distance-constraint reachability computation in uncertain graphs [J]. Proceedings of the VLDB Endowment, 2011, 4(9): 551-562
    • (2011) Proceedings of the VLDB Endowment , vol.4 , Issue.9 , pp. 551-562
    • Jin, R.1    Liu, L.2    Ding, B.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.