메뉴 건너뛰기




Volumn , Issue , 2015, Pages 4-12

Evaluation of a machine learning duplicate detection method for bioinformatics databases

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; BIOINFORMATICS; DATA HANDLING; DATA MINING; DATABASE SYSTEMS; INFORMATION SCIENCE;

EID: 84960860880     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2811163.2811175     Document Type: Conference Paper
Times cited : (22)

References (33)
  • 3
    • 0028354228 scopus 로고
    • Blood pressure measurement error: Its effect on cross-sectional and trend analyses
    • S. Bennett. Blood pressure measurement error: its effect on cross-sectional and trend analyses. Journal of clinical epidemiology, 47(3):293-301, 1994.
    • (1994) Journal of Clinical Epidemiology , vol.47 , Issue.3 , pp. 293-301
    • Bennett, S.1
  • 6
    • 0030271920 scopus 로고    scopus 로고
    • Go hunting in sequence databases but watch out for the traps
    • P. Bork and A. Bairoch. Go hunting in sequence databases but watch out for the traps. Trends in Genetics, 12(10):425-427, 1996.
    • (1996) Trends in Genetics , vol.12 , Issue.10 , pp. 425-427
    • Bork, P.1    Bairoch, A.2
  • 10
    • 0035424599 scopus 로고    scopus 로고
    • Intrinsic errors in genome annotation
    • D. Devos and A. Valencia. Intrinsic errors in genome annotation. TRENDS in Genetics, 17(8):429-431, 2001.
    • (2001) TRENDS in Genetics , vol.17 , Issue.8 , pp. 429-431
    • Devos, D.1    Valencia, A.2
  • 11
    • 33845667955 scopus 로고    scopus 로고
    • Duplicate record detection: A survey. Knowledge and data engineering
    • A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios. Duplicate record detection: A survey. Knowledge and Data Engineering, IEEE Transactions on, 19(1):1-16, 2007.
    • (2007) IEEE Transactions on , vol.19 , Issue.1 , pp. 1-16
    • Elmagarmid, A.K.1    Ipeirotis, P.G.2    Verykios, V.S.3
  • 12
    • 84865627282 scopus 로고    scopus 로고
    • Data quality: Theory and practice
    • Springer
    • W. Fan. Data quality: Theory and practice. In Web-Age Information Management, pages 1-16. Springer, 2012.
    • (2012) Web-Age Information Management , pp. 1-16
    • Fan, W.1
  • 15
    • 84958071711 scopus 로고    scopus 로고
    • Introduction to arules-A computational environment for mining association rules and frequent item sets
    • M. Hahsler, B. Grün, K. Hornik, and C. Buchta. Introduction to arules-a computational environment for mining association rules and frequent item sets. The Comprehensive R Archive Network, 2009.
    • (2009) The Comprehensive R Archive Network
    • Hahsler, M.1    Grün, B.2    Hornik, K.3    Buchta, C.4
  • 16
    • 0031829372 scopus 로고    scopus 로고
    • Removing near-neighbour redundancy from large protein sequence collections
    • L. Holm and C. Sander. Removing near-neighbour redundancy from large protein sequence collections. Bioinformatics, 14(5):423-429, 1998.
    • (1998) Bioinformatics , vol.14 , Issue.5 , pp. 423-429
    • Holm, L.1    Sander, C.2
  • 17
    • 84960859412 scopus 로고    scopus 로고
    • Duplicate detection in biological data using association rule mining
    • P34180
    • J. L. Koh, M. L. Lee, A. M. Khan, P. T. Tan, and V. Brusic. Duplicate detection in biological data using association rule mining. Locus, 501(P34180):S22388, 2004.
    • (2004) Locus , vol.501 , pp. S22388
    • Koh, J.L.1    Lee, M.L.2    Khan, A.M.3    Tan, P.T.4    Brusic, V.5
  • 19
    • 33745634395 scopus 로고    scopus 로고
    • Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences
    • W. Li and A. Godzik. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics, 22(13):1658-1659, 2006.
    • (2006) Bioinformatics , vol.22 , Issue.13 , pp. 1658-1659
    • Li, W.1    Godzik, A.2
  • 20
    • 0036699189 scopus 로고    scopus 로고
    • Sequence clustering strategies improve remote homology recognitions while reducing search times
    • W. Li, L. Jaroszewski, and A. Godzik. Sequence clustering strategies improve remote homology recognitions while reducing search times. Protein engineering, 15(8):643-649, 2002.
    • (2002) Protein Engineering , vol.15 , Issue.8 , pp. 643-649
    • Li, W.1    Jaroszewski, L.2    Godzik, A.3
  • 23
    • 84867684415 scopus 로고    scopus 로고
    • Protein sequence redundancy reduction: Comparison of various method
    • K. Sikic and O. Carugo. Protein sequence redundancy reduction: comparison of various method. Bioinformation, 5(6):234, 2010.
    • (2010) Bioinformation , vol.5 , Issue.6 , pp. 234
    • Sikic, K.1    Carugo, O.2
  • 24
    • 78049440735 scopus 로고    scopus 로고
    • Detecting duplicate biological entities using markov random field-based edit distance
    • M. Song and A. Rudniy. Detecting duplicate biological entities using markov random field-based edit distance. Knowledge and information systems, 25(2):371-387, 2010.
    • (2010) Knowledge and Information Systems , vol.25 , Issue.2 , pp. 371-387
    • Song, M.1    Rudniy, A.2
  • 25
    • 34347388470 scopus 로고    scopus 로고
    • Uniref: Comprehensive and non-redundant uniprot reference clusters
    • B. E. Suzek, H. Huang, P. McGarvey, R. Mazumder, and C. H. Wu. Uniref: comprehensive and non-redundant uniprot reference clusters. Bioinformatics, 23(10):1282-1288, 2007.
    • (2007) Bioinformatics , vol.23 , Issue.10 , pp. 1282-1288
    • Suzek, B.E.1    Huang, H.2    McGarvey, P.3    Mazumder, R.4    Wu, C.H.5
  • 26
    • 0032964297 scopus 로고    scopus 로고
    • Blast 2 sequences a new tool for comparing protein and nucleotide sequences
    • T. A. Tatusova and T. L. Madden. Blast 2 sequences, a new tool for comparing protein and nucleotide sequences. FEMS microbiology letters, 174(2):247-250, 1999.
    • (1999) FEMS Microbiology Letters , vol.174 , Issue.2 , pp. 247-250
    • Tatusova, T.A.1    Madden, T.L.2
  • 28
    • 0035545848 scopus 로고    scopus 로고
    • Learning object identification rules for information integration
    • S. Tejada, C. A. Knoblock, and S. Minton. Learning object identification rules for information integration. Information Systems, 26(8):607-633, 2001.
    • (2001) Information Systems , vol.26 , Issue.8 , pp. 607-633
    • Tejada, S.1    Knoblock, C.A.2    Minton, S.3
  • 31
    • 34748898358 scopus 로고    scopus 로고
    • The current state of business intelligence
    • H. J. Watson and B. H. Wixom. The current state of business intelligence. Computer, 40(9):96-99, 2007.
    • (2007) Computer , vol.40 , Issue.9 , pp. 96-99
    • Watson, H.J.1    Wixom, B.H.2
  • 33
    • 84931084134 scopus 로고    scopus 로고
    • Starcode: Sequence clustering based on all-pairs search
    • E. V. Zorita, P. Cuscó, and G. Filion. Starcode: sequence clustering based on all-pairs search. Bioinformatics, page btv053, 2015.
    • (2015) Bioinformatics , pp. btv053
    • Zorita, E.V.1    Cuscó, P.2    Filion, G.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.