메뉴 건너뛰기




Volumn 26, Issue 7, 2010, Pages 953-959

Accelerated similarity searching and clustering of large compound sets by geometric embedding and locality sensitive hashing

Author keywords

[No Author keywords available]

Indexed keywords

ARTICLE; AUTOMATED PATTERN RECOGNITION; BIOLOGY; CLUSTER ANALYSIS; INFORMATION RETRIEVAL; METHODOLOGY;

EID: 77951946180     PISSN: 13674803     EISSN: 14602059     Source Type: Journal    
DOI: 10.1093/bioinformatics/btq067     Document Type: Article
Times cited : (30)

References (45)
  • 1
    • 0038825237 scopus 로고    scopus 로고
    • Stochastic proximity embedding
    • Agrafiotis, D. (2003) Stochastic proximity embedding. J. Comput. Chem., 24, 1215-1221.
    • (2003) J. Comput. Chem. , vol.24 , pp. 1215-1221
    • Agrafiotis, D.1
  • 2
    • 0001916126 scopus 로고    scopus 로고
    • An efficient implementation of distance-based diversity measures based on kd trees
    • Agrafiotis, D. and Lobanov, V. (1999) An efficient implementation of distance-based diversity measures based on kd trees. J. Chem. Inf. Comput. Sci., 39, 51-58
    • (1999) J. Chem. Inf. Comput. Sci. , vol.39 , pp. 51-58
    • Agrafiotis, D.1    Lobanov, V.2
  • 3
    • 0035871891 scopus 로고    scopus 로고
    • Multidimensional scaling and visualization of large molecular similarity tables
    • Agrafiotis, D. et al. (2001) Multidimensional scaling and visualization of large molecular similarity tables. J. Comput. Chem., 22, 488-500
    • (2001) J. Comput. Chem. , vol.22 , pp. 488-500
    • Agrafiotis, D.1
  • 4
    • 0037059052 scopus 로고    scopus 로고
    • A self-organizing principle for learning nonlinear manifolds
    • Agrafiotis, D.K. and Xu, H. (2002) A self-organizing principle for learning nonlinear manifolds. Proc. Natl Acad. Sci. USA, 99, 15869-15872
    • (2002) Proc. Natl Acad. Sci. USA , vol.99 , pp. 15869-15872
    • Agrafiotis, D.K.1    Xu, H.2
  • 5
    • 8444225024 scopus 로고    scopus 로고
    • NIH molecular libraries initiative
    • Austin, C.P. et al. (2004) NIH molecular libraries initiative. Science, 306, 1138-1139
    • (2004) Science , vol.306 , pp. 1138-1139
    • Austin, C.P.1
  • 6
    • 49449099341 scopus 로고    scopus 로고
    • Speeding up chemical database searches using a proximity filter based on the logical exclusive OR
    • Baldi, P. et al. (2008) Speeding up chemical database searches using a proximity filter based on the logical exclusive OR. J. Chem. Inf. Model., 48, 1367-1378
    • (2008) J. Chem. Inf. Model. , vol.48 , pp. 1367-1378
    • Baldi, P.1
  • 7
    • 0016557674 scopus 로고
    • Multidimensional binary search trees used for associative searching
    • Bentley, J. (1975) Multidimensional binary search trees used for associative searching. Comm. ACM, 18, 509-517
    • (1975) Comm. ACM , vol.18 , pp. 509-517
    • Bentley, J.1
  • 8
    • 0038670812 scopus 로고    scopus 로고
    • Searching in high-dimensional spaces: index structures for improving the performance of multimedia databases
    • Bohm, C. et al. (2001) Searching in high-dimensional spaces: index structures for improving the performance of multimedia databases. ACM Comput. Surv., 33, 322-373
    • (2001) ACM Comput. Surv. , vol.33 , pp. 322-373
    • Bohm, C.1
  • 10
    • 46249123633 scopus 로고    scopus 로고
    • A maximum common substructure-based algorithm for searching and predicting drug-like compounds
    • Cao, Y. et al. (2008) A maximum common substructure-based algorithm for searching and predicting drug-like compounds. Bioinformatics, 24, i366
    • (2008) Bioinformatics , vol.24
    • Cao, Y.1
  • 11
    • 0015592191 scopus 로고
    • A heuristic relaxation method for nonlinear mapping in cluster analysis
    • Chang, C. and Lee, R. (1973) A heuristic relaxation method for nonlinear mapping in cluster analysis. IEEE Trans. Syst. Man Cybernet., 3, 197200
    • (1973) IEEE Trans. Syst. Man Cybernet. , vol.3 , pp. 197200
    • Chang, C.1    Lee, R.2
  • 12
    • 34548742854 scopus 로고    scopus 로고
    • ChemDB update-full-text search and virtual chemical space
    • Chen, J.H. et al. (2007) ChemDB update-full-text search and virtual chemical space. Bioinformatics, 23, 2348-2351
    • (2007) Bioinformatics , vol.23 , pp. 2348-2351
    • Chen, J.H.1
  • 13
    • 0036827080 scopus 로고    scopus 로고
    • Performance of similarity measures in 2D fragment-based similarity searching: comparison of structural descriptors and similarity coefficients
    • Chen, X. and Reynolds, C. (2002) Performance of similarity measures in 2D fragment-based similarity searching: comparison of structural descriptors and similarity coefficients. J. Chem. Inf. Comput. Sci., 42, 1407-1414
    • (2002) J. Chem. Inf. Comput. Sci. , vol.42 , pp. 1407-1414
    • Chen, X.1    Reynolds, C.2
  • 14
    • 33846155913 scopus 로고    scopus 로고
    • Structure-based maximal affinity model predicts small-molecule druggability
    • Cheng, A.C. et al. (2007) Structure-based maximal affinity model predicts small-molecule druggability. Nat. Biotechnol., 25, 71-75
    • (2007) Nat. Biotechnol. , vol.25 , pp. 71-75
    • Cheng, A.C.1
  • 16
    • 33645265985 scopus 로고    scopus 로고
    • Clustering methods and their uses in computational chemistry
    • Downs, G. and Barnard, J. (2002) Clustering methods and their uses in computational chemistry. Rev. Comput. Chem., 18, 1-40
    • (2002) Rev. Comput. Chem. , vol.18 , pp. 1-40
    • Downs, G.1    Barnard, J.2
  • 17
    • 84976803260 scopus 로고
    • Fast Map: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets
    • ACM, New York, NY
    • Faloutsos, C. and Lin, K. (1995) Fast Map: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets. In Proceedings of the ACM SIGMOD Conference on Management of Data. ACM, New York, NY, pp. 163-174
    • (1995) Proceedings of the ACM SIGMOD Conference on Management of Data , pp. 163-174
    • Faloutsos, C.1    Lin, K.2
  • 18
    • 0034215869 scopus 로고    scopus 로고
    • Dynamic vp-tree indexing for n-nearest neighbor search given pair-wise distances
    • Fu, A. et al. (2000) Dynamic vp-tree indexing for n-nearest neighbor search given pair-wise distances. VLDB J., 9, 154-173
    • (2000) VLDB J. , vol.9 , pp. 154-173
    • Fu, A.1
  • 19
    • 0001944742 scopus 로고    scopus 로고
    • Similarity search in high dimensions via hashing
    • Morgan Kaufmann Publishers Inc., San Francisco
    • Gionis, A. et al. (1999) Similarity search in high dimensions via hashing. In Proceedings of the International Conference on Very Large Data Bases. Morgan Kaufmann Publishers Inc., San Francisco, pp. 518-529
    • (1999) Proceedings of the International Conference on Very Large Data Bases , pp. 518-529
    • Gionis, A.1
  • 20
    • 26944493065 scopus 로고    scopus 로고
    • ChemMine Acompound mining database for chemical genomics
    • Girke, T. et al. (2005) ChemMine.Acompound mining database for chemical genomics. Plant Physiol., 138, 573-577
    • (2005) Plant Physiol. , vol.138 , pp. 573-577
    • Girke, T.1
  • 21
    • 20444378945 scopus 로고    scopus 로고
    • The principle of complementarity: chemical versus biological space
    • Haggarty, S.J. (2005) The principle of complementarity: chemical versus biological space. Curr. Opin. Chem. Biol., 9, 296-303
    • (2005) Curr. Opin. Chem. Biol. , vol.9 , pp. 296-303
    • Haggarty, S.J.1
  • 22
    • 0036025426 scopus 로고    scopus 로고
    • Enhanced CACTVS browser of the open NCI database
    • Ihlenfeldt, W.D. et al. (2002) Enhanced CACTVS browser of the open NCI database. J. Chem. Inf . Comput. Sci., 42, 46-57
    • (2002) J. Chem. Inf . Comput. Sci. , vol.42 , pp. 46-57
    • Ihlenfeldt, W.D.1
  • 23
    • 13844312649 scopus 로고    scopus 로고
    • ZINC-a free database of commercially available compounds for virtual screening
    • Irwin, J.J. and Shoichet, B.K. (2005) ZINC-a free database of commercially available compounds for virtual screening. J. Chem. Inf. Model., 45, 177-182
    • (2005) J. Chem. Inf. Model. , vol.45 , pp. 177-182
    • Irwin, J.J.1    Shoichet, B.K.2
  • 24
    • 0031162081 scopus 로고    scopus 로고
    • The SR-tree: an index structure for high-dimensional nearest neighbor queries
    • ACM, New York, NY
    • Katayama, N. (1997) The SR-tree: an index structure for high-dimensional nearest neighbor queries. In Proceedings of the ACM SIGMOD Conference on Management of Data. ACM, New York, NY, pp. 369-380
    • (1997) Proceedings of the ACM SIGMOD Conference on Management of Data , pp. 369-380
    • Katayama, N.1
  • 26
    • 84955245129 scopus 로고    scopus 로고
    • Multi-probe LSH: efficient indexing for high-dimensional similarity search
    • VLDB Endowment
    • Lv, Q. et al. (2007) Multi-probe LSH: efficient indexing for high-dimensional similarity search. In Proceedings of the International Conference on Very Large Data Bases. VLDB Endowment, pp. 950-961.
    • (2007) Proceedings of the International Conference on Very Large Data Bases , pp. 950-961
    • Lv, Q.1
  • 27
    • 77951954385 scopus 로고    scopus 로고
    • NIH Chemical Genomics Center PubChem Fingerprint for JChem. Available at (last accessed date March 1, 2010)
    • NIH Chemical Genomics Center (2009) PubChem Fingerprint for JChem. Available at http://ncgc.nih.gov/resources/software.html (last accessed date March 1, 2010).
    • (2009)
  • 28
    • 0036603905 scopus 로고    scopus 로고
    • Chemical space navigation in lead discovery
    • Oprea, T.I. (2002) Chemical space navigation in lead discovery. Curr. Opin. Chem. Biol., 6, 384-389
    • (2002) Curr. Opin. Chem. Biol. , vol.6 , pp. 384-389
    • Oprea, T.I.1
  • 29
    • 34447558299 scopus 로고    scopus 로고
    • Systems chemical biology
    • Oprea, T.I. et al. (2007) Systems chemical biology. Nat. Chem. Biol., 3, 447-450
    • (2007) Nat. Chem. Biol. , vol.3 , pp. 447-450
    • Oprea, T.I.1
  • 30
    • 0037348826 scopus 로고    scopus 로고
    • Comparison of chemical clustering methods using graph-and fingerprint-based similarity measures
    • Raymond, J.W. et al. (2003) Comparison of chemical clustering methods using graph-and fingerprint-based similarity measures. J. Mol. Graph Model., 21, 421-433
    • (2003) J. Mol. Graph Model. , vol.21 , pp. 421-433
    • Raymond, J.W.1
  • 31
    • 0034704222 scopus 로고    scopus 로고
    • Nonlinear dimensionality reduction by locally linear embedding
    • Roweis, S. and Saul, L. (2000) Nonlinear dimensionality reduction by locally linear embedding. Science, 290, 2323
    • (2000) Science , vol.290 , pp. 2323
    • Roweis, S.1    Saul, L.2
  • 32
    • 3342907009 scopus 로고    scopus 로고
    • Exploring the chemogenomic knowledge space with annotated chemical libraries
    • Savchuk, N.P. et al. (2004) Exploring the chemogenomic knowledge space with annotated chemical libraries. Curr. Opin. Chem. Biol., 8, 412-417
    • (2004) Curr. Opin. Chem. Biol. , vol.8 , pp. 412-417
    • Savchuk, N.P.1
  • 33
    • 38549121773 scopus 로고    scopus 로고
    • ChemBank: a small-molecule screening and cheminformatics resource database
    • (Database issue)
    • Seiler, K.P. et al. (2008) ChemBank: a small-molecule screening and cheminformatics resource database. Nucleic Acids. Res., 36 (Database issue), 351-359
    • (2008) Nucleic Acids. Res. , vol.36 , pp. 351-359
    • Seiler, K.P.1
  • 34
    • 0036740917 scopus 로고    scopus 로고
    • Why do we need so many chemical similarity search methods?
    • Sheridan, R.P. and Kearsley, S.K. (2002) Why do we need so many chemical similarity search methods? Drug Discov. Today, 7, 903-911
    • (2002) Drug Discov. Today , vol.7 , pp. 903-911
    • Sheridan, R.P.1    Kearsley, S.K.2
  • 35
    • 33244494237 scopus 로고    scopus 로고
    • Visualization and interpretation of high content screening data
    • Smellie, A. et al. (2006) Visualization and interpretation of high content screening data. J. Chem. Inf. Model., 46, 201-207
    • (2006) J. Chem. Inf. Model. , vol.46 , pp. 201-207
    • Smellie, A.1
  • 36
    • 0037432765 scopus 로고    scopus 로고
    • From knowing to controlling: a path from genomics to drugs using small molecule probes
    • Strausberg, R.L. and Schreiber, S.L. (2003) From knowing to controlling: a path from genomics to drugs using small molecule probes. Science, 300, 294-295
    • (2003) Science , vol.300 , pp. 294-295
    • Strausberg, R.L.1    Schreiber, S.L.2
  • 37
    • 34247228558 scopus 로고    scopus 로고
    • Bounds and algorithms for fast exact searches of chemical fingerprints in linear and sub-linear time
    • Swamidass, S. and Baldi, P. (2007) Bounds and algorithms for fast exact searches of chemical fingerprints in linear and sub-linear time. J. Chem. Inf. Model., 47, 302
    • (2007) J. Chem. Inf. Model. , vol.47 , pp. 302
    • Swamidass, S.1    Baldi, P.2
  • 38
    • 0034704229 scopus 로고    scopus 로고
    • A global geometric framework for nonlinear dimensionality reduction
    • Tenenbaum, J. et al. (2000) A global geometric framework for nonlinear dimensionality reduction. Science, 290, 2319
    • (2000) Science , vol.290 , pp. 2319
    • Tenenbaum, J.1
  • 39
    • 0000662711 scopus 로고
    • An O (n logn) algorithm for the all-nearest-neighbors problem
    • Vaidya, P. (1989) An O (n logn) algorithm for the all-nearest-neighbors problem. Discrete Comput. Geom., 4, 101-115
    • (1989) Discrete Comput. Geom. , vol.4 , pp. 101-115
    • Vaidya, P.1
  • 40
    • 0000681228 scopus 로고    scopus 로고
    • A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces
    • IEEE
    • Weber, R. et al. (1998) A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In Proceedings of the International Conference on Very Large Data Bases. IEEE, pp. 194-205
    • (1998) Proceedings of the International Conference on Very Large Data Bases , pp. 194-205
    • Weber, R.1
  • 42
    • 21244468757 scopus 로고    scopus 로고
    • Searching techniques for databases of two-and three-dimensional chemical structures
    • Willett, P. (2005) Searching techniques for databases of two-and three-dimensional chemical structures. J. Med. Chem., 48, 4183-4199
    • (2005) J. Med. Chem. , vol.48 , pp. 4183-4199
    • Willett, P.1
  • 43
    • 5344244908 scopus 로고    scopus 로고
    • Chemical similarity searching
    • Willett, P. (1998) Chemical similarity searching. J. Chem. Inf. Comput. Sci., 38, 983-996
    • (1998) J. Chem. Inf. Comput. Sci. , vol.38 , pp. 983-996
    • Willett, P.1
  • 44
    • 0344686480 scopus 로고    scopus 로고
    • Nearest neighbor search in general metric spaces using a tree data structure with a simple heuristic
    • Xu, H. and Agrafiotis, D. (2003). Nearest neighbor search in general metric spaces using a tree data structure with a simple heuristic. J. Chem. Inf. Comput. Sci., 43, 1933-1941
    • (2003) J. Chem. Inf. Comput. Sci. , vol.43 , pp. 1933-1941
    • Xu, H.1    Agrafiotis, D.2
  • 45
    • 0031345518 scopus 로고    scopus 로고
    • L-BFGS-B: Fortran subroutines for large-scale bound constrained optimization
    • Zhu, C. (1997) L-BFGS-B: Fortran subroutines for large-scale bound constrained optimization. ACM Trans. Math. Softw., 23, 550-560
    • (1997) ACM Trans. Math. Softw. , vol.23 , pp. 550-560
    • Zhu, C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.