메뉴 건너뛰기




Volumn 52, Issue 4, 2012, Pages 891-900

Speeding up chemical searches using the inverted index: The convergence of chemoinformatics and text search methods

Author keywords

[No Author keywords available]

Indexed keywords

INDEXING (OF INFORMATION); INFORMATION RETRIEVAL;

EID: 84862022229     PISSN: 15499596     EISSN: 1549960X     Source Type: Journal    
DOI: 10.1021/ci200552r     Document Type: Article
Times cited : (20)

References (35)
  • 1
    • 70349305672 scopus 로고    scopus 로고
    • Study on the reaction mechanism and kinetics of the thermal decomposition of nitroethane
    • Wang, Q.; Ng, D.; Mannan, M. S. Study on the Reaction Mechanism and Kinetics of the Thermal Decomposition of Nitroethane. Ind. Eng. Chem. Res. 2009, 48, 8745-8751.
    • (2009) Ind. Eng. Chem. Res. , vol.48 , pp. 8745-8751
    • Wang, Q.1    Ng, D.2    Mannan, M.S.3
  • 2
    • 27944507949 scopus 로고    scopus 로고
    • ChemDB: A public database of small molecules and related chemoinformatics resources
    • DOI 10.1093/bioinformatics/bti683
    • Chen, J.; Swamidass, S. J.; Dou, Y.; Bruand, J.; Baldi, P. ChemDB: a Public Database of Small Molecules and Related Chemoinformatics Resources. Bioinformatics 2005, 21, 4133-4139. (Pubitemid 41672103)
    • (2005) Bioinformatics , vol.21 , Issue.22 , pp. 4133-4139
    • Chen, J.1    Swamidass, S.J.2    Dou, Y.3    Bruand, J.4    Baldi, P.5
  • 3
    • 80052899880 scopus 로고    scopus 로고
    • SketchSort: Fast all pairs similarity search for large databases of molecular fingerprints
    • Tabei, Y.; Tsuda, K. SketchSort: Fast All Pairs Similarity Search for Large Databases of Molecular Fingerprints. Mol. Inf. 2011, 30, 801-807.
    • (2011) Mol. Inf. , vol.30 , pp. 801-807
    • Tabei, Y.1    Tsuda, K.2
  • 5
    • 0036567220 scopus 로고    scopus 로고
    • A modification of the Jaccard-Tanimoto similarity index for diverse selection of chemical compounds using binary strings
    • DOI 10.1198/004017002317375064
    • Fligner, M. A.; Verducci, J. S.; Blower, P. E. A Modification of the Jaccard/Tanimoto Similarity Index for Diverse Selection of Chemical Compounds Using Binary Strings. Technometrics 2002, 44, 110-119. (Pubitemid 34537163)
    • (2002) Technometrics , vol.44 , Issue.2 , pp. 110-119
    • Fligner, M.A.1    Verducci, J.S.2    Blower, P.E.3
  • 6
    • 0001232509 scopus 로고    scopus 로고
    • On the properties of bit string-based measures of chemical similarity
    • Flower, D. R. On the Properties of Bit String-Based Measures of Chemical Similarity. J. Chem. Inf. Comput. Sci. 1998, 38, 379-386. (Pubitemid 128594448)
    • (1998) Journal of Chemical Information and Computer Sciences , vol.38 , Issue.3 , pp. 379-386
    • Flower, D.R.1
  • 8
    • 0043201432 scopus 로고    scopus 로고
    • Profile scaling increases the similarity search performance of molecular fingerprints containing numerical descriptors and structural keys
    • Xue, L.; Godden, J. F.; Stahura, F. L.; Bajorath, J. Profile Scaling Increases the Similarity Search Performance of Molecular Fingerprints Containing Numerical Descriptors and Structural Keys. J. Chem. Inf. Comput. Sci. 2003, 43, 1218-1225.
    • (2003) J. Chem. Inf. Comput. Sci. , vol.43 , pp. 1218-1225
    • Xue, L.1    Godden, J.F.2    Stahura, F.L.3    Bajorath, J.4
  • 9
    • 10044240762 scopus 로고    scopus 로고
    • Similarity search profiling reveals effects of fingerprint scaling in virtual screening
    • Xue, L.; Stahura, F. L.; Bajorath, J. Similarity Search Profiling Reveals Effects of Fingerprint Scaling in Virtual Screening. J. Chem. Inf. Comput. Sci. 2004, 44, 2032-2039.
    • (2004) J. Chem. Inf. Comput. Sci. , vol.44 , pp. 2032-2039
    • Xue, L.1    Stahura, F.L.2    Bajorath, J.3
  • 10
    • 37249011239 scopus 로고    scopus 로고
    • Lossless compression of chemical fingerprints using integer entropy codes improves storage and retrieval
    • DOI 10.1021/ci700200n
    • Baldi, P.; Benz, R. W.; Hirschberg, D.; Swamidass, S. Lossless Compression of Chemical Fingerprints Using Integer Entropy Codes Improves Storage and Retrieval. J. Chem. Inf. Model. 2007, 47, 2098-2109. (Pubitemid 350275076)
    • (2007) Journal of Chemical Information and Modeling , vol.47 , Issue.6 , pp. 2098-2109
    • Baldi, P.1    Benz, R.W.2    Hirschberg, D.S.3    Swamidass, S.J.4
  • 11
    • 0036249270 scopus 로고    scopus 로고
    • Grouping of coefficients for the calculation of inter-molecular similarity and dissimilarity using 2D fragment bit-strings
    • Holliday, J. D.; Hu, C. Y.; Willett, P. Grouping of Coefficients for the Calculation of Inter-Molecular Similarity and Dissimilarity Using 2D Fragment Bit-Strings. Comb. Chem. High. Throughput Screen. 2002, 5, 155-66. (Pubitemid 34475167)
    • (2002) Combinatorial Chemistry and High Throughput Screening , vol.5 , Issue.2 , pp. 155-166
    • Holliday, J.D.1    Hu, C.-Y.2    Willett, P.3
  • 12
    • 74049146045 scopus 로고    scopus 로고
    • Large scale study of multiple-molecule queries
    • online, accessed Apr. 2012
    • Nasr, R.; Swamidass, S. J.; Baldi, P. Large Scale Study of Multiple-Molecule Queries. J. Cheminf. [online] 2009, 1, No. article 7, http://www.jcheminf.com/content/1/1/7 (accessed Apr. 2012).
    • (2009) J. Cheminf. , vol.1 , pp. 7
    • Nasr, R.1    Swamidass, S.J.2    Baldi, P.3
  • 13
    • 34250851446 scopus 로고    scopus 로고
    • Mathematical correction for fingerprint similarity measures to improve chemical retrieval
    • DOI 10.1021/ci600526a
    • Swamidass, S. J.; Baldi, P. Mathematical Correction for Fingerprint Similarity Measures to Improve Chemical Retrieval. J. Chem. Inf. Model. 2007, 47, 952-964. (Pubitemid 46973710)
    • (2007) Journal of Chemical Information and Modeling , vol.47 , Issue.3 , pp. 952-964
    • Swamidass, S.J.1    Baldi, P.2
  • 14
    • 49449099341 scopus 로고    scopus 로고
    • Speeding up chemical database searches using a proximity filter based on the logical exclusive OR
    • Baldi, P.; Hirschberg, D. S.; Nasr, R. J. Speeding up Chemical Database Searches Using a Proximity Filter Based on the Logical Exclusive OR. J. Chem. Inf. Model. 2008, 48, 1367-1378.
    • (2008) J. Chem. Inf. Model. , vol.48 , pp. 1367-1378
    • Baldi, P.1    Hirschberg, D.S.2    Nasr, R.J.3
  • 15
    • 34247228558 scopus 로고    scopus 로고
    • Bounds and algorithms for fast exact searches of chemical fingerprints in linear and sublinear time
    • DOI 10.1021/ci600358f
    • Swamidass, S. J.; Baldi, P. Bounds and Algorithms for Fast Exact Searches of Chemical Fingerprints in Linear and Sublinear Time. J. Chem. Inf. Model. 2007, 47, 302-317. (Pubitemid 46615935)
    • (2007) Journal of Chemical Information and Modeling , vol.47 , Issue.2 , pp. 302-317
    • Swamidass, S.J.1    Baldi, P.2
  • 16
    • 0015531930 scopus 로고
    • Some approaches to best-match file searching
    • Burkhard, W.; Keller, R. Some Approaches to Best-Match File Searching. Commun. ACM 1973, 16, 230-236.
    • (1973) Commun. ACM , vol.16 , pp. 230-236
    • Burkhard, W.1    Keller, R.2
  • 17
    • 0017494854 scopus 로고
    • The choice of reference points in best-match file searching
    • Shapiro, M. The choice of Reference Points in Best-Match File Searching. Commun. ACM 1977, 20, 339-343.
    • (1977) Commun. ACM , vol.20 , pp. 339-343
    • Shapiro, M.1
  • 18
    • 69549086567 scopus 로고    scopus 로고
    • An intersection inequality sharper than the tanimoto triangle inequality for efficiently searching large databases
    • Baldi, P.; Hirschberg, D. S. An Intersection Inequality Sharper than the Tanimoto Triangle Inequality for Efficiently Searching Large Databases. J. Chem. Inf. Model. 2009, 49, 1866-1870.
    • (2009) J. Chem. Inf. Model. , vol.49 , pp. 1866-1870
    • Baldi, P.1    Hirschberg, D.S.2
  • 19
    • 79952112725 scopus 로고    scopus 로고
    • Hashing algorithms and data structures for rapid searches of fingerprint vectors
    • Nasr, R.; Hirschberg, D. S.; Baldi, P. Hashing Algorithms and Data Structures for Rapid Searches of Fingerprint Vectors. J. Chem. Inf. Model. 2010, 50, 1358-1368.
    • (2010) J. Chem. Inf. Model. , vol.50 , pp. 1358-1368
    • Nasr, R.1    Hirschberg, D.S.2    Baldi, P.3
  • 20
    • 80052893485 scopus 로고    scopus 로고
    • Tree and hashing data structures to speed up chemical searches: Analysis and experiments
    • Nasr, R.; Kristensen, T.; Baldi, P. Tree and Hashing Data Structures to Speed up Chemical Searches: Analysis and Experiments. Mol. Inf. 2011, 30, 791-800.
    • (2011) Mol. Inf. , vol.30 , pp. 791-800
    • Nasr, R.1    Kristensen, T.2    Baldi, P.3
  • 23
    • 84983388436 scopus 로고
    • The binary vector as the basis of an inverted index file
    • King, D. The Binary Vector as the Basis of an Inverted Index File. J. Libr. Autom. 1974, 7, 307-14.
    • (1974) J. Libr. Autom. , vol.7 , pp. 307-314
    • King, D.1
  • 24
    • 33947482764 scopus 로고
    • Searching X-ray diffraction powder data with an inverted coordinate index
    • Matthews, F. Searching X-Ray Diffraction Powder Data with an Inverted Coordinate Index. J. Chem. Doc. 1963, 3, 213-216.
    • (1963) J. Chem. Doc. , vol.3 , pp. 213-216
    • Matthews, F.1
  • 25
    • 0041838759 scopus 로고
    • Organic search and display using a connectivity matrix derived from wiswesser notation
    • Thomson, L. H.; Hyde, E.; Matthews, F. W. Organic Search and Display using a Connectivity Matrix Derived from Wiswesser Notation. J. Chem. Doc. 1967, 7, 204-209.
    • (1967) J. Chem. Doc. , vol.7 , pp. 204-209
    • Thomson, L.H.1    Hyde, E.2    Matthews, F.W.3
  • 26
    • 4143080103 scopus 로고
    • An integrated chemical structure storage and search system operating at du pont
    • Hoffman, W. S. An Integrated Chemical Structure Storage and Search System Operating at Du Pont. J. Chem. Doc. 1968, 8, 3-13.
    • (1968) J. Chem. Doc. , vol.8 , pp. 3-13
    • Hoffman, W.S.1
  • 27
    • 0002728955 scopus 로고
    • Implementation of nearest-neighbor searching in an online chemical structure search system
    • Willett, P.; Winterman, V.; Bawden, D. Implementation of Nearest-Neighbor Searching in an Online Chemical Structure Search System. J. Chem. Inf. Comput. Sci. 1986, 26, 36-41.
    • (1986) J. Chem. Inf. Comput. Sci. , vol.26 , pp. 36-41
    • Willett, P.1    Winterman, V.2    Bawden, D.3
  • 28
  • 31
    • 78049429864 scopus 로고    scopus 로고
    • When is chemical similarity significant? The statistical distribution of chemical similarity scores and its extreme values
    • Baldi, P.; Nasr, R. When is Chemical Similarity Significant? The Statistical Distribution of Chemical Similarity Scores and Its Extreme Values. J. Chem. Inf. Model. 2010, 50, 1205-1222.
    • (2010) J. Chem. Inf. Model. , vol.50 , pp. 1205-1222
    • Baldi, P.1    Nasr, R.2
  • 32
    • 77952772341 scopus 로고    scopus 로고
    • Extended-connectivity fingerprints
    • Rogers, D.; Hahn, M. Extended-Connectivity Fingerprints. J. Chem. Inf. Model. 2010, 50, 742-754.
    • (2010) J. Chem. Inf. Model. , vol.50 , pp. 742-754
    • Rogers, D.1    Hahn, M.2
  • 33
    • 33749598013 scopus 로고    scopus 로고
    • Cheminformatics analysis and learning in a data pipelining environment
    • DOI 10.1007/s11030-006-9041-5
    • Hassan, M.; Brown, R. D.; Varma-O'Brien, S.; Rogers, D. Cheminformatics Analysis and Learning in a Data Pipelining Environment. Mol. Diversity 2006, 10, 283-299. (Pubitemid 44546683)
    • (2006) Molecular Diversity , vol.10 , Issue.3 , pp. 283-299
    • Hassan, M.1    Brown, R.D.2    Varma-O'Brien, S.3    Rogers, D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.