메뉴 건너뛰기




Volumn 2, Issue , 2012, Pages 33-64

Developing Best Practices for Descriptor-based Property Prediction: Appropriate Matching of Datasets, Descriptors, Methods, and Expectations

Author keywords

Linear free energy relationship (LFER); Multiple linear regression (MLR); Property encoded shape distributions (PESD) descriptors; Quantitative structure activity relationship (QSAR); Root mean squared error (RMSE); Ultrafast shape recognition (USR)

Indexed keywords


EID: 84876581653     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1002/9783527645121.ch2     Document Type: Chapter
Times cited : (13)

References (153)
  • 1
    • 77950635141 scopus 로고
    • On the connection between chemical constitution and physiological action; with special reference to the physiological action of the salts of the ammonium bases derived from strychnia, Brucia, Thebaia, Codeia, Morphia, and Nicotia
    • Brown, A.C. and Fraser, T.R. (1868) On the connection between chemical constitution and physiological action; with special reference to the physiological action of the salts of the ammonium bases derived from strychnia, Brucia, Thebaia, Codeia, Morphia, and Nicotia. Journal of Anatomy and Physiology, 2 (2), 224-242.
    • (1868) Journal of Anatomy and Physiology , vol.2 , Issue.2 , pp. 224-242
    • Brown, A.C.1    Fraser, T.R.2
  • 4
    • 58149095139 scopus 로고    scopus 로고
    • Promises and pitfalls of quantitative structure- activity relationship approaches for predicting metabolism and toxicity
    • Zvinavashe, E., Murk, A.J., and Rietjens, I.M.C.M. (2008) Promises and pitfalls of quantitative structure- activity relationship approaches for predicting metabolism and toxicity. Chemical Research in Toxicology, 21 (12), 2229-2236.
    • (2008) Chemical Research in Toxicology , vol.21 , Issue.12 , pp. 2229-2236
    • Zvinavashe, E.1    Murk, A.J.2    Rietjens, I.M.C.M.3
  • 7
    • 84886083264 scopus 로고    scopus 로고
    • Frontmatter, John Wiley & Sons, Inc, New York
    • Hochberg, Y. and Tamhane, A.C. (2008) Frontmatter, John Wiley & Sons, Inc, New York.
    • (2008)
    • Hochberg, Y.1    Tamhane, A.C.2
  • 8
    • 14644390912 scopus 로고    scopus 로고
    • Using AUC and accuracy in evaluating learning algorithms
    • IEEE Transactions on
    • Jin, H. and Ling, C.X. (2005) Using AUC and accuracy in evaluating learning algorithms. Knowledge and Data Engineering, IEEE Transactions on, 17 (3), 299-310.
    • (2005) Knowledge and Data Engineering , vol.17 , Issue.3 , pp. 299-310
    • Jin, H.1    Ling, C.X.2
  • 9
  • 11
    • 34250628103 scopus 로고    scopus 로고
    • Principles of QSAR models validation: Internal and external
    • Gramatica, P. (2007) Principles of QSAR models validation: Internal and external. QSAR & Combinatorial Science, 26 (5), 694-701.
    • (2007) QSAR & Combinatorial Science , vol.26 , Issue.5 , pp. 694-701
    • Gramatica, P.1
  • 12
    • 67949118928 scopus 로고    scopus 로고
    • Hownot to develop a quantitative structure-activity or structure-property relationship (QSAR/ QSPR)
    • Dearden, J.C., Cronin, M.T.D., and Kaiser, K.L.E. (2009)Hownot to develop a quantitative structure-activity or structure-property relationship (QSAR/ QSPR). SAR and QSAR in Environmental Research, 20 (3/4), 241-266.
    • (2009) SAR and QSAR in Environmental Research , vol.20 , Issue.3-4 , pp. 241-266
    • Dearden, J.C.1    Cronin, M.T.D.2    Kaiser, K.L.E.3
  • 14
    • 42749092988 scopus 로고    scopus 로고
    • The importance of the domain of applicability in QSAR modeling
    • Weaver, S. and Gleeson, M.P. (2008) The importance of the domain of applicability in QSAR modeling. Journal of Molecular Graphics and Modelling, 26 (8), 1315-1326.
    • (2008) Journal of Molecular Graphics and Modelling , vol.26 , Issue.8 , pp. 1315-1326
    • Weaver, S.1    Gleeson, M.P.2
  • 15
    • 77956964002 scopus 로고    scopus 로고
    • Best practices for QSAR model development, validation, and exploitation
    • Tropsha, A. (2010) Best practices for QSAR model development, validation, and exploitation. Molecular Informatics, 29 (6-7), 476-488.
    • (2010) Molecular Informatics , vol.29 , Issue.6-7 , pp. 476-488
    • Tropsha, A.1
  • 16
    • 0036589313 scopus 로고    scopus 로고
    • Predictive QSAR modeling based on diversity sampling of experimental datasets for the training and test set selection
    • Golbraikh, A. and Tropsha, A. (2002) Predictive QSAR modeling based on diversity sampling of experimental datasets for the training and test set selection. Journal of Computer-Aided Molecular Design, 16 (5), 357-369.
    • (2002) Journal of Computer-Aided Molecular Design , vol.16 , Issue.5 , pp. 357-369
    • Golbraikh, A.1    Tropsha, A.2
  • 18
    • 0004797467 scopus 로고
    • Exploring QSAR: Fundamentals and applications in chemistry and biology
    • Hansch, C. and Leo, A. (1995) Exploring QSAR: Fundamentals and applications in chemistry and biology. American Chemical Society, 1, 1-557.
    • (1995) American Chemical Society , vol.1 , pp. 1-557
    • Hansch, C.1    Leo, A.2
  • 19
    • 33746928751 scopus 로고    scopus 로고
    • Local lazy regression: Making use of the neighborhood to improve QSAR predictions
    • Guha, R., Dutta, D., Jurs, P.C., and Chen, T. (2006) Local lazy regression: Making use of the neighborhood to improve QSAR predictions. Journal of Chemical Information and Modeling, 46 (4), 1836-1847.
    • (2006) Journal of Chemical Information and Modeling , vol.46 , Issue.4 , pp. 1836-1847
    • Guha, R.1    Dutta, D.2    Jurs, P.C.3    Chen, T.4
  • 20
    • 31544483641 scopus 로고    scopus 로고
    • Classification ensembles for unbalanced class sizes in predictive toxicology
    • Chen, J.J., Tsai, C.A., Young, J.F., and Kodell, R.L. (2005) Classification ensembles for unbalanced class sizes in predictive toxicology. SAR and QSAR in Environmental Research, 16 (6), 517-529.
    • (2005) SAR and QSAR in Environmental Research , vol.16 , Issue.6 , pp. 517-529
    • Chen, J.J.1    Tsai, C.A.2    Young, J.F.3    Kodell, R.L.4
  • 23
    • 33947453229 scopus 로고
    • Polar and steric substituent constants for aliphatic and obenzoate groups from rates of esterification and hydrolysis of esters
    • Taft, R.W. (1952) Polar and steric substituent constants for aliphatic and obenzoate groups from rates of esterification and hydrolysis of esters. Journal of the American Chemical Society, 74, 3120-3128.
    • (1952) Journal of the American Chemical Society , vol.74 , pp. 3120-3128
    • Taft, R.W.1
  • 24
    • 0005910987 scopus 로고
    • The correlation of biological activity of plant growth regulators and chloromycetin derivatives withHammett constants and partition coefficients
    • Hansch, C., Muir, R.M., Fujita, T., Maloney, P.P., Geiger, F., and Streich, M. (1963) The correlation of biological activity of plant growth regulators and chloromycetin derivatives withHammett constants and partition coefficients. Journal of the American Chemical Society, 85, 2817-2824.
    • (1963) Journal of the American Chemical Society , vol.85 , pp. 2817-2824
    • Hansch, C.1    Muir, R.M.2    Fujita, T.3    Maloney, P.P.4    Geiger, F.5    Streich, M.6
  • 25
    • 0035263818 scopus 로고    scopus 로고
    • Chem-bioinformatics and QSAR:Areview of QSARlacking positive hydrophobic terms
    • Hansch, C., Kurup, A.,Garg, R., andGao, H. (2001) Chem-bioinformatics and QSAR:Areview of QSARlacking positive hydrophobic terms. Chemical Reviews, 101 (3), 619-672.
    • (2001) Chemical Reviews , vol.101 , Issue.3 , pp. 619-672
    • Hansch, C.1    Kurup A.Garg, R.2    Gao, H.3
  • 26
    • 0346207528 scopus 로고    scopus 로고
    • Comparative QSAR studies on anti-HIV drugs
    • Garg, R., Gupta, S.P., Gao, H., Babu, M.S., Debnath, A.K., and Hansch, C. (1999) Comparative QSAR studies on anti-HIV drugs. Chemical Reviews, 99, 3525-3601.Garg, R., Gupta, S.P., Gao, H., Babu, M.S.
    • (1999) Chemical Reviews , vol.99 , pp. 3525-3601
    • Debnath, A.K.1    Hansch, C.2
  • 27
    • 44249108095 scopus 로고    scopus 로고
    • Utilizing high throughput screening data for predictive toxicology models: Protocols and application toMLSCNassays
    • Guha, R. and Schürer, S. (2008) Utilizing high throughput screening data for predictive toxicology models: Protocols and application toMLSCNassays. Journal of Computer-Aided Molecular Design, 22 (6), 367-384.
    • (2008) Journal of Computer-Aided Molecular Design , vol.22 , Issue.6 , pp. 367-384
    • Guha, R.1    Schürer, S.2
  • 28
    • 71849099675 scopus 로고    scopus 로고
    • PubChem BioAssays as a data source for predictive models
    • Chen, B. and Wild, D.J. (2010) PubChem BioAssays as a data source for predictive models. Journal of MolecularGraphics and Modelling, 28 (5), 420-426.
    • (2010) Journal of MolecularGraphics and Modelling , vol.28 , Issue.5 , pp. 420-426
    • Chen, B.1    Wild, D.J.2
  • 29
    • 27444447278 scopus 로고    scopus 로고
    • Biospectra analysis: Model proteome characterization for linking molecular structure and biological response
    • Fliri, A.F., Loging, W.T., Thadeio, P.F., and Volkmann, R.A. (2005) Biospectra analysis: Model proteome characterization for linking molecular structure and biological response. Journal of Medicinal Chemistry, 48, 6918-6925.
    • (2005) Journal of Medicinal Chemistry , vol.48 , pp. 6918-6925
    • Fliri, A.F.1    Loging, W.T.2    Thadeio, P.F.3    Volkmann, R.A.4
  • 32
    • 33646561524 scopus 로고    scopus 로고
    • Complex graph matrix representations and characterizations of proteomic maps and chemically induced changes to proteomes
    • Balasubramanian, K., Khokhani, K., and Basak, S.C. (2006) Complex graph matrix representations and characterizations of proteomic maps and chemically induced changes to proteomes. Journal of Proteome Research, 5, 1133-1142.
    • (2006) Journal of Proteome Research , vol.5 , pp. 1133-1142
    • Balasubramanian, K.1    Khokhani, K.2    Basak, S.C.3
  • 35
    • 77957232531 scopus 로고    scopus 로고
    • Recent advances in QSAR studies: methods and applications
    • eds T. Puzyn, J. Leszczynski, and M.T. Cronin), Springer, Berlin
    • Puzyn, T., Leszczynski, J., and Cronin, M.T. (2009) Recent advances in QSAR studies: methods and applications, in Recent Advances inQSARStudies:Methods and Applications, vol. 1 (eds T. Puzyn, J. Leszczynski, and M.T. Cronin), Springer, Berlin, pp. 383-403.
    • (2009) in Recent Advances inQSARStudies:Methods and Applications , vol.1 , pp. 1383-403
    • Puzyn, T.1    Leszczynski, J.2    Cronin, M.T.3
  • 37
    • 85047683680 scopus 로고    scopus 로고
    • In silico prediction of ADMET properties: How far have we come?
    • Dearden, J.C. (2007) In silico prediction of ADMET properties: How far have we come? Expert Opinion on DrugMetabolism & Toxicology, 3 (5), 635-639.
    • (2007) Expert Opinion on DrugMetabolism & Toxicology , vol.3 , Issue.5 , pp. 635-639
    • Dearden, J.C.1
  • 38
  • 39
    • 84886028965 scopus 로고
    • Information Theoretic Indices for Characterization of Chemical Structures, Research Studies Press, Chichester, UK
    • Bonchev, D. (1983) Information Theoretic Indices for Characterization of Chemical Structures, Research Studies Press, Chichester, UK.
    • (1983)
    • Bonchev, D.1
  • 40
    • 77958091914 scopus 로고    scopus 로고
    • A history of graph entropy measures
    • Dehmer, M. and Mowshowitz, A. (2011) A history of graph entropy measures. Information Sciences, 181 (1), 57-78.
    • (2011) Information Sciences , vol.181 , Issue.1 , pp. 57-78
    • Dehmer, M.1    Mowshowitz, A.2
  • 42
    • 1042265247 scopus 로고    scopus 로고
    • Approaches to measure chemical similarity - a review
    • Nikolova, N. and Jaworska, J. (2003) Approaches to measure chemical similarity - a review. The QSAR and Combinatorial Science, 22, 1006-1026.
    • (2003) The QSAR and Combinatorial Science , vol.22 , pp. 1006-1026
    • Nikolova, N.1    Jaworska, J.2
  • 46
    • 0000224701 scopus 로고    scopus 로고
    • The coding of the threedimensional structure of molecules by molecular transforms and its application to structure-spectra correlations and studies of biological activity
    • Schuur, J.H., Selzer, P., and Gasteiger, J. (1996) The coding of the threedimensional structure of molecules by molecular transforms and its application to structure-spectra correlations and studies of biological activity. Journal of Chemical Information and Computer Sciences, 36 (2), 334-344.
    • (1996) Journal of Chemical Information and Computer Sciences , vol.36 , Issue.2 , pp. 334-344
    • Schuur, J.H.1    Selzer, P.2    Gasteiger, J.3
  • 51
    • 34547260921 scopus 로고    scopus 로고
    • Ultrafast shape recognition to search compound databases for similar molecular shapes
    • Ballester, P.J. and Richards, W.G. (2007) Ultrafast shape recognition to search compound databases for similar molecular shapes. Journal of Computational Chemistry, 28, 1711-1723.
    • (2007) Journal of Computational Chemistry , vol.28 , pp. 1711-1723
    • Ballester, P.J.1    Richards, W.G.2
  • 53
    • 61449202735 scopus 로고    scopus 로고
    • Ultrafast shape recognition: Evaluating a new ligand-based virtual screening technology
    • Ballester, P.J., Finn, P.W., and Richards, W.G. (2009) Ultrafast shape recognition: Evaluating a new ligand-based virtual screening technology. Journal of Molecular Graphics & Modelling, 27, 836-845.
    • (2009) Journal of Molecular Graphics & Modelling , vol.27 , pp. 836-845
    • Ballester, P.J.1    Finn, P.W.2    Richards, W.G.3
  • 54
    • 2942592378 scopus 로고    scopus 로고
    • QSAR and QSPR based solely on surface properties?
    • Clark, T. (2004) QSAR and QSPR based solely on surface properties? Journal of Molecular Graphics & Modelling, 22, 519-525.
    • (2004) Journal of Molecular Graphics & Modelling , vol.22 , pp. 519-525
    • Clark, T.1
  • 55
    • 0035498339 scopus 로고    scopus 로고
    • Development of quantitative structure-property relationship models for early ADME evaluation in drug discovery: Part 2
    • Liu, R., Sun, H., and So, S.S. (2001) Development of quantitative structure- property relationship models for early ADME evaluation in drug discovery: Part 2. Blood-brain barrier penetration. Journal of Chemical Information and Computer Sciences, 41 (6), 1623-1632.
    • (2001) Blood-brain barrier penetration. Journal of Chemical Information and Computer Sciences , vol.41 , Issue.6 , pp. 1623-1632
    • Liu, R.1    Sun, H.2    So, S.S.3
  • 56
    • 54949132575 scopus 로고    scopus 로고
    • QSPR prediction of pKa for benzoic acids in different solvents
    • Jover, J., Bosque, R., and Sales, J. (2008) QSPR prediction of pKa for benzoic acids in different solvents. QSAR and Combinatorial Science, 27 (5), 563-581.
    • (2008) QSAR and Combinatorial Science , vol.27 , Issue.5 , pp. 563-581
    • Jover, J.1    Bosque, R.2    Sales, J.3
  • 57
    • 70350367267 scopus 로고    scopus 로고
    • Relationship between structure and permeability in artificial membranes: Theoretical whole molecule descriptors in development of QSAR models
    • Tulp, I., Sild, S., and Maran, U. (2009) Relationship between structure and permeability in artificial membranes: Theoretical whole molecule descriptors in development of QSAR models. QSAR and Combinatorial Science, 28 (8), 811-814.
    • (2009) QSAR and Combinatorial Science , vol.28 , Issue.8 , pp. 811-814
    • Tulp, I.1    Sild, S.2    Maran, U.3
  • 58
    • 54249131345 scopus 로고    scopus 로고
    • Benchmarking the reliability of QikProp: Correlation between experimental and predicted values
    • Ioakimidis, L., Thoukydidis, L., Mirza, A., Naeem, S., and Reynisson, J. (2008) Benchmarking the reliability of QikProp: Correlation between experimental and predicted values. QSAR and Combinatorial Science, 27 (4), 445-456.
    • (2008) QSAR and Combinatorial Science , vol.27 , Issue.4 , pp. 445-456
    • Ioakimidis, L.1    Thoukydidis, L.2    Mirza, A.3    Naeem, S.4    Reynisson, J.5
  • 59
    • 73349097886 scopus 로고    scopus 로고
    • Rapid comparison of protein binding site surfaces with property encoded shape distributions
    • Das, S., Kokardekar, A., and Breneman, C.M. (2009) Rapid comparison of protein binding site surfaces with property encoded shape distributions. Journal of Chemical Information and Modeling, 49 (12), 2863-2872.
    • (2009) Journal of Chemical Information and Modeling , vol.49 , Issue.12 , pp. 2863-2872
    • Das, S.1    Kokardekar, A.2    Breneman, C.M.3
  • 60
    • 84886076412 scopus 로고
    • Shape in Chemistry, Wiley-VCH, New York
    • Mezey, P.G. (1993) Shape in Chemistry, Wiley-VCH, New York.
    • (1993)
    • Mezey, P.G.1
  • 61
    • 0000033460 scopus 로고    scopus 로고
    • The holographic electron density theorem and quantum similarity measures
    • Mezey, P.G. (1999) The holographic electron density theorem and quantum similarity measures. Molecular Physics, 96 (2), 169-178.
    • (1999) Molecular Physics , vol.96 , Issue.2 , pp. 169-178
    • Mezey, P.G.1
  • 62
    • 0028321320 scopus 로고
    • Ab initio quality electron densities for proteins: A MEDLA approach
    • Walker, P.D. and Mezey, P.G. (1994) Ab initio quality electron densities for proteins: A MEDLA approach. Journal of the American Chemical Society, 116, 12022-12032.
    • (1994) Journal of the American Chemical Society , vol.116 , pp. 12022-12032
    • Walker, P.D.1    Mezey, P.G.2
  • 63
    • 0000886672 scopus 로고
    • Electron density modeling of large systems using the transferable atom equivalent method
    • Breneman, C.M., Thompson, T.R., Rhem, M., and Dung, M. (1995) Electron density modeling of large systems using the transferable atom equivalent method. Computers and Chemistry, 19 (3), 161.
    • (1995) Computers and Chemistry , vol.19 , Issue.3 , pp. 161
    • Breneman, C.M.1    Thompson, T.R.2    Rhem, M.3    Dung, M.4
  • 67
    • 0345735309 scopus 로고    scopus 로고
    • Shape signatures, a new approach to computer-aided ligandand receptor-based drug design
    • Zauhar, R.J., Moyna, G., Tian, L., Li, Z., and Welsh, W.J. (2003) Shape signatures, a new approach to computer-aided ligandand receptor-based drug design. Journal of Medicinal Chemistry, 46, 5674-5690.
    • (2003) Journal of Medicinal Chemistry , vol.46 , pp. 5674-5690
    • Zauhar, R.J.1    Moyna, G.2    Tian, L.3    Li, Z.4    Welsh, W.J.5
  • 68
    • 0034710718 scopus 로고    scopus 로고
    • GRid- INdependent descriptors (GRIND): A novel class of alignment-independent three-dimensional molecular descriptors
    • Pastor, M., Cruciani, G., McLay, I., Pickett, S., and Clementi, S. (2000) GRid- INdependent descriptors (GRIND): A novel class of alignment-independent three-dimensional molecular descriptors. Journal of Medicinal Chemistry, 43, 3233-3243.
    • (2000) Journal of Medicinal Chemistry , vol.43 , pp. 3233-3243
    • Pastor, M.1    Cruciani, G.2    McLay, I.3    Pickett, S.4    Clementi, S.5
  • 69
    • 0034625096 scopus 로고    scopus 로고
    • Molecular fields in quantitative structure-permeation relationships: The VolSurf approach
    • Cruciani, G., Crivori, P., Carrupt, P.A., and Testa, B. (2000) Molecular fields in quantitative structure-permeation relationships: The VolSurf approach. Journal of Molecular Structure: THEOCHEM, 503 (1-2), 17.
    • (2000) Journal of Molecular Structure: THEOCHEM , vol.503 , Issue.1-2 , pp. 17
    • Cruciani, G.1    Crivori, P.2    Carrupt, P.A.3    Testa, B.4
  • 70
    • 0018709674 scopus 로고
    • Chance factors in studies of quantitative- structure property relationships
    • Topliss, J.G. and Edwards, R.P. (1979) Chance factors in studies of quantitative- structure property relationships. Journal of Medicinal Chemistry, 22, 1238-1244.
    • (1979) Journal of Medicinal Chemistry , vol.22 , pp. 1238-1244
    • Topliss, J.G.1    Edwards, R.P.2
  • 72
    • 84886070416 scopus 로고
    • Multivariate Data Analysis, Wiley, New York
    • Cooley, W., Lohnes, P., and Analysis, M.D. (1971) Multivariate Data Analysis, Wiley, New York.
    • (1971)
    • Cooley, W.1    Lohnes, P.2    Analysis, M.D.3
  • 73
    • 84886054112 scopus 로고    scopus 로고
    • Modern Multidimensional Scaling: Theory and Applications, Springer, New York
    • Borg, I. and Groenen, P.J.F. (1997) Modern Multidimensional Scaling: Theory and Applications, Springer, New York.
    • (1997)
    • Borg, I.1    Groenen, P.J.F.2
  • 74
    • 84887006810 scopus 로고
    • A nonlinear mapping for data structure analysis
    • Sammon, J.W. (1969) A nonlinear mapping for data structure analysis. IEEE Transactions on Computers, 18, 401-409.
    • (1969) IEEE Transactions on Computers , vol.18 , pp. 401-409
    • Sammon, J.W.1
  • 78
    • 0000181046 scopus 로고    scopus 로고
    • Nonlinear mapping of massive data sets by fuzzy clustering and neural networks
    • Rassokhin, D.N., Lobanov, V.S., and Agrafiotis, D.K. (2001) Nonlinear mapping of massive data sets by fuzzy clustering and neural networks. Journal of Computational Chemistry, 22 (4), 373-386.
    • (2001) Journal of Computational Chemistry , vol.22 , Issue.4 , pp. 373-386
    • Rassokhin, D.N.1    Lobanov, V.S.2    Agrafiotis, D.K.3
  • 79
    • 0035871891 scopus 로고    scopus 로고
    • Multidimensional scaling and visualization of large molecular similarity tables
    • Agrafiotis, D.K., Rassokhin, D.N., and Lobanov, V.S. (2001) Multidimensional scaling and visualization of large molecular similarity tables. Journal of Computational Chemistry, 22 (5), 488-500.
    • (2001) Journal of Computational Chemistry , vol.22 , Issue.5 , pp. 488-500
    • Agrafiotis, D.K.1    Rassokhin, D.N.2    Lobanov, V.S.3
  • 80
    • 0034704229 scopus 로고    scopus 로고
    • A global geometric framework for nonlinear dimensionality reduction
    • Tenenbaum, J.B., Silva, V.d., and Langford, J.C. (2000) A global geometric framework for nonlinear dimensionality reduction. Science, 290 (5500), 2319-2323.
    • (2000) Science , vol.290 , Issue.5500 , pp. 2319-2323
    • Tenenbaum, J.B.1    Silva, V.d.2    Langford, J.C.3
  • 81
    • 0034704222 scopus 로고    scopus 로고
    • Nonlinear dimensionality reduction by locally linear embedding
    • Roweis, S.T. and Saul, L.K. (2000) Nonlinear dimensionality reduction by locally linear embedding. Science, 290 (5500), 2323-2326.
    • (2000) Science , vol.290 , Issue.5500 , pp. 2323-2326
    • Roweis, S.T.1    Saul, L.K.2
  • 84
    • 78650209298 scopus 로고    scopus 로고
    • Stochastic proximity embedding: Methods and applications
    • Agrafiotis, D.K., Xu, H., Zhu, F., Bandyopadhyay, D., and Liu, P. (2010) Stochastic proximity embedding: Methods and applications. Molecular Informatics, 29 (11), 758-770.
    • (2010) Molecular Informatics , vol.29 , Issue.11 , pp. 758-770
    • Agrafiotis, D.K.1    Xu, H.2    Zhu, F.3    Byopadhyay, D.4    Liu, P.5
  • 85
    • 34547156051 scopus 로고    scopus 로고
    • A measure of domain of applicability for QSAR modelling based on intelligent K-means clustering
    • Stanforth, R.W., Kolossov, E., and Mirkin, B. (2007) A measure of domain of applicability for QSAR modelling based on intelligent K-means clustering. QSAR & Combinatorial Science, 26 (7), 837-844.
    • (2007) QSAR & Combinatorial Science , vol.26 , Issue.7 , pp. 837-844
    • Stanforth, R.W.1    Kolossov, E.2    Mirkin, B.3
  • 87
    • 0033338339 scopus 로고    scopus 로고
    • Semi-Supervised Clustering Using Genetic Algorithms, Artificial Neural Networks in Engineering
    • Bennett, K., Demiriz, A., and Embrechts, M. (1999) Semi-Supervised Clustering Using Genetic Algorithms, Artificial Neural Networks in Engineering.
    • (1999)
    • Bennett, K.1    Demiriz, A.2    Embrechts, M.3
  • 88
    • 0025981114 scopus 로고
    • An application of unsupervised neural network methodology Kohonen topology-preserving mapping to QSAR analysis
    • Rose,V.S.,Croall, I.F., andMacfie, H.J.H. (1991) An application of unsupervised neural network methodology Kohonen topology-preserving mapping to QSAR analysis. Quantitative Structure-Activity Relationships, 10 (1), 6-15.
    • (1991) Quantitative Structure-Activity Relationships , vol.10 , Issue.1 , pp. 6-15
    • RoseV.S.Croall, I.F.1    Macfie, H.J.H.2
  • 91
    • 85152373811 scopus 로고
    • Notes on the history and nature of partial least-squares (PLS) modelling
    • Geladi, P. (1988) Notes on the history and nature of partial least-squares (PLS) modelling. Journal ofChemometrics, 2, 231.
    • (1988) Journal ofChemometrics , vol.2 , pp. 231
    • Geladi, P.1
  • 92
    • 0035498337 scopus 로고    scopus 로고
    • QSAR and k-nearest neighbor classification analysis of selective cyclooxygenase-2 inhibitors using topologically-based numerical descriptors
    • Kauffman, G.W. and Jurs, P.C. (2001) QSAR and k-nearest neighbor classification analysis of selective cyclooxygenase-2 inhibitors using topologically-based numerical descriptors. Journal of Chemical Information and Computer Sciences, 41 (6), 1553-1560.
    • (2001) Journal of Chemical Information and Computer Sciences , vol.41 , Issue.6 , pp. 1553-1560
    • Kauffman, G.W.1    Jurs, P.C.2
  • 96
    • 0030900210 scopus 로고    scopus 로고
    • Neural network modeling for estimation of the aqueous solubility of structurally related drugs
    • Huuskonen, J., Salo, M., and Jyrki, Taskinen, (1997) Neural network modeling for estimation of the aqueous solubility of structurally related drugs. Journal of Pharmaceutical Sciences, 86 (4), 450-454.
    • (1997) Journal of Pharmaceutical Sciences , vol.86 , Issue.4 , pp. 450-454
    • Huuskonen, J.1    Salo, M.2    Jyrki Taskinen3
  • 98
    • 56049095031 scopus 로고    scopus 로고
    • Onthe interpretation and interpretability of quantitative structure- activity relationship models
    • Guha, R. (2008)Onthe interpretation and interpretability of quantitative structure- activity relationship models. Journal of Computer-Aided Molecular Design, 22 (12), 857-871.
    • (2008) Journal of Computer-Aided Molecular Design , vol.22 , Issue.12 , pp. 857-871
    • Guha, R.1
  • 99
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • Breiman, L. (1996) Bagging predictors. Machine Learning, 24 (2), 123-140.
    • (1996) Machine Learning , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 100
    • 84983110889 scopus 로고
    • A decision-theoretic generalization of on-line learning and an application to boosting, in Proceedings of the Second European Conference on Computational Learning Theory, Springer, Berlin
    • Freund, Y. and Schapire, R.E. (1995) A decision-theoretic generalization of on-line learning and an application to boosting, in Proceedings of the Second European Conference on Computational Learning Theory, Springer, Berlin, pp. 23-37.
    • (1995) , pp. 23-37
    • Freund, Y.1    Schapire, R.E.2
  • 101
    • 0030196364 scopus 로고    scopus 로고
    • Stacked regressions
    • Breiman, L. (1996) Stacked regressions. Machine Learning, 24 (1), 49-64.
    • (1996) Machine Learning , vol.24 , Issue.1 , pp. 49-64
    • Breiman, L.1
  • 105
    • 33645856496 scopus 로고    scopus 로고
    • A QSAR model of hERG binding using a large, diverse, and internally consistent training set
    • Seierstad, M. and Agrafiotis, D.K. (2006) A QSAR model of hERG binding using a large, diverse, and internally consistent training set. Chemical Biology & Drug Design, 67 (4), 284-296.
    • (2006) Chemical Biology & Drug Design , vol.67 , Issue.4 , pp. 284-296
    • Seierstad, M.1    Agrafiotis, D.K.2
  • 110
    • 0033576680 scopus 로고    scopus 로고
    • Consensus scoring: A method for obtaining improved hit rates from docking databases of three-dimensional structures into proteins
    • Charifson, P.S., Corkery, J.J., Murcko, M.A., and Walters, W.P. (1999) Consensus scoring: A method for obtaining improved hit rates from docking databases of three-dimensional structures into proteins. Journal of Medicinal Chemistry, 42 (25), 5100-5109.
    • (1999) Journal of Medicinal Chemistry , vol.42 , Issue.25 , pp. 5100-5109
    • Charifson, P.S.1    Corkery, J.J.2    Murcko, M.A.3    Walters, W.P.4
  • 111
    • 0036606204 scopus 로고    scopus 로고
    • ConsDock: A new program for the consensus analysis of protein-ligand interactions
    • Paul, N. and Rognan, D. (2002) ConsDock: A new program for the consensus analysis of protein-ligand interactions. Proteins: Structure, Function, and Bioinformatics, 47 (4), 521-533.
    • (2002) Proteins: Structure,Function, and Bioinformatics , vol.47 , Issue.4 , pp. 521-533
    • Paul, N.1    Rognan, D.2
  • 114
    • 2442682055 scopus 로고    scopus 로고
    • A hybrid decision tree/genetic algorithm method for data mining
    • Carvalho, D.R. and Freitas, A.A. (2004) A hybrid decision tree/genetic algorithm method for data mining. Information Sciences, 163 (1-3), 13-35.
    • (2004) Information Sciences , vol.163 , Issue.1-3 , pp. 13-35
    • Carvalho, D.R.1    Freitas, A.A.2
  • 115
    • 33645923096 scopus 로고    scopus 로고
    • Computational methods in developing quantitative structure-activity relationships (QSAR): A review
    • Dudek, A.Z., Arodz, T., and Galvez, J. (2006) Computational methods in developing quantitative structure-activity relationships (QSAR): A review. Combinatorial Chemistry and High Throughput Screening, 9, 213-228.
    • (2006) Combinatorial Chemistry and High Throughput Screening , vol.9 , pp. 213-228
    • Dudek, A.Z.1    Arodz, T.2    Galvez, J.3
  • 117
    • 0035478854 scopus 로고    scopus 로고
    • Random Forests
    • Breiman, L. (2001) Random Forests. Machine Learning, 45 (1), 5-32.
    • (2001) Machine Learning , vol.45 , Issue.1 , pp. 5-32
    • Breiman, L.1
  • 120
    • 34249753618 scopus 로고
    • Support-vector networks
    • Cortes, C. and Vapnik, V.N. (1995) Support-vector networks. Machine Learning, 20, 273-297.
    • (1995) Machine Learning , vol.20 , pp. 273-297
    • Cortes, C.1    Vapnik, V.N.2
  • 122
    • 0038259120 scopus 로고    scopus 로고
    • Kernel partial least squares regression in reproducing kernel Hilbert space
    • Rosipal, R. and Trejo, L.J. (2001) Kernel partial least squares regression in reproducing kernel Hilbert space. Machine Learning Resources, 2, 97-123.
    • (2001) Machine Learning Resources , vol.2 , pp. 97-123
    • Rosipal, R.1    Trejo, L.J.2
  • 123
    • 0001907967 scopus 로고    scopus 로고
    • Support vector machines: Hype or hallelujah
    • Bennett, K. and Campbell, C. (2000) Support vector machines: Hype or hallelujah. SIGKDD Explorations, 2 (2), 1-13.
    • (2000) SIGKDD Explorations , vol.2 , Issue.2 , pp. 1-13
    • Bennett, K.1    Campbell, C.2
  • 125
    • 0034676318 scopus 로고    scopus 로고
    • Classification of multidrug-resistance reversal agents using structure-based descriptors and linear discriminant analysis
    • Bakken, G.A. and Jurs, P.C. (2000) Classification of multidrug-resistance reversal agents using structure-based descriptors and linear discriminant analysis. Journal of Medicinal Chemistry, 43 (23), 4534-4541.
    • (2000) Journal of Medicinal Chemistry , vol.43 , Issue.23 , pp. 4534-4541
    • Bakken, G.A.1    Jurs, P.C.2
  • 126
    • 84886043237 scopus 로고    scopus 로고
    • Optimization approaches to semisupervised learning, in Applications and Algorithms of Complementarity (eds M.C. Ferris, O.L. Mangasarian, and J.S. Pang), Kluwer Academic, Boston.
    • Bennett, K. and Demiriz, A. (2000) Optimization approaches to semisupervised learning, in Applications and Algorithms of Complementarity (eds M.C. Ferris, O.L. Mangasarian, and J.S. Pang), Kluwer Academic, Boston.
    • (2000)
    • Bennett, K.1    Demiriz, A.2
  • 127
    • 0034740222 scopus 로고    scopus 로고
    • Drug design bymachine learning: Support vector machines for pharmaceutical data analysis
    • Burbidge, R., Trotter, M., Buxton, B., and Holden, S. (2001)Drug design bymachine learning: Support vector machines for pharmaceutical data analysis. Computational Chemistry, 26, 5-14.Burbidge, R., Trotter, M., Buxton, B., and Holden, S. (2001)Drug design bymachine learning: Support vector machines for pharmaceutical data analysis. Computational Chemistry, 26, 5-14.
    • (2001) Computational Chemistry , vol.26 , pp. 5-14
    • Burbidge, R.1    Trotter, M.2    Buxton, B.3    Holden, S.4
  • 129
    • 2442514721 scopus 로고    scopus 로고
    • An optimization perspective on partial least squares
    • (eds J.A.K. Suykens, G. Horvath, S. Basu, C. Micchelli, and J. Vandewalle), IOS Press, Amsterdam
    • Bennett, K.P. and Embrechts, M.J. (2003) An optimization perspective on partial least squares, in Advances in Learning Theory: Methods, Models and Applications, vol. 190 (eds J.A.K. Suykens, G. Horvath, S. Basu, C. Micchelli, and J. Vandewalle), IOS Press, Amsterdam, pp. 227-250.
    • (2003) Advances in Learning Theory: Methods, Models and Applications , vol.190 , pp. 227-250
    • Bennett, K.P.1    Embrechts, M.J.2
  • 130
    • 84886011608 scopus 로고    scopus 로고
    • QSAR model stability: How much information is in the data?
    • American Chemical Society National Meeting, New Orleans, LA
    • Ryan, D., McLellan, M., and Breneman, C.M. (2008) QSAR model stability: How much information is in the data? American Chemical Society National Meeting, New Orleans, LA.
    • (2008)
    • Ryan, D.1    McLellan, M.2    Breneman, C.M.3
  • 131
    • 44449107147 scopus 로고    scopus 로고
    • Support-vector-machine-based ranking significantly improves the effectiveness of similarity searching using 2D fingerprints and multiple reference compounds
    • Geppert, H., Horváth, T., Gärtner, T., Wrobel, S., and Bajorath, J. (2008) Support-vector-machine-based ranking significantly improves the effectiveness of similarity searching using 2D fingerprints and multiple reference compounds. Journal of Chemical Information and Modeling, 48 (4), 742-746.
    • (2008) Journal of Chemical Information and Modeling , vol.48 , Issue.4 , pp. 742-746
    • Geppert, H.1    Horváth, T.2    Gärtner, T.3    Wrobel, S.4    Bajorath, J.5
  • 132
    • 77952768125 scopus 로고    scopus 로고
    • Ranking chemical structures for drug discovery: A new machine learning approach
    • Agarwal, S., Dugar, D., and Sengupta, S. (2010) Ranking chemical structures for drug discovery: A new machine learning approach. Journal of Chemical Information and Modeling, 50 (5), 716-731.
    • (2010) Journal of Chemical Information and Modeling , vol.50 , Issue.5 , pp. 716-731
    • Agarwal, S.1    Dugar, D.2    Sengupta, S.3
  • 133
    • 56449128429 scopus 로고    scopus 로고
    • Multiple instance ranking
    • Proceedings of the 25th International Conference on Machine Learning, ACM, Helsinki, Finland, pp.
    • Bergeron, C., Zaretzki, J., Breneman, C., and Bennett, K.P. (2008) Multiple instance ranking. Proceedings of the 25th International Conference on Machine Learning, ACM, Helsinki, Finland, pp. 48-55.
    • (2008) , pp. 48-55
    • Bergeron, C.1    Zaretzki, J.2    Breneman, C.3    Bennett, K.P.4
  • 134
    • 20544451024 scopus 로고    scopus 로고
    • An approach toward the problem of outliers in QSAR
    • Verma, R.P. and Hansch, C. (2005) An approach toward the problem of outliers in QSAR. Bioorganic and Medicinal Chemistry, 13 (15), 4597-4621.
    • (2005) Bioorganic and Medicinal Chemistry , vol.13 , Issue.15 , pp. 4597-4621
    • Verma, R.P.1    Hansch, C.2
  • 135
    • 33746931581 scopus 로고    scopus 로고
    • On outliers and activity cliffs - why QSAR often disappoints
    • Maggiora, G.M. (2006) On outliers and activity cliffs - why QSAR often disappoints. Journal of Chemical Information and Modeling, 46 (4), 1535.
    • (2006) Journal of Chemical Information and Modeling , vol.46 , Issue.4 , pp. 1535
    • Maggiora, G.M.1
  • 136
    • 78049349961 scopus 로고    scopus 로고
    • Trust, but verify: On the importance of chemical structure curation in cheminformatics and QSAR modeling research
    • Fourches, D., Muratov, E., and Tropsha, A. (2010) Trust, but verify: On the importance of chemical structure curation in cheminformatics and QSAR modeling research. Journal of Chemical Information and Modeling, 50 (7), 1189-1204.
    • (2010) Journal of Chemical Information and Modeling , vol.50 , Issue.7 , pp. 1189-1204
    • Fourches, D.1    Muratov, E.2    Tropsha, A.3
  • 137
    • 0032291174 scopus 로고    scopus 로고
    • Computationally intelligent data mining for the automated design and discovery of novel pharmaceuticals
    • November 1-4, 1998 (eds C.H. Dagli, M. Akay, A.L. Buczak, O. Ersoy, and B.R. Fernandex), ASME Press, St. Louis, MO, pp.
    • Embrechts, M.J., Robert Kewley, J., and Breneman, C. (1998) Computationally intelligent data mining for the automated design and discovery of novel pharmaceuticals, in Smart Engineering Systems: Neural Networks, Fuzzy Logic, Evolutionary Programming, Data Mining and Rough Sets, November 1-4, 1998 (eds C.H. Dagli, M. Akay, A.L. Buczak, O. Ersoy, and B.R. Fernandex), ASME Press, St. Louis, MO, pp. 397-403.
    • (1998) Smart Engineering Systems: Neural Networks, Fuzzy Logic, Evolutionary Programming, Data Mining and Rough Sets , pp. 397-403
    • Embrechts, M.J.1    Robert Kewley, J.2    Breneman, C.3
  • 139
    • 33845550710 scopus 로고
    • Multivariate quantitative structure- activity relationships (QSAR): Conditions for their applicability
    • Wold, S. and Dunn, W.J. (1983) Multivariate quantitative structure- activity relationships (QSAR): Conditions for their applicability. Journal of Chemical Information and Computer Sciences, 23 (1), 6-13.
    • (1983) Journal of Chemical Information and Computer Sciences , vol.23 , Issue.1 , pp. 6-13
    • Wold, S.1    Dunn, W.J.2
  • 140
    • 0038724207 scopus 로고    scopus 로고
    • The importance of being earnest: Validation is the absolute essential for successful application and interpretation of QSPR models
    • Tropsha, A., Gramatica, P., and Gombar, V.K. (2003) The importance of being earnest: Validation is the absolute essential for successful application and interpretation of QSPR models. QSAR and Combinatorial Science, 22 (1), 69-77.
    • (2003) QSAR and Combinatorial Science , vol.22 , Issue.1 , pp. 69-77
    • Tropsha, A.1    Gramatica, P.2    Gombar, V.K.3
  • 141
    • 36749045167 scopus 로고    scopus 로고
    • Exploring the impact of size of training sets for the development of predictive QSAR models
    • Roy, P.P., Leonard, J.T., and Roy, K. (2008) Exploring the impact of size of training sets for the development of predictive QSAR models. Chemometrics and Intelligent Laboratory Systems, 90 (1), 31-42.
    • (2008) Chemometrics and Intelligent Laboratory Systems , vol.90 , Issue.1 , pp. 31-42
    • Roy, P.P.1    Leonard, J.T.2    Roy, K.3
  • 142
    • 33845277149 scopus 로고    scopus 로고
    • QSAR prediction of estrogen activity for a large set of diverse chemicals under the guidance of OECD principles
    • Liu, H., Papa, E., and Gramatica, P. (2006) QSAR prediction of estrogen activity for a large set of diverse chemicals under the guidance of OECD principles. Chemical Research in Toxicology, 19 (11), 1540-1548.
    • (2006) Chemical Research in Toxicology , vol.19 , Issue.11 , pp. 1540-1548
    • Liu, H.1    Papa, E.2    Gramatica, P.3
  • 143
    • 84987100711 scopus 로고
    • Crossvalidation, bootstrapping, and partial least squares compared with multiple regression in conventional QSAR studies
    • Cramer, R.D., Bunce, J.D., Patterson, D.E., and Frank, I.E. (1988) Crossvalidation, bootstrapping, and partial least squares compared with multiple regression in conventional QSAR studies. Quantitative Structure- Activity Relationships, 7 (1), 18-25.
    • (1988) Quantitative Structure- Activity Relationships , vol.7 , Issue.1 , pp. 18-25
    • Cramer, R.D.1    Bunce, J.D.2    Patterson, D.E.3    Frank, I.E.4
  • 145
    • 20844450385 scopus 로고    scopus 로고
    • Statistical variation in progressive scrambling
    • Clark, R. and Fox, P. (2004) Statistical variation in progressive scrambling. Journal of Computer-Aided Molecular Design, 18 (7), 563-576.
    • (2004) Journal of Computer-Aided Molecular Design , vol.18 , Issue.7 , pp. 563-576
    • Clark, R.1    Fox, P.2
  • 146
    • 27744590591 scopus 로고    scopus 로고
    • QSAR applicability domain estimation by projection of the training set in descriptor space: a review
    • Jaworska, J., Nikolova-Jeliazkova, N., and Aldenberg, T. (2005) QSAR applicability domain estimation by projection of the training set in descriptor space: a review. Alternatives to Laboratory Animals, 33 (5), 445-459.
    • (2005) Alternatives to Laboratory Animals , vol.33 , Issue.5 , pp. 445-459
    • Jaworska, J.1    Nikolova-Jeliazkova, N.2    Aldenberg, T.3
  • 148
    • 54249125512 scopus 로고    scopus 로고
    • Critical assessment of QSAR models of environmental toxicity against Tetrahymena pyriformis: Focusing on applicability domain and overfitting by variable selection
    • Tetko, I.V., Sushko, I., Pandey, A.K., Zhu, H., Tropsha, A., Papa, E., Öberg, T., Todeschini, R., Fourches, D., and Varnek, A. (2008) Critical assessment of QSAR models of environmental toxicity against Tetrahymena pyriformis: Focusing on applicability domain and overfitting by variable selection. Journal of Chemical Information and Modeling, 48 (9), 1733-1746.
    • (2008) Journal of Chemical Information and Modeling , vol.48 , Issue.9 , pp. 1733-1746
    • Tetko, I.V.1    Sushko, I.2    Pey, A.K.3    Zhu, H.4    Tropsha, A.5    Papa, E.6    Öberg, T.7    Todeschini, R.8    Fourches, D.9    Varnek, A.10
  • 149
    • 77649229098 scopus 로고    scopus 로고
    • Binding affinity prediction with property-encoded shape distribution signatures
    • Das, S., Krein,M.P., and Breneman, C.M. (2010) Binding affinity prediction with property-encoded shape distribution signatures. Journal of Chemical Information and Modeling, 50 (2), 298-308.
    • (2010) Journal of Chemical Information and Modeling , vol.50 , Issue.2 , pp. 298-308
    • Das, S.1    Krein, M.P.2    Breneman, C.M.3
  • 150
    • 42149090634 scopus 로고    scopus 로고
    • Structure-activity landscape index: Identifying and quantifying activity cliffs
    • Guha, R. and Van Drie, J.H. (2008) Structure-activity landscape index: Identifying and quantifying activity cliffs. Journal of Chemical Information and Modeling, 48, 646-658.
    • (2008) Journal of Chemical Information and Modeling , vol.48 , pp. 646-658
    • Guha, R.1    Van Drie, J.H.2
  • 151
    • 36148989137 scopus 로고    scopus 로고
    • SAR index: quantifying the nature of structure-activity relationships
    • Peltason, L. and Bajorath, J. (2007) SAR index: quantifying the nature of structure-activity relationships. Journal of Medicinal Chemistry, 50 (23), 5571-5578.
    • (2007) Journal of Medicinal Chemistry , vol.50 , Issue.23 , pp. 5571-5578
    • Peltason, L.1    Bajorath, J.2
  • 152
    • 77954082092 scopus 로고    scopus 로고
    • Rationalizing three-dimensional activity landscapes and the influence of molecular representations on landscape topology and the formation of activity cliffs
    • Peltason, L., Iyer, P., and Bajorath, J. (2010) Rationalizing three-dimensional activity landscapes and the influence of molecular representations on landscape topology and the formation of activity cliffs. Journal of Chemical Information and Modeling, 50 (6), 1021-1033.
    • (2010) Journal of Chemical Information and Modeling , vol.50 , Issue.6 , pp. 1021-1033
    • Peltason, L.1    Iyer, P.2    Bajorath, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.