-
1
-
-
0030801002
-
Gapped blast and psi-blast: A new generation of protein data
-
Altschul S, Madden T, Schafer A, Zhang J, Zhang Z, Miller W, Lipman D. Gapped blast and psi-blast: a new generation of protein data. Nucleic Acids Res 1997;25:3389-3402.
-
(1997)
Nucleic Acids Res
, vol.25
, pp. 3389-3402
-
-
Altschul, S.1
Madden, T.2
Schafer, A.3
Zhang, J.4
Zhang, Z.5
Miller, W.6
Lipman, D.7
-
2
-
-
0028043552
-
Position-based sequence weights
-
Henikoff S, Henikoff J. Position-based sequence weights. J Mol Biol 1994;243:574-578.
-
(1994)
J Mol Biol
, vol.243
, pp. 574-578
-
-
Henikoff, S.1
Henikoff, J.2
-
3
-
-
0028181441
-
Hidden markov models in computational biology: Applications to protein modeling
-
Krogh A, Brown M, Mian I, Sjolander K, Haussler D. Hidden markov models in computational biology: applications to protein modeling. J Mol Biol 1994;235:1501-1531.
-
(1994)
J Mol Biol
, vol.235
, pp. 1501-1531
-
-
Krogh, A.1
Brown, M.2
Mian, I.3
Sjolander, K.4
Haussler, D.5
-
4
-
-
1542714925
-
Mismatch string kernels for discriminative protein classification
-
Leslie C, Eskin E, Cohen A, Weston J, Noble W. Mismatch string kernels for discriminative protein classification. Bioinformatics 2004;20:467-476.
-
(2004)
Bioinformatics
, vol.20
, pp. 467-476
-
-
Leslie, C.1
Eskin, E.2
Cohen, A.3
Weston, J.4
Noble, W.5
-
5
-
-
0742287001
-
Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships
-
Liao L, Noble W. Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships. J Comput Biol 2003;10:857-868.
-
(2003)
J Comput Biol
, vol.10
, pp. 857-868
-
-
Liao, L.1
Noble, W.2
-
6
-
-
26944462332
-
A novel approach to extracting features from motif content and protein composition for protein sequence classification
-
Zhao X, Cheung Y, Huang D. A novel approach to extracting features from motif content and protein composition for protein sequence classification. Neural Networks 2005;18:1019-1028.
-
(2005)
Neural Networks
, vol.18
, pp. 1019-1028
-
-
Zhao, X.1
Cheung, Y.2
Huang, D.3
-
7
-
-
33748437922
-
Classifying protein sequences using hydropathy blocks
-
Huang D, Zhao X, Huang G, Cheung Y. Classifying protein sequences using hydropathy blocks. Pattern Recog 2006;39:2293-2300.
-
(2006)
Pattern Recog
, vol.39
, pp. 2293-2300
-
-
Huang, D.1
Zhao, X.2
Huang, G.3
Cheung, Y.4
-
8
-
-
0035014847
-
Multi-class protein fold recognition using support vector machines and neural networks
-
Ding CH, Dubchak I. Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics 2001;17:349-358.
-
(2001)
Bioinformatics
, vol.17
, pp. 349-358
-
-
Ding, C.H.1
Dubchak, I.2
-
10
-
-
0000487102
-
Estimating the support of a high-dimensional distribution
-
Scholkopf B, Platt JC, Shawe-Taylor JC, Smola AJ, Williamson RC. Estimating the support of a high-dimensional distribution. Neural Comput 2001;13:1443-1471.
-
(2001)
Neural Comput
, vol.13
, pp. 1443-1471
-
-
Scholkopf, B.1
Platt, J.C.2
Shawe-Taylor, J.C.3
Smola, A.J.4
Williamson, R.C.5
-
11
-
-
0037753593
-
-
PhD thesis. Delft University of Technology, Delft, The Netherlands;
-
Tax D. One-class classification. PhD thesis. Delft University of Technology, Delft, The Netherlands; 2001.
-
(2001)
One-class classification
-
-
Tax, D.1
-
12
-
-
16644402628
-
Feature selection for text categorization on imbalanced data
-
Zheng Z, Wu X, Srihari R. Feature selection for text categorization on imbalanced data. ACM SIGKDD Explor Newslett 2004;6:80-89.
-
(2004)
ACM SIGKDD Explor Newslett
, vol.6
, pp. 80-89
-
-
Zheng, Z.1
Wu, X.2
Srihari, R.3
-
15
-
-
27144479454
-
Learning from imbalanced data sets with boosting and data generation: The databoost-im approach
-
Guo H, Viktor HL. Learning from imbalanced data sets with boosting and data generation: the databoost-im approach. SIGKDD Explor 2004;6:30-39.
-
(2004)
SIGKDD Explor
, vol.6
, pp. 30-39
-
-
Guo, H.1
Viktor, H.L.2
-
17
-
-
0343081513
-
Reduction techniques for instance-based learning algorithms
-
Wilson DR, Martinez TR. Reduction techniques for instance-based learning algorithms. Mach Learn 2000;38:257-286.
-
(2000)
Mach Learn
, vol.38
, pp. 257-286
-
-
Wilson, D.R.1
Martinez, T.R.2
-
19
-
-
9444297357
-
SMOTEBoost: Improving prediction of the minority class in boosting
-
In the, Cavtat, Croatia;
-
Chawla NV, Lazarevic A, Hall L, Bowyer KW. SMOTEBoost: improving prediction of the minority class in boosting. In the Proceedings of the 7th European Conf on principles and practice of knowledge discovery in databases (PKDD), Cavtat, Croatia; 2003. pp 107-119.
-
(2003)
Proceedings of the 7th European Conf on principles and practice of knowledge discovery in databases (PKDD)
, pp. 107-119
-
-
Chawla, N.V.1
Lazarevic, A.2
Hall, L.3
Bowyer, K.W.4
-
21
-
-
0031361611
-
Machine learning research: Four current directions
-
Dietterich TG. Machine learning research: four current directions. AI Magazine 1998;18:97-136.
-
(1998)
AI Magazine
, vol.18
, pp. 97-136
-
-
Dietterich, T.G.1
-
22
-
-
33747880465
-
Ensemble classifier for protein fold pattern recognition
-
Shen H, Chou K. Ensemble classifier for protein fold pattern recognition. Bioinformatics 2006;22:1717-1722.
-
(2006)
Bioinformatics
, vol.22
, pp. 1717-1722
-
-
Shen, H.1
Chou, K.2
-
23
-
-
14944354760
-
Multi-class protein fold classification using a new ensemble machine learning approach
-
Tan A, Gilbert D, Deville Y. Multi-class protein fold classification using a new ensemble machine learning approach. Genome Informatics 2003;14:206-217.
-
(2003)
Genome Informatics
, vol.14
, pp. 206-217
-
-
Tan, A.1
Gilbert, D.2
Deville, Y.3
-
24
-
-
14044254198
-
Automated protein classification using consensus decision
-
Stanford, CA;
-
Can T, Camoglu O, Singh A, Wang Y. Automated protein classification using consensus decision. Computational Systems Bioinformatics Conference, Stanford, CA; 2004. pp 224-235.
-
(2004)
Computational Systems Bioinformatics Conference
, pp. 224-235
-
-
Can, T.1
Camoglu, O.2
Singh, A.3
Wang, Y.4
-
25
-
-
4444273377
-
Protein homology detection using string alignment kernels
-
Saigo H, Vert JP, Ueda N, Akutsu T. Protein homology detection using string alignment kernels. Bioinformatics 2004;20:1682-1689.
-
(2004)
Bioinformatics
, vol.20
, pp. 1682-1689
-
-
Saigo, H.1
Vert, J.P.2
Ueda, N.3
Akutsu, T.4
-
26
-
-
0019887799
-
Identification of common molecular subsequences
-
Smith TF, Waterman MS. Identification of common molecular subsequences. J Mol Biol 1981;147:195-197.
-
(1981)
J Mol Biol
, vol.147
, pp. 195-197
-
-
Smith, T.F.1
Waterman, M.S.2
-
27
-
-
0014757386
-
A general method applicable to the search for similarities in the amino acid sequence of two proteins
-
Needleman SB, Wunsch CD. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 1970;48:443-453.
-
(1970)
J Mol Biol
, vol.48
, pp. 443-453
-
-
Needleman, S.B.1
Wunsch, C.D.2
-
28
-
-
33846064578
-
A Protein Classification Benchmark collection for machine learning
-
Database issue:D232-D236
-
Sonego P, Pacurar M, Dhir S, Kertsz-Farkas A, Kocsor A, Gspri Z, Leunissen JA, Pongor S. A Protein Classification Benchmark collection for machine learning. Nucleic Acids Res 2007;35 (Database issue):D232-D236.
-
(2007)
Nucleic Acids Res
, pp. 35
-
-
Sonego, P.1
Pacurar, M.2
Dhir, S.3
Kertsz-Farkas, A.4
Kocsor, A.5
Gspri, Z.6
Leunissen, J.A.7
Pongor, S.8
-
29
-
-
0346494941
-
SCOP database in 2004: Refinements integrate structure and sequence family data
-
Andreeva A, Howorth D, Brenner SE, Hubbard TJ, Chothia C, Murzin AG. SCOP database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Res 2004;32:D226-D229.
-
(2004)
Nucleic Acids Res
, vol.32
-
-
Andreeva, A.1
Howorth, D.2
Brenner, S.E.3
Hubbard, T.J.4
Chothia, C.5
Murzin, A.G.6
-
30
-
-
13444272079
-
The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis
-
Pearl F, Todd A, Sillitoe I, Dibley M, Redfern O, Lewis T, Bennett C, Marsden R, Grant A, Lee D, Akpor A, Maibaum M, Harrison A, Dallman T, Reeves G, Diboun I, Addou S, Lise S, Johnston C, Sillero A, Thornton J, Orengo C. The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis. Nucleic Acids Res 1005;33:D247-D251.
-
Nucleic Acids Res
, vol.1005
, Issue.33
-
-
Pearl, F.1
Todd, A.2
Sillitoe, I.3
Dibley, M.4
Redfern, O.5
Lewis, T.6
Bennett, C.7
Marsden, R.8
Grant, A.9
Lee, D.10
Akpor, A.11
Maibaum, M.12
Harrison, A.13
Dallman, T.14
Reeves, G.15
Diboun, I.16
Addou, S.17
Lise, S.18
Johnston, C.19
Sillero, A.20
Thornton, J.21
Orengo, C.22
more..
-
32
-
-
39749101564
-
-
Quinlan JR. C4.5: Programs for machine learning. San Francisco, CA: Morgan Kaufmann Publishers; 1993.
-
Quinlan JR. C4.5: Programs for machine learning. San Francisco, CA: Morgan Kaufmann Publishers; 1993.
-
-
-
|