메뉴 건너뛰기




Volumn 29, Issue 7, 2012, Pages 551-564

A novel model to predict O-glycosylation sites using a highly unbalanced dataset

Author keywords

Amino acid index; Feature selection; PP LDA; Protein glycosylation prediction

Indexed keywords

ACCURACY; AMINO TERMINAL SEQUENCE; ARTICLE; ARTIFICIAL NEURAL NETWORK; CARBOXY TERMINAL SEQUENCE; CONTROLLED STUDY; DISCRIMINANT ANALYSIS; GLYCOSYLATION; HYDROPHOBICITY; INTERMETHOD COMPARISON; MATHEMATICAL MODEL; PREDICTION; PRIORITY JOURNAL; PROBABILITY; SUPPORT VECTOR MACHINE; VALIDATION PROCESS;

EID: 84865796100     PISSN: 02820080     EISSN: 15734986     Source Type: Journal    
DOI: 10.1007/s10719-012-9434-x     Document Type: Article
Times cited : (9)

References (47)
  • 1
    • 35248882544 scopus 로고    scopus 로고
    • Modulation of protein biophysical properties by chemical glycosylation: Biochemical insights and biomedical implications
    • DOI 10.1007/s00018-007-6551-y
    • Sola, R.J., Rodriguez-Martinez, J.A., Griebenow, K.: Modulation of protein biophysical properties by chemical glycosylation: biochemical insights and biomedical implications. Cell. Mol. Life Sci. 64(16), 2133-2152 (2007) (Pubitemid 350092414)
    • (2007) Cellular and Molecular Life Sciences , vol.64 , Issue.16 , pp. 2133-2152
    • Sola, R.J.1    Rodriguez-Martinez, J.A.2    Griebenow, K.3
  • 2
    • 33845417633 scopus 로고    scopus 로고
    • Strategies for analysis of glycoprotein glycosylation
    • DOI 10.1016/j.bbapap.2006.10.007, PII S1570963906003487, Posttranslational Modifications in Proteomics
    • Geyer, H., Geyer, R.: Strategies for analysis of glycoprotein glycosylation. BBA-Proteins Proteom 1764(12), 1853-1869 (2006) (Pubitemid 44895032)
    • (2006) Biochimica et Biophysica Acta - Proteins and Proteomics , vol.1764 , Issue.12 , pp. 1853-1869
    • Geyer, H.1    Geyer, R.2
  • 3
    • 0036370537 scopus 로고    scopus 로고
    • Prediction of glycosylation across the human proteome and the correlation to protein function
    • Gupta, R., S. Brunak: Prediction of glycosylation across the human proteome and the correlation to protein function. Pac. Symp. Biocomput. 310-322 (2002)
    • (2002) Pac. Symp. Biocomput. , pp. 310-322
    • Gupta, R.1    Brunak, S.2
  • 4
    • 0027034140 scopus 로고
    • Glycosylation
    • Hart, G.W.: Glycosylation. Curr.Opin. Cell Biol. 4(6), 1017-1023 (1992)
    • (1992) Curr.Opin. Cell Biol. , vol.4 , Issue.6 , pp. 1017-1023
    • Hart, G.W.1
  • 5
    • 33748195979 scopus 로고    scopus 로고
    • Glycosylation in cellular mechanisms of health and disease
    • DOI 10.1016/j.cell.2006.08.019, PII S0092867406010865
    • Ohtsubo, K., Marth, J.D.: Glycosylation in cellular mechanisms of health and disease. Cell 126(5), 855-867 (2006) (Pubitemid 44310784)
    • (2006) Cell , vol.126 , Issue.5 , pp. 855-867
    • Ohtsubo, K.1    Marth, J.D.2
  • 6
    • 78651273016 scopus 로고    scopus 로고
    • Glycan changes: Cancer metastasis and anti-cancer vaccines
    • Li, M., Song, L.J., Qin, X.Y.: Glycan changes: cancer metastasis and anti-cancer vaccines. J. Biosciences. 35(4), 665-673 (2010)
    • (2010) J. Biosciences. , vol.35 , Issue.4 , pp. 665-673
    • Li, M.1    Song, L.J.2    Qin, X.Y.3
  • 7
    • 18544390506 scopus 로고    scopus 로고
    • Post-Translational modifications of tau protein in alzheimer's disease
    • Gong, C.X., et al.: Post-Translational modifications of tau protein in Alzheimer's disease. J. Neural Transm. 112(6), 813-838 (2005)
    • (2005) J. Neural Transm. , vol.112 , Issue.6 , pp. 813-838
    • Gong, C.X.1
  • 8
    • 79551484543 scopus 로고    scopus 로고
    • Highly glycosylated tumour antigens: Interactions with the immune system
    • Saeland, E., van Kooyk, Y.: Highly glycosylated tumour antigens: interactions with the immune system. Biochem. Soc. Trans. 39, 388-392 (2011)
    • (2011) Biochem. Soc. Trans. , vol.39 , pp. 388-392
    • Saeland, E.1    Van Kooyk, Y.2
  • 9
    • 0035146448 scopus 로고    scopus 로고
    • Database analysis of O-glycosylation sites in proteins
    • Christlet, T., Veluraja, K.: Database analysis of O-glycosylation sites in proteins. Biophys. J. 80(2), 952-960 (2001) (Pubitemid 32128346)
    • (2001) Biophysical Journal , vol.80 , Issue.2 , pp. 952-960
    • Christlet, T.H.T.1    Veluraja, K.2
  • 10
    • 33749860977 scopus 로고    scopus 로고
    • Post-translational modifications in the context of therapeutic proteins
    • DOI 10.1038/nbt1252, PII NBT1252
    • Walsh, G., Jefferis, R.: Post-Translational modifications in the context of therapeutic proteins. Nat. Biotechnol. 24, 1241-1252 (2006) (Pubitemid 44564769)
    • (2006) Nature Biotechnology , vol.24 , Issue.10 , pp. 1241-1252
    • Walsh, G.1    Jefferis, R.2
  • 11
    • 2942564430 scopus 로고    scopus 로고
    • Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence
    • DOI 10.1002/pmic.200300771
    • Blom, N., et al.: Prediction of post-Translational glycosylation and phosphorylation of proteins from the amino acid sequence. Proteomics 4(6), 1633-1649 (2004) (Pubitemid 38738322)
    • (2004) Proteomics , vol.4 , Issue.6 , pp. 1633-1649
    • Blom, N.1    Sicheritz-Ponten, T.2    Gupta, R.3    Gammeltoft, S.4    Brunak, So.5
  • 12
    • 0027280810 scopus 로고
    • The specificity of UDP-GalNAc:polypeptide N- acetylgalactosaminyltransferase as inferred from a database of in vivo substrates and from the in vitro glycosylation of proteins and peptides
    • Elhammer, A., et al.: The specificity of UDP-GalNAc:polypeptide N-Acetylgalactosaminyltransferase as inferred from a database of in vivo substrates and from the in vitro glycosylation of proteins and peptides. J. Biol. Chem. 268, 10029-10038 (1993) (Pubitemid 23150178)
    • (1993) Journal of Biological Chemistry , vol.268 , Issue.14 , pp. 10029-10038
    • Elhammer, A.P.1    Poorman, R.A.2    Brown, E.3    Maggiora, L.L.4    Hoogerheide, J.G.5    Kezdy, F.J.6
  • 13
    • 0025865091 scopus 로고
    • Amino acid distributions around o-linked glycosylation sites
    • Wilson, B., Gavel, Y., von Heijne, G.: Amino acid distributions around O-linked glycosylation sites. Biochem. J. 275, 529-534 (1991)
    • (1991) Biochem. J. , vol.275 , pp. 529-534
    • Wilson, B.1    Gavel, Y.2    Von Heijne, G.3
  • 14
    • 0026470310 scopus 로고
    • The influence of flanking sequence on the o-glycosylation of threonine invitro
    • Oconnell, B.C., Hagen, F.K., Tabak, L.A.: The influence of flanking sequence on the O-glycosylation of threonine invitro. J. Biol. Chem. 267(35), 25010-25018 (1992)
    • (1992) J. Biol. Chem. , vol.267 , Issue.35 , pp. 25010-25018
    • Oconnell, B.C.1    Hagen, F.K.2    Tabak, L.A.3
  • 15
    • 0030764614 scopus 로고    scopus 로고
    • Discovery of the shortest sequence motif for high level mucin-type O- glycosylation
    • DOI 10.1074/jbc.272.27.16884
    • Yoshida, A., et al.: Discovery of the shortest sequence motif for high level mucin-Type O-glycosylation. J. Biol. Chem. 272(27), 16884-16888 (1997) (Pubitemid 27289787)
    • (1997) Journal of Biological Chemistry , vol.272 , Issue.27 , pp. 16884-16888
    • Yoshida, A.1    Suzuki, M.2    Ikenaga, H.3    Takeuchi, M.4
  • 16
    • 33745800293 scopus 로고    scopus 로고
    • Interpreting the protein language using proteomics
    • DOI 10.1038/nrm1939, PII NRM1939
    • Jensen, O.N.: Interpreting the protein language using proteomics. Nat. Rev. Mol. Cell Biol. 7(6), 391-403 (2006) (Pubitemid 44050093)
    • (2006) Nature Reviews Molecular Cell Biology , vol.7 , Issue.6 , pp. 391-403
    • Jensen, O.N.1
  • 17
    • 0029076524 scopus 로고
    • A sequence-coupled vector-projection model for predicting the specificity of galnac-Transferase
    • Chou, K.: A sequence-coupled vector-projection model for predicting the specificity of GalNAc-Transferase. Protein Sci. 4, 1365-1383 (1995)
    • (1995) Protein Sci. , vol.4 , pp. 1365-1383
    • Chou, K.1
  • 18
    • 58149512313 scopus 로고    scopus 로고
    • Galnac-Transferase specificity prediction based on feature selection method
    • Lu, L., et al.: GalNAc-Transferase specificity prediction based on feature selection method. Peptides 30(2), 359-364 (2009)
    • (2009) Peptides , vol.30 , Issue.2 , pp. 359-364
    • Lu, L.1
  • 19
    • 77952778270 scopus 로고    scopus 로고
    • Prediction of posttranslational modification of proteins from their amino acid sequence
    • Eisenhaber, B., Eisenhaber, F.: Prediction of posttranslational modification of proteins from their amino acid sequence. Methods Mol. Biol. 609, 365-384 (2010)
    • (2010) Methods Mol. Biol. , vol.609 , pp. 365-384
    • Eisenhaber, B.1    Eisenhaber, F.2
  • 20
    • 0028800739 scopus 로고
    • A vector projection method for predicting the specificity of galnac-Transferase
    • Chou, K., et al.: A vector projection method for predicting the specificity of GalNAc-Transferase. Proteins 21, 118-126 (1995)
    • (1995) Proteins , vol.21 , pp. 118-126
    • Chou, K.1
  • 21
    • 0029003322 scopus 로고
    • Prediction of o-glycosylation of mammalian proteins: Specificity patterns of udp-galnac:polypeptide nacetylgalactosaminyltransferase
    • Hansen, J., et al.: Prediction of O-glycosylation of mammalian proteins: specificity patterns of UDP-GalNac:polypeptide Nacetylgalactosaminyltransferase. Biochem. J. 308, 801-813 (1995)
    • (1995) Biochem. J. , vol.308 , pp. 801-813
    • Hansen, J.1
  • 22
    • 33646887253 scopus 로고    scopus 로고
    • Predicting O-glycosylation sites in mammalian proteins by using SVMs
    • DOI 10.1016/j.compbiolchem.2006.02.002, PII S1476927106000107
    • Li, S., et al.: Predicting O-glycosylation sites in mammalian proteins by using SVMs. Comput. Biol. Chem. 30(3), 203-208 (2006) (Pubitemid 43783164)
    • (2006) Computational Biology and Chemistry , vol.30 , Issue.3 , pp. 203-208
    • Li, S.1    Liu, B.2    Zeng, R.3    Cai, Y.4    Li, Y.5
  • 23
    • 13644257223 scopus 로고    scopus 로고
    • Prediction, conservation analysis, and structural characterization of mammalian mucin-type O-glycosylation sites
    • DOI 10.1093/glycob/cwh151
    • Julenius, K., et al.: Prediction, conservation analysis, and structural characterization of mammalian mucin-Type O-glycosylation sites. Glycobiology 15, 153-164 (2005) (Pubitemid 40227920)
    • (2005) Glycobiology , vol.15 , Issue.2 , pp. 153-164
    • Julenius, K.1    Molgaard, A.2    Gupta, R.3    Brunak, S.4
  • 24
    • 42949170961 scopus 로고    scopus 로고
    • Prediction of mucin-Type o-glycosylation sites in mammalian proteins using the composition of k-spaced amino acid pairs
    • Chen, Y.-Z., et al.: Prediction of mucin-Type O-glycosylation sites in mammalian proteins using the composition of k-spaced amino acid pairs. BMC Bioinformatics 9(1), 101 (2008)
    • (2008) BMC Bioinformatics , vol.9 , Issue.1 , pp. 101
    • Chen, Y.-Z.1
  • 26
    • 38849163717 scopus 로고    scopus 로고
    • Glycosylation site prediction using ensembles of support vector machine classifiers
    • Caragea, C., et al.: Glycosylation site prediction using ensembles of Support Vector Machine classifiers. BMC Bioinformatics 8(1), 438 (2007)
    • (2007) BMC Bioinformatics , vol.8 , Issue.1 , pp. 438
    • Caragea, C.1
  • 27
    • 72149102185 scopus 로고    scopus 로고
    • Affinity enrichment and characterization of mucin core-1 type glycopeptides from bovine serum
    • Darula, Z., Medzihradszky, K.F.: Affinity enrichment and characterization of mucin core-1 type glycopeptides from bovine serum. Mol. Cell. Proteomics 8(11), 2515-2526 (2009)
    • (2009) Mol. Cell. Proteomics , vol.8 , Issue.11 , pp. 2515-2526
    • Darula, Z.1    Medzihradszky, K.F.2
  • 28
    • 0343621535 scopus 로고    scopus 로고
    • AAindex: Amino acid index database
    • Kawashima, S., Kanehisa, M.: AAindex: amino acid index database. Nucl. Acids. Res. 28(1), 374 (2000) (Pubitemid 30047811)
    • (2000) Nucleic Acids Research , vol.28 , Issue.1 , pp. 374
    • Kawashima, S.1    Kanehisa, M.2
  • 30
    • 0029922443 scopus 로고    scopus 로고
    • Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins
    • Tomii, K., Kanehisa, M.: Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. Protein Eng. 9(1), 27-36 (1996) (Pubitemid 26099761)
    • (1996) Protein Engineering , vol.9 , Issue.1 , pp. 27-36
    • Tomii, K.1    Kanehisa, M.2
  • 31
    • 0344141207 scopus 로고    scopus 로고
    • Selection of variables for interpreting multivariate gas sensor data
    • DOI 10.1016/S0003-2670(98)00739-9, PII S0003267098007399
    • Eklöv, T., Mårtensson, P., Lundström, I.: Selection of variables for interpreting multivariate gas sensor data. Anal. Chim. Acta 381(2-3), 221-232 (1999) (Pubitemid 29073299)
    • (1999) Analytica Chimica Acta , vol.381 , Issue.2-3 , pp. 221-232
    • Eklov, T.1    Martensson, P.2    Lundstrom, I.3
  • 32
    • 0035914576 scopus 로고    scopus 로고
    • Comparison of different methods for variable selection
    • Xu, L., Zhang,W.-J.: Comparison of different methods for variable selection. Anal. Chim. Acta 446(1-2), 475-481 (2001)
    • (2001) Anal. Chim. Acta , vol.446 , Issue.1-2 , pp. 475-481
    • Xu, L.1    Zhang, W.-J.2
  • 33
    • 33244497332 scopus 로고    scopus 로고
    • Coupling fast variable selection methods to neural network-based classifiers: Application to multisensor systems
    • DOI 10.1016/j.snb.2005.04.046, PII S0925400505005599
    • Gualdrón, O., et al.: Coupling fast variable selection methods to neural network-based classifiers: application to multisensor systems. Sens. Actuator. B-Chem. 114(1), 522-529 (2006) (Pubitemid 43276223)
    • (2006) Sensors and Actuators, B: Chemical , vol.114 , Issue.1 , pp. 522-529
    • Gualdron, O.1    Llobet, E.2    Brezmes, J.3    Vilanova, X.4    Correig, X.5
  • 34
    • 34147154787 scopus 로고    scopus 로고
    • Probing the anticancer activity of nucleoside analogues: A qsar model approach using an internally consistent training set
    • Morales Helguera, A., et al.: Probing the anticancer activity of nucleoside analogues: a QSAR model approach using an internally consistent training set. J. Med. Chem. 50(7), 1537-1545 (2007)
    • (2007) J. Med. Chem. , vol.50 , Issue.7 , pp. 1537-1545
    • Morales Helguera, A.1
  • 35
    • 0032594959 scopus 로고    scopus 로고
    • An overview of statistical learning theory
    • Vapnik, V.N.: An overview of statistical learning theory. IEEE Trans. Neural Netw. 10(5), 988-999 (1999)
    • (1999) IEEE Trans. Neural Netw. , vol.10 , Issue.5 , pp. 988-999
    • Vapnik, V.N.1
  • 37
    • 0020068152 scopus 로고
    • Self-organized formation of topologically correct feature maps
    • DOI 10.1007/BF00337288
    • Kohonen, T.: Self-organized formation of topologically correct feature maps. Biol. Cybern. 43(1), 59-69 (1982) (Pubitemid 12139984)
    • (1982) Biological Cybernetics , vol.43 , Issue.1 , pp. 59-69
    • Kohonen, T.1
  • 38
    • 34447622472 scopus 로고    scopus 로고
    • Application of self-organizing competitive neural network in fault diagnosis of suck rod pumping system
    • DOI 10.1016/j.petrol.2006.11.008, PII S0920410506003032
    • Xu, P., Xu, S.J., Yin, H.W.: Application of self-organizing competitive neural network in fault diagnosis of suck rod pumping system. J. Pet. Sci. Eng. 58(1-2), 43-48 (2007) (Pubitemid 47087165)
    • (2007) Journal of Petroleum Science and Engineering , vol.58 , Issue.1-2 , pp. 43-48
    • Xu, P.1    Xu, S.2    Yin, H.3
  • 40
  • 41
    • 34248181215 scopus 로고    scopus 로고
    • Predicting the protein SUMO modification sites based on Properties Sequential Forward Selection (PSFS)
    • DOI 10.1016/j.bbrc.2007.04.097, PII S0006291X07008042
    • Liu, B., et al.: Predicting the protein SUMO modification sites based on Properties Sequential Forward Selection (PSFS). Biochem. Biophys. Res. Commun. 358(1), 136-139 (2007) (Pubitemid 46719092)
    • (2007) Biochemical and Biophysical Research Communications , vol.358 , Issue.1 , pp. 136-139
    • Liu, B.1    Li, S.2    Wang, Y.3    Lu, L.4    Li, Y.5    Cai, Y.6
  • 42
    • 80051556369 scopus 로고    scopus 로고
    • Prediction of mucin-Type o-glycosylation sites by a two-staged strategy
    • Cai, Y., He, J., Lu, L.: Prediction of mucin-Type O-glycosylation sites by a two-staged strategy. Mol. Divers. 15(2), 427-433 (2011)
    • (2011) Mol. Divers. , vol.15 , Issue.2 , pp. 427-433
    • Cai, Y.1    He, J.2    Lu, L.3
  • 43
    • 79951510489 scopus 로고    scopus 로고
    • Prediction and analysis of protein palmitoylation sites
    • Hu, L.L., et al.: Prediction and analysis of protein palmitoylation sites. Biochimie 93(3), 489-496 (2011)
    • (2011) Biochimie , vol.93 , Issue.3 , pp. 489-496
    • Hu, L.L.1
  • 44
    • 33745561205 scopus 로고    scopus 로고
    • An introduction to variable and feature selection
    • Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. 3, 1157-1182 (2003)
    • (2003) J. Mach. Learn. , vol.3 , pp. 1157-1182
    • Guyon, I.1    Elisseeff, A.2
  • 45
    • 0034300875 scopus 로고    scopus 로고
    • A new lda-based face recognition system which can solve the small sample size problem
    • Chen, L.F., et al.: A new LDA-based face recognition system which can solve the small sample size problem. Pattern. Recogn. 33, 1713-1726 (2000)
    • (2000) Pattern. Recogn. , vol.33 , pp. 1713-1726
    • Chen, L.F.1
  • 46
    • 0036489046 scopus 로고    scopus 로고
    • Comparison of discrimination methods for the classification of tumors using gene expression data
    • Dudoit, S., Fridlyand, J., Speed, T.P.: Comparison of discrimination methods for the classification of tumors using gene expression data. J. Am. Stat. Assoc. 97(457), 77-87 (2002)
    • (2002) J. Am. Stat. Assoc. , vol.97 , Issue.457 , pp. 77-87
    • Dudoit, S.1    Fridlyand, J.2    Speed, T.P.3
  • 47
    • 0031973966 scopus 로고    scopus 로고
    • Folding type-specific secondary structure propensities of amino acids, derived from alpha-helical, beta-sheet, alpha/ beta, and alpha+beta proteins of known structures
    • Jiang, B., et al.: Folding type-specific secondary structure propensities of amino acids, derived from alpha-helical, beta-sheet, alpha/ beta, and alpha+beta proteins of known structures. Biopolymers 45(1), 35-49 (1998)
    • (1998) Biopolymers , vol.45 , Issue.1 , pp. 35-49
    • Jiang, B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.