메뉴 건너뛰기




Volumn 6, Issue 1, 2008, Pages 223-240

Feature selection in validating mass spectrometry database search results

Author keywords

Feature selection; Machine learning; Mass spectrometry; Proteomics; Random forest; Support vector machine

Indexed keywords

ACCURACY; ALGORITHM; AMINO ACID SEQUENCE; AREA UNDER THE CURVE; COMPUTER PROGRAM; CONFERENCE PAPER; FALSE POSITIVE RESULT; IONIZATION; MACHINE LEARNING; PEPTIDE ANALYSIS; PHYSICAL CHEMISTRY; PROTEIN ANALYSIS; PROTEIN DATABASE; RANDOMIZATION; SCORING SYSTEM; SENSITIVITY AND SPECIFICITY; STATISTICAL ANALYSIS; SUPPORT VECTOR MACHINE; TANDEM MASS SPECTROMETRY; VALIDATION PROCESS;

EID: 44049085330     PISSN: 02197200     EISSN: None     Source Type: Journal    
DOI: 10.1142/S0219720008003345     Document Type: Conference Paper
Times cited : (10)

References (42)
  • 1
    • 0000857494 scopus 로고
    • An approach to correlate tandem mass-spectral data of peptides with amino-acid-sequences in a protein database
    • Eng JK, Mccormack AL, Yates JR, An approach to correlate tandem mass-spectral data of peptides with amino-acid-sequences in a protein database, J Am Soc Mass Spectrom 5:976-989, 1994.
    • (1994) J Am Soc Mass Spectrom , vol.5 , pp. 976-989
    • Eng, J.K.1    Mccormack, A.L.2    Yates, J.R.3
  • 2
    • 0033434080 scopus 로고    scopus 로고
    • Probability-based protein identification by searching sequence databases using mass spectrometry data
    • Perkins DN, Pappin DJC, Creasy DM, Cottrell JS, Probability-based protein identification by searching sequence databases using mass spectrometry data, Electrophoresis 20:3551-3567, 1999.
    • (1999) Electrophoresis , vol.20 , pp. 3551-3567
    • Perkins, D.N.1    Pappin, D.J.C.2    Creasy, D.M.3    Cottrell, J.S.4
  • 3
    • 0033565832 scopus 로고    scopus 로고
    • Role of accurate mass measurement (±10 ppm) in protein identification strategies employing MS or MS/MS and database searching
    • Clauser KR, Baker P, Burlingame AL, Role of accurate mass measurement (±10 ppm) in protein identification strategies employing MS or MS/MS and database searching, Anal Chem 71:2871-2882, 1999.
    • (1999) Anal Chem , vol.71 , pp. 2871-2882
    • Clauser, K.R.1    Baker, P.2    Burlingame, A.L.3
  • 4
    • 3142702204 scopus 로고    scopus 로고
    • Matching proteins with tandem mass spectra
    • TANDEM
    • Craig R, Beavis RC, TANDEM: Matching proteins with tandem mass spectra, Bioinformatics 20:1466-1467, 2004.
    • (2004) Bioinformatics , vol.20 , pp. 1466-1467
    • Craig, R.1    Beavis, R.C.2
  • 5
    • 27744497057 scopus 로고    scopus 로고
    • Protein and peptide identification algorithms using MS for use in high-throughput, automated pipelines
    • Shadforth I, Crowther D, Bessant C, Protein and peptide identification algorithms using MS for use in high-throughput, automated pipelines, Proteomics 5:4082-4095, 2005.
    • (2005) Proteomics , vol.5 , pp. 4082-4095
    • Shadforth, I.1    Crowther, D.2    Bessant, C.3
  • 6
    • 19944408971 scopus 로고    scopus 로고
    • Limitations of current proteomics technologies
    • Garbis S, Lubec G, Fountoulakis M, Limitations of current proteomics technologies, J Chromatogr A 1077:1-18, 2005.
    • (2005) J Chromatogr A , vol.1077 , pp. 1-18
    • Garbis, S.1    Lubec, G.2    Fountoulakis, M.3
  • 7
    • 0036209134 scopus 로고    scopus 로고
    • Qscore: An algorithm for evaluating SEQUEST database search results
    • Moore RE, Young MK, Lee TD, Qscore: An algorithm for evaluating SEQUEST database search results, J Am Soc Mass Spectrom 13:378-386, 2002.
    • (2002) J Am Soc Mass Spectrom , vol.13 , pp. 378-386
    • Moore, R.E.1    Young, M.K.2    Lee, T.D.3
  • 8
    • 0037108887 scopus 로고    scopus 로고
    • Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search
    • Keller A, Nesvizhskii AI, Kolker E, Aebersold R, Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search, Anal Chem 74:5383-5392, 2002.
    • (2002) Anal Chem , vol.74 , pp. 5383-5392
    • Keller, A.1    Nesvizhskii, A.I.2    Kolker, E.3    Aebersold, R.4
  • 9
    • 3042737683 scopus 로고    scopus 로고
    • Improving reproducibility and sensitivity in identifying human proteins by shotgun proteomics
    • Resing KA et al., Improving reproducibility and sensitivity in identifying human proteins by shotgun proteomics, Anal Chem 76 3556-3568, 2004.
    • (2004) Anal Chem , vol.76 , pp. 3556-3568
    • Resing, K.A.1
  • 10
    • 2442526682 scopus 로고    scopus 로고
    • SILVER helps assign peptides to tandem mass spectra using intensity-based scoring
    • Gibbons FD, Elias JE, Gygi SP, Roth FP, SILVER helps assign peptides to tandem mass spectra using intensity-based scoring, J Am Soc Mass Spectrom 15:910-912, 2004.
    • (2004) J Am Soc Mass Spectrom , vol.15 , pp. 910-912
    • Gibbons, F.D.1    Elias, J.E.2    Gygi, S.P.3    Roth, F.P.4
  • 11
    • 0042972838 scopus 로고    scopus 로고
    • A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: Support vector machine classification of peptide MS/MS spectra and SEQUEST scores
    • Anderson DC, Li WQ, Payan DG, Noble WS, A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: Support vector machine classification of peptide MS/MS spectra and SEQUEST scores, J Proteome Res 2:137-146, 2003.
    • (2003) J Proteome Res , vol.2 , pp. 137-146
    • Anderson, D.C.1    Li, W.Q.2    Payan, D.G.3    Noble, W.S.4
  • 12
    • 33645467062 scopus 로고    scopus 로고
    • Improved classification of mass spectrometry database search results using newer machine learning approaches
    • Ulintz PJ, Zhu J, Qin ZHS, Andrews PC, Improved classification of mass spectrometry database search results using newer machine learning approaches, Mol Cell Proteomics 5:497-509, 2006.
    • (2006) Mol Cell Proteomics , vol.5 , pp. 497-509
    • Ulintz, P.J.1    Zhu, J.2    Qin, Z.H.S.3    Andrews, P.C.4
  • 13
    • 0742305695 scopus 로고    scopus 로고
    • Intensity-based protein identification by machine learning from a library of tandem mass spectra
    • Elias JE, Gibbons FD, King OD, Roth FP, Gygi SP, Intensity-based protein identification by machine learning from a library of tandem mass spectra, Nat Biotechnol 22:214-219, 2004.
    • (2004) Nat Biotechnol , vol.22 , pp. 214-219
    • Elias, J.E.1    Gibbons, F.D.2    King, O.D.3    Roth, F.P.4    Gygi, S.P.5
  • 14
    • 1842423492 scopus 로고    scopus 로고
    • A computational method for assessing peptide-identification reliability in tandem mass spectrometry analysis with SEQUEST
    • Razumovskaya J, Olman V, Xu D, Uberbacher EC, Verbermoes N, Xu Y, A computational method for assessing peptide-identification reliability in tandem mass spectrometry analysis with SEQUEST, Proteomics 4 961-969, 2004.
    • (2004) Proteomics , vol.4 , pp. 961-969
    • Razumovskaya, J.1    Olman, V.2    Xu, D.3    Uberbacher, E.C.4    Verbermoes, N.5    Xu, Y.6
  • 15
    • 38049057893 scopus 로고    scopus 로고
    • Fang JW, Grzymala-Busse JW, Mining mass spectrometry database search results - A rough set approach, in Kryszkiewicz M, Peters JF, Rybinski H, Skowron A (eds.), Rough Sets and Intelligent Systems Paradigms: International Conference, RSEISP 2007, Warsaw, Poland, June 28-30, 2007, Proceedings, Lecture Notes in Computer Science, 4585, Springer, Berlin, pp. 340-349, 2007.
    • Fang JW, Grzymala-Busse JW, Mining mass spectrometry database search results - A rough set approach, in Kryszkiewicz M, Peters JF, Rybinski H, Skowron A (eds.), Rough Sets and Intelligent Systems Paradigms: International Conference, RSEISP 2007, Warsaw, Poland, June 28-30, 2007, Proceedings, Lecture Notes in Computer Science, Vol. 4585, Springer, Berlin, pp. 340-349, 2007.
  • 16
    • 12744280774 scopus 로고    scopus 로고
    • Software for automatically validating the quality of MS/MS spectrum from SEQUEST results
    • AMASS
    • Sun W, Li FX, Wang J, Zheng DX, Gao YH, AMASS: Software for automatically validating the quality of MS/MS spectrum from SEQUEST results, Mol Cell Proteomics 3:1194-1199, 2004.
    • (2004) Mol Cell Proteomics , vol.3 , pp. 1194-1199
    • Sun, W.1    Li, F.X.2    Wang, J.3    Zheng, D.X.4    Gao, Y.H.5
  • 17
    • 21244443812 scopus 로고    scopus 로고
    • The use of proteotypic peptide libraries for protein identification
    • Craig R, Cortens JP, Beavis RC, The use of proteotypic peptide libraries for protein identification, Rapid Commun Mass Spectrom 19 1844-1850, 2005.
    • (2005) Rapid Commun Mass Spectrom , vol.19 , pp. 1844-1850
    • Craig, R.1    Cortens, J.P.2    Beavis, R.C.3
  • 18
    • 33747626954 scopus 로고    scopus 로고
    • Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries
    • Frewen BE, Merrihew GE, Su CC, Noble WS, MacCoss MJ, Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries, Anal Chem 78:5678-5684, 2006.
    • (2006) Anal Chem , vol.78 , pp. 5678-5684
    • Frewen, B.E.1    Merrihew, G.E.2    Su, C.C.3    Noble, W.S.4    MacCoss, M.J.5
  • 19
    • 0242653718 scopus 로고    scopus 로고
    • Mining a tandem mass spectrometry database to determine the trends and global factors influencing peptide fragmentation
    • Kapp EA, Schutz F, Reid GE, Eddes JS, Moritz RL, O'Hair RAJ, Speed TP, Simpson RJ, Mining a tandem mass spectrometry database to determine the trends and global factors influencing peptide fragmentation, Anal Chem 75:6251-6264, 2003.
    • (2003) Anal Chem , vol.75 , pp. 6251-6264
    • Kapp, E.A.1    Schutz, F.2    Reid, G.E.3    Eddes, J.S.4    Moritz, R.L.5    O'Hair, R.A.J.6    Speed, T.P.7    Simpson, R.J.8
  • 20
    • 0029810698 scopus 로고    scopus 로고
    • Influence of peptide composition, gas-phase basicity, and chemical modification on fragmentation efficiency: Evidence for the mobile proton model
    • Dongre AR, Jones JL, Somogyi A, Wysocki VH, Influence of peptide composition, gas-phase basicity, and chemical modification on fragmentation efficiency: Evidence for the mobile proton model, J Am Chem Soc 118:8365-8374, 1996.
    • (1996) J Am Chem Soc , vol.118 , pp. 8365-8374
    • Dongre, A.R.1    Jones, J.L.2    Somogyi, A.3    Wysocki, V.H.4
  • 21
    • 1442324456 scopus 로고    scopus 로고
    • Influence of basic residue content on fragment ion peak intensities in low-energy collision-induced dissociation spectra of peptides
    • Tabb DL, Huang YY, Wysocki VH, Yates JR, Influence of basic residue content on fragment ion peak intensities in low-energy collision-induced dissociation spectra of peptides, Anal Chem 76:1243-1248, 2004.
    • (2004) Anal Chem , vol.76 , pp. 1243-1248
    • Tabb, D.L.1    Huang, Y.Y.2    Wysocki, V.H.3    Yates, J.R.4
  • 22
    • 0037353846 scopus 로고    scopus 로고
    • Statistical characterization of ion trap tandem mass spectra from doubly charged tryptic peptides
    • Tabb DL, Smith LL, Breci LA, Wysocki VH, Lin D, Yates JR III, Statistical characterization of ion trap tandem mass spectra from doubly charged tryptic peptides, Anal Chem 75:1155-1163, 2003.
    • (2003) Anal Chem , vol.75 , pp. 1155-1163
    • Tabb, D.L.1    Smith, L.L.2    Breci, L.A.3    Wysocki, V.H.4    Lin, D.5    Yates III, J.R.6
  • 23
    • 33846133955 scopus 로고    scopus 로고
    • Computational prediction of proteotypic peptides for quantitiative proteomics
    • Mallick P et al., Computational prediction of proteotypic peptides for quantitiative proteomics, Nat Biotechnol 25:125-131, 2007.
    • (2007) Nat Biotechnol , vol.25 , pp. 125-131
    • Mallick, P.1
  • 24
    • 33745561205 scopus 로고    scopus 로고
    • An introduction to variable and feature selection
    • Guyon I, Elisseeff A, An introduction to variable and feature selection, J Mach Learn Res 3:1157-1182, 2003.
    • (2003) J Mach Learn Res , vol.3 , pp. 1157-1182
    • Guyon, I.1    Elisseeff, A.2
  • 25
    • 0036161259 scopus 로고    scopus 로고
    • Gene selection for cancer classification using support vector machines
    • Guyon I, Weston J, Barnhill S, Vapnik V, Gene selection for cancer classification using support vector machines, Mach Learn 46 389-422, 2002.
    • (2002) Mach Learn , vol.46 , pp. 389-422
    • Guyon, I.1    Weston, J.2    Barnhill, S.3    Vapnik, V.4
  • 26
    • 30644464444 scopus 로고    scopus 로고
    • Gene selection and classification of microarray data using random forest
    • Diaz-Uriate R, de Andres SA, Gene selection and classification of microarray data using random forest, BMC Bioinformatics 7:3, 2006.
    • (2006) BMC Bioinformatics , vol.7 , pp. 3
    • Diaz-Uriate, R.1    de Andres, S.A.2
  • 27
    • 0018800255 scopus 로고
    • Local interactions as a structure determinant for protein molecules
    • Krigbaum WR, Komoriya A, Local interactions as a structure determinant for protein molecules, Biochim Biophys Acta 576:204-228, 1979.
    • (1979) Biochim Biophys Acta , vol.576 , pp. 204-228
    • Krigbaum, W.R.1    Komoriya, A.2
  • 29
    • 0021691817 scopus 로고
    • Analysis of membrane and surface protein sequences with the hydrophobic moment plot
    • Eisenberg D, Schwarz E, Komaromy M, Wall R, Analysis of membrane and surface protein sequences with the hydrophobic moment plot, J Mol Biol 179:125-142, 1984.
    • (1984) J Mol Biol , vol.179 , pp. 125-142
    • Eisenberg, D.1    Schwarz, E.2    Komaromy, M.3    Wall, R.4
  • 30
    • 0001040367 scopus 로고
    • An algorithm for protein secondary structure prediction based on class prediction
    • Deleage G, Roux B, An algorithm for protein secondary structure prediction based on class prediction, Protein Eng 1:289-294, 1987.
    • (1987) Protein Eng , vol.1 , pp. 289-294
    • Deleage, G.1    Roux, B.2
  • 31
  • 32
    • 33845280446 scopus 로고
    • Comparing the polarities of the amino acids: Side-chain distribution coefficients between the vapor phase, cyclohexane, 1-octanol, and neutral aqueous solution
    • Radzicka A, Wolfenden R, Comparing the polarities of the amino acids: Side-chain distribution coefficients between the vapor phase, cyclohexane, 1-octanol, and neutral aqueous solution, Biochemistry 27:1664-1670, 1988.
    • (1988) Biochemistry , vol.27 , pp. 1664-1670
    • Radzicka, A.1    Wolfenden, R.2
  • 34
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • Breiman L, Random forests, Mach Learn 45:5-32, 2001.
    • (2001) Mach Learn , vol.45 , pp. 5-32
    • Breiman, L.1
  • 36
    • 0037498037 scopus 로고    scopus 로고
    • Prediction of aqueous solubility and partition coefficient optimized by a genetic algorithm based descriptor selection method
    • Wegner J, Zell A, Prediction of aqueous solubility and partition coefficient optimized by a genetic algorithm based descriptor selection method, J Chem Inf Comput Sci 43:1077-1084, 2003.
    • (2003) J Chem Inf Comput Sci , vol.43 , pp. 1077-1084
    • Wegner, J.1    Zell, A.2
  • 38
    • 0037208311 scopus 로고    scopus 로고
    • Recursive median partitioning for virtual screening of large databases
    • Godden JW, Furr JR, Bajorath J, Recursive median partitioning for virtual screening of large databases, J Chem Inf Comput Sci 43 182-188, 2003.
    • (2003) J Chem Inf Comput Sci , vol.43 , pp. 182-188
    • Godden, J.W.1    Furr, J.R.2    Bajorath, J.3
  • 39
    • 0042525842 scopus 로고    scopus 로고
    • Neural-network construction and selection in nonlinear modeling
    • Rivals I, Personnaz L, Neural-network construction and selection in nonlinear modeling, IEEE Trans Neural Netw 14:804-819, 2003.
    • (2003) IEEE Trans Neural Netw , vol.14 , pp. 804-819
    • Rivals, I.1    Personnaz, L.2
  • 40
  • 42
    • 0034501099 scopus 로고    scopus 로고
    • Special feature: Commentary - Mobile and localized protons: A framework for understanding peptide dissociation
    • Wysocki VH, Tsaprailis G, Smith LL, Breci LA, Special feature: Commentary - Mobile and localized protons: A framework for understanding peptide dissociation, J Mass Spectrom 35:1399-1406, 2000.
    • (2000) J Mass Spectrom , vol.35 , pp. 1399-1406
    • Wysocki, V.H.1    Tsaprailis, G.2    Smith, L.L.3    Breci, L.A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.