메뉴 건너뛰기




Volumn 4316 LNBI, Issue , 2006, Pages 65-77

Automatic annotation of protein functional class from sparse and imbalanced data sets

Author keywords

Feature selection; Gene annotation; Gene ontology; Imbalanced data; Interpro

Indexed keywords

DATA STRUCTURES; FEATURE EXTRACTION; GENES; ONTOLOGY;

EID: 34547466817     PISSN: 03029743     EISSN: 16113349     Source Type: Conference Proceeding    
DOI: 10.1007/11960669_7     Document Type: Conference Paper
Times cited : (8)

References (27)
  • 1
    • 27344448597 scopus 로고    scopus 로고
    • Feature Selection and the Class Imbalance Problem in Predict Protein Function form sequence
    • Al-shahib, A., Breitling, R. and Gilbert, D. : Feature Selection and the Class Imbalance Problem in Predict Protein Function form sequence. Applied Bioinformatics Vol.4 (2005) 195-203
    • (2005) Applied Bioinformatics , vol.4 , pp. 195-203
    • Al-shahib, A.1    Breitling, R.2    Gilbert, D.3
  • 3
    • 34547440278 scopus 로고    scopus 로고
    • Drummond, C. and Holte, R.C. : C4.5,Class Imbalance, and Cost sensitivity: Why Under-sampling beats Oversampling. ICML'2003 Workshop on Learning from Imbalanced Datasets II. (2003)
    • Drummond, C. and Holte, R.C. : C4.5,Class Imbalance, and Cost sensitivity: Why Under-sampling beats Oversampling. ICML'2003 Workshop on Learning from Imbalanced Datasets II. (2003)
  • 4
  • 5
    • 0041620215 scopus 로고    scopus 로고
    • Automated Gene Ontology annotation for anonymous sequence data
    • Hennig, S., Groth, D. and Lehrach, H. : Automated Gene Ontology annotation for anonymous sequence data. Nucleic acids Research (2003) 3712-3715
    • (2003) Nucleic acids Research , pp. 3712-3715
    • Hennig, S.1    Groth, D.2    Lehrach, H.3
  • 6
    • 44649092482 scopus 로고    scopus 로고
    • Comparing Naive Bayes, Decision Trees, and SVM using Accuracy and AUC. Proc. of The Third IEEE Inter
    • Huang, J., Lu, J. and Ling, C.X. : Comparing Naive Bayes, Decision Trees, and SVM using Accuracy and AUC. Proc. of The Third IEEE Inter. Conf. on Data Mining (ICDM) (2003) 553-556
    • (2003) Conf. on Data Mining (ICDM) , pp. 553-556
    • Huang, J.1    Lu, J.2    Ling, C.X.3
  • 7
    • 33845536164 scopus 로고    scopus 로고
    • The class imbalanced problem : A systematic study
    • Japkowics, N. and Stepen, S. : The class imbalanced problem : A systematic study. Intelligent Data Analysis Vol.6 (2002)
    • (2002) Intelligent Data Analysis , vol.6
    • Japkowics, N.1    Stepen, S.2
  • 9
    • 84888281174 scopus 로고    scopus 로고
    • Genome scale prediction of protein functional class from sequence using data mining. Proc. of the sixth ACM SIGKDD Inter
    • King, R.D., Karwath, A.,Clare,A. and Dephaspe,L. : Genome scale prediction of protein functional class from sequence using data mining. Proc. of the sixth ACM SIGKDD Inter. Conf. on Knowledge discovery and data mining (2003)
    • (2003) Conf. on Knowledge discovery and data mining
    • King, R.D.1    Karwath, A.2    Clare, A.3    Dephaspe, L.4
  • 10
    • 0001972236 scopus 로고    scopus 로고
    • Addressing the curse of Imbalanced Training sets : One-sided Selection. Proc. of the Fourteenth Inter
    • Kubat, M. and Matwin, S. : Addressing the curse of Imbalanced Training sets : One-sided Selection. Proc. of the Fourteenth Inter. Conf. on Machine Learning Proc. (ICML) (1997) 179-186
    • (1997) Conf. on Machine Learning Proc. (ICML) , pp. 179-186
    • Kubat, M.1    Matwin, S.2
  • 11
    • 85161651554 scopus 로고    scopus 로고
    • Data mining for direct marketing :problem and solution. Proc. of the Fourth Inter
    • Ling, C. and Li, C. : Data mining for direct marketing :problem and solution. Proc. of the Fourth Inter. Conf. on Knowledges Discovery and Data Mining(KDD) (1998) 73-79
    • (1998) Conf. on Knowledges Discovery and Data Mining(KDD) , pp. 73-79
    • Ling, C.1    Li, C.2
  • 12
    • 13244268370 scopus 로고    scopus 로고
    • GOtcha : A new method for prediction of protein function assessed by the annotation of sever genomes
    • Martin, D.M., Berriman, M. and Barton, G.J. : GOtcha : A new method for prediction of protein function assessed by the annotation of sever genomes. BMC bioinformatics Vol.5 (2004)
    • (2004) BMC bioinformatics , vol.5
    • Martin, D.M.1    Berriman, M.2    Barton, G.J.3
  • 16
    • 20844458491 scopus 로고    scopus 로고
    • Mining with rarity : A unifying framework
    • Weiss,G.M. : Mining with rarity : A unifying framework. ACM SIGKDD Explorations Newsletter Vol.6 (2004) 7-19
    • (2004) ACM SIGKDD Explorations Newsletter , vol.6 , pp. 7-19
    • Weiss, G.M.1
  • 17
    • 0003141935 scopus 로고    scopus 로고
    • A comparative study on feature selection in text categorization. Proc. of the Fourteenth Inter
    • Yang, Y. and Pedersen, J.O. : A comparative study on feature selection in text categorization. Proc. of the Fourteenth Inter. Conf. on Machine Learning (ICML) (1997) 412-420
    • (1997) Conf. on Machine Learning (ICML) , pp. 412-420
    • Yang, Y.1    Pedersen, J.O.2
  • 18
    • 1942451938 scopus 로고    scopus 로고
    • Feature Selection for high-Dimensional Data: A Fast Correlation-based filter solution. Proc. of the Twentieth Inter
    • Yu, L. and Liu, H. : Feature Selection for high-Dimensional Data: A Fast Correlation-based filter solution. Proc. of the Twentieth Inter. Conf. on Machine Learning (ICML) (2003)
    • (2003) Conf. on Machine Learning (ICML)
    • Yu, L.1    Liu, H.2
  • 19
    • 0043122923 scopus 로고    scopus 로고
    • OntoBlast function: From sequence similarities directly to potential functional annotations by ontology terms
    • Zehetner, G. : OntoBlast function: from sequence similarities directly to potential functional annotations by ontology terms. Nucleic acids Research (2003) 3799-3803
    • (2003) Nucleic acids Research , pp. 3799-3803
    • Zehetner, G.1
  • 21
    • 16644402628 scopus 로고    scopus 로고
    • Feature selection for text categorization on imbalanced data
    • Zheng, Z., Wu, X. and Shrihari, R. : Feature selection for text categorization on imbalanced data. ACM SIGKDD Exploration Newsletter Vol.6 (2004) 80-89
    • (2004) ACM SIGKDD Exploration Newsletter , vol.6 , pp. 80-89
    • Zheng, Z.1    Wu, X.2    Shrihari, R.3
  • 22
    • 34547492802 scopus 로고    scopus 로고
    • Gene Ontology(GO) Consortium
    • Gene Ontology(GO) Consortium, http://www.geneontology.org/
  • 23
    • 34547408329 scopus 로고    scopus 로고
    • InterPro, http://www.ebi.ac.uk/interpro/
    • InterPro
  • 24
    • 34547404729 scopus 로고    scopus 로고
    • MATLAB
    • MATLAB, http://www.mathworks.com/
  • 25
    • 34547472097 scopus 로고    scopus 로고
    • Pattern Recognition Toolbox for MATLAB
    • Pattern Recognition Toolbox for MATLAB, http://cmp.felk.cvut.cz/~xfrancv/ stprtool/
  • 26
    • 34547399509 scopus 로고    scopus 로고
    • UniProt, www.uniprot.org/
    • UniProt
  • 27
    • 34547414982 scopus 로고    scopus 로고
    • WEKA
    • WEKA, http://www.cs.waikato.ac.nz/~ml/


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.