메뉴 건너뛰기




Volumn 31, Issue 11, 2010, Pages 1310-1323

Analytical evaluation of term weighting schemes for text categorization

Author keywords

Contour lines; Relative weights; Term occurrence probability; Term weighting; Text categorization

Indexed keywords

CONTOUR LINE; RELATIVE WEIGHTS; TERM OCCURRENCES; TERM WEIGHTING; TEXT CATEGORIZATION;

EID: 77953131475     PISSN: 01678655     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patrec.2010.03.012     Document Type: Article
Times cited : (56)

References (28)
  • 1
    • 77953130813 scopus 로고
    • Implementation of the smart information retrieval system. Technical Report, Cornell University, Ithaca, USA
    • Buckley, C., 1985. Implementation of the smart information retrieval system. Technical Report, Cornell University, Ithaca, USA.
    • (1985)
    • Buckley, C.1
  • 2
    • 58349094507 scopus 로고    scopus 로고
    • Feature selection for text classification with naive Bayes
    • Chen J., Huang H., Tian S., and Qu Y. Feature selection for text classification with naive Bayes. Expert Syst. Appl. 36 (2009) 5432-5435
    • (2009) Expert Syst. Appl. , vol.36 , pp. 5432-5435
    • Chen, J.1    Huang, H.2    Tian, S.3    Qu, Y.4
  • 4
    • 0037998887 scopus 로고    scopus 로고
    • Supervised term weighting for automated text categorization
    • New York, NY, USA. ACM, pp
    • Debole, F., Sebastiani, F., 2003. Supervised term weighting for automated text categorization. In: SAC '03: Proc. 2003 ACM Symp. on Applied Computing, New York, NY, USA. ACM, pp. 784-788.
    • (2003) SAC '03: Proc. 2003 ACM Symp. on Applied Computing , pp. 784-788
    • Debole, F.1    Sebastiani, F.2
  • 5
    • 17644390231 scopus 로고    scopus 로고
    • An analysis of the relative hardness of Reuters-21578 subsets
    • Debole F., and Sebastiani F. An analysis of the relative hardness of Reuters-21578 subsets. J. Amer. Soc. Inform. Sci. Technol. 56 6 (2004) 584-596
    • (2004) J. Amer. Soc. Inform. Sci. Technol. , vol.56 , Issue.6 , pp. 584-596
    • Debole, F.1    Sebastiani, F.2
  • 7
    • 77950498503 scopus 로고    scopus 로고
    • Erenel, Z., Alti{dotless}nçay, H., Varoǧlu, E., 2009. A symmetric term weighting scheme for text categorization based on term occurrence probabilities. In: Proc. Fifth Internat. Conf. on Soft Computing, Computing with Words and Perceptions in System Analysis, Decision and Control (ICSCCW), Famagusta. Northern Cyprus.
    • Erenel, Z., Alti{dotless}nçay, H., Varoǧlu, E., 2009. A symmetric term weighting scheme for text categorization based on term occurrence probabilities. In: Proc. Fifth Internat. Conf. on Soft Computing, Computing with Words and Perceptions in System Analysis, Decision and Control (ICSCCW), Famagusta. Northern Cyprus.
  • 8
    • 2942731012 scopus 로고    scopus 로고
    • An extensive empirical study of feature selection metrics for text classification
    • Forman G. An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3 (2003) 1289-1305
    • (2003) J. Mach. Learn. Res. , vol.3 , pp. 1289-1305
    • Forman, G.1
  • 9
    • 84880904111 scopus 로고    scopus 로고
    • Feature selection for text classification
    • Chapman and Hall/CRC Press
    • Forman G. Feature selection for text classification. Computational Methods of Feature Selection (2007), Chapman and Hall/CRC Press
    • (2007) Computational Methods of Feature Selection
    • Forman, G.1
  • 11
    • 0002409860 scopus 로고    scopus 로고
    • A probabilistic analysis of the Rocchio algorithm with tfidf for text categorization
    • Joachims, T., 1997. A probabilistic analysis of the Rocchio algorithm with tfidf for text categorization. In: Proc. 14th Internat. Conf. on Machine Learning, pp. 143-151.
    • (1997) Proc. 14th Internat. Conf. on Machine Learning , pp. 143-151
    • Joachims, T.1
  • 12
    • 84957069814 scopus 로고    scopus 로고
    • Text categorization with support vector machines: Learning with many relevant features
    • Joachims, T., 1998. Text categorization with support vector machines: learning with many relevant features. In: Proc. 10th European Conf. Machine Learning, pp. 137-142.
    • (1998) Proc. 10th European Conf. Machine Learning , pp. 137-142
    • Joachims, T.1
  • 13
    • 0002714543 scopus 로고    scopus 로고
    • Making large-scale SVM learning practical
    • Schölkoph B., Burges C.J.C., and Smola A.J. (Eds), MIT Press, Cambridge, MA
    • Joachims T. Making large-scale SVM learning practical. In: Schölkoph B., Burges C.J.C., and Smola A.J. (Eds). Advances in Kernel Methods-Support Vector Learning (1999), MIT Press, Cambridge, MA 169-184
    • (1999) Advances in Kernel Methods-Support Vector Learning , pp. 169-184
    • Joachims, T.1
  • 14
    • 62249199807 scopus 로고    scopus 로고
    • Supervised and traditional term weighting methods for automatic text categorization
    • Lan M., Tan C.L., Su J., and Lu Y. Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans. Pattern Anal. Machine Intell. 31 4 (2009) 721-735
    • (2009) IEEE Trans. Pattern Anal. Machine Intell. , vol.31 , Issue.4 , pp. 721-735
    • Lan, M.1    Tan, C.L.2    Su, J.3    Lu, Y.4
  • 15
    • 0036161242 scopus 로고    scopus 로고
    • Text categorization with support vector machines.how to represent texts in input space?
    • Leopold E., and Kindermann J. Text categorization with support vector machines.how to represent texts in input space?. Machine Learning 46 1-3 (2002) 423-444
    • (2002) Machine Learning , vol.46 , Issue.1-3 , pp. 423-444
    • Leopold, E.1    Kindermann, J.2
  • 16
    • 17044405923 scopus 로고    scopus 로고
    • Toward integrating feature selection algorithms for classification and clustering
    • Liu H., and Yu L. Toward integrating feature selection algorithms for classification and clustering. IEEE Trans. Knowledge Data Eng. 17 4 (2005) 491-502
    • (2005) IEEE Trans. Knowledge Data Eng. , vol.17 , Issue.4 , pp. 491-502
    • Liu, H.1    Yu, L.2
  • 17
    • 53849085839 scopus 로고    scopus 로고
    • Imbalanced text classification: A term weighting approach
    • Liu Y., Loh H.T., and Sun A. Imbalanced text classification: A term weighting approach. Expert Systems with Applications 36 (2009) 690-701
    • (2009) Expert Systems with Applications , vol.36 , pp. 690-701
    • Liu, Y.1    Loh, H.T.2    Sun, A.3
  • 18
    • 0037375142 scopus 로고    scopus 로고
    • Feature selection on hierarchy of web documents
    • Mladenic D., and Grobelnik M. Feature selection on hierarchy of web documents. Decision Support Syst. 35 1 (2003) 45-87
    • (2003) Decision Support Syst. , vol.35 , Issue.1 , pp. 45-87
    • Mladenic, D.1    Grobelnik, M.2
  • 20
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • Porter M.F. An algorithm for suffix stripping. Program 14 3 (1980) 130-137
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.F.1
  • 21
    • 0037818380 scopus 로고    scopus 로고
    • High-performing feature selection for text classification
    • Information and Knowledge Management, pp
    • Rogati, M., Yang, Y., 2002. High-performing feature selection for text classification. In: Proc. Eleventh Internat. Conf. Information and Knowledge Management, pp. 659-661.
    • (2002) Proc. Eleventh Internat. Conf , pp. 659-661
    • Rogati, M.1    Yang, Y.2
  • 22
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • Sebastiani F. Machine learning in automated text categorization. ACM Comput. Surveys 34 1 (2002) 1-47
    • (2002) ACM Comput. Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 23
    • 0242692524 scopus 로고    scopus 로고
    • Web page feature selection and classification using neural networks
    • Selamat A., and Omatu S. Web page feature selection and classification using neural networks. Inform. Sci. 158 (2004) 69-88
    • (2004) Inform. Sci. , vol.158 , pp. 69-88
    • Selamat, A.1    Omatu, S.2
  • 24
    • 17844387127 scopus 로고    scopus 로고
    • Neighbor-weighted k-nearest neighbor for unbalanced text corpus
    • Tan S. Neighbor-weighted k-nearest neighbor for unbalanced text corpus. Expert Syst. Appl. 28 (2005) 667-671
    • (2005) Expert Syst. Appl. , vol.28 , pp. 667-671
    • Tan, S.1
  • 25
    • 84871981420 scopus 로고    scopus 로고
    • Exploiting likely-positive and unlabeled data to improve the identification of protein-protein interaction articles
    • Tsai R.T., Hung H., Dai H., Lin Y., and Hsu W. Exploiting likely-positive and unlabeled data to improve the identification of protein-protein interaction articles. BMC Bioinform. 9 (2008)
    • (2008) BMC Bioinform. , vol.9
    • Tsai, R.T.1    Hung, H.2    Dai, H.3    Lin, Y.4    Hsu, W.5
  • 26
    • 0003141935 scopus 로고    scopus 로고
    • A comparative study on feature selection in text categorization
    • Morgan Kaufmann Publishers, San Francisco, US, pp
    • Yang, Y., Pedersen, J.O., 1997. A comparative study on feature selection in text categorization. In: Proc. ICML'97, 14th Internat. Conf. on Machine Learning. Morgan Kaufmann Publishers, San Francisco, US, pp. 412-420.
    • (1997) Proc. ICML'97, 14th Internat. Conf. on Machine Learning , pp. 412-420
    • Yang, Y.1    Pedersen, J.O.2
  • 28
    • 16644402628 scopus 로고    scopus 로고
    • Feature selection for text categorization on imbalanced data
    • Zheng Z., Wu X., and Srihari R. Feature selection for text categorization on imbalanced data. SIGKDD Explor. Newsl. 6 1 (2004) 80-89
    • (2004) SIGKDD Explor. Newsl. , vol.6 , Issue.1 , pp. 80-89
    • Zheng, Z.1    Wu, X.2    Srihari, R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.