메뉴 건너뛰기




Volumn 22, Issue 1, 2008, Pages 106-111

Performance of KNN and SVM classifiers on full word Arabic articles

Author keywords

Arabic text categorization; CHI statistics; Full word features; KNN; SVM; tf.idf weighting

Indexed keywords

COMPUTATIONAL METHODS; FEATURE EXTRACTION; LEARNING ALGORITHMS; STATISTICS; SUPPORT VECTOR MACHINES;

EID: 39449129104     PISSN: 14740346     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.aei.2007.12.001     Document Type: Article
Times cited : (121)

References (24)
  • 1
    • 84957069814 scopus 로고    scopus 로고
    • T. Joachims, Text categorization with support vector machines: learning with many relevant features, in: Proceedings of 10th European Conference on Machine Learning, (ECML), Chemnitz, Germany, pp. 137-142, 1998.
    • T. Joachims, Text categorization with support vector machines: learning with many relevant features, in: Proceedings of 10th European Conference on Machine Learning, (ECML), Chemnitz, Germany, pp. 137-142, 1998.
  • 2
    • 39449110422 scopus 로고
    • IEEE Computer Society Press, Las Alamitos, California
    • Dasarathy B. Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques. McGraw-Hill Computer Science Series (1991), IEEE Computer Society Press, Las Alamitos, California
    • (1991) McGraw-Hill Computer Science Series
    • Dasarathy, B.1
  • 3
    • 0006506807 scopus 로고    scopus 로고
    • Making large-scales SVM learning practical. Advances in Kernel Methods?
    • Scholkopf B., Burges C., and Smola A. (Eds), MIT Press, Massachusetts, USA
    • Joachims T. Making large-scales SVM learning practical. Advances in Kernel Methods?. In: Scholkopf B., Burges C., and Smola A. (Eds). Support Vector Learning (1999), MIT Press, Massachusetts, USA
    • (1999) Support Vector Learning
    • Joachims, T.1
  • 4
    • 52149100731 scopus 로고    scopus 로고
    • R. Duwairi, An eager k-nearest-neighbor classifier for arabic text categorization, in: Proceedings of the International Conference on Data Mining (DMIN), Nevada, USA, pp. 187-192, 2005.
    • R. Duwairi, An eager k-nearest-neighbor classifier for arabic text categorization, in: Proceedings of the International Conference on Data Mining (DMIN), Nevada, USA, pp. 187-192, 2005.
  • 5
    • 39449133978 scopus 로고    scopus 로고
    • L. Khreisat, Arabic text classification using N-Gram frequency statistics. A comparative study, in: Proceedings of the international conference on data mining (DMIN), Nevada, USA, pp. 78-82, 2006.
    • L. Khreisat, Arabic text classification using N-Gram frequency statistics. A comparative study, in: Proceedings of the international conference on data mining (DMIN), Nevada, USA, pp. 78-82, 2006.
  • 7
    • 39449138137 scopus 로고    scopus 로고
    • He Ji, Ah-Hwee Tan, Chew-Lim Tan, A comparative study on chinese text categorization methods, in: Proceedings of the International Workshop on Text and Web Mining (PRICAI), Melbourne, Australia, pp. 24-35, 2000.
    • He Ji, Ah-Hwee Tan, Chew-Lim Tan, A comparative study on chinese text categorization methods, in: Proceedings of the International Workshop on Text and Web Mining (PRICAI), Melbourne, Australia, pp. 24-35, 2000.
  • 8
    • 39449111262 scopus 로고    scopus 로고
    • N. Fuhr, S. Hartmanna, G. Lustig, M. Schwantner, K. Tzeras, Air/x-a rule-based multistage indexing systems for large subject fields, in: Proceedings of Conference on Intelligent Text and Image Handling (RIAO), Barcelona, Spain, pp. 606-623, 1991.
    • N. Fuhr, S. Hartmanna, G. Lustig, M. Schwantner, K. Tzeras, Air/x-a rule-based multistage indexing systems for large subject fields, in: Proceedings of Conference on Intelligent Text and Image Handling (RIAO), Barcelona, Spain, pp. 606-623, 1991.
  • 9
    • 0028461554 scopus 로고
    • An example-based mapping method for text categorization and retrieval
    • Yang Y., and Chute C. An example-based mapping method for text categorization and retrieval. ACM Transaction on Information Systems (TOIS) 12 3 (1994) 252-277
    • (1994) ACM Transaction on Information Systems (TOIS) , vol.12 , Issue.3 , pp. 252-277
    • Yang, Y.1    Chute, C.2
  • 10
    • 0026986166 scopus 로고    scopus 로고
    • B. Masand, G. Linoff, D. Waltz, Classifying news stories using memory based reasoning, in: 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Copenhagen, Denmark, pp. 59-64, 1992.
    • B. Masand, G. Linoff, D. Waltz, Classifying news stories using memory based reasoning, in: 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Copenhagen, Denmark, pp. 59-64, 1992.
  • 11
    • 84984720462 scopus 로고    scopus 로고
    • Y. Yang, Expert network: effective and efficient learning from human decisions in text categorization and retrieval, in: 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Dublin, Ireland, pp. 13-22, 1994.
    • Y. Yang, Expert network: effective and efficient learning from human decisions in text categorization and retrieval, in: 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Dublin, Ireland, pp. 13-22, 1994.
  • 12
    • 39449133351 scopus 로고    scopus 로고
    • Y. Yang, J.P. Pedersen, Feature selection in statistical learning of text categorization, in: Proceedings of the 14th International Conference on Machine Learning (ICML), Tennessee, USA, pp. 412-420, 1997.
    • Y. Yang, J.P. Pedersen, Feature selection in statistical learning of text categorization, in: Proceedings of the 14th International Conference on Machine Learning (ICML), Tennessee, USA, pp. 412-420, 1997.
  • 13
    • 27144441097 scopus 로고    scopus 로고
    • An evaluation of statistical approaches to text categorization
    • Yang Y. An evaluation of statistical approaches to text categorization. Journal of Information Retrieval 1 1/2 (1999) 69-90
    • (1999) Journal of Information Retrieval , vol.1 , Issue.1-2 , pp. 69-90
    • Yang, Y.1
  • 14
    • 0032282385 scopus 로고    scopus 로고
    • W. Lam, C.Y. Ho, Using a generalized instance set for automatic text categorization, in: Proceedings of the 21th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Melbourne, Australia, pp. 81-89, 1998.
    • W. Lam, C.Y. Ho, Using a generalized instance set for automatic text categorization, in: Proceedings of the 21th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Melbourne, Australia, pp. 81-89, 1998.
  • 15
    • 39449103615 scopus 로고    scopus 로고
    • I. Moulinier, Is learning bias an issue on the text categorization problem?, Technical report, LAFORIA-LIP6, Universite Paris VI, 1997.
    • I. Moulinier, Is learning bias an issue on the text categorization problem?, Technical report, LAFORIA-LIP6, Universite Paris VI, 1997.
  • 16
    • 39449084118 scopus 로고    scopus 로고
    • D.D. Lewis, M. Ringuette, Comparison of two learning algorithms for text categorization, in: Proceedings of the 3rd Annual Symposium on Document Analysis and Information Retrieval (SDAIR), Nevada, USA, pp. 81-93, 1994.
    • D.D. Lewis, M. Ringuette, Comparison of two learning algorithms for text categorization, in: Proceedings of the 3rd Annual Symposium on Document Analysis and Information Retrieval (SDAIR), Nevada, USA, pp. 81-93, 1994.
  • 17
    • 39449084686 scopus 로고    scopus 로고
    • C. Apte, F. Damerau, S. Weiss, Text mining with decision rules and decision trees, in: Proceedings of the Conference on Automated Learning and Discovery (CONALD), Workshop 6: Learning from Text and the Web, Pittsburgh, USA, June 1998.
    • C. Apte, F. Damerau, S. Weiss, Text mining with decision rules and decision trees, in: Proceedings of the Conference on Automated Learning and Discovery (CONALD), Workshop 6: Learning from Text and the Web, Pittsburgh, USA, June 1998.
  • 18
    • 39449129885 scopus 로고    scopus 로고
    • Y. Yang, J. Pederson, A comparative study on feature selection in text categorization, In: Proceedings of the 14th International Conference on Machine Learning, (ICML), Tennessee, USA, pp. 412-420, 1997.
    • Y. Yang, J. Pederson, A comparative study on feature selection in text categorization, In: Proceedings of the 14th International Conference on Machine Learning, (ICML), Tennessee, USA, pp. 412-420, 1997.
  • 19
    • 39449121646 scopus 로고    scopus 로고
    • T. Joachims, A probabilistic analysis of the Rocchio Algorithm with TFIDF for text categorization, in: Proceedings of the 14th International Conference on Machine Learning, (ICML), Tennessee, USA, pp. 143-151, 1997.
    • T. Joachims, A probabilistic analysis of the Rocchio Algorithm with TFIDF for text categorization, in: Proceedings of the 14th International Conference on Machine Learning, (ICML), Tennessee, USA, pp. 143-151, 1997.
  • 20
    • 39449117995 scopus 로고    scopus 로고
    • A. Bow McCallum, A toolkit for statistical language modeling, text retrieval, classification and clustering, http://www.cs.cmu.edu/mccallum/bow, 1996.
    • A. Bow McCallum, A toolkit for statistical language modeling, text retrieval, classification and clustering, http://www.cs.cmu.edu/mccallum/bow, 1996.
  • 21
    • 39449105707 scopus 로고    scopus 로고
    • A. Yahya, On the complexity of the initial stages of arabic text processing, First Great Lakes Computer Science Conference; Kalamazoo, Michigan, USA, October 1989.
    • A. Yahya, On the complexity of the initial stages of arabic text processing, First Great Lakes Computer Science Conference; Kalamazoo, Michigan, USA, October 1989.
  • 22
    • 39449119492 scopus 로고    scopus 로고
    • Gist Support vector machine and kernel principal components analysis software toolkit Version 2.0.9 Authors: William Stafford Noble and Paul Pavlidis, Copyright (C)1999-2002, Columbia University.
    • Gist Support vector machine and kernel principal components analysis software toolkit Version 2.0.9 Authors: William Stafford Noble and Paul Pavlidis, Copyright (C)1999-2002, Columbia University.
  • 24
    • 39449113920 scopus 로고    scopus 로고
    • http://research.microsoft.com/~jplatt/svm.html. Visited on May 12, 2007.
    • http://research.microsoft.com/~jplatt/svm.html. Visited on May 12, 2007.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.