메뉴 건너뛰기




Volumn 3, Issue 1, 2009, Pages 72-77

A machine learning approach for Arabic text classification using N-gram frequency statistics

Author keywords

Arabic; Categorization; Classification; Data mining; Machine learning; N gram

Indexed keywords


EID: 58149462758     PISSN: 17511577     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.joi.2008.11.005     Document Type: Article
Times cited : (69)

References (27)
  • 1
    • 84878415655 scopus 로고    scopus 로고
    • Ad Dustour. (2008). www.addustour.com.
    • Ad Dustour. (2008). www.addustour.com.
  • 2
    • 84878393391 scopus 로고    scopus 로고
    • Al Arab, A. Y. (2008). http://www.alarabalyawm.net/.
    • Al Arab, A. Y. (2008). http://www.alarabalyawm.net/.
  • 3
    • 84878410007 scopus 로고    scopus 로고
    • Al Ghad. (2008). http://www.alghad.jo/.
    • Al Ghad. (2008). http://www.alghad.jo/.
  • 4
    • 0013182812 scopus 로고
    • A new algorithm to generate Arabic root-pattern forms
    • King Fahd University of Petroleum & Minerals, Dhahran, Saudi Arabia
    • Al-Fedaghi S., and Al-Anzi F. A new algorithm to generate Arabic root-pattern forms. Proceedings of the 11th National Computer Conference. King Fahd University of Petroleum & Minerals, Dhahran, Saudi Arabia (1989) 04-07
    • (1989) Proceedings of the 11th National Computer Conference , pp. 04-07
    • Al-Fedaghi, S.1    Al-Anzi, F.2
  • 5
    • 84878386478 scopus 로고    scopus 로고
    • Al Ra'I. (2008). Daily. www.alrai.com.
    • Al Ra'I. (2008). Daily. www.alrai.com.
  • 8
    • 0013181904 scopus 로고    scopus 로고
    • Arabic finite-state morphological analysis and generation
    • Beesley K. Arabic finite-state morphological analysis and generation. Proceedings of the COLING, vol. 1 (1996) 89-94
    • (1996) Proceedings of the COLING, vol. 1 , pp. 89-94
    • Beesley, K.1
  • 11
    • 0028911698 scopus 로고
    • Gauging similarity with N-grams: Language-independent categorization of text
    • Damashek M. Gauging similarity with N-grams: Language-independent categorization of text. Science 267 10 February (1995) 843-848
    • (1995) Science , vol.267 , Issue.10 February , pp. 843-848
    • Damashek, M.1
  • 15
    • 0038217041 scopus 로고    scopus 로고
    • The distribution of N-grams
    • (16)
    • Egghe L. The distribution of N-grams. Scientometrics 47 (February) 2 (2000) 237-252 (16)
    • (2000) Scientometrics , vol.47 , Issue.February 2 , pp. 237-252
    • Egghe, L.1
  • 16
    • 84878378140 scopus 로고    scopus 로고
    • Egyptian Demographic Center. (2000). http://www.frcu.eun.eg/www/homepage/cdc/cdc.htm.
    • Egyptian Demographic Center. (2000). http://www.frcu.eun.eg/www/homepage/cdc/cdc.htm.
  • 18
    • 84878382133 scopus 로고    scopus 로고
    • El-Kourdi, M., Bensaid, A., & Rachidi, T. (2004). Automatic Arabic document categorization based on the Naïve-Bayes Algorithm. Workshop on computational approaches to Arabic script-based languages, COLING-2004, University of Geneva, Geneva, Switzerland, August.
    • El-Kourdi, M., Bensaid, A., & Rachidi, T. (2004). Automatic Arabic document categorization based on the Naïve-Bayes Algorithm. Workshop on computational approaches to Arabic script-based languages, COLING-2004, University of Geneva, Geneva, Switzerland, August.
  • 19
    • 84878421489 scopus 로고    scopus 로고
    • Khoja, S. (2007). Personal communication.
    • Khoja, S. (2007). Personal communication.
  • 20
    • 58149461920 scopus 로고    scopus 로고
    • Arabic text classification using N-gram frequency statistics: A comparative study
    • Las Vegas, USA
    • Khreisat L. Arabic text classification using N-gram frequency statistics: A comparative study. Proceedings of the DMIN'06. Las Vegas, USA (2006)
    • (2006) Proceedings of the DMIN'06
    • Khreisat, L.1
  • 21
    • 0039955285 scopus 로고
    • Multi-tape two-level morphology: A case study in semitic non-linear morphology
    • Kiraz G. Multi-tape two-level morphology: A case study in semitic non-linear morphology. Proceedings of COLING, vol. 1 (1994) 180-186
    • (1994) Proceedings of COLING, vol. 1 , pp. 180-186
    • Kiraz, G.1
  • 23
    • 84878396781 scopus 로고    scopus 로고
    • Savoy, J., & Rasolofo, Y. (2002). Report on the TREC-11 experiment: Arabic, named page and topic distillation searches, TREC-11.
    • Savoy, J., & Rasolofo, Y. (2002). Report on the TREC-11 experiment: Arabic, named page and topic distillation searches, TREC-11.
  • 24
    • 84878402974 scopus 로고    scopus 로고
    • Sawaf H., Zaplo J., & Ney, H. (2001). Statistical classification methods for Arabic news articles. Arabic natural language processing in ACL2001, Toulouse, France, July.
    • Sawaf H., Zaplo J., & Ney, H. (2001). Statistical classification methods for Arabic news articles. Arabic natural language processing in ACL2001, Toulouse, France, July.
  • 25
    • 84878401268 scopus 로고    scopus 로고
    • Smřz, O. (2007). Functional Arabic morphology: Formal system and implementation. Doctoral Thesis, Prague.
    • Smřz, O. (2007). Functional Arabic morphology: Formal system and implementation. Doctoral Thesis, Prague.
  • 26
    • 0036989615 scopus 로고    scopus 로고
    • Xu, J., Fraser, A., & Weischedel, R. (2002). Empirical studies in strategies for Arabic retrieval. SIGIR'02, Tampere, Finland.
    • Xu, J., Fraser, A., & Weischedel, R. (2002). Empirical studies in strategies for Arabic retrieval. SIGIR'02, Tampere, Finland.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.