메뉴 건너뛰기




Volumn 21, Issue 8, 2008, Pages 879-886

Text classification based on multi-word with support vector machine

Author keywords

Feature selection; Information gain; Multi word; Support vector machine; Text classification

Indexed keywords

DATA MINING; FEATURE EXTRACTION; GEARS; IMAGE RETRIEVAL; INFORMATION RETRIEVAL SYSTEMS; LEARNING SYSTEMS; LINGUISTICS; MINING; MULTILAYER NEURAL NETWORKS; SUPPORT VECTOR MACHINES; VECTORS;

EID: 54949149162     PISSN: 09507051     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.knosys.2008.03.044     Document Type: Article
Times cited : (241)

References (39)
  • 1
    • 84953588425 scopus 로고
    • On the specification of term values in automatic indexing
    • Salton G., and Yang C.S. On the specification of term values in automatic indexing. Journal of Documentation 29 4 (1973) 11-21
    • (1973) Journal of Documentation , vol.29 , Issue.4 , pp. 11-21
    • Salton, G.1    Yang, C.S.2
  • 2
    • 58049084561 scopus 로고    scopus 로고
    • A. Hotho et al., Ontologies Improve Text Document Clustering, in: Proceedings of the 3rd IEEE International Conference on Data Mining, 2003, pp. 541-544.
    • A. Hotho et al., Ontologies Improve Text Document Clustering, in: Proceedings of the 3rd IEEE International Conference on Data Mining, 2003, pp. 541-544.
  • 3
    • 54949121253 scopus 로고    scopus 로고
    • S. Scott, S. Matwin, Text classification using WordNet Hypernyms, in: Proceedings of the COLING/ACL Workshop on Usage of WordNet in Natural Language Processing Systems, pp. 45-52.
    • S. Scott, S. Matwin, Text classification using WordNet Hypernyms, in: Proceedings of the COLING/ACL Workshop on Usage of WordNet in Natural Language Processing Systems, pp. 45-52.
  • 4
    • 54949134742 scopus 로고    scopus 로고
    • M.B. Rodriguez et al., Using WordNet to complement training information in text categorization, in: Proceedings of 2nd International Conference on Recent Advances in Natural Language Processing II: Selected Papers from RANLP'97, vol. 189 of Current Issues in Linguistic Theory (CILT), 2000, pp. 353-364.
    • M.B. Rodriguez et al., Using WordNet to complement training information in text categorization, in: Proceedings of 2nd International Conference on Recent Advances in Natural Language Processing II: Selected Papers from RANLP'97, vol. 189 of Current Issues in Linguistic Theory (CILT), 2000, pp. 353-364.
  • 5
    • 0027001621 scopus 로고    scopus 로고
    • D.D. Lewis, An evaluation of phrasal and clustered representation on a text categorization task, in: SIGIR'92: Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1992, pp. 37-50.
    • D.D. Lewis, An evaluation of phrasal and clustered representation on a text categorization task, in: SIGIR'92: Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1992, pp. 37-50.
  • 6
    • 54949142733 scopus 로고    scopus 로고
    • R. Papka, J. Allan, Document classification using multiword features, in: Proceedings of the Seventh International Conference on Information and Knowledge Management Table of Contents, Bethesda, Maryland, United States, 1998, pp. 124-131.
    • R. Papka, J. Allan, Document classification using multiword features, in: Proceedings of the Seventh International Conference on Information and Knowledge Management Table of Contents, Bethesda, Maryland, United States, 1998, pp. 124-131.
  • 7
    • 36049012687 scopus 로고    scopus 로고
    • Text document clustering based on frequent word meaning sequences
    • Li Y.J., et al. Text document clustering based on frequent word meaning sequences. Data & Knowledge Engineering 64 (2008) 381-404
    • (2008) Data & Knowledge Engineering , vol.64 , pp. 381-404
    • Li, Y.J.1
  • 8
    • 54949100637 scopus 로고    scopus 로고
    • B.C.M. Fung et al., Hierarchical document clustering using frequent itemsets, in: Proceedings of SIAM International Conference on Data Mining, 2003, pp. 59-70.
    • B.C.M. Fung et al., Hierarchical document clustering using frequent itemsets, in: Proceedings of SIAM International Conference on Data Mining, 2003, pp. 59-70.
  • 10
    • 0036471411 scopus 로고    scopus 로고
    • Non-hierarchical document clustering based on a tolerance rough set model
    • Ho T.B., and Nguyen N.B. Non-hierarchical document clustering based on a tolerance rough set model. International Journal of Intelligent Systems 17 (2000) 199-212
    • (2000) International Journal of Intelligent Systems , vol.17 , pp. 199-212
    • Ho, T.B.1    Nguyen, N.B.2
  • 11
    • 54949145637 scopus 로고    scopus 로고
    • W.B. Cavnar, J.M. Trenkle, N-Gram based text categorization, in: Proceedings of 3rd Annual Symposium on Document Analysis and Information Retrieval, 1994, pp. 161-169.
    • W.B. Cavnar, J.M. Trenkle, N-Gram based text categorization, in: Proceedings of 3rd Annual Symposium on Document Analysis and Information Retrieval, 1994, pp. 161-169.
  • 12
    • 54949147131 scopus 로고
    • Blackwell, Oxford
    • Firth J.R. A synopsis of linguistic theory 1930-1955. Studies in linguistic analysis. Philological Society (1957), Blackwell, Oxford
    • (1957) Philological Society
    • Firth, J.R.1
  • 13
    • 54949132437 scopus 로고    scopus 로고
    • D. Hawking, P. Thistlewaite, Proximity operators - so near and yet so far, in: Proceedings of TREC-4, 1996, pp. 131-144.
    • D. Hawking, P. Thistlewaite, Proximity operators - so near and yet so far, in: Proceedings of TREC-4, 1996, pp. 131-144.
  • 14
    • 84974295346 scopus 로고
    • Technical terminology: some linguistic properties and an algorithm for identification in text
    • Justeson J.S., and Katz S.M. Technical terminology: some linguistic properties and an algorithm for identification in text. Natural Language Engineering 1 1 (1995) 9-27
    • (1995) Natural Language Engineering , vol.1 , Issue.1 , pp. 9-27
    • Justeson, J.S.1    Katz, S.M.2
  • 15
    • 54949124477 scopus 로고    scopus 로고
    • D. Bourigault, Surface grammatical analysis for the extraction of terminological noun phrases, in: Proceedings of the 14th International Conference on Computational Linguistics, Nantes, France, 1992, pp. 977-981.
    • D. Bourigault, Surface grammatical analysis for the extraction of terminological noun phrases, in: Proceedings of the 14th International Conference on Computational Linguistics, Nantes, France, 1992, pp. 977-981.
  • 16
    • 84974675567 scopus 로고
    • Retrieving collocations from text: Xtract
    • Smadja F. Retrieving collocations from text: Xtract. Computational Linguistics 19 1 (1993) 143-177
    • (1993) Computational Linguistics , vol.19 , Issue.1 , pp. 143-177
    • Smadja, F.1
  • 17
    • 54949086241 scopus 로고    scopus 로고
    • Y.J. Park, R.J. Byrd, K.B. Boguraev, Automatic glossary extraction: beyond terminology identification, in: Proceedings of the 19th International Conference on Computational linguistics, Taiwan, 2002, pp. 1-17.
    • Y.J. Park, R.J. Byrd, K.B. Boguraev, Automatic glossary extraction: beyond terminology identification, in: Proceedings of the 19th International Conference on Computational linguistics, Taiwan, 2002, pp. 1-17.
  • 18
    • 54949111209 scopus 로고    scopus 로고
    • B. Daille et al., Towards automatic extraction of monolingual and bilingual terminology, in: Proceedings of the International Conference on Computational Linguistics, Kyoto, Japan, 1994, pp. 93-98.
    • B. Daille et al., Towards automatic extraction of monolingual and bilingual terminology, in: Proceedings of the International Conference on Computational Linguistics, Kyoto, Japan, 1994, pp. 93-98.
  • 19
    • 54949131217 scopus 로고    scopus 로고
    • J. Zhang, J.F. Gao, M. Zhou, Extraction of Chinese compound words: an experiment study on a very large corpus, in: Proceedings of the Second Chinese Language Processing Workshop, HongKong, 2000, pp. 132-139.
    • J. Zhang, J.F. Gao, M. Zhou, Extraction of Chinese compound words: an experiment study on a very large corpus, in: Proceedings of the Second Chinese Language Processing Workshop, HongKong, 2000, pp. 132-139.
  • 21
    • 33846434437 scopus 로고    scopus 로고
    • An associative classification-based recommendation system for personalization in B2C e-commerce application
    • Zhang Y.Y., and Jiao J.X. An associative classification-based recommendation system for personalization in B2C e-commerce application. Expert Systems with Applications 33 1 (2007) 357-367
    • (2007) Expert Systems with Applications , vol.33 , Issue.1 , pp. 357-367
    • Zhang, Y.Y.1    Jiao, J.X.2
  • 22
    • 34548686982 scopus 로고    scopus 로고
    • Towards automated classification of intensive care nursing narratives
    • Hiissa M., et al. Towards automated classification of intensive care nursing narratives. International Journal of Medical Informatics 76 3 (2007) 362-368
    • (2007) International Journal of Medical Informatics , vol.76 , Issue.3 , pp. 362-368
    • Hiissa, M.1
  • 23
    • 85024373635 scopus 로고    scopus 로고
    • Y.M. Yang, X. Liu, A re-examination of text categorization methods, in: Proceedings on the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA, 1999 pp. 42-49.
    • Y.M. Yang, X. Liu, A re-examination of text categorization methods, in: Proceedings on the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA, 1999 pp. 42-49.
  • 24
    • 54949132015 scopus 로고    scopus 로고
    • Y.M. Yang, J.O. Pedersen, A comparative study on feature selection in text categorization, Proceedings of the Fourteenth International Conference on Machine Learning, 1997, pp. 412-420.
    • Y.M. Yang, J.O. Pedersen, A comparative study on feature selection in text categorization, Proceedings of the Fourteenth International Conference on Machine Learning, 1997, pp. 412-420.
  • 27
    • 54949128927 scopus 로고    scopus 로고
    • M.A. Aizerman et al., Theoretical Foundations of the potential function method in pattern recognition learning, Journal of Machine Learning Research (2000) 113-141. Available from: .
    • M.A. Aizerman et al., Theoretical Foundations of the potential function method in pattern recognition learning, Journal of Machine Learning Research (2000) 113-141. Available from: .
  • 28
    • 54949126248 scopus 로고    scopus 로고
    • J. Weston, C. Watkins, Multi-class Support Vector Machines, Technical Report CSD-TR-98-04, Royal Holloway, University of London, Department of Computer Science, 1998.
    • J. Weston, C. Watkins, Multi-class Support Vector Machines, Technical Report CSD-TR-98-04, Royal Holloway, University of London, Department of Computer Science, 1998.
  • 29
    • 54949124476 scopus 로고    scopus 로고
    • D. Bourigault, Surface Grammatical Analysis for the Extraction of Terminological Noun Phrases, in: Proceedings of the 14th International Conference on Computational Linguistics, Nantes, France, 1992, pp. 977-981.
    • D. Bourigault, Surface Grammatical Analysis for the Extraction of Terminological Noun Phrases, in: Proceedings of the 14th International Conference on Computational Linguistics, Nantes, France, 1992, pp. 977-981.
  • 30
    • 0001882615 scopus 로고
    • Self-organized language modeling for speech recognition
    • Waibel A., and Lee K.F. (Eds), Morgan Kaufmann Publishers
    • Jelinek F. Self-organized language modeling for speech recognition. In: Waibel A., and Lee K.F. (Eds). Readings in Speech Recognition (1990), Morgan Kaufmann Publishers 450-506
    • (1990) Readings in Speech Recognition , pp. 450-506
    • Jelinek, F.1
  • 31
    • 0001984996 scopus 로고
    • A multiple-corpus approach to recognition of proper names in chinese texts
    • Chang J.S., et al. A multiple-corpus approach to recognition of proper names in chinese texts. Computer Processing of Chinese and Oriental Languages 8 1 (1994) 75-85
    • (1994) Computer Processing of Chinese and Oriental Languages , vol.8 , Issue.1 , pp. 75-85
    • Chang, J.S.1
  • 32
    • 54949107628 scopus 로고    scopus 로고
    • I. Fahmi, C Value Method for Multi-word Term Extraction, Seminar in Statistics and Methodology, Alfa-informatica, RuG, May 23, 2005. Available from: .
    • I. Fahmi, C Value Method for Multi-word Term Extraction, Seminar in Statistics and Methodology, Alfa-informatica, RuG, May 23, 2005. Available from: .
  • 34
    • 0030092468 scopus 로고    scopus 로고
    • Distribution of content words and phrases in texts and language modeling
    • Katz S. Distribution of content words and phrases in texts and language modeling. Natural Language Engineering 2 1 (1996) 15-59
    • (1996) Natural Language Engineering , vol.2 , Issue.1 , pp. 15-59
    • Katz, S.1
  • 36
    • 0036161242 scopus 로고    scopus 로고
    • Text categorization with support vector machines. How to represent texts in input space?
    • Leopold E., and Kindermann J. Text categorization with support vector machines. How to represent texts in input space?. Machine Learning 46 (2002) 423-444
    • (2002) Machine Learning , vol.46 , pp. 423-444
    • Leopold, E.1    Kindermann, J.2
  • 37
    • 54949130065 scopus 로고    scopus 로고
    • T. Joachims, Text categorization with support vector machines: learning with many relevant features, in: Proceedings of ECML-98, 10th European Conference on Machine Learning, pp. 137-142.
    • T. Joachims, Text categorization with support vector machines: learning with many relevant features, in: Proceedings of ECML-98, 10th European Conference on Machine Learning, pp. 137-142.
  • 38
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • 32-33
    • Sebastiani F. Machine learning in automated text categorization. ACM Computing Surveys 34 1 (2002) 11-12 32-33
    • (2002) ACM Computing Surveys , vol.34 , Issue.1 , pp. 11-12
    • Sebastiani, F.1
  • 39
    • 33750982419 scopus 로고    scopus 로고
    • Improving self-organization of document collection by semantic mapping
    • Correa R.F., and Ludermir T.B. Improving self-organization of document collection by semantic mapping. Neurocomputing 70 (2006) 62-69
    • (2006) Neurocomputing , vol.70 , pp. 62-69
    • Correa, R.F.1    Ludermir, T.B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.