메뉴 건너뛰기




Volumn , Issue , 2008, Pages 1221-1230

Extremely fast text feature extraction for classification and indexing

Author keywords

Bag of words; Document categorization; Feature engineering; Feature extraction; Text indexing; Text mining; Text tokenization

Indexed keywords

BAG-OF-WORDS; DOCUMENT CATEGORIZATION; FEATURE ENGINEERING; TEXT INDEXING; TEXT MINING; TEXT TOKENIZATION;

EID: 70349254646     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1458082.1458243     Document Type: Conference Paper
Times cited : (48)

References (15)
  • 1
    • 0016518897 scopus 로고
    • Efficient string matching: An aid to bibliographic search
    • Aho, AV. and Corasick, MJ. 1975. Efficient string matching: an aid to bibliographic search. Communications of the ACM 18 (6): 333-340.
    • (1975) Communications of the ACM , vol.18 , Issue.6 , pp. 333-340
    • Aho, A.V.1    Corasick, M.J.2
  • 2
    • 84976228809 scopus 로고
    • A string matching algorithm fast on the average
    • Proc. of the 6th Colloquium, on Automata, Languages and Programming July 16-20, Springer-Verlag
    • Commentz-Walter, B. 1979. A string matching algorithm fast on the average. In Proc. of the 6th Colloquium, on Automata, Languages and Programming (July 16-20, 1979). Lecture Notes in Comp. Sci.,v.71, Springer-Verlag, 118-132.
    • (1979) Lecture Notes in Comp. Sci , vol.71 , pp. 118-132
    • Commentz-Walter, B.1
  • 3
    • 2942731012 scopus 로고    scopus 로고
    • An extensive empirical study of feature selection metrics for text classification
    • Mar
    • Forman, G. 2003. An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3 (Mar. 2003), 1289-1305.
    • (2003) J. Mach. Learn. Res , vol.3 , pp. 1289-1305
    • Forman, G.1
  • 9
    • 0003352252 scopus 로고
    • The Art of Computer Programming
    • Addison-Wesley, Reading, MA
    • Knuth, DE. 1973. The Art of Computer Programming, Volume 3: Sorting and Searching, Addison-Wesley, Reading, MA.
    • (1973) Sorting and Searching , vol.3
    • Knuth, D.E.1
  • 13
    • 84925643023 scopus 로고    scopus 로고
    • Salmela, L., Tarhio, J., and Kytöjoki, J. 2007. Multipattern string matching with q-grams. J. Exp. Algorithmics 11 (Feb. 2007), 1.1.
    • Salmela, L., Tarhio, J., and Kytöjoki, J. 2007. Multipattern string matching with q-grams. J. Exp. Algorithmics 11 (Feb. 2007), 1.1.
  • 14
    • 70349230202 scopus 로고    scopus 로고
    • The Unicode Consortium. 2006. The Unicode Standard, Version 5.0. Addison-Wesley
    • The Unicode Consortium. 2006. The Unicode Standard, Version 5.0. Addison-Wesley.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.