메뉴 건너뛰기




Volumn , Issue , 2007, Pages 3-12

How much noise is too much: A study in automatic text classification

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC TEXT CLASSIFICATION; BENCHMARK DATASETS; CONTACT CENTERS; DATA CLEANING; DATA-SETS; INFORMATION EXTRACTION; INTERNATIONAL CONFERENCES; MOBILE PHONES; NEWSGROUPS; PRE-PROCESSING; REAL-LIFE DATA; REUTERS-21578; RULE LEARNING; SIMULATED NOISE; TEXT CLASSIFICATION; TEXT CLASSIFIERS; TEXT-MINING;

EID: 49749119987     PISSN: 15504786     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICDM.2007.21     Document Type: Conference Paper
Times cited : (85)

References (27)
  • 5
    • 84941869105 scopus 로고
    • A technique for computer detection and correction of spelling errors
    • F. J. Damerau. A technique for computer detection and correction of spelling errors. Commun. ACM, 7(3):171-176, 1964.
    • (1964) Commun. ACM , vol.7 , Issue.3 , pp. 171-176
    • Damerau, F.J.1
  • 8
    • 0141702347 scopus 로고    scopus 로고
    • Optimizing svms for complex call classification
    • P. Haffner, G. Tur, and J. Wright. Optimizing svms for complex call classification. In Proc. of ICASSP, 2003.
    • (2003) Proc. of ICASSP
    • Haffner, P.1    Tur, G.2    Wright, J.3
  • 9
    • 0001509519 scopus 로고    scopus 로고
    • Probabilistic latent semantic analysis
    • T. Hofmann. Probabilistic latent semantic analysis. In Proc. of UAI, 1999.
    • (1999) Proc. of UAI
    • Hofmann, T.1
  • 10
    • 0002714543 scopus 로고    scopus 로고
    • Making large-scale support vector machine learning practical
    • A. S. B. Schölkopf, C. Burges, editor, MIT Press, Cambridge, MA
    • T. Joachims. Making large-scale support vector machine learning practical. In A. S. B. Schölkopf, C. Burges, editor, Advances in Kernel Methods: Support Vector Machines. MIT Press, Cambridge, MA, 1998.
    • (1998) Advances in Kernel Methods: Support Vector Machines
    • Joachims, T.1
  • 11
    • 0026979939 scopus 로고
    • Technique for automatically correcting words in text
    • K. Kukich. Technique for automatically correcting words in text. ACM Comput. Surv., 24(4):377-439, 1992.
    • (1992) ACM Comput. Surv , vol.24 , Issue.4 , pp. 377-439
    • Kukich, K.1
  • 12
    • 25144437707 scopus 로고
    • Binary codes capable of correcting deletions, insertions, and reversals
    • Technical Report 8
    • V. I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. Technical Report 8, 1966.
    • (1966)
    • Levenshtein, V.I.1
  • 16
    • 84880741035 scopus 로고    scopus 로고
    • Semantic annotation of unstructured and ungrammatical text
    • M. Michelson and C. A. Knoblock. Semantic annotation of unstructured and ungrammatical text. In Proc. of IJCAI, pages 1091-1098, 2005.
    • (2005) Proc. of IJCAI , pp. 1091-1098
    • Michelson, M.1    Knoblock, C.A.2
  • 22
    • 77950866782 scopus 로고    scopus 로고
    • Automatic generation of domain models for call-centers from noisy transcriptions
    • S. Roy and L. V. Subramaniam. Automatic generation of domain models for call-centers from noisy transcriptions. In Proc. of ACL-COLING, 2006.
    • (2006) Proc. of ACL-COLING
    • Roy, S.1    Subramaniam, L.V.2
  • 23
    • 84906338522 scopus 로고    scopus 로고
    • Context-based speech recognition error detection and correction
    • A. Sarma and D. Palmer. Context-based speech recognition error detection and correction. In Proc. of HLT-NAACL, 2004.
    • (2004) Proc. of HLT-NAACL
    • Sarma, A.1    Palmer, D.2
  • 24
    • 10044290500 scopus 로고    scopus 로고
    • Noisy text categorization
    • Washington, DC, USA, IEEE Computer Society
    • A. Vinciarelli. Noisy text categorization. In Proc. of ICPR'04 Volume 2, pages 554-557, Washington, DC, USA, 2004. IEEE Computer Society.
    • (2004) Proc. of ICPR'04 , vol.2 , pp. 554-557
    • Vinciarelli, A.1
  • 26
    • 1542347782 scopus 로고    scopus 로고
    • Robustness of regularized linear classification methods in text categorization
    • J. Zhang and Y. Yang. Robustness of regularized linear classification methods in text categorization. In Proc. of SIGIR, 2003.
    • (2003) Proc. of SIGIR
    • Zhang, J.1    Yang, Y.2
  • 27
    • 0005004572 scopus 로고    scopus 로고
    • A probability analysis on the value of unlabeled data for classification problems
    • T. Zhang and F. Oles. A probability analysis on the value of unlabeled data for classification problems. In Proc. of ICML, 2000.
    • (2000) Proc. of ICML
    • Zhang, T.1    Oles, F.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.