메뉴 건너뛰기




Volumn 16, Issue 8, 2004, Pages 949-964

TopCat: Data mining for topic identification in a text corpus

Author keywords

Clustering; Data mining; Topic detection

Indexed keywords

CLUSTERING; TEXT CORPUS; TOPIC DETECTION; TOPIC IDENTIFICATION;

EID: 4344680632     PISSN: 10414347     EISSN: None     Source Type: Journal    
DOI: 10.1109/TKDE.2004.32     Document Type: Article
Times cited : (84)

References (52)
  • 4
    • 0031187250 scopus 로고    scopus 로고
    • Exploiting background information in knowledge discovery from text
    • July
    • R. Feldman and H. Hirsh, "Exploiting Background Information in Knowledge Discovery from Text," J. Intelligent Information Systems, vol. 9, no. 1, pp. 83-97, July 1998.
    • (1998) J. Intelligent Information Systems , vol.9 , Issue.1 , pp. 83-97
    • Feldman, R.1    Hirsh, H.2
  • 14
    • 0028460389 scopus 로고
    • Information extraction as a basis for high-precision text classification
    • E. Riloff and W. Lehnert, "Information Extraction as a Basis for High-Precision Text Classification," ACM Trans. Information Systems, vol. 12, no. 3, pp. 296-333, 1994.
    • (1994) ACM Trans. Information Systems , vol.12 , Issue.3 , pp. 296-333
    • Riloff, E.1    Lehnert, W.2
  • 15
    • 0029777093 scopus 로고    scopus 로고
    • Natural language processing for information retrieval
    • D.D. Lewis and K.S. Jones, "Natural Language Processing for Information Retrieval," Comm. ACM, vol. 39, no. 1, pp. 92-100, 1996.
    • (1996) Comm. ACM , vol.39 , Issue.1 , pp. 92-100
    • Lewis, D.D.1    Jones, K.S.2
  • 16
    • 84989380187 scopus 로고    scopus 로고
    • Methods of automatic term recognition: A review
    • K. Kageura and B. Umino, "Methods of Automatic Term Recognition: A Review," Terminology, vol. 3, no. 2, 1996.
    • (1996) Terminology , vol.3 , Issue.2
    • Kageura, K.1    Umino, B.2
  • 21
    • 0003227299 scopus 로고    scopus 로고
    • Grouper: A dynamic clustering interface to web search results
    • May
    • O. Zamir and O. Etzioni, "Grouper: A Dynamic Clustering Interface to Web Search Results," Proc. Eighth Int'l World Wide Web Conf., May 1999, http://www8.org/w8-papers/3a-search-query/dynamic/dynamic.html.
    • (1999) Proc. Eighth Int'l World Wide Web Conf.
    • Zamir, O.1    Etzioni, O.2
  • 23
    • 0003141935 scopus 로고    scopus 로고
    • A comparative study on feature selection in text categorization
    • July
    • Y. Yang and J.P. Pedersen, "A Comparative Study on Feature Selection in Text Categorization," Proc. 14th Int'l Conf. Machine Learning (ICML '97), July 1997, http://www.cs.cmu.edu/yiming/papers.yy/ml97.ps.
    • (1997) Proc. 14th Int'l Conf. Machine Learning (ICML '97)
    • Yang, Y.1    Pedersen, J.P.2
  • 24
    • 84957069814 scopus 로고    scopus 로고
    • Text categorization with support vector machines: Learning with many relevant features
    • Apr
    • T. Joachims, "Text Categorization with Support Vector Machines: Learning with Many Relevant Features," Proc. European Conf. Machine Learning, pp. 137-142, Apr. 1998.
    • (1998) Proc. European Conf. Machine Learning , pp. 137-142
    • Joachims, T.1
  • 25
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • M. Porter, "An Algorithm for Suffix Stripping," Automated Library and Information Systems, vol. 14, no. 3, pp. 130-137, 1980.
    • (1980) Automated Library and Information Systems , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.1
  • 26
    • 4344662626 scopus 로고    scopus 로고
    • Classification of news stories using support vector machines
    • Aug
    • R. Cooley, "Classification of News Stories Using Support Vector Machines," IJCAI '99 Workshop Text Mining, Aug. 1999.
    • (1999) IJCAI '99 Workshop Text Mining
    • Cooley, R.1
  • 27
    • 34248833974 scopus 로고
    • Introduction to wordnet: An on-line lexical database
    • G.A. Miller, C. Fellbaum, J. Kegl, and K.J. Miller, "Introduction to Wordnet: An On-Line Lexical Database," Int'l J. Lexicography, vol. 3, no. 4, pp. 235-244, 1990, ftp://ftp.cogsci.princeton.edu/pub/wordnet/5papers.ps.
    • (1990) Int'l J. Lexicography , vol.3 , Issue.4 , pp. 235-244
    • Miller, G.A.1    Fellbaum, C.2    Kegl, J.3    Miller, K.J.4
  • 28
    • 0002565067 scopus 로고
    • Overview of the first text rEtrieval conference (TREC-1)
    • no. SN003-003-03614-5, Nat'l Inst. of Standards and Technology. Gaithersburg, Md.: Government Printing Office, Nov
    • D. Harman, "Overview of the First Text REtrieval Conference (TREC-1)," Proc. First Text REtrieval Conf. (TREC-1), no. SN003-003-03614-5, Nat'l Inst. of Standards and Technology. Gaithersburg, Md.: Government Printing Office, pp. 1-20, Nov. 1992, http://trec.nist.gov/pubs/trec7/t7_proceedings.html.
    • (1992) Proc. First Text REtrieval Conf. (TREC-1) , pp. 1-20
    • Harman, D.1
  • 29
    • 84976664060 scopus 로고
    • Automatic structuring and retrieval of large text files
    • Feb
    • G. Salton, J. Allan, and C. Buckley, "Automatic Structuring and Retrieval of Large Text Files," Comm. ACM, vol. 37, no. 2, pp. 97-108, Feb. 1994, http://www.acm.org/pubs/citations/journals/cacm/1994-37-2/p97-salton/.
    • (1994) Comm. ACM , vol.37 , Issue.2 , pp. 97-108
    • Salton, G.1    Allan, J.2    Buckley, C.3
  • 30
    • 84936824188 scopus 로고
    • Word association norms, mutual information and lexicography
    • K.W. Church and P. Hanks, "Word Association Norms, Mutual Information and Lexicography," Computational Linguistics, vol. 16, no. 1, pp. 22-29, 1991, http://www.research.att.com/kwc/published_1989_CL.ps.
    • (1991) Computational Linguistics , vol.16 , Issue.1 , pp. 22-29
    • Church, K.W.1    Hanks, P.2
  • 32
    • 11344285341 scopus 로고    scopus 로고
    • Beyond market baskets: Generalizing association rules to dependence rules
    • Jan
    • C. Silverstein, S. Brin, and R. Motwani, "Beyond Market Baskets: Generalizing Association Rules to Dependence Rules," Data Mining and Knowledge Discovery, vol. 2, no. 1, pp. 39-68, Jan. 1998.
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.1 , pp. 39-68
    • Silverstein, C.1    Brin, S.2    Motwani, R.3
  • 42
    • 0003190646 scopus 로고    scopus 로고
    • Topic detection and tracking using IDF-weightedCosineCoefficient
    • Feb
    • J.M. Shultz and M. Liberman, "Topic Detection and Tracking Using IDF-WeightedCosineCoefficient,"Proc.1999DARPABroadcastNews Workshop, Feb. 1999. http://www.nist.gov/speech/publications/darpa99/html/abstract.htm#tdt3-10.
    • (1999) Proc. 1999 DARPA Broadcast News Workshop
    • Shultz, J.M.1    Liberman, M.2
  • 47
    • 0002199862 scopus 로고    scopus 로고
    • Machine learning of event segmentation for news on demand
    • Feb
    • S. Boykin and A. Merlino, "Machine Learning of Event Segmentation for News on Demand," Comm. ACM, vol. 43, no. 2, pp. 35-41, Feb. 2000.
    • (2000) Comm. ACM , vol.43 , Issue.2 , pp. 35-41
    • Boykin, S.1    Merlino, A.2
  • 49
    • 0000776545 scopus 로고    scopus 로고
    • Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies
    • Aug
    • S. Chakrabarti, B. Dom, R. Agrawal, and P. Raghavan, "Scalable Feature Selection, Classification and Signature Generation for Organizing Large Text Databases into Hierarchical Topic Taxonomies," VLDB J., vol. 7, no. 3, pp. 163-178, Aug. 1998, http://www.almaden.ibm.com/cs/k53/irpapers/VLDB54_3.PDF.
    • (1998) VLDB J. , vol.7 , Issue.3 , pp. 163-178
    • Chakrabarti, S.1    Dom, B.2    Agrawal, R.3    Raghavan, P.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.