메뉴 건너뛰기




Volumn 34, Issue 3, 2013, Pages 563-595

A segment-based approach to clustering multi-topic documents

Author keywords

Document clustering; Interdisciplinary documents; Text segmentation; Topic identification

Indexed keywords

CHARACTER RECOGNITION; CLUSTER ANALYSIS; INFORMATION MANAGEMENT; TEXT PROCESSING;

EID: 84874188569     PISSN: 02191377     EISSN: 02193116     Source Type: Journal    
DOI: 10.1007/s10115-012-0556-z     Document Type: Article
Times cited : (52)

References (55)
  • 1
    • 4944256990 scopus 로고    scopus 로고
    • Web mining: a survey in the fuzzy framework
    • Arotaritei D, Mitra S (2004) Web mining: a survey in the fuzzy framework. Fuzzy Sets Syst 148: 5-19.
    • (2004) Fuzzy Sets Syst , vol.148 , pp. 5-19
    • Arotaritei, D.1    Mitra, S.2
  • 4
    • 0033280561 scopus 로고    scopus 로고
    • A survey of fuzzy clustering algorithms for pattern recognition. i-ii
    • Baraldi A, Blonda P (1999) A survey of fuzzy clustering algorithms for pattern recognition. i-ii. IEEE Trans Syst Man Cybern Part B 29(6): 778-801.
    • (1999) IEEE Trans Syst Man Cybern Part B , vol.29 , Issue.6 , pp. 778-801
    • Baraldi, A.1    Blonda, P.2
  • 9
    • 80052025105 scopus 로고    scopus 로고
    • An integration of fuzzy association rules and WordNet for document clustering
    • Chen CL, Tseng FSC, Liang T (2011) An integration of fuzzy association rules and WordNet for document clustering. Knowl Inf Syst 28(3): 687-708.
    • (2011) Knowl Inf Syst , vol.28 , Issue.3 , pp. 687-708
    • Chen, C.L.1    Tseng, F.S.C.2    Liang, T.3
  • 10
    • 50649091328 scopus 로고    scopus 로고
    • Efficient phrase-based document similarity for clustering
    • Chim H, Deng X (2008) Efficient phrase-based document similarity for clustering. IEEE Trans Knowl Data Eng 20(9): 1217-1229.
    • (2008) IEEE Trans Knowl Data Eng , vol.20 , Issue.9 , pp. 1217-1229
    • Chim, H.1    Deng, X.2
  • 12
    • 0034824884 scopus 로고    scopus 로고
    • Concept decompositions for large sparse text data using clustering
    • Dhillon IS, Modha DS (2001) Concept decompositions for large sparse text data using clustering. Mach Learn 42(1/2): 143-175.
    • (2001) Mach Learn , vol.42 , Issue.1-2 , pp. 143-175
    • Dhillon, I.S.1    Modha, D.S.2
  • 13
    • 77955656991 scopus 로고    scopus 로고
    • A segmented topic model based on the two-parameter Poisson-Dirichlet process
    • Du L, Buntine WL, Jin H (2010) A segmented topic model based on the two-parameter Poisson-Dirichlet process. Mach Learn 81(1): 5-19.
    • (2010) Mach Learn , vol.81 , Issue.1 , pp. 5-19
    • Du, L.1    Buntine, W.L.2    Jin, H.3
  • 14
    • 79961211393 scopus 로고    scopus 로고
    • Statistical semantics for enhancing document clustering
    • Farahat AK, Kamel MS (2011) Statistical semantics for enhancing document clustering. Knowl Inf Syst 28(2): 365-393.
    • (2011) Knowl Inf Syst , vol.28 , Issue.2 , pp. 365-393
    • Farahat, A.K.1    Kamel, M.S.2
  • 18
    • 77956061153 scopus 로고    scopus 로고
    • Keep it simple with time: a re-examination of probabilistic topic detection models
    • He Q, Chang K, Lim EP, Banerjee A (2010) Keep it simple with time: a re-examination of probabilistic topic detection models. IEEE Trans Pattern Anal Mach Intell 32(10): 1795-1808.
    • (2010) IEEE Trans Pattern Anal Mach Intell , vol.32 , Issue.10 , pp. 1795-1808
    • He, Q.1    Chang, K.2    Lim, E.P.3    Banerjee, A.4
  • 19
    • 0001819680 scopus 로고    scopus 로고
    • TextTiling: segmenting text into multi-paragraph subtopic passages
    • Hearst MA (1997) TextTiling: segmenting text into multi-paragraph subtopic passages. Comput Linguist 23(1): 33-64.
    • (1997) Comput Linguist , vol.23 , Issue.1 , pp. 33-64
    • Hearst, M.A.1
  • 23
    • 0034818212 scopus 로고    scopus 로고
    • Unsupervised Learning by Probabilistic Latent Semantic Analysis
    • Hofmann T (2001) Unsupervised Learning by Probabilistic Latent Semantic Analysis. Machine Learning 42(1-2): 177-196.
    • (2001) Machine Learning , vol.42 , Issue.1-2 , pp. 177-196
    • Hofmann, T.1
  • 25
    • 77957556167 scopus 로고    scopus 로고
    • Knowledge-based vector space model for text clustering
    • Jing L, Ng MK, Huang JZ (2010) Knowledge-based vector space model for text clustering. Knowl Inf Syst 25(1): 35-55.
    • (2010) Knowl Inf Syst , vol.25 , Issue.1 , pp. 35-55
    • Jing, L.1    Ng, M.K.2    Huang, J.Z.3
  • 28
    • 38349151563 scopus 로고    scopus 로고
    • A novelty-based clustering method for online documents
    • Khy S, Ishikawa Y, Kitagawa H (2007) A novelty-based clustering method for online documents. World Wide Web 11: 1-37.
    • (2007) World Wide Web , vol.11 , pp. 1-37
    • Khy, S.1    Ishikawa, Y.2    Kitagawa, H.3
  • 34
    • 84876811202 scopus 로고    scopus 로고
    • RCV1: a new benchmark collection for text categorization research
    • Lewis DD, Yang Y, Rose T, Li F (2004) RCV1: a new benchmark collection for text categorization research. J Mach Learn Res 5: 361-397.
    • (2004) J Mach Learn Res , vol.5 , pp. 361-397
    • Lewis, D.D.1    Yang, Y.2    Rose, T.3    Li, F.4
  • 39
    • 79955993118 scopus 로고    scopus 로고
    • Short text clustering by finding core terms
    • Ni X, Quan X, Lu Z, Wenyin L, Hua B (2011) Short text clustering by finding core terms. Knowl Inf Syst 27(3): 345-365.
    • (2011) Knowl Inf Syst , vol.27 , Issue.3 , pp. 345-365
    • Ni, X.1    Quan, X.2    Lu, Z.3    Wenyin, L.4    Hua, B.5
  • 41
    • 17044376078 scopus 로고    scopus 로고
    • Subspace clustering for high dimensional data: a review
    • Parsons L, Haque E, Liu H (2004) Subspace clustering for high dimensional data: a review. ACM SIGKDD Explor Newsl 6(1): 90-105.
    • (2004) ACM SIGKDD Explor Newsl , vol.6 , Issue.1 , pp. 90-105
    • Parsons, L.1    Haque, E.2    Liu, H.3
  • 47
    • 79953865329 scopus 로고    scopus 로고
    • D2S: document-to-sentence framework for novelty detection
    • Tsai FS, Zhang Y (2011) D2S: document-to-sentence framework for novelty detection. Knowl Inf Syst 29(2): 419-433.
    • (2011) Knowl Inf Syst , vol.29 , Issue.2 , pp. 419-433
    • Tsai, F.S.1    Zhang, Y.2
  • 49
    • 40649129226 scopus 로고    scopus 로고
    • Towards a unified approach to document similarity search using manifold-ranking of blocks
    • Wan X, Yang J, Xiao J (2008) Towards a unified approach to document similarity search using manifold-ranking of blocks. Inf Process Manag 44: 1032-1048.
    • (2008) Inf Process Manag , vol.44 , pp. 1032-1048
    • Wan, X.1    Yang, J.2    Xiao, J.3
  • 52
    • 3543085722 scopus 로고    scopus 로고
    • Empirical and theoretical comparison of selected criterion functions for document clustering
    • Zhao Y, Karypis G (2004) Empirical and theoretical comparison of selected criterion functions for document clustering. Mach Learn 55(3): 311-331.
    • (2004) Mach Learn , vol.55 , Issue.3 , pp. 311-331
    • Zhao, Y.1    Karypis, G.2
  • 54
    • 24044537630 scopus 로고    scopus 로고
    • Hierarchical clustering algorithms for document datasets
    • Zhao Y, Karypis G, Fayyad UM (2005) Hierarchical clustering algorithms for document datasets. Data Min Knowl Discov 10(2): 141-168.
    • (2005) Data Min Knowl Discov , vol.10 , Issue.2 , pp. 141-168
    • Zhao, Y.1    Karypis, G.2    Fayyad, U.M.3
  • 55
    • 24944501423 scopus 로고    scopus 로고
    • Generative model-based document clustering: a comparative study
    • Zhong S, Ghosh J (2005) Generative model-based document clustering: a comparative study. Knowl Inf Syst 8(3): 374-384.
    • (2005) Knowl Inf Syst , vol.8 , Issue.3 , pp. 374-384
    • Zhong, S.1    Ghosh, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.