메뉴 건너뛰기




Volumn 25, Issue 1, 2010, Pages 35-55

Knowledge-based vector space model for text clustering

Author keywords

Knowledge based VSM; Semantic relationship; Term similarity; Text clustering

Indexed keywords

HIERARCHICAL CLUSTERING; KNOWLEDGE BASED SYSTEMS; ONTOLOGY; SEMANTICS; TEXT PROCESSING; VECTOR SPACES;

EID: 77957556167     PISSN: 02191377     EISSN: 02193116     Source Type: Journal    
DOI: 10.1007/s10115-009-0256-5     Document Type: Article
Times cited : (79)

References (39)
  • 1
    • 77957556265 scopus 로고    scopus 로고
    • Berry M (2003) Survey of text mining: clustering, classification, and retrieval. Hardcover.
  • 2
    • 33750295383 scopus 로고    scopus 로고
    • Ji X, Xu W, Zhu S (2006) Document clustering with prior knowledge. In: Proceedings of ACM SIGIR, Seattle, Washington, USA.
  • 3
    • 34347228671 scopus 로고    scopus 로고
    • An entropy weighting k-means algorithm for subspace clsutering of high-dimensional sparse data
    • Jing L, Ng M, Huang J (2007) An entropy weighting k-means algorithm for subspace clsutering of high-dimensional sparse data. IEEE Trans Knowl Data Eng 19(8): 1026-1041.
    • (2007) IEEE Trans Knowl Data Eng , vol.19 , Issue.8 , pp. 1026-1041
    • Jing, L.1    Ng, M.2    Huang, J.3
  • 4
    • 41549108964 scopus 로고    scopus 로고
    • Beyond topic similarity: a structural similarity measure for retrieving highly similar documents
    • Wan X (2008) Beyond topic similarity: a structural similarity measure for retrieving highly similar documents. Knowl Inform Syst 15(1): 55-73.
    • (2008) Knowl Inform Syst , vol.15 , Issue.1 , pp. 55-73
    • Wan, X.1
  • 5
    • 57549085945 scopus 로고    scopus 로고
    • Zhang X, Hu X, Zhou X (2008) A comparative evaluation of different link types on enhancing document clustring. In: Proceedings of ACM SIGIR, Singapore, pp 555-562.
  • 6
    • 38649116960 scopus 로고    scopus 로고
    • Fast and effective clustering of XML data using structural information
    • Nayak R (2008) Fast and effective clustering of XML data using structural information. Knowl Inform Syst 14(2): 197-215.
    • (2008) Knowl Inform Syst , vol.14 , Issue.2 , pp. 197-215
    • Nayak, R.1
  • 7
    • 67349281381 scopus 로고    scopus 로고
    • An online document clustering technique for short web contents
    • Carullo M, Binaghi E, Gallo I (2009) An online document clustering technique for short web contents. Pattern Recognit Lett 30(10): 870-876.
    • (2009) Pattern Recognit Lett , vol.30 , Issue.10 , pp. 870-876
    • Carullo, M.1    Binaghi, E.2    Gallo, I.3
  • 9
    • 77957586892 scopus 로고    scopus 로고
    • Steinbach M, Karypis G, Kumar V (2000) A comparison of document clustering techniques. Technical Report {music sharp sign}00-034 at Department of Computer Science and Engineering, University of Minnesota.
  • 10
    • 77957572015 scopus 로고    scopus 로고
    • Hotho A, Staab S, Stumme G (2003) Wordnet improves text document clustering. In: Proceedings of the semantic web workshop at 26th annual international ACM SIGIR conference, Toronto, Canada.
  • 11
    • 77957554228 scopus 로고    scopus 로고
    • Hotho A, Bloehdorn S (2004) Text classification by boosting weak learners based on terms and concepts. In: Proceedings of the 4th IEEE international conference on data mining, Brighton, UK, pp 72-79.
  • 12
    • 0036356286 scopus 로고    scopus 로고
    • Mao W, Chu W (2002) Free text medical document retrieval via phrased-based vector space model. In: Proceedings of American medical informatics association annual symposium, San Antonio, TX, USA.
  • 13
    • 77957570843 scopus 로고    scopus 로고
    • Hirst G, St-Onge D (1998) Lexical chains as representations of context for the detection and correction of malapropisms, Fellbaum, pp 305-332.
  • 14
    • 85132112110 scopus 로고    scopus 로고
    • Ontologies for knowledge management: an information systems perspective
    • Jurisica I, Mylopolous J, Yu E (2004) Ontologies for knowledge management: an information systems perspective. Knowl Inform Syst 6(4): 380-401.
    • (2004) Knowl Inform Syst , vol.6 , Issue.4 , pp. 380-401
    • Jurisica, I.1    Mylopolous, J.2    Yu, E.3
  • 17
    • 0027846505 scopus 로고    scopus 로고
    • Sussna M (1993) Word sense disambiguation for free-text indexing using a massive semantic network. In: Proceedings of the 2nd international conference on information and knowledge management, Arlington, Virginia.
  • 18
    • 77957587833 scopus 로고    scopus 로고
    • Resnik P (1995) Using information content to evaluate semantic similarity. In: Proceedings of the 14th international joint conference on artificial intelligence, pp 448-453.
  • 19
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • Porter M (1980) An algorithm for suffix stripping. Program 14(3): 130-137.
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.1
  • 20
    • 0028516390 scopus 로고
    • A new initialization technique for generalized Lioyd iteration
    • Katsavounidis I, Kuo C, Zhang Z (1994) A new initialization technique for generalized Lioyd iteration. IEEE Signal Proc Lett 1(10): 144-146.
    • (1994) IEEE Signal Proc Lett , vol.1 , Issue.10 , pp. 144-146
    • Katsavounidis, I.1    Kuo, C.2    Zhang, Z.3
  • 21
    • 35148839490 scopus 로고
    • A translation approach to portable ontologies
    • Gruber T (1993) A translation approach to portable ontologies. Knowl Acquisit 5(2): 199-220.
    • (1993) Knowl Acquisit , vol.5 , Issue.2 , pp. 199-220
    • Gruber, T.1
  • 22
    • 33646760990 scopus 로고    scopus 로고
    • Evaluating wordnet-based measures of lexical semantic relatedness
    • Budanitsky A, Hirst G (2006) Evaluating wordnet-based measures of lexical semantic relatedness. Comput Linguistics 32(1): 13-47.
    • (2006) Comput Linguistics , vol.32 , Issue.1 , pp. 13-47
    • Budanitsky, A.1    Hirst, G.2
  • 23
    • 77957572013 scopus 로고    scopus 로고
    • Zhao G (1996) Analogical translator: experience-guided transfer in machine translation. Ph. D. Thesis at UMIST, UK.
  • 25
    • 0035267985 scopus 로고    scopus 로고
    • Ontology learning for the Semantic Web
    • Maedche A, Staab S (2001) Ontology learning for the Semantic Web. IEEE Trans Intell Syst 31: 72-79.
    • (2001) IEEE Trans Intell Syst , vol.31 , pp. 72-79
    • Maedche, A.1    Staab, S.2
  • 26
    • 77957566545 scopus 로고    scopus 로고
    • Sabou M (2005) Learning web service ontologies automatic extraction method and its evaluation. In: Ontology learning from text: methods, applications and evaluation. IOS Press, Amsterdam.
  • 27
    • 31144459206 scopus 로고    scopus 로고
    • Learning concept hierarchies from text corpora using formal concept analysis
    • Cimiano P, Hotho A, Staab S (2005) Learning concept hierarchies from text corpora using formal concept analysis. J Artif Intell Res 24: 305-339.
    • (2005) J Artif Intell Res , vol.24 , pp. 305-339
    • Cimiano, P.1    Hotho, A.2    Staab, S.3
  • 28
    • 84931059008 scopus 로고
    • Contextual correlates of semantic similarity
    • Miller G, Charles G (1991) Contextual correlates of semantic similarity. Lang Cogn Process 6(1): 1-28.
    • (1991) Lang Cogn Process , vol.6 , Issue.1 , pp. 1-28
    • Miller, G.1    Charles, G.2
  • 29
    • 77957572330 scopus 로고    scopus 로고
    • Weiss S, Kulikowski C (1991) Computer systems that learn: classification and prediction methods from statistics neural nets, machine learning, and expert systems. Morgan Kaufmann, Menlo Park.
  • 30
    • 77957563147 scopus 로고    scopus 로고
    • Edwards A (1976) The correlation coefficient. In: An introduction to linear regression and correlation. Freeman, San Francisco.
  • 31
    • 85030313899 scopus 로고    scopus 로고
    • Hersh W, Buchley C, Leone T, Hickam D (1994) OHSUMED: an interactive retrieval evaluation and new large test collection for research. In: Proceedings of ACM SIGIR, Dublin, Ireland, pp 192-201.
  • 32
    • 77957580881 scopus 로고    scopus 로고
    • Miller G (1998) Nouns in WordNet. In: WordNet: an electronic lexical database. MIT Press, Cambridge.
  • 33
    • 77957561359 scopus 로고    scopus 로고
    • Steinbach W, Karypis G, Kumar V (2000) A comparison of document clustering techniques. In: Proceedings of KDD workshop on text mining, Boston, MA, USA.
  • 34
    • 26944481948 scopus 로고    scopus 로고
    • Jing L, Ng M, Huang Z (2005) Subspace clustering of text documents with feature weighting k-means algorithm. In: Proceedings of PAKDD, Hanoi, Vietnam, pp 802-812.
  • 36
    • 77957584708 scopus 로고    scopus 로고
    • Zhao Y, Karypis G (2002) Comparison of agglomerative and partitional document clustering algorithms. Technical report {music sharp sign}02-014 at University of Minnesota.
  • 37
    • 77957578678 scopus 로고    scopus 로고
    • Shi Z, Ghosh J (2003) A comparative study of generative models for document clustering. In: Proceedings of SDW workshop on clustering high dimensional data and its applications, San Francisco, CA, USA.
  • 38
    • 33745960921 scopus 로고    scopus 로고
    • Human-centered ontology engineering: the HCOME methodology
    • Kotis K, Vouros G (2006) Human-centered ontology engineering: the HCOME methodology. Knowl Inform Syst 10(1): 109-131.
    • (2006) Knowl Inform Syst , vol.10 , Issue.1 , pp. 109-131
    • Kotis, K.1    Vouros, G.2
  • 39
    • 67349109407 scopus 로고    scopus 로고
    • Using Wikipedia knowledge to improve text classification
    • Wang P, Hu J, Zeng H, Chen Z (2009) Using Wikipedia knowledge to improve text classification. Knowl Inform Syst 19(3): 265-281.
    • (2009) Knowl Inform Syst , vol.19 , Issue.3 , pp. 265-281
    • Wang, P.1    Hu, J.2    Zeng, H.3    Chen, Z.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.