메뉴 건너뛰기




Volumn 2, Issue 2, 2008, Pages 1-25

Semantic Text Similarity Using Corpus-Based Word Similarity and String Similarity

Author keywords

Algorithms; corpusbased measures; Experimentation; Performance; Semantic similarity of words; similarity of short texts

Indexed keywords


EID: 84866976063     PISSN: 15564681     EISSN: 1556472X     Source Type: Journal    
DOI: 10.1145/1376815.1376819     Document Type: Article
Times cited : (127)

References (50)
  • 1
    • 0023041177 scopus 로고
    • A bit-string longest-common-subsequence algorithm
    • Allison, L. and Dix, T. 1986. A bit-string longest-common-subsequence algorithm. Inf. Proc. Lett. 23, 305-310
    • (1986) Inf. Proc. Lett. , vol.23 , pp. 305-310
    • Allison, L.1    Dix, T.2
  • 3
    • 84889310870 scopus 로고    scopus 로고
    • Explorations in context space: Words, sentences, discourse
    • Burgess, C., Livesay, K., and Lund, K. 1998. Explorations in context space: Words, sentences, discourse. Disc. Proc. 25, 2-3, 211-257
    • (1998) Disc. Proc. , vol.25 , Issue.2-3 , pp. 211-257
    • Burgess, C.1    Livesay, K.2    Lund, K.3
  • 5
    • 0000666461 scopus 로고    scopus 로고
    • Data integration using similarity joins and a word-based information representation language
    • Cohen, W. 2000. Data integration using similarity joins and a word-based information representation language. ACM Trans. Inf. Syst. 18, 3, 288-321
    • (2000) ACM Trans. Inf. Syst. , vol.18 , Issue.3 , pp. 288-321
    • Cohen, W.1
  • 8
    • 27344433526 scopus 로고    scopus 로고
    • Lexrank: Graph-based lexical centrality as salience in text summarization
    • Erkan, G. and Radev, D. 2004. Lexrank: Graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Research 22, 457-479
    • (2004) J. Artif. Intell. Research , vol.22 , pp. 457-479
    • Erkan, G.1    Radev, D.2
  • 9
    • 84937189426 scopus 로고    scopus 로고
    • The measurement of textual coherence with latent semantic analysis
    • Foltz, P., Kintsch, W., and Landauer, T. 1998. The measurement of textual coherence with latent semantic analysis. Disc. Proc. 25, 2-3, 285-307
    • (1998) Disc. Proc. , vol.25 , Issue.2-3 , pp. 285-307
    • Foltz, P.1    Kintsch, W.2    Landauer, T.3
  • 13
    • 46749114872 scopus 로고    scopus 로고
    • Applications of corpus-based semantic similarity and word segmentation to database schema matching
    • (Published online)
    • Islam, A., Inkpen, D. Z., and Kiringa, I. 2008. Applications of corpus-based semantic similarity and word segmentation to database schema matching. The VLDB Journal (Published online)
    • (2008) The VLDB Journal
    • Islam, A.1    Inkpen, D.Z.2    Kiringa, I.3
  • 14
    • 0004200363 scopus 로고
    • Semantics and Cognition
    • MIT Press, Cambridge, MA
    • Jackendoff, R. 1983. Semantics and Cognition. MIT Press, Cambridge, MA
    • (1983)
    • Jackendoff, R.1
  • 17
    • 26444559482 scopus 로고    scopus 로고
    • Classification of rss-formatted documents using full text similarity measures
    • D. Lowe andM. Gaedke, Eds. LNCS 3579. Springer
    • Katarzyna, W.-W. and Szczepaniak, P. 2005. Classification of rss-formatted documents using full text similarity measures. In Proceedings of the 5th International Conference on Web Engineering, D. Lowe andM. Gaedke, Eds. LNCS 3579. Springer, 400-405
    • (2005) Proceedings of the 5th International Conference on Web Engineering , pp. 400-405
    • Katarzyna, W.-W.1    Szczepaniak, P.2
  • 18
    • 0344927122 scopus 로고    scopus 로고
    • Improving text categorization using the importance of sentences
    • Ko, Y., Park, J., and Seo, J. 2004. Improving text categorization using the importance of sentences. Inf. Proc. Manage. 40, 65-79
    • (2004) Inf. Proc. Manage. , vol.40 , pp. 65-79
    • Ko, Y.1    Park, J.2    Seo, J.3
  • 20
    • 0000600219 scopus 로고    scopus 로고
    • A solution to platos problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge
    • Landauer, T. and Dumais, S. 1997. A solution to platos problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psych. Rev. 104, 2, 211-240
    • (1997) Psych. Rev. , vol.104 , Issue.2 , pp. 211-240
    • Landauer, T.1    Dumais, S.2
  • 21
    • 80053431219 scopus 로고    scopus 로고
    • Introduction to latent semantic analysis
    • Landauer, T., Foltz, P., and Laham, D. 1998. Introduction to latent semantic analysis. Dis. Proc. 25, 2-3, 259-284
    • (1998) Dis. Proc. , vol.25 , Issue.2-3 , pp. 259-284
    • Landauer, T.1    Foltz, P.2    Laham, D.3
  • 23
    • 45449091073 scopus 로고    scopus 로고
    • Word Net:An electronic lexical database
    • MIT Press, Chapter Combining local context and Word Net similarity for word sense identification
    • Leacock, C. and Chodorow, M. 1998. Word Net:An electronic lexical database. MIT Press, Chapter Combining local context and Word Net similarity for word sense identification, 265-283
    • (1998) , pp. 265-283
    • Leacock, C.1    Chodorow, M.2
  • 24
    • 85050285247 scopus 로고
    • Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone
    • Lesk, M. 1986. Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. In Proceedings of the SIGDOC Conference
    • (1986) Proceedings of the SIGDOC Conference
    • Lesk, M.1
  • 25
    • 0042850512 scopus 로고    scopus 로고
    • An approach for measuring semantic similarity using multiple information sources
    • Li, Y., Bandar, Z., and McLean, D. 2003. An approach for measuring semantic similarity using multiple information sources. IEEE Trans. Knowl. Data Eng. 15, 4, 871-882
    • (2003) IEEE Trans. Knowl. Data Eng. , vol.15 , Issue.4 , pp. 871-882
    • Li, Y.1    Bandar, Z.2    McLean, D.3
  • 26
  • 29
    • 27144451834 scopus 로고    scopus 로고
    • Text similarity computing based on standard deviation
    • In D.-S. Huang, X.-P. Zhang, and G.-B. Huang, Eds. Lecture Notes in Computer Science, Springer-Verlag, New York
    • Liu, T. and Guo, J. 2005. Text similarity computing based on standard deviation. In Proceedings of the International Conference on Intelligent Computing, D.-S. Huang, X.-P. Zhang, and G.-B. Huang, Eds. Lecture Notes in Computer Science, vol. 3644. Springer-Verlag, New York, 456-464
    • (2005) Proceedings of the International Conference on Intelligent Computing , vol.3644 , pp. 456-464
    • Liu, T.1    Guo, J.2
  • 33
    • 0003596936 scopus 로고    scopus 로고
    • Text Information Retrieval Systems, second ed
    • Academic Press
    • Meadow, C., Boyce, B., and Kraft, D. 2000. Text Information Retrieval Systems, second ed. Academic Press
    • (2000)
    • Meadow, C.1    Boyce, B.2    Kraft, D.3
  • 34
    • 0040213339 scopus 로고    scopus 로고
    • Bitext maps and alignment via pattern recognition
    • Melamed, I. D. 1999. Bitext maps and alignment via pattern recognition. Computat. Linguist. 25, 1, 107-130
    • (1999) Computat. Linguist. , vol.25 , Issue.1 , pp. 107-130
    • Melamed, I.D.1
  • 36
    • 0004074834 scopus 로고
    • Introduction to wordnet: An on-line lexical database
    • Cognitive Science Laboratory, Princeton University, Princeton, NJ
    • Miller, G., Beckwith, R., Fellbaum, C., Gross, D., and Miller, K. 1993. Introduction to wordnet: An on-line lexical database. Tech. Rep. 43, Cognitive Science Laboratory, Princeton University, Princeton, NJ
    • (1993) Tech. Rep. , vol.43
    • Miller, G.1    Beckwith, R.2    Fellbaum, C.3    Gross, D.4    Miller, K.5
  • 37
  • 39
    • 16244377443 scopus 로고    scopus 로고
    • Techniques for improving web retrieval effectiveness
    • Park, E., Ra, D., and Jang, M. 2005. Techniques for improving web retrieval effectiveness. Inf. Processing and Management 41, 5, 1207-1223
    • (2005) Inf. Processing and Management , vol.41 , Issue.5 , pp. 1207-1223
    • Park, E.1    Ra, D.2    Jang, M.3
  • 41
    • 0037339910 scopus 로고    scopus 로고
    • Determining semantic similarity among entity classes from different ontologies
    • Rodriguez, M. A. and Egenhofer, M. J. 2003. Determining semantic similarity among entity classes from different ontologies. IEEE Trans. Knowl. Data Eng. 15, 2, 442-456
    • (2003) IEEE Trans. Knowl. Data Eng. , vol.15 , Issue.2 , pp. 442-456
    • Rodriguez, M.A.1    Egenhofer, M.J.2
  • 42
    • 84863873032 scopus 로고
    • Contextual correlates of synonymy
    • Rubenstein, H. and Goodenough, J. B. 1965. Contextual correlates of synonymy. Comm. ACM 8, 10, 627-633
    • (1965) Comm. ACM , vol.8 , Issue.10 , pp. 627-633
    • Rubenstein, H.1    Goodenough, J.B.2
  • 43
    • 33750706390 scopus 로고
    • Computer Evaluation of Indexing and Text Processing
    • Prentice Hall, Inc. Englewood Cliffs, NJ
    • Salton, G. and Lesk, M. 1971. Computer Evaluation of Indexing and Text Processing. Prentice Hall, Inc. Englewood Cliffs, NJ
    • (1971)
    • Salton, G.1    Lesk, M.2
  • 44
    • 1142288181 scopus 로고    scopus 로고
    • Efficient similarity-based operations for data integration
    • Schallehn, E., Sattler, K., and Saake, G. 2004. Efficient similarity-based operations for data integration. Data Knowl. Eng. 48, 361-387
    • (2004) Data Knowl. Eng. , vol.48 , pp. 361-387
    • Schallehn, E.1    Sattler, K.2    Saake, G.3
  • 45
    • 0347596961 scopus 로고    scopus 로고
    • Automatic word sense discrimination
    • Schutze, H. 1998. Automatic word sense discrimination. Computat. Linguist. 24, 1, 97-124
    • (1998) Computat. Linguist. , vol.24 , Issue.1 , pp. 97-124
    • Schutze, H.1
  • 46
    • 0004201792 scopus 로고    scopus 로고
    • Collins Cobuild English Dictionary for Advanced Learners, third ed
    • Harper Collins
    • Sinclair, J., Ed. 2001. Collins Cobuild English Dictionary for Advanced Learners, third ed. Harper Collins
    • (2001)
    • Sinclair, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.