메뉴 건너뛰기




Volumn 34, Issue , 2009, Pages 443-498

Wikipedia-based semantic interpretation for natural language processing

Author keywords

[No Author keywords available]

Indexed keywords

SEMANTICS; TEXT PROCESSING;

EID: 65349111942     PISSN: None     EISSN: 10769757     Source Type: Journal    
DOI: 10.1613/jair.2669     Document Type: Article
Times cited : (326)

References (131)
  • 3
    • 0032264186 scopus 로고    scopus 로고
    • Distributional clustering of words for text classification
    • Croft, B, Moffat, A, Van Rijsbergen, C. J, Wilkinson, R, &Zobel, J, Eds, Melbourne, AU. ACM Press, New York, US
    • Baker, D., &McCallum, A. K. (1998). Distributional clustering of words for text classification. In Croft, B., Moffat, A., Van Rijsbergen, C. J., Wilkinson, R., &Zobel, J. (Eds.), Proceedings of the 21st ACM International Conference on Research and Development in Information Retrieval, pp. 96-103, Melbourne, AU. ACM Press, New York, US.
    • (1998) Proceedings of the 21st ACM International Conference on Research and Development in Information Retrieval , pp. 96-103
    • Baker, D.1    McCallum, A.K.2
  • 9
    • 84867919822 scopus 로고
    • Transformation-based error-driven learning and natural language processing: A case study in part of speech tagging
    • Brill, E. (1995). Transformation-based error-driven learning and natural language processing: A case study in part of speech tagging. Computational Linguistics, 21(4), 543-565.
    • (1995) Computational Linguistics , vol.21 , Issue.4 , pp. 543-565
    • Brill, E.1
  • 11
    • 33646760990 scopus 로고    scopus 로고
    • Evaluating wordnet-based measures of lexical semantic relatedness
    • Budanitsky, A., &Hirst, G. (2006). Evaluating wordnet-based measures of lexical semantic relatedness. Computational Linguistics, 32(1), 13-47.
    • (2006) Computational Linguistics , vol.32 , Issue.1 , pp. 13-47
    • Budanitsky, A.1    Hirst, G.2
  • 13
    • 0242647875 scopus 로고    scopus 로고
    • A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization
    • Chin, A. G, Ed, Idea Group Publishing, Hershey, US
    • Caropreso, M. F., Matwin, S., &Sebastiani, F. (2001). A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization. In Chin, A. G. (Ed.), Text Databases and Document Management: Theory and Practice, pp. 78-102. Idea Group Publishing, Hershey, US.
    • (2001) Text Databases and Document Management: Theory and Practice , pp. 78-102
    • Caropreso, M.F.1    Matwin, S.2    Sebastiani, F.3
  • 17
    • 0032633485 scopus 로고    scopus 로고
    • Similarity-based models of word cooccurrence probabilities
    • Dagan, I., Lee, L., &Pereira, F. C. N. (1999). Similarity-based models of word cooccurrence probabilities. Machine Learning, 34(1-3), 43-69.
    • (1999) Machine Learning , vol.34 , Issue.1-3 , pp. 43-69
    • Dagan, I.1    Lee, L.2    Pereira, F.C.N.3
  • 18
    • 0029290931 scopus 로고
    • Contextual word similarity and estimation from sparse data
    • Dagan, I., Marcus, S., &Markovitch, S. (1995). Contextual word similarity and estimation from sparse data. Computer Speech and Language, 9(2), 123-152.
    • (1995) Computer Speech and Language , vol.9 , Issue.2 , pp. 123-152
    • Dagan, I.1    Marcus, S.2    Markovitch, S.3
  • 20
    • 0037998887 scopus 로고    scopus 로고
    • Debole, F., &Sebastian!, F. (2003). Supervised term weighting for automated text categorization. In Proceedings of SAC-03, 18th ACM Symposium on Applied Computing, pp. 784-788.
    • Debole, F., &Sebastian!, F. (2003). Supervised term weighting for automated text categorization. In Proceedings of SAC-03, 18th ACM Symposium on Applied Computing, pp. 784-788.
  • 22
    • 29644438050 scopus 로고    scopus 로고
    • Statistical comparison of classifiers over multiple data sets
    • Demsar, J. (2006). Statistical comparison of classifiers over multiple data sets. Journal of Machine Learning Research, 7, 1-30.
    • (2006) Journal of Machine Learning Research , vol.7 , pp. 1-30
    • Demsar, J.1
  • 24
    • 2942723846 scopus 로고    scopus 로고
    • A divisive information-theoretic feature clustering algorithm for text classification
    • Dhillon, I., Mallela, S., &Kumar, R. (2003). A divisive information-theoretic feature clustering algorithm for text classification. Journal of Machine Learning Research, 3, 1265-1287.
    • (2003) Journal of Machine Learning Research , vol.3 , pp. 1265-1287
    • Dhillon, I.1    Mallela, S.2    Kumar, R.3
  • 26
    • 84875030103 scopus 로고    scopus 로고
    • Concept-based feature generation and selection for information retrieval
    • Egozi, O., Gabrilovich, E., &Markovitch, S. (2008). Concept-based feature generation and selection for information retrieval. In AAAF08.
    • (2008) AAAF , vol.8
    • Egozi, O.1    Gabrilovich, E.2    Markovitch, S.3
  • 28
  • 32
    • 14344259210 scopus 로고    scopus 로고
    • Text categorization with many redundant features: Using aggressive feature selection to make SVMs competitive with C4.5
    • Gabrilovich, E., &Markovitch, S. (2004). Text categorization with many redundant features: Using aggressive feature selection to make SVMs competitive with C4.5. In Proceedings of the 21st International Conference on Machine Learning, pp. 321-328.
    • (2004) Proceedings of the 21st International Conference on Machine Learning , pp. 321-328
    • Gabrilovich, E.1    Markovitch, S.2
  • 36
    • 35748946646 scopus 로고    scopus 로고
    • Harnessing the expertise of 70,000 human editors: Knowledge-based feature generation for text categorization
    • Gabrilovich, E., &Markovitch, S. (2007b). Harnessing the expertise of 70,000 human editors: Knowledge-based feature generation for text categorization. Journal of Machine Learning Research, 8, 2297-2345.
    • (2007) Journal of Machine Learning Research , vol.8 , pp. 2297-2345
    • Gabrilovich, E.1    Markovitch, S.2
  • 38
    • 30744439551 scopus 로고    scopus 로고
    • Internet encyclopaedias go head to head
    • Giles, J. (2005). Internet encyclopaedias go head to head. Nature, 438, 900-901.
    • (2005) Nature , vol.438 , pp. 900-901
    • Giles, J.1
  • 42
    • 0012992939 scopus 로고    scopus 로고
    • Lexical chains as representations of context for the detection and correction of malapropisms
    • MIT Press, Cambridge, MA
    • Hirst, G., &St-Onge, D. (1998). Lexical chains as representations of context for the detection and correction of malapropisms. In WordNet: An Electronic Lexical Database, pp. 305-332. MIT Press, Cambridge, MA.
    • (1998) WordNet: An Electronic Lexical Database , pp. 305-332
    • Hirst, G.1    St-Onge, D.2
  • 45
    • 34447620746 scopus 로고
    • Improving text retrieval for the routing problem using latent semantic indexing
    • Croft, W. B, &Van Rijsbergen, C. J, Eds, Dublin, Ireland. Springer Verlag, Heidelberg, Germany
    • Hull, D. A. (1994). Improving text retrieval for the routing problem using latent semantic indexing. In Croft, W. B., &Van Rijsbergen, C. J. (Eds.), Proceedings of the 17th ACM International Conference on Research and Development in Information Retrieval, pp. 282-289, Dublin, Ireland. Springer Verlag, Heidelberg, Germany.
    • (1994) Proceedings of the 17th ACM International Conference on Research and Development in Information Retrieval , pp. 282-289
    • Hull, D.A.1
  • 49
    • 33745940463 scopus 로고    scopus 로고
    • Neurotextcategorizer: A new model of neural network for text categorization
    • Taejon, South Korea
    • Jo, T. (2000). Neurotextcategorizer: A new model of neural network for text categorization. In Proceedings of the International Conference of Neural Information Processing, pp. 280-285, Taejon, South Korea.
    • (2000) Proceedings of the International Conference of Neural Information Processing , pp. 280-285
    • Jo, T.1
  • 52
    • 84957069814 scopus 로고    scopus 로고
    • Text categorization with support vector machines: Learning with many relevant features
    • Joachims, T. (1998). Text categorization with support vector machines: Learning with many relevant features. In Proceedings of the European Conference on Machine Learning, pp. 137-142.
    • (1998) Proceedings of the European Conference on Machine Learning , pp. 137-142
    • Joachims, T.1
  • 53
    • 0002714543 scopus 로고    scopus 로고
    • Making large-scale SVM learning practical
    • Schoelkopf, B, Burges, C, &Smola, A, Eds, The MIT Press
    • Joachims, T. (1999). Making large-scale SVM learning practical. In Schoelkopf, B., Burges, C., &Smola, A. (Eds.), Advances in Kernel Methods - Support Vector Learning, pp. 169-184. The MIT Press.
    • (1999) Advances in Kernel Methods - Support Vector Learning , pp. 169-184
    • Joachims, T.1
  • 57
    • 0002542095 scopus 로고    scopus 로고
    • Combining local context and WordNet similarity for word sense identification
    • MIT Press, Cambridge, MA
    • Leacock, C., &Chodorow, M. (1998). Combining local context and WordNet similarity for word sense identification. In WordNet: An Electronic Lexical Database, pp. 265-283. MIT Press, Cambridge, MA.
    • (1998) WordNet: An Electronic Lexical Database , pp. 265-283
    • Leacock, C.1    Chodorow, M.2
  • 60
    • 0029410109 scopus 로고
    • CYC: A large-scale investment in knowledge infrastructure
    • Lenat, D. B. (1995). CYC: A large-scale investment in knowledge infrastructure. Communications of the ACM, 38(11).
    • (1995) Communications of the ACM , Issue.11
    • Lenat, D.B.1
  • 61
    • 33749125343 scopus 로고    scopus 로고
    • From 2001 to 2001: Common sense and the mind of HAL
    • The MIT Press
    • Lenat, D. B. (1997). From 2001 to 2001: Common sense and the mind of HAL. In HAL's Legacy, pp. 194-209. The MIT Press.
    • (1997) HAL's Legacy , pp. 194-209
    • Lenat, D.B.1
  • 63
    • 0036161242 scopus 로고    scopus 로고
    • Text categorization with support vector machines: How to represent texts in input space
    • Leopold, E., &Kindermann, J. (2002). Text categorization with support vector machines: How to represent texts in input space. Machine Learning, 46, 423-444.
    • (2002) Machine Learning , vol.46 , pp. 423-444
    • Leopold, E.1    Kindermann, J.2
  • 67
  • 70
    • 19544372770 scopus 로고    scopus 로고
    • Improving text classification using local latent semantic indexing
    • Liu, T., Chen, Z., Zhang, B., Ma, W.-y., &Wu, G. (2004). Improving text classification using local latent semantic indexing. In ICDM'04, pp. 162-169.
    • (2004) ICDM'04 , pp. 162-169
    • Liu, T.1    Chen, Z.2    Zhang, B.3    Ma, W.-Y.4    Wu, G.5
  • 72
    • 0036778917 scopus 로고    scopus 로고
    • Feature generation using general constructor functions
    • Markovitch, S., &Rosenstein, D. (2002). Feature generation using general constructor functions. Machine Learning, 49(1), 59-98.
    • (2002) Machine Learning , vol.49 , Issue.1 , pp. 59-98
    • Markovitch, S.1    Rosenstein, D.2
  • 75
    • 0004234233 scopus 로고    scopus 로고
    • MeSH , MeSH
    • MeSH (2003). Medical subject headings (MeSH). National Library of Medicine, http://www.nlm.nih.gov/rmesh.
    • (2003) Medical subject headings
  • 77
    • 0141611059 scopus 로고    scopus 로고
    • Turning wordnet into an information retrieval resource: Systematic polysemy and conversion to hierarchical codes
    • Mihalcea, R. (2003). Turning wordnet into an information retrieval resource: Systematic polysemy and conversion to hierarchical codes. International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI), 17(1), 689-704.
    • (2003) International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI) , vol.17 , Issue.1 , pp. 689-704
    • Mihalcea, R.1
  • 78
    • 33750693384 scopus 로고    scopus 로고
    • Corpus-based and knowledge-based measures of text semantic similarity
    • Mihalcea, R., Corley, C, &Strapparava, C. (2006). Corpus-based and knowledge-based measures of text semantic similarity. In AAAI'06, pp. 775-780.
    • (2006) AAAI'06 , pp. 775-780
    • Mihalcea, R.1    Corley, C.2    Strapparava, C.3
  • 83
    • 0041160562 scopus 로고
    • The proper treatment of quantification in ordinary English
    • Hintikka, J, Moravcsik, J, &Suppes, P, Eds, Reidel, Dordrecht
    • Montague, R. (1973). The proper treatment of quantification in ordinary English. In Hintikka, J., Moravcsik, J., &Suppes, P. (Eds.), Approaches to Natural Language, pp. 373-398. Reidel, Dordrecht.
    • (1973) Approaches to Natural Language , pp. 373-398
    • Montague, R.1
  • 84
  • 85
    • 0025389210 scopus 로고
    • Boolean feature discovery in empirical learning
    • Pagallo, G., &Haussler, D. (1990). Boolean feature discovery in empirical learning. Machine Learning, 5(1), 71-99.
    • (1990) Machine Learning , vol.5 , Issue.1 , pp. 71-99
    • Pagallo, G.1    Haussler, D.2
  • 87
    • 3843083955 scopus 로고    scopus 로고
    • Augmenting naive Bayes classifiers with statistical language models
    • Peng, F., Schuurmans, D., &Wang, S. (2004). Augmenting naive Bayes classifiers with statistical language models. Information Retrieval, 7(3-4), 317-345.
    • (2004) Information Retrieval , vol.7 , Issue.3-4 , pp. 317-345
    • Peng, F.1    Schuurmans, D.2    Wang, S.3
  • 89
    • 65349142296 scopus 로고    scopus 로고
    • Comparison of human and latent semantic analysis (LSA) judgements of pairwise document similarities for a news corpus. Tech. rep. DSTO-RR-0278, Information Sciences Laboratory, Defence Science and Technology Organization, Department of Defense, Australian Government
    • Pincombe, B. (2004). Comparison of human and latent semantic analysis (LSA) judgements of pairwise document similarities for a news corpus. Tech. rep. DSTO-RR-0278, Information Sciences Laboratory, Defence Science and Technology Organization, Department of Defense, Australian Government.
    • (2004)
    • Pincombe, B.1
  • 90
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • Porter, M. (1980). An algorithm for suffix stripping. Program, 14 (3), 130-137.
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.1
  • 94
    • 84948187054 scopus 로고    scopus 로고
    • Second order features for maximizing text classification performance
    • De Raedt, L, &Flach, P, Eds, Proceedings of the European Conference on Machine Learning ECML, Springer-Verlag
    • Raskutti, B., Ferra, H., &Kowalczyk, A. (2001). Second order features for maximizing text classification performance. In De Raedt, L., &Flach, P. (Eds.), Proceedings of the European Conference on Machine Learning (ECML), Lecture notes in Artificial Intelligence (LNAI) 2167, pp. 419-430. Springer-Verlag.
    • (2001) Lecture notes in Artificial Intelligence (LNAI , vol.2167 , pp. 419-430
    • Raskutti, B.1    Ferra, H.2    Kowalczyk, A.3
  • 95
    • 0002016474 scopus 로고    scopus 로고
    • Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language
    • Resnik, P. (1999). Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language. Journal of Artificial Intelligence Research, 11, 95-130.
    • (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 95-130
    • Resnik, P.1
  • 100
    • 84880733494 scopus 로고    scopus 로고
    • Rose, T., Stevenson, M., &Whitehead, M. (2002). The Reuters Corpus 1-from yesterday's news to tomorrow's language resources. In Proceedings of the Third International Conference on Language Resources and Evaluation, pp. 7-13.
    • Rose, T., Stevenson, M., &Whitehead, M. (2002). The Reuters Corpus Volume 1-from yesterday's news to tomorrow's language resources. In Proceedings of the Third International Conference on Language Resources and Evaluation, pp. 7-13.
  • 104
    • 34250638291 scopus 로고    scopus 로고
    • A web-based kernel function for measuring the similarity of short text snippets
    • ACM Press
    • Sahami, M., &Heilman, T. (2006). A web-based kernel function for measuring the similarity of short text snippets. In WWW'06, pp. 377-386. ACM Press.
    • (2006) WWW'06 , pp. 377-386
    • Sahami, M.1    Heilman, T.2
  • 109
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • Sebastiani, F. (2002). Machine learning in automated text categorization. A CM Computing Surveys, 34(1), 1-47.
    • (2002) A CM Computing Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 111
    • 84921996835 scopus 로고    scopus 로고
    • Cross-lingual information retrieval with explicit semantic analysis
    • Sorg, P., &Cimiano, P. (2008). Cross-lingual information retrieval with explicit semantic analysis. In Working Notes for the CLEF Workshop.
    • (2008) Working Notes for the CLEF Workshop
    • Sorg, P.1    Cimiano, P.2
  • 112
    • 33750738051 scopus 로고    scopus 로고
    • Wikipedia
    • WikiRelate! Computing semantic relatedness using, Boston, MA
    • Strube, M., &Ponzetto, S. P. (2006). WikiRelate! Computing semantic relatedness using Wikipedia. In AAAF06, pp. 1419-1424, Boston, MA.
    • (2006) AAAF06 , pp. 1419-1424
    • Strube, M.1    Ponzetto, S.P.2
  • 115
    • 33748661515 scopus 로고    scopus 로고
    • Similarity of semantic relations
    • Turney, P. (2006). Similarity of semantic relations. Computational Linguistics, 32(3), 379- 416.
    • (2006) Computational Linguistics , vol.32 , Issue.3 , pp. 379-416
    • Turney, P.1
  • 116
    • 65349116939 scopus 로고    scopus 로고
    • Unsupervised learning of semantic orientation from a hundred-billion-word corpus. Tech. rep. ERB-1094, National Research Council Canada
    • Turney, P., &Littman, M. L. (2002). Unsupervised learning of semantic orientation from a hundred-billion-word corpus. Tech. rep. ERB-1094, National Research Council Canada.
    • (2002)
    • Turney, P.1    Littman, M.L.2
  • 120
    • 65349153471 scopus 로고    scopus 로고
    • Wang, B. B., McKay, R., Abbass, H. A., k Barlow, M. (2003). A comparative study for domain ontology guided feature extraction. In Proceedings of the 26th Australian Computer Science Conference (ASCS-2003), pp. 69-78.
    • Wang, B. B., McKay, R., Abbass, H. A., k Barlow, M. (2003). A comparative study for domain ontology guided feature extraction. In Proceedings of the 26th Australian Computer Science Conference (ASCS-2003), pp. 69-78.
  • 123
    • 33750718642 scopus 로고    scopus 로고
    • Evaluating the utility of statistical phrases and latent semantic indexing for text classification
    • Wu, H., &Gunopulos, D. (2002). Evaluating the utility of statistical phrases and latent semantic indexing for text classification. In IEEE International Conference on Data Mining, pp. 713-716.
    • (2002) IEEE International Conference on Data Mining , pp. 713-716
    • Wu, H.1    Gunopulos, D.2
  • 127
  • 129
    • 85046835829 scopus 로고    scopus 로고
    • Automatically creating datasets for measures of semantic relatedness
    • Sydney, Australia
    • Zesch, T., &Gurevych, I. (2006). Automatically creating datasets for measures of semantic relatedness. In Proceedings of the ACL Workshop on Linguistic Distances, pp. 16-24, Sydney, Australia.
    • (2006) Proceedings of the ACL Workshop on Linguistic Distances , pp. 16-24
    • Zesch, T.1    Gurevych, I.2
  • 131
    • 0002848777 scopus 로고    scopus 로고
    • Exploring the similarity space
    • Zobel, J., &Moffat, A. (1998). Exploring the similarity space. ACM SIGIR Forum, 32(1), 18-34.
    • (1998) ACM SIGIR Forum , vol.32 , Issue.1 , pp. 18-34
    • Zobel, J.1    Moffat, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.