메뉴 건너뛰기




Volumn 36, Issue 3, 2007, Pages 23-34

Overview and semantic issues of text mining

Author keywords

[No Author keywords available]

Indexed keywords

NATURAL LANGUAGE PROCESSING SYSTEMS; SEMANTICS; SYNTACTICS; TEXT PROCESSING;

EID: 35748948970     PISSN: 01635808     EISSN: None     Source Type: Journal    
DOI: 10.1145/1324185.1324190     Document Type: Conference Paper
Times cited : (103)

References (83)
  • 1
    • 85011081955 scopus 로고    scopus 로고
    • Scalable Semantic Web Data Management Using Vertical Partitioning
    • Austria, pp
    • Abadi, D., Marcus, A., Madden, S., and Hollenbach K. 2007. Scalable Semantic Web Data Management Using Vertical Partitioning. In Proc. of the 33rd VLDB, Austria, pp. 411-422.
    • (2007) Proc. of the 33rd VLDB , pp. 411-422
    • Abadi, D.1    Marcus, A.2    Madden, S.3    Hollenbach, K.4
  • 2
    • 35748984583 scopus 로고    scopus 로고
    • The national centre for text mining: Aims and objectives
    • Jan
    • Ananiadou, S., Chruszcz, J., Keane, J., Mcnaught, J., and Watry, P. 2005. The national centre for text mining: aims and objectives. In Ariadne 42, Jan. 2005.
    • (2005) Ariadne , vol.42
    • Ananiadou, S.1    Chruszcz, J.2    Keane, J.3    Mcnaught, J.4    Watry, P.5
  • 3
    • 84859921107 scopus 로고    scopus 로고
    • A high-performance semi-supervised learning method for text chunking
    • Ann Arbor, pp
    • rd ACL, Ann Arbor, pp 1-9.
    • (2005) rd ACL , pp. 1-9
    • Ando, R.K.1    Zhang, T.2
  • 6
    • 56549087736 scopus 로고    scopus 로고
    • Better rules, fewer features: A semantic approach to selecting features from text
    • San Jose, CA, pp
    • Blake, C., and Pratt, W. 2001. Better rules, fewer features: a semantic approach to selecting features from text. In Proc. of IEEE DM Conference (IEEE DM), San Jose, CA, pp. 59-66.
    • (2001) Proc. of IEEE DM Conference (IEEE DM) , pp. 59-66
    • Blake, C.1    Pratt, W.2
  • 9
    • 19544368439 scopus 로고    scopus 로고
    • Text classification by boosting weak learners based on terms and concepts
    • Brighton, UK, pp
    • th ICDM, Brighton, UK, pp. 331-334.
    • (2004) th ICDM , pp. 331-334
    • Bloehdorn, S.1    Hotho, A.2
  • 10
    • 0038483449 scopus 로고
    • Surface grammatical analysis for the extraction of terminological noun phrases
    • Nantes, pp
    • Bourigault D. 1992. Surface grammatical analysis for the extraction of terminological noun phrases. In Proc. of the 14th COLING-92, Nantes, pp. 977-981.
    • (1992) Proc. of the 14th COLING-92 , pp. 977-981
    • Bourigault, D.1
  • 11
    • 84874652784 scopus 로고    scopus 로고
    • Brown Corpus. http://helmer.aksis.uib.no/icame/brown/bcm.html
    • Brown Corpus
  • 14
    • 33750740871 scopus 로고    scopus 로고
    • Extracting knowledge from evaluative text
    • In the, Banff, Alberta, Canada, pp
    • Carenini, G., Ng, R.T., and Zwart, E. 2005. Extracting knowledge from evaluative text. In the 3rd KCAP, Banff, Alberta, Canada, pp. 11-18.
    • (2005) 3rd KCAP , pp. 11-18
    • Carenini, G.1    Ng, R.T.2    Zwart, E.3
  • 15
    • 0242647875 scopus 로고    scopus 로고
    • A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization
    • AMITA G. CHIN, Ed. Idea Group Publishing, Hershey, PA
    • Caropreso, M.F., Matwin, S., and Sebastiani, F. 2001. A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization. In Text Databases and Document Management: Theory and Practice, AMITA G. CHIN, Ed. Idea Group Publishing, Hershey, PA, 78-102.
    • (2001) Text Databases and Document Management: Theory and Practice , pp. 78-102
    • Caropreso, M.F.1    Matwin, S.2    Sebastiani, F.3
  • 16
    • 31144459206 scopus 로고    scopus 로고
    • Learning concept hierarchies from text corpora using formal concept analysis
    • Cimiano, P., Hotho, A., and Staab, S. 2005. Learning concept hierarchies from text corpora using formal concept analysis. Journal of Artificial Intelligence Research, 24, pp. 305-339.
    • (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 305-339
    • Cimiano, P.1    Hotho, A.2    Staab, S.3
  • 17
    • 35748966298 scopus 로고    scopus 로고
    • Cohen, K.B., and Hunter, L. 2004. Natural language processing and systems biology. In Artificial Intelligence methods and tools for systems biology, Dubitzky and Pereira, Springer Verlag.
    • Cohen, K.B., and Hunter, L. 2004. Natural language processing and systems biology. In Artificial Intelligence methods and tools for systems biology, Dubitzky and Pereira, Springer Verlag.
  • 18
    • 35048894035 scopus 로고    scopus 로고
    • Semisupervised text classification using partitioned EM
    • Jesu Island, Korea, pp
    • th DASFAA, Jesu Island, Korea, pp., 482-493.
    • (2004) th DASFAA , pp. 482-493
    • Cong, G.1    Lee, W.2    Wu, H.3    Liu, B.4
  • 22
    • 84871273579 scopus 로고    scopus 로고
    • EuroWordNet. http://www.illc.uva.nl/EuroWordNet
    • EuroWordNet
  • 24
    • 0007186571 scopus 로고
    • A synopsis of linguistic theory 1930-1955
    • Philological Society, Oxford, Reprinted in Selected papers of J.R.Firth 1952-1959, Longman, London
    • Firth, J.R. 1957. A synopsis of linguistic theory 1930-1955. In Studies in Linguistic Analysis, Philological Society, Oxford, 1-32. Reprinted in Selected papers of J.R.Firth 1952-1959, Longman, London.
    • (1957) Studies in Linguistic Analysis , pp. 1-32
    • Firth, J.R.1
  • 33
    • 85149131035 scopus 로고
    • Multi-paragraph segmentation of expository text
    • Las Cruces, NM, pp
    • Hearst, M.A. 1994. Multi-paragraph segmentation of expository text. In Proc. of the 32nd ACL, Las Cruces, NM, pp. 9-16.
    • (1994) Proc. of the 32nd ACL , pp. 9-16
    • Hearst, M.A.1
  • 34
    • 85036387096 scopus 로고    scopus 로고
    • Untangling text data mining
    • College Park, MD, pp
    • th ACL, College Park, MD, pp. 3-10.
    • (1999) th ACL , pp. 3-10
    • Hearst, M.A.1
  • 35
    • 3943106585 scopus 로고    scopus 로고
    • Accomplishments and challenges in literature data mining for biology
    • Hirschman, L., Park, J.C., Tsujii, J., Wong, L., and Wu, C. 2002. Accomplishments and challenges in literature data mining for biology. In BioInformatics, 18(12), pp. 1553-1561.
    • (2002) BioInformatics , vol.18 , Issue.12 , pp. 1553-1561
    • Hirschman, L.1    Park, J.C.2    Tsujii, J.3    Wong, L.4    Wu, C.5
  • 36
    • 28744443171 scopus 로고    scopus 로고
    • Creating the ultimate research assistant
    • Hoskinson, A. 2005. Creating the ultimate research assistant. IEEE Computer, 38(11), pp. 97-99.
    • (2005) IEEE Computer , vol.38 , Issue.11 , pp. 97-99
    • Hoskinson, A.1
  • 37
    • 33750308486 scopus 로고    scopus 로고
    • Identifying comparative sentences in text documents
    • Seattle, USA, pp
    • Jindal, N., and Bing, L. 2006. Identifying comparative sentences in text documents. In Proc. of the 29th SIGIR, Seattle, USA, pp. 244-251.
    • (2006) Proc. of the 29th SIGIR , pp. 244-251
    • Jindal, N.1    Bing, L.2
  • 38
    • 84989380187 scopus 로고    scopus 로고
    • Methods of automatic term recognition
    • Kageura, K., and Umino, B. 1996. Methods of automatic term recognition. Technology Journal, 3(2), pp. 259-289.
    • (1996) Technology Journal , vol.3 , Issue.2 , pp. 259-289
    • Kageura, K.1    Umino, B.2
  • 39
    • 85029543647 scopus 로고    scopus 로고
    • th LREC, IV, European Language Resources Association, Paris, 2004, pp. 1115-1118.
    • th LREC, vol. IV, European Language Resources Association, Paris, 2004, pp. 1115-1118.
  • 40
    • 35748952687 scopus 로고    scopus 로고
    • Report on KDD conference 2004 panel discussion - can natural language processing help text mining? SIGKDD Explorations
    • Dec
    • Kao, A., and Poteet, S. 2004. Report on KDD conference 2004 panel discussion - can natural language processing help text mining? SIGKDD Explorations 6(2), Dec. 2004, pp. 132-133.
    • (2004) , vol.6 , Issue.2 , pp. 132-133
    • Kao, A.1    Poteet, S.2
  • 41
    • 33750811268 scopus 로고    scopus 로고
    • Text mining and natural language processing - Introduction for the special issue
    • June
    • Kao, A., and Poteet S. 2006. Text mining and natural language processing - Introduction for the special issue. SIGKDD Explorations 7(1), June 2006, pp. 1-2.
    • (2006) SIGKDD Explorations , vol.7 , Issue.1 , pp. 1-2
    • Kao, A.1    Poteet, S.2
  • 42
    • 0142086575 scopus 로고    scopus 로고
    • A comparison of word- and sense-based text categorization using several classification algorithms
    • Kehagias, A., Petridis, V., Kaburlasos, V.G., and Fragkou, P. 2001. A comparison of word- and sense-based text categorization using several classification algorithms. Journal of Intelligent Information Systems, 21(3), pp. 227-247.
    • (2001) Journal of Intelligent Information Systems , vol.21 , Issue.3 , pp. 227-247
    • Kehagias, A.1    Petridis, V.2    Kaburlasos, V.G.3    Fragkou, P.4
  • 43
    • 35748942444 scopus 로고    scopus 로고
    • st ACL, Columbus, Ohio, USA, pp. 286-288.
    • st ACL, Columbus, Ohio, USA, pp. 286-288.
  • 44
    • 0142192295 scopus 로고    scopus 로고
    • Conditional random fields: Probabilistic models for segmenting and labeling sequence data
    • Williamstown, MA, pp
    • Lafferty, J., Mccallum, A., and Pereira, F. 2001. Conditional random fields: probabilistic models for segmenting and labeling sequence data. In Proc. of the 18th ICML, Williamstown, MA, pp. 282-289.
    • (2001) Proc. of the 18th ICML , pp. 282-289
    • Lafferty, J.1    Mccallum, A.2    Pereira, F.3
  • 45
    • 0027012244 scopus 로고
    • An evaluation of phrasal and clustered representations on a text categorization task
    • Copenhagen, Denmark, pp
    • Lewis, D.D. 1992. An evaluation of phrasal and clustered representations on a text categorization task. In Proc. of SIGIR, Copenhagen, Denmark, pp. 37-50.
    • (1992) Proc. of SIGIR , pp. 37-50
    • Lewis, D.D.1
  • 47
    • 33749612526 scopus 로고    scopus 로고
    • Integrating unstructured data into relational databases
    • Mansuri, I.R, and Sarawagi, S. 2006. Integrating unstructured data into relational databases. In Proc. of the 22nd ICDE, 29.
    • (2006) Proc. of the 22nd ICDE , pp. 29
    • Mansuri, I.R.1    Sarawagi, S.2
  • 48
    • 84996678707 scopus 로고    scopus 로고
    • Information Extraction: Distilling Structured Data from Unstructured Text
    • November
    • McCallum, A. 2005. Information Extraction: Distilling Structured Data from Unstructured Text. ACM Queue, 3(9), November 2005.
    • (2005) ACM Queue , vol.3 , Issue.9
    • McCallum, A.1
  • 49
    • 33745797351 scopus 로고    scopus 로고
    • Similarity measures for tracking information flow
    • Bremen, Germany, pp
    • Metzler, D., Bernstein, Y., Croft, W.B., Moffat, A., and Zobel, J. 2005. Similarity measures for tracking information flow. In Proc. of CIKM, Bremen, Germany, pp. 517-524.
    • (2005) Proc. of CIKM , pp. 517-524
    • Metzler, D.1    Bernstein, Y.2    Croft, W.B.3    Moffat, A.4    Zobel, J.5
  • 53
    • 33645690403 scopus 로고    scopus 로고
    • Mining knowledge from text using information extraction
    • June
    • Mooney, R.J., and Bunescu, R. 2005. Mining knowledge from text using information extraction. ACM SIGKDD Explorations 7(1), June 2006, pp. 3-10.
    • (2005) ACM SIGKDD Explorations , vol.7 , Issue.1 , pp. 3-10
    • Mooney, R.J.1    Bunescu, R.2
  • 55
    • 85136905861 scopus 로고    scopus 로고
    • Analyzing the effectiveness and applicability of co-training
    • In the, Kansas City, MI, pp
    • Nigam, K., and Ghani, R. 2000. Analyzing the effectiveness and applicability of co-training. In the 8th CIKM, Kansas City, MI, pp. 86-93.
    • (2000) 8th CIKM , pp. 86-93
    • Nigam, K.1    Ghani, R.2
  • 56
    • 1642404313 scopus 로고    scopus 로고
    • Linking lexicons and ontologies: Mapping WordNet to the suggested upper merged ontology
    • Las Vegas, Nevada, pp
    • Niles, I., and Pease, A. 2003. Linking lexicons and ontologies: mapping WordNet to the suggested upper merged ontology. In Proc. of the 2003 International Conference on IKE, Las Vegas, Nevada, pp. 412-416.
    • (2003) Proc. of the 2003 International Conference on IKE , pp. 412-416
    • Niles, I.1    Pease, A.2
  • 57
    • 85141803251 scopus 로고    scopus 로고
    • Thumbs up? Sentiment classification using machine learning techniques
    • Pang, B., Lee, L., and Vaithyanathan, S. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proc. of the 2002 EMNLP, pp. 79-86.
    • (2002) Proc. of the 2002 EMNLP , pp. 79-86
    • Pang, B.1    Lee, L.2    Vaithyanathan, S.3
  • 58
    • 35748986603 scopus 로고    scopus 로고
    • Penn Treebank. http://www.ois.upenn.edu/-treebank/home.html [59] Rajman, M., and Besançon, R. 1999. Stochastic distributional models for textual information retrieval. In Proc. of 9th ASMDA, Lisbon, Portugal, pp. 80-85.
    • Penn Treebank. http://www.ois.upenn.edu/-treebank/home.html [59] Rajman, M., and Besançon, R. 1999. Stochastic distributional models for textual information retrieval. In Proc. of 9th ASMDA, Lisbon, Portugal, pp. 80-85.
  • 59
    • 0003033112 scopus 로고
    • Using information content to evaluate semantic similarity in a taxonomy
    • Montreal, QC, Canada, pp
    • Resnik, P. 1995. Using information content to evaluate semantic similarity in a taxonomy. In Proc. of the 14th UCAI-95, Montreal, QC, Canada, pp. 448-453.
    • (1995) Proc. of the 14th UCAI-95 , pp. 448-453
    • Resnik, P.1
  • 60
    • 0002016474 scopus 로고    scopus 로고
    • Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language
    • Resnik, P. 1999. Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. Journal of Artificial Intelligence Research, 11, pp. 95-130.
    • (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 95-130
    • Resnik, P.1
  • 61
    • 0029206077 scopus 로고
    • Little words can make a big difference for text classification
    • Seattle, WA, pp
    • Riloff, E. 1995. Little words can make a big difference for text classification. In Proc. of the 18th SIGIR, Seattle, WA, pp. 130-136.
    • (1995) Proc. of the 18th SIGIR , pp. 130-136
    • Riloff, E.1
  • 62
    • 2342601888 scopus 로고
    • Syntactic approaches to automatic book indexing
    • NY
    • Salton, G. 1988. Syntactic approaches to automatic book indexing. In Proc. of the 26th ACL, NY, 120-138.
    • (1988) Proc. of the 26th ACL , pp. 120-138
    • Salton, G.1
  • 63
    • 0016572913 scopus 로고
    • A vector space model for automatic indexing
    • Salton, G., Wong, A., and Yang, C.S. 1975. A vector space model for automatic indexing. In Communications of the ACM 18(11), pp. 613-620.
    • (1975) Communications of the ACM , vol.18 , Issue.11 , pp. 613-620
    • Salton, G.1    Wong, A.2    Yang, C.S.3
  • 65
    • 84880692052 scopus 로고    scopus 로고
    • Schapire, R.E. 1999. A brief introduction to boosting. In Proc. of the 16th IJCAI, Stockholm, pp. 1401-1405.
    • Schapire, R.E. 1999. A brief introduction to boosting. In Proc. of the 16th IJCAI, Stockholm, pp. 1401-1405.
  • 66
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • Sebastiani, F. 2002. Machine learning in automated text categorization. In ACM Computing Surveys, 34(1), pp. 1-47.
    • (2002) In ACM Computing Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 67
    • 33750569051 scopus 로고    scopus 로고
    • Classification of text, automatic
    • 2nd ed, Elsevier Science Pub, pp
    • Sebastiani, F. 2006. Classification of text, automatic. In The Encyclopedia of Language and Linguistics 14, 2nd ed., Elsevier Science Pub., pp. 457-462.
    • (2006) The Encyclopedia of Language and Linguistics 14 , pp. 457-462
    • Sebastiani, F.1
  • 68
    • 85182338269 scopus 로고    scopus 로고
    • Seco, N., Veale, T., and Hayes, J. 2004. An intrinsic information content metric for semantic similarity in WordNet. In Proc. of the 16th ECAI, Valencia, Spain, pp. 1089-1090.
    • Seco, N., Veale, T., and Hayes, J. 2004. An intrinsic information content metric for semantic similarity in WordNet. In Proc. of the 16th ECAI, Valencia, Spain, pp. 1089-1090.
  • 69
    • 84856043672 scopus 로고
    • A mathematical theory of communication
    • Shannon, C.E. 1948. A mathematical theory of communication. Bell System Technical Journal, 27, pp. 379-423.
    • (1948) Bell System Technical Journal , vol.27 , pp. 379-423
    • Shannon, C.E.1
  • 70
    • 27744487007 scopus 로고    scopus 로고
    • Text mining and ontologies in biomedicine: Making sense of raw text
    • Spasic, I., Ananiadou, S., Mcnaught, J., and Kumar, A. 2005. Text mining and ontologies in biomedicine: making sense of raw text. Briefings in Bioinformatics 6(3), pp. 239-251.
    • (2005) Briefings in Bioinformatics , vol.6 , Issue.3 , pp. 239-251
    • Spasic, I.1    Ananiadou, S.2    Mcnaught, J.3    Kumar, A.4
  • 71
    • 35748951183 scopus 로고    scopus 로고
    • SUMO
    • SUMO, http://ontology.teknowledge.com/
  • 72
    • 0028109119 scopus 로고
    • Assessing a gap in the biomedical literature: Magnesium deficiency and neurologic disease
    • Swanson, D.R., and Smalheiser, N.R. 1994. Assessing a gap in the biomedical literature: magnesium deficiency and neurologic disease. Neuroscience Research Communications 15(1), pp. 1-9.
    • (1994) Neuroscience Research Communications , vol.15 , Issue.1 , pp. 1-9
    • Swanson, D.R.1    Smalheiser, N.R.2
  • 73
    • 0031125707 scopus 로고    scopus 로고
    • An interactive system for finding complementary literatures: A stimulus to scientific discovery
    • Swanson, D.R., and Smalheiser, N.R. 1997. An interactive system for finding complementary literatures: a stimulus to scientific discovery. Artificial Intelligence 91, pp. 183-203.
    • (1997) Artificial Intelligence , vol.91 , pp. 183-203
    • Swanson, D.R.1    Smalheiser, N.R.2
  • 74
    • 2442507763 scopus 로고    scopus 로고
    • Measuring praise and criticism: Inference of semantic orientation from association
    • Turney, P.D., and Littman, M.L. 2003. Measuring praise and criticism: inference of semantic orientation from association. ACM TOIS 21(4), pp. 315-346.
    • (2003) ACM TOIS , vol.21 , Issue.4 , pp. 315-346
    • Turney, P.D.1    Littman, M.L.2
  • 77
    • 0032650194 scopus 로고    scopus 로고
    • Text mining: A new frontier for lossless compression
    • Snowbird, Utah, pp
    • Witten, I.H., Bray, Z., Mahoui, M., and Teahan, B. 1999. Text mining: a new frontier for lossless compression. In Proc. of DCC, Snowbird, Utah, pp. 198-207.
    • (1999) Proc. of DCC , pp. 198-207
    • Witten, I.H.1    Bray, Z.2    Mahoui, M.3    Teahan, B.4
  • 78
    • 35748970638 scopus 로고    scopus 로고
    • WordNet. http://wordnet.princeton.edu/
    • WordNet
  • 79
    • 85024373635 scopus 로고    scopus 로고
    • A re-examination of text categorization methods
    • Berkeley, CA, pp
    • Yang, Y., and Liu, X. 1999. A re-examination of text categorization methods. In Proc. of SIGIR, Berkeley, CA, pp. 42-49.
    • (1999) Proc. of SIGIR , pp. 42-49
    • Yang, Y.1    Liu, X.2
  • 80
    • 0003141935 scopus 로고    scopus 로고
    • A comparative study on feature selection in text categorization
    • Nashville, TN, pp
    • Yang, Y., and Pedersen, J. 1997. A comparative study on feature selection in text categorization. In Proc. of the 14th ICML, Nashville, TN, pp. 412-420.
    • (1997) Proc. of the 14th ICML , pp. 412-420
    • Yang, Y.1    Pedersen, J.2
  • 81
    • 85141919230 scopus 로고
    • Unsupervised word sense disambiguation rivaling supervised methods
    • Cambridge, MA, pp
    • Yarowsky, D. 1995. Unsupervised word sense disambiguation rivaling supervised methods. In Proc. of the 33rd ACL, Cambridge, MA, pp. 189-196.
    • (1995) Proc. of the 33rd ACL , pp. 189-196
    • Yarowsky, D.1
  • 82
    • 4944235422 scopus 로고    scopus 로고
    • Evaluation of text data mining for database curation: Lessons learned from the KDD challenge cup
    • Yeh, A.S., Hirschman, L., and Morgan, A.A. 2003. Evaluation of text data mining for database curation: lessons learned from the KDD challenge cup. Bioinformatics 19 (Suppl. 1), pp. i331-i339.
    • (2003) Bioinformatics , vol.19 , Issue.SUPPL. 1
    • Yeh, A.S.1    Hirschman, L.2    Morgan, A.A.3
  • 83
    • 35248863431 scopus 로고    scopus 로고
    • From resource discovery to knowledge discovery on the internet
    • 1998-13, Simon Fraser University, August
    • Zaïane, O.R. 1998. From resource discovery to knowledge discovery on the internet. Technical Report TR 1998-13, Simon Fraser University, August, 1998.
    • (1998) Technical Report TR
    • Zaïane, O.R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.