메뉴 건너뛰기




Volumn 7, Issue 2, 2009, Pages 101-135

Unsupervised Part-of-Speech Tagging in the Large

Author keywords

Application based evaluation; Graph clustering; Induction of parts of speech; Unsupervised and knowledge free NLP

Indexed keywords


EID: 77956695088     PISSN: 15707075     EISSN: 15728706     Source Type: Journal    
DOI: 10.1007/s11168-010-9067-9     Document Type: Article
Times cited : (18)

References (52)
  • 1
    • 77956658836 scopus 로고    scopus 로고
    • Banko, M., & Brill, E. (2001). Scaling to very very large corpora for natural language disambiguation. In Proceedings of ACL-01 (pp. 26-33).
  • 2
    • 85081787936 scopus 로고    scopus 로고
    • Biemann, C. (2006). Chinese whispers-an efficient graph clustering algorithm and its application to natural language processing problems. In Proceedings of textGraphs: The second workshop on graph based methods for natural language processing (pp. 73-80). New York City, June. Association for Computational Linguistics.
  • 3
    • 77956685040 scopus 로고    scopus 로고
    • Biemann, C. (2007). Unsupervised and knoweldge-free natural language processing in the structure discovery paradigm. Ph. D. thesis, University of Leipzig.
  • 4
    • 85130961613 scopus 로고    scopus 로고
    • Biemann, C., Giuliano, C., & Gliozzo A. (2007). Unsupervised part-of-speech tagging supporting supervised methods. In Proceedings of recent advances in natural language processing (RANLP-07), Borovets, Bulgaria.
  • 5
    • 77956707772 scopus 로고    scopus 로고
    • Brants, T. (2000). TnT: A statistical part-of-speech tagger. In Proceedings of the sixth conference on applied natural language processing (ANLP-00) (pp. 224-231). San Francisco, CA, USA: Morgan Kaufmann Publishers Inc.
  • 6
    • 77956708385 scopus 로고    scopus 로고
    • Brants, T., Hendriks, R., Kramp, S., Krenn, B., Preis, C., Skut, W., et al. (1997). Das NEGRA-Annotationsschema. Negra project report, Universität des Saarlandes, Saarbrücken.
  • 7
    • 77956703597 scopus 로고    scopus 로고
    • Brill, E. (1992). A simple rule-based part of speech tagger. In Proceedings of the third conference on applied natural language processing (ANLP-92) (pp. 152-155). Morristown, NJ, USA: Association for Computational Linguistics.
  • 10
    • 0027707504 scopus 로고    scopus 로고
    • Charniak, E., Hendrickson, C., Jacobson N., & Perkowitz, M. (1993). Equations for part-of-speech tagging. In National Conference on Artificial Intelligence (pp. 784-789).
  • 11
    • 77956691840 scopus 로고    scopus 로고
    • Clark, A. (2000). Inducing syntactic categories by context distribution clustering. In Cardie, C., Daelemans, W., Nédellec, C., & Tjong Kim Sang, E., (Eds.), In Proceedings of the fourth conference on computational natural language learning and of the second learning language in logic workshop, Lisbon, 2000 (pp. 91-94). Somerset, New Jersey: Association for Computational Linguistics.
  • 12
    • 77956677138 scopus 로고    scopus 로고
    • Clark, A. (2003). Combining distributional and morphological information for part of speech induction. In Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics (EACL-03) (pp. 59-66). Morristown, NJ, USA: Association for Computational Linguistics.
  • 13
    • 77956671633 scopus 로고    scopus 로고
    • Cucerzan, S., & Yarowsky, D. (1999). Language independent named entity recognition combining morphological and contextual evidence. In Proceedings of 1999 joint SIGDAT conference on EMNLP and VLC (pp. 132-138). College Park.
  • 14
    • 77952375075 scopus 로고    scopus 로고
    • Dhillon, I. S., Mallela, S., & Modha, D. S. (2003). Information-theoretic co-clustering. In Proceedings of The ninth ACM SIGKDD international conference on knowledge discovery and data mining(KDD-2003) (pp. 89-98).
  • 15
    • 85055298348 scopus 로고
    • Accurate methods for the statistics of surprise and coincidence
    • Dunning T. E. (1993) Accurate methods for the statistics of surprise and coincidence. Computational Linguistics 19(1): 61-74.
    • (1993) Computational Linguistics , vol.19 , Issue.1 , pp. 61-74
    • Dunning, T.E.1
  • 16
    • 77956659293 scopus 로고    scopus 로고
    • Eiken, U. C., Liseth, A. T., Witschel, H. F., Richter, M., & Biemann, C. (2006). Ord i dag: Mining Norwegian daily newswire. In Proceedings of the FinTAL. Turku, Finland.
  • 17
    • 77956657874 scopus 로고    scopus 로고
    • Ertöz, L., Steinbach, M., & Kumar, V. (2002). A new shared nearest neighbor clustering algorithm and its applications. In Proceedings of workshop on clustering high dimensional data and its applications (pp. 105-115).
  • 18
    • 77956696719 scopus 로고    scopus 로고
    • Finch, S., & Chater, N. (1992). Bootstrapping syntactic categories using statistical methods. In Background and experiments in machine learning of natural language: Proceedings of the 1st SHOE Workshop (pp. 229-235). Brabant, Holland: Katholieke Universiteit.
  • 19
    • 77956671221 scopus 로고    scopus 로고
    • Freitag, D. (2004a). Toward unsupervised whole-corpus tagging. In Proceedings of the 20th international conference on Computational Linguistics (COLING-04) (p. 357). Morristown, NJ, USA: Association for Computational Linguistics.
  • 20
    • 77956697590 scopus 로고    scopus 로고
    • Freitag, D. (2004b). Trained named entity recognition using distributional clusters. In Proceedings of EMNLP-04, (pp. 262-269).
  • 22
    • 77956690434 scopus 로고    scopus 로고
    • Gauch, S., & Futrelle, R. (1994). Experiments in automatic word class and word sense identification for information retrieval. In Proceedings of the 3rd annual symposium on document analysis and information retrieval (pp. 425-434). Las Vegas, NV, April.
  • 23
    • 77956698851 scopus 로고    scopus 로고
    • Gliozzo, A. M. (2005). Semantic domains in computational linguistics. Ph. D. thesis, University of Trento, Italy.
  • 24
    • 84859893241 scopus 로고    scopus 로고
    • Gliozzo, A. M, Giuliano, C., & Strapparava, C. (2005). Domain kernels for word sense disambiguation. In Proceedings of the 43rd annual meeting of the association for computational linguistics (ACL-05) (pp. 403-410). Ann Arbor, Michigan, USA.
  • 25
    • 84860525845 scopus 로고    scopus 로고
    • Goldwater, S., & Griffiths, T. (2007). A fully bayesian approach to unsupervised part-of-speech tagging. In Proceedings of the 45th annual meeting of the association of computational linguistics (pp. 744-751). Prague, Czech Republic, June. Association for Computational Linguistics.
  • 26
    • 77956690886 scopus 로고    scopus 로고
    • Hagen, K., Johannessen J. B., & Nøklestad, A. (2000). A constraint-based tagger for Norwegian. In Lindberg, C.-E. og S. Nordahl Lund (red.) Proceedings of 17th scandinavian conference of linguistics, vol. I. Odense: Odense Working Papers in Language and Communication, I(19).
  • 27
    • 70049102734 scopus 로고    scopus 로고
    • Haghighi, A., & Klein, D. (2006). Prototype-driven learning for sequence models. In Proceedings of the human language technology conference of the North American chapter of the association of computational linguistics (HLT-NAACL-06). New York, NY, USA.
  • 31
    • 77956682954 scopus 로고    scopus 로고
    • Lafferty, J., McCallum, A., & Pereira, F. (2001). Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the 18th international conference on machine learning (ICML-01) (pp 282-289). San Francisco, CA: Morgan Kaufmann.
  • 33
    • 77956662813 scopus 로고    scopus 로고
    • Mihalcea, R., Chklovsky, T., & Kilgarriff, A. (2004). The SENSEVAL-3 english lexical sample task. In Proceedings of SENSEVAL-3: Third international workshop on the evaluation of systems for the semantic analysis of text (pp. 25-28). New Brunswick, NJ, USA.
  • 36
    • 0027929445 scopus 로고
    • On structuring probabilistic dependences in stochastic language modelling
    • Ney H., Essen U., Knese R. (1994) On structuring probabilistic dependences in stochastic language modelling. Computer Speech and Language 8(1): 1-38.
    • (1994) Computer Speech and Language , vol.8 , Issue.1 , pp. 1-38
    • Ney, H.1    Essen, U.2    Knese, R.3
  • 38
    • 77956671305 scopus 로고    scopus 로고
    • Peters, C. (Ed.). (2006). Working notes for the CLEF 2006 Workshop. Alicante, Spain.
  • 40
    • 84991789529 scopus 로고    scopus 로고
    • Quasthoff, U., Richter, M., & Biemann, C. (2006). Corpus portal for search in monolingual corpora. In Proceedings of the fifth international conference on language resources and evaluation (LREC-06) (pp. 1799-1802).
  • 41
    • 84859884440 scopus 로고    scopus 로고
    • Rapp, R. (2005). A practical solution to the problem of automatic part-of-speech induction from text. In Conference companion volume of the 43rd annual meeting of the association for computational linguistics (ACL-05). Ann Arbor, Michigan, USA.
  • 42
    • 77956663785 scopus 로고    scopus 로고
    • Roth, D., & van den Bosch, A. (Eds.). (2002). Proceedings of the sixth workshop on computational language learning (CoNLL-02). Taipei, Taiwan.
  • 43
    • 77956654784 scopus 로고    scopus 로고
    • Schmid, H. (1994). Probabilistic part-of-speech tagging using decision trees. In International conference on new methods in language processing. Manchester, UK.
  • 44
    • 77956653251 scopus 로고    scopus 로고
    • Schütze, H. (1993). Part-of-speech induction from scratch. In Proceedings of the 31st annual meeting on association for computational linguistics (ACL-93) (pp. 251-258). Morristown, NJ, USA: Association for Computational Linguistics.
  • 45
    • 77956671216 scopus 로고    scopus 로고
    • Schütze, H. (1995). Distributional part-of-speech tagging. In Proceedings of the 7th conference on European chapter of the association for Computational Linguistics (EACL-95) (pp. 141-148), San Francisco, CA, USA: Morgan Kaufmann Publishers Inc.
  • 46
    • 77956713488 scopus 로고    scopus 로고
    • Tjong Kim Sang, E., & Buchholz, S. (2000). Introduction to the conll-2000 shared task: Chunking. In Proceedings of CoNLL-2000, Lisbon, Portugal.
  • 47
    • 77956679029 scopus 로고    scopus 로고
    • Toutanova, K., Klein, D., Manning, C. D., & Singer, Y. (2003). Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of HLT-NAACL 2003 (pp. 252-259).
  • 48
    • 77956712851 scopus 로고    scopus 로고
    • van den Bosch, A., & Buchholz, S. (2001) Shallow parsing on the basis of words only: A case study. In ACL '02: Proceedings of the 40th annual meeting on association for computational linguistics (pp. 433-440). Morristown, NJ, USA: Association for Computational Linguistics.
  • 49
    • 0004217877 scopus 로고
    • 2nd edn., University of Glasgow: Department of Computer Science
    • van Rijsbergen C. J. (1979) Information retrieval, 2nd edition. Department of Computer Science, University of Glasgow.
    • (1979) Information Retrieval
    • van Rijsbergen, C.J.1
  • 50
    • 77956710851 scopus 로고    scopus 로고
    • Witschel, H. F., & Biemann, C. (2005). Rigorous dimensionality reduction through linguistically motivated feature selection for text categorization. In Proceedings of NODALIDA'05, Joensuu, Finland.
  • 51
    • 0027697605 scopus 로고
    • An optimal graph theoretic approach to data clustering: Theory and its application to image segmentation
    • Wu Z., Leahy R. (1993) An optimal graph theoretic approach to data clustering: Theory and its application to image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 15(11): 1101-1113.
    • (1993) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.15 , Issue.11 , pp. 1101-1113
    • Wu, Z.1    Leahy, R.2
  • 52
    • 77956665941 scopus 로고    scopus 로고
    • Yarowsky, D. (1994). Decision lists for lexical ambiguity resolution: application to accent restoration in spanish and french. In Proceedings of the 32nd annual meeting on association for computational linguistics (ACL-94) (pp. 88-95). Las Cruces, New Mexico.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.