메뉴 건너뛰기




Volumn 3, Issue , 2013, Pages 103-120

LeTs Preprocess: The multilingual LT3 linguistic preprocessing toolkit

Author keywords

[No Author keywords available]

Indexed keywords

CROSS VALIDATION; DIFFERENT DOMAINS; GOLD STANDARDS; LINGUISTIC PREPROCESSINGS; NAMED ENTITIES; PART-OF-SPEECH TAGGER; PRE-PROCESSING STEP; PREPROCESSING MODULES;

EID: 84907924798     PISSN: 22114009     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (45)

References (40)
  • 5
    • 84876795299 scopus 로고    scopus 로고
    • TIGER Morphologie-Annotationsschema
    • Universitat des Saarlandes - Computerlinguistik, Universität Stuttgart - Institut für Maschinelle Sprachverarbeitung, Universität Potsdam - Institut für Germanistik
    • Crysmann, B., S. Hansen-Schirra, G. Smith, and D. Ziegler-Eisele (2005), TIGER Morphologie-Annotationsschema, Technical report, Universitat des Saarlandes - Computerlinguistik, Universität Stuttgart - Institut für Maschinelle Sprachverarbeitung, Universität Potsdam - Institut für Germanistik.
    • (2005) Technical Report
    • Crysmann, B.1    Hansen-Schirra, S.2    Smith, G.3    Ziegler-Eisele, D.4
  • 12
    • 44949230930 scopus 로고    scopus 로고
    • Europarl: A parallel corpus for statistical machine translation
    • Phuket, Thailand
    • Koehn, P. (2005), Europarl: A parallel corpus for statistical machine translation, Proceedings of the tenth Machine Translation Summit, Phuket, Thailand, pp. 79-86.
    • (2005) Proceedings of the Tenth Machine Translation Summit , pp. 79-86
    • Koehn, P.1
  • 13
    • 0026890725 scopus 로고
    • Robust part-of-speech tagging using a hidden Markov model
    • Kupiec, J. (1992), Robust part-of-speech tagging using a hidden Markov model, Computer Speech and Language 6(3), pp. 225-242.
    • (1992) Computer Speech and Language , vol.6 , Issue.3 , pp. 225-242
    • Kupiec, J.1
  • 16
    • 84877027694 scopus 로고    scopus 로고
    • TExSIS: Bilingual terminology extraction from parallel corpora using chunk-based alignment
    • John Benjamins Publishing Company
    • Macken, L., E. Lefever, and V. Hoste (2013), TExSIS: bilingual terminology extraction from parallel corpora using chunk-based alignment, Terminology 19(1), pp. 1-30, John Benjamins Publishing Company.
    • (2013) Terminology , vol.19 , Issue.1 , pp. 1-30
    • Macken, L.1    Lefever, E.2    Hoste, V.3
  • 17
    • 34249852033 scopus 로고
    • Building a large annotated corpus of English: The Penn Treebank
    • Marcus, M. P., B. Santorini, and M. A. Marcinkiewicz (1993), Building a large annotated corpus of English: The Penn Treebank, Computational Linguistics 19(2), pp. 313-330.
    • (1993) Computational Linguistics , vol.19 , Issue.2 , pp. 313-330
    • Marcus, M.P.1    Santorini, B.2    Marcinkiewicz, M.A.3
  • 18
    • 85121365374 scopus 로고    scopus 로고
    • Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons
    • Morristown, New Jersey, USA
    • McCallum, A. and W. Li (2003), Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons, Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003, Morristown, New Jersey, USA, pp. 188-191.
    • (2003) Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003 , pp. 188-191
    • McCallum, A.1    Li, W.2
  • 23
    • 0012941680 scopus 로고
    • Part-of-speech tagging guidelines for the Penn Treebank Project
    • University of Pennsylvania, Department of Computer and Information Science, Philadelphia, Pennsylvania, USA
    • Santorini, B. (1990), Part-of-speech tagging guidelines for the Penn Treebank Project, Technical report, University of Pennsylvania, Department of Computer and Information Science, Philadelphia, Pennsylvania, USA.
    • (1990) Technical Report
    • Santorini, B.1
  • 24
    • 33646195659 scopus 로고    scopus 로고
    • Guidelines für das Tagging deutscher Textcorpora mit STTS (kleines und großes Tagset)
    • Universität Stuttgart - Institut für maschinelle Sprachverarbeitung, Universität Tübingen - Seminar für Sprachwissenschaft
    • Schiller, A., S. Teufel, C. Stöckert, and C. Thielen (1999), Guidelines für das Tagging deutscher Textcorpora mit STTS (kleines und großes Tagset), Technical report, Universität Stuttgart - Institut für maschinelle Sprachverarbeitung, Universität Tübingen - Seminar für Sprachwissenschaft.
    • (1999) Technical Report
    • Schiller, A.1    Teufel, S.2    Stöckert, C.3    Thielen, C.4
  • 26
    • 0042096783 scopus 로고
    • Improvements in part-of-speech tagging with an application to German
    • Dublin, Ireland
    • Schmid, H. (1995), Improvements in part-of-speech tagging with an application to German, Proceedings of the ACL SIGDAT-Workshop, Dublin, Ireland, pp. 47-50.
    • (1995) Proceedings of the ACL SIGDAT-workshop , pp. 47-50
    • Schmid, H.1
  • 29
    • 84907925004 scopus 로고    scopus 로고
    • A brief introduction to the TIGER treebank, version 1
    • Universität Potsdam
    • Smith, G. (2003), A brief introduction to the TIGER treebank, version 1, Technical report, Universität Potsdam.
    • (2003) Technical Report
    • Smith, G.1
  • 32
    • 9444254243 scopus 로고    scopus 로고
    • Memory-based named entity recognition
    • Taipei, Taiwan
    • Tjong Kim Sang, E. F. (2002b), Memory-based named entity recognition, Proceedings of CoNLL-2002, Taipei, Taiwan, pp. 203-206.
    • (2002) Proceedings of CoNLL-2002 , pp. 203-206
    • Tjong Kim Sang, E.F.1
  • 33
    • 85099019865 scopus 로고    scopus 로고
    • Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition
    • Edmonton, Canada
    • Tjong Kim Sang, E. F. and F. De Meulder (2003), Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition, Proceedings of CoNLL-2003, Edmonton, Canada, pp. 142-147.
    • (2003) Proceedings of CoNLL-2003 , pp. 142-147
    • Tjong Kim Sang, E.F.1    De Meulder, F.2
  • 36
    • 57349166605 scopus 로고    scopus 로고
    • Part of speech tagging en lemmatisering van het D-Coi corpus
    • Centrum voor Computerlingüistiek, KU Leuven
    • Van Eynde, F. (2005), Part of speech tagging en lemmatisering van het D-Coi corpus, Technical report, Centrum voor Computerlingüistiek, KU Leuven.
    • (2005) Technical Report
    • Van Eynde, F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.