메뉴 건너뛰기




Volumn 12, Issue 3, 2009, Pages 275-299

On knowledge-poor methods for person name matching and lemmatization for highly inflectional languages

Author keywords

Highly inflectional languages; Lemmatization; Person name matching; String distance metrics

Indexed keywords


EID: 64749104008     PISSN: 13864564     EISSN: 15737659     Source Type: Journal    
DOI: 10.1007/s10791-008-9085-5     Document Type: Article
Times cited : (29)

References (41)
  • 2
    • 85035644599 scopus 로고    scopus 로고
    • Entity-based cross-document co-referencing using the vector space model
    • Montreal, Quebec, Canada
    • Bagga, A., & Baldwin, B. (1998). Entity-based Cross-document Co-referencing Using the Vector Space Model. In Proceedings of the ACL 1998, Montreal, Quebec, Canada (pp. 79-85).
    • (1998) Proceedings of the ACL 1998 , pp. 79-85
    • Bagga, A.1    Baldwin, B.2
  • 6
    • 64749102882 scopus 로고    scopus 로고
    • Technical report, TR-CS-06-02, Computer Science Laboratory, The Australian National University, Canberra, Australia
    • Christen, P. (2006). A comparison of personal name matching: Techniques and practical issues. Technical report, TR-CS-06-02, Computer Science Laboratory, The Australian National University, Canberra, Australia.
    • (2006) A Comparison of Personal Name Matching: Techniques and Practical Issues
    • Christen, P.1
  • 7
    • 0742293389 scopus 로고
    • The analysis and acquisition of proper names for the understanding of a free text
    • S. Coates Steohens 1992 The analysis and acquisition of proper names for the understanding of a free text Computers and the Humanities 26 441 456
    • (1992) Computers and the Humanities , vol.26 , pp. 441-456
    • Coates Steohens, S.1
  • 10
    • 80053379324 scopus 로고    scopus 로고
    • Large scale named entity disambiguation based on Wikipedia data
    • Prague, Czech Republic, ACL
    • Cucerzan, S. (2007). Large scale named entity disambiguation based on Wikipedia data. In Proceedings of the EMNLP-CoNLL Joint Conference, Prague, Czech Republic, ACL.
    • (2007) Proceedings of the EMNLP-CoNLL Joint Conference
    • Cucerzan, S.1
  • 18
    • 84950419860 scopus 로고
    • Advances in record linking methodology as applied to the 1985 census of Tampa Florida
    • M. Jaro 1989 Advances in record linking methodology as applied to the 1985 census of Tampa Florida Journal of the American Statistical Society 84 406 414 420
    • (1989) Journal of the American Statistical Society , vol.84 , Issue.406 , pp. 414-420
    • Jaro, M.1
  • 20
    • 80053348579 scopus 로고    scopus 로고
    • Weakly supervised named-entity transliteration and discovery from multilingual comparable corpora
    • ACL
    • Klementiev, A., & Roth, D. (2006). Weakly supervised named-entity transliteration and discovery from multilingual comparable corpora. In Proceedings of ACL 2006 Conference. ACL
    • (2006) Proceedings of ACL 2006 Conference
    • Klementiev, A.1    Roth, D.2
  • 21
    • 0000390142 scopus 로고
    • Binary codes for correcting deletions, insertions, and reversals
    • V. Levenshtein 1965 Binary codes for correcting deletions, insertions, and reversals Doklady Akademii Nauk SSSR 163 4 845 848
    • (1965) Doklady Akademii Nauk SSSR , vol.163 , Issue.4 , pp. 845-848
    • Levenshtein, V.1
  • 24
    • 85107157397 scopus 로고    scopus 로고
    • Unsupervised personal name disambiguation
    • Edmonton, Canada
    • Mann, G., & Yarowsky, D. (2003). Unsupervised personal name disambiguation. In Proceedings of CoNLL 2003, Edmonton, Canada (pp. 33-40).
    • (2003) Proceedings of CoNLL 2003 , pp. 33-40
    • Mann, G.1    Yarowsky, D.2
  • 25
    • 64749084969 scopus 로고    scopus 로고
    • Web document
    • Miłkowski, M. (2007). Morfologik. Web document: http://morfologik.blogspot.com.
    • (2007) Morfologik
    • Miłkowski, M.1
  • 27
    • 64749110265 scopus 로고    scopus 로고
    • Using a WWW search engine to evaluate normalization performance for a highly inflectional language
    • Companion Volume
    • Ntoulas, A., Stamou, S., & Tzagarakis, M. (2001). Using a WWW search engine to evaluate normalization performance for a highly inflectional language. In Proceedings of ACL 2001 (Companion Volume) (pp. 31-36).
    • (2001) Proceedings of ACL 2001 , pp. 31-36
    • Ntoulas, A.1    Stamou, S.2    Tzagarakis, M.3
  • 29
    • 24344469749 scopus 로고    scopus 로고
    • Name discrimination by clustering similar contexts
    • Pedersen, T., Purandare, A., & Kulkarni, A. (2005). Name discrimination by clustering similar contexts. In CICLing (pp. 226-237).
    • (2005) CICLing , pp. 226-237
    • Pedersen, T.1    Purandare, A.2    Kulkarni, A.3
  • 30
    • 79551571679 scopus 로고    scopus 로고
    • Named-entity recognition for Polish with SproUT
    • L. Bolc, Z. Michalewicz, & T. Nishida (Eds.) Warsaw, Poland
    • Piskorski, J. (2005). Named-entity recognition for Polish with SProUT. In L. Bolc, Z. Michalewicz, & T. Nishida (Eds.), LNCS Vol 3490: Proceedings of IMTCI 2004, Warsaw, Poland.
    • (2005) LNCS Vol 3490: Proceedings of IMTCI 2004
    • Piskorski, J.1
  • 33
    • 64749100773 scopus 로고    scopus 로고
    • Automatic construction of multilingual name dictionaries
    • C. Goutte, N. Cancedda, M. Dymetman, & G. Foster (Eds.) MIT Press - Advances in Neural Information Processing (NIPS) Series
    • Pouliquen, B., & Steinberger, R. (2009). Automatic construction of multilingual name dictionaries. In: C. Goutte, N. Cancedda, M. Dymetman, & G. Foster (Eds.), Learning machine translation (pp. 59-78). MIT Press - Advances in Neural Information Processing (NIPS) Series.
    • (2009) Learning Machine Translation , pp. 59-78
    • Pouliquen, B.1    Steinberger, R.2
  • 35
    • 0019887799 scopus 로고
    • Identification of common molecular subsequences
    • T. Smith M. Waterman 1981 Identification of common molecular subsequences Journal of Molecular Biology 147 195 197
    • (1981) Journal of Molecular Biology , vol.147 , pp. 195-197
    • Smith, T.1    Waterman, M.2
  • 37
    • 0027113212 scopus 로고
    • Approximate string matching with q-grams and maximal matches
    • E. Ukkonen 1992 Approximate string matching with q-grams and maximal matches Theoretical Computer Science 92 1 191 211
    • (1992) Theoretical Computer Science , vol.92 , Issue.1 , pp. 191-211
    • Ukkonen, E.1
  • 38
    • 33645997866 scopus 로고    scopus 로고
    • Morphological and syntactic processing for text retrieval
    • Vilares, J., Alonso, M., & Vilares Ferro, M. (2004). Morphological and syntactic processing for text retrieval. In DEXA (pp. 371-380).
    • (2004) DEXA , pp. 371-380
    • Vilares, J.1    Alonso, M.2    Vilares Ferro, M.3
  • 39
    • 33750696453 scopus 로고    scopus 로고
    • A survey of freely available Polish stemmers and evaluation of their applicability in information retrieval
    • Poznań, Poland, 2005
    • Weiss, D. (2005). A survey of freely available Polish stemmers and evaluation of their applicability in information retrieval. In Proceedings of the 2nd Language and Technology Conference (LTC'2005), Poznań, Poland, 2005 (pp. 216-221).
    • (2005) Proceedings of the 2nd Language and Technology Conference (LTC'2005) , pp. 216-221
    • Weiss, D.1
  • 41
    • 0012866045 scopus 로고    scopus 로고
    • Technical report, Statistical Research Division, U.S. Bureau of the Census, Washington, DC
    • Winkler, W. (1999). The state of record linkage and current research problems. Technical report, Statistical Research Division, U.S. Bureau of the Census, Washington, DC.
    • (1999) The State of Record Linkage and Current Research Problems
    • Winkler, W.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.