메뉴 건너뛰기




Volumn 62, Issue 3, 2006, Pages 328-349

An evaluation of conflation accuracy using finite-state transducers

Author keywords

Accuracy; Linguistics; Programming and algorithm theory; Semantics

Indexed keywords


EID: 33646360230     PISSN: 00220418     EISSN: None     Source Type: Journal    
DOI: 10.1108/00220410610666493     Document Type: Article
Times cited : (4)

References (64)
  • 1
    • 0016083316 scopus 로고
    • The use of an association measure based on character structure to identify semantically related pairs of words and document titles
    • Adamson, G.W. and Boreham, J. (1974), "The use of an association measure based on character structure to identify semantically related pairs of words and document titles", Information Storage and Retrieval, Vol. 10 No. 1, pp. 253-60.
    • (1974) Information Storage and Retrieval , vol.10 , Issue.1 , pp. 253-60
    • Adamson, G.W.1    Boreham, J.2
  • 2
    • 33646337879 scopus 로고
    • Morfología del verbo espaol
    • Martin Vide, C. Publicaciones de la Universidad Barcelona
    • Alcoba, S. (1991), "Morfología del verbo espaol", in Martin Vide, C. (Ed.), Lenguajes Naturales y Lenguajes Formales, Publicaciones de la Universidad, Barcelona.
    • (1991) Lenguajes Naturales y Lenguajes Formales
    • Alcoba, S.1
  • 5
    • 33646368641 scopus 로고
    • Contribución al estudio del verbo espaol: Un análisis morfosemántico
    • Ambadiang, T. (1990), "Contribución al estudio del verbo espaol: un análisis morfosemántico", Anuario de Lingüística Hispánica, Vol. 6, pp. 29-63.
    • (1990) Anuario de Lingüística Hispánica , vol.6 , pp. 29-63
    • Ambadiang, T.1
  • 7
    • 0020685593 scopus 로고
    • Automatic spelling correction using a trigram similarity measure
    • Angell, R.C., Freund, G.E. and Willett, P. (1983), "Automatic spelling correction using a trigram similarity measure", Information Processing & Management, Vol. 19 No. 4, pp. 255-61.
    • (1983) Information Processing & Management , vol.19 , Issue.4 , pp. 255-61
    • Angell, R.C.1    Freund, G.E.2    Willett, P.3
  • 9
    • 0002784843 scopus 로고
    • Document retrieval and routing using the INQUERY system
    • Harman, D.K. National Institute of Standards and Technology Special Publication 500-225 Gaithersburg, MD
    • Broglio, J., Callan, J.P., Croft, W.B. and Nachbar, D.W. (1994), "Document retrieval and routing using the INQUERY system", in Harman, D.K. (Ed.), Proceedings of the Third Text REtrieval Conference (TREC-3), National Institute of Standards and Technology Special Publication 500-225, Gaithersburg, MD, pp. 29-38.
    • (1994) Proceedings of the Third Text REtrieval Conference (TREC-3) , pp. 29-38
    • Broglio, J.1    Callan, J.P.2    Croft, W.B.3    Nachbar, D.W.4
  • 10
    • 0002039278 scopus 로고
    • Automatic query expansion using SMART: TREC 3
    • Harman, D.K. National Institute of Standards and Technology Special Publication 500-225 Gaithersburg, MD
    • Buckley, C., Salton, G., Allan, J. and Singhal, A. (1994), "Automatic query expansion using SMART: TREC 3", in Harman, D.K. (Ed.), Proceedings of the Third Text REtrieval Conference (TREC-3), National Institute of Standards and Technology Special Publication 500-225, Gaithersburg, MD, pp. 69-80.
    • (1994) Proceedings of the Third Text REtrieval Conference (TREC-3) , pp. 69-80
    • Buckley, C.1    Salton, G.2    Allan, J.3    Singhal, A.4
  • 11
    • 0040260430 scopus 로고    scopus 로고
    • Using query zoning and correlation within SMART: TREC 5
    • Harman, D.K. National Institute of Standards and Technology Special Publication 500-238 Gaithersburg, MD
    • Buckley, C., Singhal, A. and Mitra, M. (1996), "Using query zoning and correlation within SMART: TREC 5", in Harman, D.K. (Ed.), Proceedings of the Fourth Text REtrieval Conference (TREC-5), National Institute of Standards and Technology Special Publication 500-238, Gaithersburg, MD, pp. 105-18.
    • (1996) Proceedings of the Fourth Text REtrieval Conference (TREC-5) , pp. 105-18
    • Buckley, C.1    Singhal, A.2    Mitra, M.3
  • 12
    • 0002529215 scopus 로고
    • New retrieval approaches using SMART: TREC 4
    • Harman, D.K. National Institute of Standards and Technology Special Publication 500-236 Gaithersburg, MD
    • Buckley, C., Singhal, A., Mitra, M. and Salton, G. (1995), "New retrieval approaches using SMART: TREC 4", in Harman, D.K. (Ed.), Proceedings of the Fourth Text REtrieval Conference (TREC-4), National Institute of Standards and Technology Special Publication 500-236, Gaithersburg, MD, pp. 25-48.
    • (1995) Proceedings of the Fourth Text REtrieval Conference (TREC-4) , pp. 25-48
    • Buckley, C.1    Singhal, A.2    Mitra, M.3    Salton, G.4
  • 14
    • 0142093775 scopus 로고
    • Using an n-gram based document representation with a vector processing retrieval model
    • Harman, D.K. National Institute of Standards and Technology Gaithersburg, MD
    • Cavnar, W.B. (1994), "Using an n-gram based document representation with a vector processing retrieval model", in Harman, D.K. (Ed.), Proceedings of the Third Text REtrieval Conference (TREC-3), National Institute of Standards and Technology, Gaithersburg, MD, pp. 269-78.
    • (1994) Proceedings of the Third Text REtrieval Conference (TREC-3) , pp. 269-78
    • Cavnar, W.B.1
  • 18
    • 0028911698 scopus 로고
    • Gauging similarity with n-grams: Language independent categorization of text
    • Damashek, M. (1995), "Gauging similarity with n-grams: language independent categorization of text", Science, Vol. 267, pp. 843-8.
    • (1995) Science , vol.267 , pp. 843-8
    • Damashek, M.1
  • 21
    • 0001918328 scopus 로고
    • Stemming algorithms
    • Frakes, W.B. Baeza-Yates, R. Prentice-Hall Englewood Cliffs, NJ
    • Frakes, W.B. (1992), "Stemming algorithms", in Frakes, W.B. and Baeza-Yates, R. (Eds), Information Retrieval: Data Structures and Algorithms, Prentice-Hall, Englewood Cliffs, NJ.
    • (1992) Information Retrieval: Data Structures and Algorithms
    • Frakes, W.B.1
  • 22
    • 33646350476 scopus 로고    scopus 로고
    • Strength and similarity of affix removal stemming algorithms
    • Frakes, W.B. and Fox, C.J. (2003), "Strength and similarity of affix removal stemming algorithms", ACM SIGIR Forum, Vol. 37 No. 1, pp. 26-30.
    • (2003) ACM SIGIR Forum , vol.37 , Issue.1 , pp. 26-30
    • Frakes, W.B.1    Fox, C.J.2
  • 23
    • 23744486846 scopus 로고    scopus 로고
    • Term conflation methods in information retrieval: Non-linguistic and linguistic approaches
    • Galvez, C., Moya-Anegón, F. and Solana, V.H. (2005), "Term conflation methods in information retrieval: non-linguistic and linguistic approaches", Journal of Documentation, Vol. 61 No. 4, pp. 520-47.
    • (2005) Journal of Documentation , vol.61 , Issue.4 , pp. 520-47
    • Galvez, C.1    Moya-Anegón, F.2    Solana, V.H.3
  • 24
    • 33646351244 scopus 로고
    • Logistic regression at TREC 4: Probalistic retrieval from full text document collections
    • Harman, D.K. National Institute of Standards and Technology Special Publication 500-236 Gaithersburg, MD
    • Gey, F.C., Chen, J.A., He, M. and Jason, M. (1995), "Logistic regression at TREC 4: probalistic retrieval from full text document collections", in Harman, D.K. (Ed.), Proceedings of the Fourth Text REtrieval Conference (TREC-4), National Institute of Standards and Technology Special Publication 500-236, Gaithersburg, MD, pp. 65-72.
    • (1995) Proceedings of the Fourth Text REtrieval Conference (TREC-4) , pp. 65-72
    • Gey, F.C.1    Chen, J.A.2    He, M.3    Jason, M.4
  • 27
    • 21644479790 scopus 로고
    • The accentual patterns of verb paradigms in Spanish
    • Harris, J.W. (1987), "The accentual patterns of verb paradigms in Spanish", Natural Language and Linguistic Theory, Vol. 5, pp. 61-90.
    • (1987) Natural Language and Linguistic Theory , vol.5 , pp. 61-90
    • Harris, J.W.1
  • 29
    • 0002910412 scopus 로고    scopus 로고
    • Stemming algorithms: A case study for detailed evaluation
    • Hull, D.A. (1996), "Stemming algorithms: a case study for detailed evaluation", Journal of the American Society for Information Science, Vol. 47 No. 1, pp. 70-84.
    • (1996) Journal of the American Society for Information Science , vol.47 , Issue.1 , pp. 70-84
    • Hull, D.A.1
  • 30
    • 0342621305 scopus 로고    scopus 로고
    • Xerox TREC-5 site report: Routing filtering, NLP and Spanish tracks
    • Voorhees, E.M. Harman, D.K. National Institute of Standards and Technology Special Publication 500-238 Gaithersburg, MD
    • Hull, D.A., Grefenstette, G., Schulze, B.M., Gaussier, E., Schütze, H. and Pedersen, J.O. (1996), "Xerox TREC-5 site report: routing filtering, NLP and Spanish tracks", in Voorhees, E.M. and Harman, D.K. (Eds), The Fifth TExt Retrieval Conference (TREC-5), National Institute of Standards and Technology Special Publication 500-238, Gaithersburg, MD, pp. 167-80.
    • (1996) The Fifth TExt Retrieval Conference (TREC-5) , pp. 167-80
    • Hull, D.A.1    Grefenstette, G.2    Schulze, B.M.3    Gaussier, E.4    Schütze, H.5    Pedersen, J.O.6
  • 31
    • 0003110740 scopus 로고    scopus 로고
    • NLP for term variant extraction: Synergy between morphology, lexicon, and syntax
    • Strzalkowski, T. Kluwer Dordrecht
    • Jacquemin, C. and Tzoukermann, E. (1999), "NLP for term variant extraction: synergy between morphology, lexicon, and syntax", in Strzalkowski, T. (Ed.), Natural Language Information Retrieval, Kluwer, Dordrecht.
    • (1999) Natural Language Information Retrieval
    • Jacquemin, C.1    Tzoukermann, E.2
  • 34
    • 23744498084 scopus 로고
    • KIMMO: A general morphological processor
    • Karttunen, L. (1983), "KIMMO: a general morphological processor", Texas Linguistics Forum, Vol. 22, pp. 217-28.
    • (1983) Texas Linguistics Forum , vol.22 , pp. 217-28
    • Karttunen, L.1
  • 37
    • 84945184398 scopus 로고    scopus 로고
    • TREC-5 experiments at Dublin City University: Query space reduction, Spanish stemming and character shape encoding
    • Voorhees, E.M. Harman, D.K. National Institute of Standards and Technology Special Publication 500-238 Gaithersburg, MD
    • Kelledy, F. and Smeaton, A.F. (1996), "TREC-5 experiments at Dublin City University: query space reduction, Spanish stemming and character shape encoding", in Voorhees, E.M. and Harman, D.K. (Eds), Proceedings of the Fifth Text REtrieval Conference (TREC-5), National Institute of Standards and Technology Special Publication 500-238, Gaithersburg, MD, pp. 57-64.
    • (1996) Proceedings of the Fifth Text REtrieval Conference (TREC-5) , pp. 57-64
    • Kelledy, F.1    Smeaton, A.F.2
  • 41
    • 0040402828 scopus 로고
    • Evaluation of a Dutch stemming algorithm
    • Rowley, R. Taylor Graham London
    • Kraaij, W. and Pohlmann, R. (1995), "Evaluation of a Dutch stemming algorithm", in Rowley, R. (Ed.), The New Review of Document and Text Management, Vol. 1, Taylor Graham, London.
    • (1995) The New Review of Document and Text Management , vol.1
    • Kraaij, W.1    Pohlmann, R.2
  • 43
    • 0019613156 scopus 로고
    • An evaluation of some conflation algorithms for information retrieval
    • Lennon, M., Pierce, D.S., Tarry, B.D. and Willett, P. (1981), "An evaluation of some conflation algorithms for information retrieval", Journal of Information Science, Vol. 3 No. 4, pp. 177-83.
    • (1981) Journal of Information Science , vol.3 , Issue.4 , pp. 177-83
    • Lennon, M.1    Pierce, D.S.2    Tarry, B.D.3    Willett, P.4
  • 45
    • 25444470633 scopus 로고
    • The inflection component of a word-and-paradigm grammar
    • Matthews, P.H. (1965), "The inflection component of a word-and-paradigm grammar", Journal of Linguistics, Vol. 1, pp. 139-71.
    • (1965) Journal of Linguistics , vol.1 , pp. 139-71
    • Matthews, P.H.1
  • 47
    • 33646361028 scopus 로고
    • Notas sobre la noción de aspecto en un marco de clasificación de verbos (Vb) y sustantivos verbales (Sv)
    • Mighetto, D. (1992), "Notas sobre la noción de aspecto en un marco de clasificación de verbos (Vb) y sustantivos verbales (Sv)", Voz y Letra, Vol. 3 No. 1, pp. 69-100.
    • (1992) Voz y Letra , vol.3 , Issue.1 , pp. 69-100
    • Mighetto, D.1
  • 49
    • 84976754255 scopus 로고
    • Another stemmer
    • Paice, C.D. (1990), "Another stemmer", ACM SIGIR Forum, Vol. 24 No. 3, pp. 56-61.
    • (1990) ACM SIGIR Forum , vol.24 , Issue.3 , pp. 56-61
    • Paice, C.D.1
  • 50
    • 0030216658 scopus 로고    scopus 로고
    • A method for evaluation of stemming algorithms based on error counting
    • Paice, C.D. (1996), "A method for evaluation of stemming algorithms based on error counting", Journal of the American Society for Information Science, Vol. 47 No. 8, pp. 632-49.
    • (1996) Journal of the American Society for Information Science , vol.47 , Issue.8 , pp. 632-49
    • Paice, C.D.1
  • 51
    • 84993016582 scopus 로고    scopus 로고
    • Morphological typology of languages for IR
    • Pirkola, A. (2001), "Morphological typology of languages for IR", Journal of Documentation, Vol. 57 No. 3, pp. 330-48.
    • (2001) Journal of Documentation , vol.57 , Issue.3 , pp. 330-48
    • Pirkola, A.1
  • 52
    • 84989549444 scopus 로고
    • The effectiveness of stemming for natural-language access to Slovene textual data
    • Popovic, M. and Willett, P. (1992), "The effectiveness of stemming for natural-language access to Slovene textual data", Journal of the American Society for Information Science, Vol. 43 No. 5, pp. 384-90.
    • (1992) Journal of the American Society for Information Science , vol.43 , Issue.5 , pp. 384-90
    • Popovic, M.1    Willett, P.2
  • 53
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • Porter, M.F. (1980), "An algorithm for suffix stripping", Program, Vol. 14, pp. 130-7.
    • (1980) Program , vol.14 , pp. 130-7
    • Porter, M.F.1
  • 54
    • 0032405150 scopus 로고    scopus 로고
    • Applications of n-grams in textual information systems
    • Robertson, A.M. and Willett, P. (1998), "Applications of n-grams in textual information systems", Journal of Documentation, Vol. 54 No. 1, pp. 48-69.
    • (1998) Journal of Documentation , vol.54 , Issue.1 , pp. 48-69
    • Robertson, A.M.1    Willett, P.2
  • 56
    • 0342986731 scopus 로고
    • Stemming of French words based on grammatical categories
    • Savoy, J. (1993), "Stemming of French words based on grammatical categories", Journal of the American Society for Information Science, Vol. 44 No. 1, pp. 1-9.
    • (1993) Journal of the American Society for Information Science , vol.44 , Issue.1 , pp. 1-9
    • Savoy, J.1
  • 58
    • 0347368390 scopus 로고    scopus 로고
    • INTEX: An FST toolbox
    • Silberztein, M. (2000), "INTEX: an FST toolbox", Theorical Computer Science, Vol. 231 No. 1, pp. 33-46.
    • (2000) Theorical Computer Science , vol.231 , Issue.1 , pp. 33-46
    • Silberztein, M.1
  • 59
    • 0021393414 scopus 로고
    • Automatic search term variant generation
    • Sparck Jones, K. and Tait, J.I. (1984), "Automatic search term variant generation", Journal of Documentation, Vol. 40 No. 1, pp. 50-66.
    • (1984) Journal of Documentation , vol.40 , Issue.1 , pp. 50-66
    • Sparck Jones, K.1    Tait, J.I.2
  • 62
    • 0142214474 scopus 로고    scopus 로고
    • COLE experiments at CLEF 2002 Spanish monolingual track
    • Peters, C. Braschler, M. Gonzalo, J. Kluck, M. Springer-Verlag Berlin (Lecture Notes in Computer Science, Vol. 2785), pp. 265-78
    • Vilares, J., Alonso, M.A., Ribadas, F.J. and Vilares, M. (2003), "COLE experiments at CLEF 2002 Spanish monolingual track", in Peters, C., Braschler, M., Gonzalo, J. and Kluck, M. (Eds), Advances in Cross-language Information Retrieval, Springer-Verlag, Berlin, (Lecture Notes in Computer Science, Vol. 2785), pp. 265-78.
    • (2003) Advances in Cross-language Information Retrieval
    • Vilares, J.1    Alonso, M.A.2    Ribadas, F.J.3    Vilares, M.4
  • 63
    • 33646362996 scopus 로고
    • Morphological disambiguation
    • Karlsson, F. Voutilainen, A. Heikkilä, J. Anttila, A. Mouton de Gruyter Berlin
    • Voutilainen, A. (1995), "Morphological disambiguation", in Karlsson, F., Voutilainen, A., Heikkilä, J. and Anttila, A. (Eds), Constraint Grammar: A Language-independent System for Parsing Unrestricted Text, Mouton de Gruyter, Berlin, pp. 165-284.
    • (1995) Constraint Grammar: A Language-independent System for Parsing Unrestricted Text , pp. 165-284
    • Voutilainen, A.1
  • 64
    • 0031599183 scopus 로고    scopus 로고
    • Corpus-based stemming using co-occurrence of word variants
    • Xu, J. and Croft, B. (1998), "Corpus-based stemming using co-occurrence of word variants", ACM Transactions on Information Systems, Vol. 16 No. 1, pp. 61-81.
    • (1998) ACM Transactions on Information Systems , vol.16 , Issue.1 , pp. 61-81
    • Xu, J.1    Croft, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.