메뉴 건너뛰기




Volumn 51, Issue 8, 2000, Pages 691-706

An evaluation of retrieval effectiveness using spelling-correction and string-similarity matching methods on malay texts

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; CHARACTER RECOGNITION; COMPUTATIONAL METHODS; DYNAMIC PROGRAMMING; PATTERN MATCHING; PATTERN RECOGNITION SYSTEMS;

EID: 0033721946     PISSN: 15322882     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (7)

References (72)
  • 2
    • 0016083316 scopus 로고
    • The use of an association measure based on character structure to identify semantically related pairs of words and document titles
    • Adamson, G.W., & Boreham, J. (1974). The use of an association measure based on character structure to identify semantically related pairs of words and document titles. Information Storage and Retrieval, 10, 253-260.
    • (1974) Information Storage and Retrieval , vol.10 , pp. 253-260
    • Adamson, G.W.1    Boreham, J.2
  • 4
    • 0013454827 scopus 로고
    • String similarity and misspellings
    • Alberga, C.N. (1967). String similarity and misspellings. Communications of the ACM, 10(5), 302-313.
    • (1967) Communications of the ACM , vol.10 , Issue.5 , pp. 302-313
    • Alberga, C.N.1
  • 5
    • 0023041177 scopus 로고
    • A bit longest-common subsequence algorithm
    • Allison, L., & Dix, T.I. (1986). A bit longest-common subsequence algorithm. Information Processing Letters, 23, 305-310.
    • (1986) Information Processing Letters , vol.23 , pp. 305-310
    • Allison, L.1    Dix, T.I.2
  • 6
    • 84982481712 scopus 로고
    • Comparing words, stems and roots as index terms in an Arabic information retrieval system
    • Al-Kharashi, I.A., & Evens, M.W. (1994). Comparing words, stems and roots as index terms in an Arabic information retrieval system. Journal of the American Society for Information Science, 45(8), 548-560.
    • (1994) Journal of the American Society for Information Science , vol.45 , Issue.8 , pp. 548-560
    • Al-Kharashi, I.A.1    Evens, M.W.2
  • 8
    • 34447640446 scopus 로고
    • Alfred P. Sloan School of Management, WP # CIS-90-15. Cambridge, MA: Massachusetts Institute of Technology
    • Arcelus, A. (1990). Name management strategy for database integration. Alfred P. Sloan School of Management, WP # CIS-90-15. Cambridge, MA: Massachusetts Institute of Technology.
    • (1990) Name Management Strategy for Database Integration
    • Arcelus, A.1
  • 10
    • 0023480460 scopus 로고
    • A logical framework for the correction of spelling errors in electronic journals
    • Berghel, H.L. (1987). A logical framework for the correction of spelling errors in electronic journals. Information Processing and Management, 23, 477-494.
    • (1987) Information Processing and Management , vol.23 , pp. 477-494
    • Berghel, H.L.1
  • 11
    • 0023313308 scopus 로고
    • Automatic correction to misspelled names: A fourth-generation language approach
    • Bickel, M.A. (1987). Automatic correction to misspelled names: A fourth-generation language approach. Communications of the ACM, 30(3), 224-228.
    • (1987) Communications of the ACM , vol.30 , Issue.3 , pp. 224-228
    • Bickel, M.A.1
  • 13
    • 0017547820 scopus 로고
    • A fast string searching algorithm
    • Boyer, R.S., & Moore, J.S. (1977). A fast string searching algorithm. Communications of ACM 20(10), 762-772.
    • (1977) Communications of ACM , vol.20 , Issue.10 , pp. 762-772
    • Boyer, R.S.1    Moore, J.S.2
  • 14
    • 0028911698 scopus 로고
    • Gauging similarity with n-grams: Language-independent categorization of text
    • Damashek, M. (1995). Gauging similarity with n-grams: Language-independent categorization of text. Science, 267, 843-848.
    • (1995) Science , vol.267 , pp. 843-848
    • Damashek, M.1
  • 15
    • 84941869105 scopus 로고
    • A technique for computer detection and correction of spelling errors
    • Damerau, F.J. (1964). A technique for computer detection and correction of spelling errors. Communications of the ACM, 7, 171-176.
    • (1964) Communications of the ACM , vol.7 , pp. 171-176
    • Damerau, F.J.1
  • 16
    • 4644243617 scopus 로고
    • Retrieval of misspelled names in an airline's passenger record system
    • Davidson, L. (1962). Retrieval of misspelled names in an airline's passenger record system. Communications of the ACM, 5, 169-171.
    • (1962) Communications of the ACM , vol.5 , pp. 169-171
    • Davidson, L.1
  • 17
    • 84861604521 scopus 로고
    • Multiple comparisons using rank sums
    • Dunn, O.J. 1964. Multiple comparisons using rank sums. Technometrics, 6, 241-252.
    • (1964) Technometrics , vol.6 , pp. 241-252
    • Dunn, O.J.1
  • 18
    • 0024050355 scopus 로고
    • A new distance metric on strings computable in linear time
    • Ehrenfeucht, A., & Haussler, D. (1988). A new distance metric on strings computable in linear time. Discrete Applied Mathematics, 20, 191-203.
    • (1988) Discrete Applied Mathematics , vol.20 , pp. 191-203
    • Ehrenfeucht, A.1    Haussler, D.2
  • 20
    • 73549092375 scopus 로고
    • An inductive approach to language translation
    • Faulk, R.D. (1964). An inductive approach to language translation. Communications of the ACM, 7, 647-653.
    • (1964) Communications of the ACM , vol.7 , pp. 647-653
    • Faulk, R.D.1
  • 21
    • 0001918328 scopus 로고
    • Stemming algorithm
    • W.B. Frakes, & R. BaezaYates (Eds.), Englewood Cliffs, NJ: Prentice Hall
    • Frakes, W.B. 1992. Stemming algorithm. In W.B. Frakes, & R. BaezaYates (Eds.), Information retrieval: Data structures and algorithms (pp. 131-160), Englewood Cliffs, NJ: Prentice Hall.
    • (1992) Information Retrieval: Data Structures and Algorithms , pp. 131-160
    • Frakes, W.B.1
  • 22
    • 0013234285 scopus 로고
    • Online identification of word variants and arbitrary truncation searching using a string similarity measure
    • Freund, G.E., Willett, P. (1982).Online identification of word variants and arbitrary truncation searching using a string similarity measure. Information Technology: Research and Development, 1, 177-187.
    • (1982) Information Technology: Research and Development , vol.1 , pp. 177-187
    • Freund, G.E.1    Willett, P.2
  • 23
    • 1542701882 scopus 로고
    • Fisching fore werds: Phonetic retrieval of written text in information systems
    • Gadd, T.N. (1988). Fisching fore werds: Phonetic retrieval of written text in information systems. Program, 22, 222-237.
    • (1988) Program , vol.22 , pp. 222-237
    • Gadd, T.N.1
  • 24
    • 0141572720 scopus 로고
    • PHONIX: The algorithm
    • Gadd, T.N. (1990). PHONIX: The algorithm. Program, 24, 363-366.
    • (1990) Program , vol.24 , pp. 363-366
    • Gadd, T.N.1
  • 29
    • 0016518550 scopus 로고
    • A linear space algorithm for computing maximal common subsequences
    • Hirschberg, D.S. (1975). A linear space algorithm for computing maximal common subsequences. Journal of the ACM 18(6), 341-343.
    • (1975) Journal of the ACM , vol.18 , Issue.6 , pp. 341-343
    • Hirschberg, D.S.1
  • 30
    • 84976825385 scopus 로고
    • Algorithm for the longest common subsequence problem
    • Hirschberg, D.S. (1977). Algorithm for the longest common subsequence problem. Journal of the ACM, 24, 664-675.
    • (1977) Journal of the ACM , vol.24 , pp. 664-675
    • Hirschberg, D.S.1
  • 31
    • 34447647190 scopus 로고
    • The effect of stemming strength on the effectiveness of output ranking
    • Keen, E.M. 1992. The effect of stemming strength on the effectiveness of output ranking. Proceedings of Informatics, 11.
    • (1992) Proceedings of Informatics , vol.11
    • Keen, E.M.1
  • 35
    • 0001116877 scopus 로고
    • Binary codes capable of correcting deletions, insertions, and reversals
    • Levenshtein, V.I. (1966). Binary codes capable of correcting deletions, insertions, and reversals. Cybernetics and Control Theory, 10(8), 707-710.
    • (1966) Cybernetics and Control Theory , vol.10 , Issue.8 , pp. 707-710
    • Levenshtein, V.I.1
  • 37
    • 0023491747 scopus 로고
    • Spelling checkers, spelling correctors and the misspellings of poor spellers
    • Mitton, R. (1987). Spelling checkers, spelling correctors and the misspellings of poor spellers. Information Processing and Management, 23, 495-505.
    • (1987) Information Processing and Management , vol.23 , pp. 495-505
    • Mitton, R.1
  • 38
    • 0014757386 scopus 로고
    • A general method applicable to the search for similarities in the amino acid sequence of two proteins
    • Needleman, S.B., Wunsch, C.D. (1970). A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology, 48, 443-453.
    • (1970) Journal of Molecular Biology , vol.48 , pp. 443-453
    • Needleman, S.B.1    Wunsch, C.D.2
  • 39
    • 84976659272 scopus 로고
    • Computer programs for detecting and correcting spelling errors
    • Peterson, J.L. (1980). Computer programs for detecting and correcting spelling errors. Communications of the ACM, 23, 676-687.
    • (1980) Communications of the ACM , vol.23 , pp. 676-687
    • Peterson, J.L.1
  • 43
    • 0020290089 scopus 로고
    • Spelling error detection and correction by computer: Some notes and a bibliography
    • Pollock, J.J. (1982). Spelling error detection and correction by computer: some notes and a bibliography. Journal of Documentation, 38, 282-291.
    • (1982) Journal of Documentation , vol.38 , pp. 282-291
    • Pollock, J.J.1
  • 44
  • 45
    • 84976776121 scopus 로고
    • Automatic spelling correction in scientific and scholarly text
    • Pollock, J.J., Zamora, A. (1984). Automatic spelling correction in scientific and scholarly text. Communications of the ACM, 27,358-368.
    • (1984) Communications of the ACM , vol.27 , pp. 358-368
    • Pollock, J.J.1    Zamora, A.2
  • 46
    • 84989549444 scopus 로고
    • The effectiveness of stemming for natural-language access to Slovene textual data
    • Popovic, M., Willett, P. (1992). The effectiveness of stemming for natural-language access to Slovene textual data. Journal of the American Society for Information Science, 43(5), 384-390.
    • (1992) Journal of the American Society for Information Science , vol.43 , Issue.5 , pp. 384-390
    • Popovic, M.1    Willett, P.2
  • 47
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • Porter, M.F. (1980). An algorithm for suffix stripping. Program, 14(3), 130-137.
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.F.1
  • 49
    • 0039218397 scopus 로고
    • A comparison of spelling-correction methods for the identification of word forms in historical text databases
    • Robertson, A.M., Willett, P. (1993). A comparison of spelling-correction methods for the identification of word forms in historical text databases. Literary and Linguistic Computing, 8(3), 143-152.
    • (1993) Literary and Linguistic Computing , vol.8 , Issue.3 , pp. 143-152
    • Robertson, A.M.1    Willett, P.2
  • 51
    • 0642291949 scopus 로고
    • Searching for historical word forms in text databases using spelling correction methods: Reverse error and phonetic coding methods
    • Rogers, H.J., Willett, P. (1991). Searching for historical word forms in text databases using spelling correction methods: Reverse error and phonetic coding methods. Journal of Documentation, 47, 333-353.
    • (1991) Journal of Documentation , vol.47 , pp. 333-353
    • Rogers, H.J.1    Willett, P.2
  • 52
    • 34447646647 scopus 로고
    • United States patent 1261167. Washington, DC: United States Patent Office
    • Russell, R.C. (1918). United States patent 1261167. Washington, DC: United States Patent Office.
    • (1918)
    • Russell, R.C.1
  • 53
    • 34447634971 scopus 로고
    • United States patent 1435663. Washington, DC: United States Patent Office
    • Russell, R.C. (1922). United States patent 1435663. Washington, DC: United States Patent Office.
    • (1922)
    • Russell, R.C.1
  • 57
    • 34447638589 scopus 로고
    • Chapter 42, Reading, MA: Addison-Wesley
    • Sedgewick, R. (1988). Algorithms (2nd ed., Chapter 42, pp. 595-605). Reading, MA: Addison-Wesley.
    • (1988) Algorithms 2nd ed , pp. 595-605
    • Sedgewick, R.1
  • 59
    • 34447625809 scopus 로고
    • Experiments with n-gram string-similarity measure on Malay texts
    • Universiti Kebangssan Malaysia
    • Sembok, T.M.T., Willett, P. (1995). Experiments with n-gram string-similarity measure on Malay texts. Technical Report, Universiti Kebangssan Malaysia.
    • (1995) Technical Report
    • Sembok, T.M.T.1    Willett, P.2
  • 60
    • 0018493409 scopus 로고
    • Analysis of word similarity in spelling correction systems
    • Sidorov, A.A. (1979). Analysis of word similarity in spelling correction systems. Program and Computer Software, 5, 274-277.
    • (1979) Program and Computer Software , vol.5 , pp. 274-277
    • Sidorov, A.A.1
  • 61
    • 34447645392 scopus 로고
    • A study of existing sound indexing algorithm and a proposed enhancement
    • Dublin City University, School of Computer Applications
    • Smeaton, A., Geary, M. (1988). A study of existing sound indexing algorithm and a proposed enhancement. Dublin City University, School of Computer Applications, Working Paper Ca-0788.
    • (1988) Working Paper Ca-0788
    • Smeaton, A.1    Geary, M.2
  • 62
    • 0003203872 scopus 로고
    • String searching algorithm
    • Singapore: World Scientific Publishing
    • Stephen, G.A. (1994). String searching algorithm. Lecture Notes Series on Computing (Volume 3). Singapore: World Scientific Publishing.
    • (1994) Lecture Notes Series on Computing , vol.3
    • Stephen, G.A.1
  • 63
    • 0027113212 scopus 로고
    • Approximate string-matching with q-grams and maximal matches
    • Ukkonen, E. (1992). Approximate string-matching with q-grams and maximal matches. Theoretical Computer Science, 92, 191-211.
    • (1992) Theoretical Computer Science , vol.92 , pp. 191-211
    • Ukkonen, E.1
  • 66
    • 0344005042 scopus 로고
    • Speech discrimination by dynamic programming
    • Vintsyuk, T.K. (1968). Speech discrimination by dynamic programming. Cybernetics, 4(1), 52-57.
    • (1968) Cybernetics , vol.4 , Issue.1 , pp. 52-57
    • Vintsyuk, T.K.1
  • 68
    • 0003203872 scopus 로고
    • String searching algorithm
    • Singapore: World Scientific Publishing
    • Also in Stephen, G.A. (1994). String searching algorithm. Lecture Notes Series on Computing (Volume 3, pp. 53-57), Singapore: World Scientific Publishing.
    • (1994) Lecture Notes Series on Computing , vol.3 , pp. 53-57
    • Stephen, G.A.1
  • 69
    • 27844461760 scopus 로고
    • Improving subject retrieval in online catalogues. 1. Stemming, automatic spelling correction and cross reference tables. London
    • Walker, S., Jones, R.M. (1987). Improving subject retrieval in online catalogues. 1. Stemming, automatic spelling correction and cross reference tables. London, British Library Research Paper 24.
    • (1987) British Library Research Paper , vol.24
    • Walker, S.1    Jones, R.M.2
  • 70
    • 0016881995 scopus 로고
    • Bounds for the string editing problem
    • Wong, C.K., Chandra, A.K. (1976). Bounds for the string editing problem. Journal of the ACM, 23(1), 13-16.
    • (1976) Journal of the ACM , vol.23 , Issue.1 , pp. 13-16
    • Wong, C.K.1    Chandra, A.K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.