메뉴 건너뛰기




Volumn 15, Issue 3, 1997, Pages 291-320

Recursive hashing functions for n-grams

Author keywords

E.2 Data : Data Storage Representations hash table representations; G.2.1 Discrete Mathematics : Combinatorics recurrences and difference equations; G.3 Mathematics of Computing : Probability And Statistics probabilistic algorithms

Indexed keywords

COMPUTATIONAL COMPLEXITY; DATA STRUCTURES; INDEXING (OF INFORMATION); INFORMATION RETRIEVAL;

EID: 0031177375     PISSN: 10468188     EISSN: None     Source Type: Journal    
DOI: 10.1145/256163.256168     Document Type: Article
Times cited : (42)

References (54)
  • 1
    • 0020685593 scopus 로고
    • Automatic spelling correction using trigram similarity measure
    • ANGELL, R. C., FREUND, G. E., AND WILLET, P. 1983. Automatic spelling correction using trigram similarity measure, Inf. Process. Manage. 19, 4, 255-261.
    • (1983) Inf. Process. Manage , vol.19 , Issue.4 , pp. 255-261
    • Angell, R.C.1    Freund, G.E.2    Willet, P.3
  • 2
    • 0016069406 scopus 로고
    • An information-theoretic approach to text searching in direct access systems
    • BARTON, I. J., CREASY, S. E., LYNCH, M. F., AND SNELL, M. J. 1974. An information-theoretic approach to text searching in direct access systems. Commun. ACM 17, 6 (June), 345-350.
    • (1974) Commun. ACM , vol.17 , Issue.6 JUNE , pp. 345-350
    • Barton, I.J.1    Creasy, S.E.2    Lynch, M.F.3    Snell, M.J.4
  • 6
    • 0018522315 scopus 로고
    • Document retrieval experiments using indexing vocabularies of varying size. I. Variety generation symbols assigned to the fronts of index terms
    • BURNETT, J. E., COOPER, D., LYNCH, M. F., WILLETT, P., AND WYCHERLEY, M. 1979. Document retrieval experiments using indexing vocabularies of varying size. I. Variety generation symbols assigned to the fronts of index terms. J. Doc. 35, 3 (Sept.), 197-206.
    • (1979) J. Doc. , vol.35 , Issue.3 SEPT. , pp. 197-206
    • Burnett, J.E.1    Cooper, D.2    Lynch, M.F.3    Willett, P.4    Wycherley, M.5
  • 7
    • 0346407230 scopus 로고
    • N-gram-based text filtering for TREC-2
    • D. Harman, Ed. National Bureau of Standards, Gaithersburg, Md.
    • CAVNAR, W. B. 1993. N-gram-based text filtering for TREC-2. In Proceedings of TREC-2: Text Retrieval Conference 2, D. Harman, Ed. National Bureau of Standards, Gaithersburg, Md.
    • (1993) Proceedings of TREC-2: Text Retrieval Conference , vol.2
    • Cavnar, W.B.1
  • 8
    • 84982395055 scopus 로고
    • Highlights: Language- and domain-independent automatic indexing terms for abstracting
    • See vol. 47, issue 3, p. 260 for erratum
    • COHEN, J. D. 1995. Highlights: Language- and domain-independent automatic indexing terms for abstracting. J. Am. Soc. Inf. Sci. 46 (Apr.), 162-174. See vol. 47, issue 3, p. 260 for erratum.
    • (1995) J. Am. Soc. Inf. Sci. , vol.46 , Issue.APR. , pp. 162-174
    • Cohen, J.D.1
  • 10
    • 0039218401 scopus 로고
    • A statistical method of spelling correction
    • CORNEW, R. W. 1968. A statistical method of spelling correction. Inf. Control 12, 79-93.
    • (1968) Inf. Control , vol.12 , pp. 79-93
    • Cornew, R.W.1
  • 11
    • 0028911698 scopus 로고
    • Gauging similarity with n-grams: Language-independent categorization of text
    • DAMASHEK, M. 1995. Gauging similarity with n-grams: Language-independent categorization of text. Science 267, 10 (Feb.), 843-848.
    • (1995) Science , vol.267 , Issue.10 FEB. , pp. 843-848
    • Damashek, M.1
  • 13
    • 0025517138 scopus 로고
    • The reactive keyboard: A predictive typing aid
    • DARRAGH, J. J., WITTEN, I. H., AND JAMES, M. L. 1990. The reactive keyboard: A predictive typing aid. Computer 23, 11 (Nov.), 41-49.
    • (1990) Computer , vol.23 , Issue.11 NOV. , pp. 41-49
    • Darragh, J.J.1    Witten, I.H.2    James, M.L.3
  • 14
    • 0020248305 scopus 로고
    • The application of the concept of homeosemy to natural language information retrieval
    • DE HEER, T. 1982. The application of the concept of homeosemy to natural language information retrieval. Inf. Process. Manage. 18, 5, 229-236.
    • (1982) Inf. Process. Manage , vol.18 , Issue.5 , pp. 229-236
    • De Heer, T.1
  • 15
    • 0001852294 scopus 로고
    • Indexing for rapid random access memory systems
    • DUMEY, A. I. 1956. Indexing for rapid random access memory systems. Comput. Autom. 5, 12 (Dec.), 6-9.
    • (1956) Comput. Autom. , vol.5 , Issue.12 DEC. , pp. 6-9
    • Dumey, A.I.1
  • 16
    • 0025702286 scopus 로고
    • An analysis of the Karp-Rabin string matching algorithm
    • GONNET, G. H. AND BAEZA-YATES, R. A. 1990. An analysis of the Karp-Rabin string matching algorithm. Inf. Process. Lett. 34, 271-274.
    • (1990) Inf. Process. Lett. , vol.34 , pp. 271-274
    • Gonnet, G.H.1    Baeza-Yates, R.A.2
  • 18
    • 18044381055 scopus 로고
    • An application of coding theory to a file address problem
    • HANAN, M. AND PALERMO, F. P. 1963. An application of coding theory to a file address problem. IBM J. Res. Devel. 7 (Apr.), 127-129.
    • (1963) IBM J. Res. Devel. , vol.7 , Issue.APR. , pp. 127-129
    • Hanan, M.1    Palermo, F.P.2
  • 20
    • 0015201444 scopus 로고
    • Implementation of the substring test for hashing
    • HARRISON, M. C. 1971. Implementation of the substring test for hashing. Commun. ACM 14, 12 (Dec.), 777-779.
    • (1971) Commun. ACM , vol.14 , Issue.12 DEC. , pp. 777-779
    • Harrison, M.C.1
  • 21
    • 0020189609 scopus 로고
    • Experiments in text recognition with binary n-gram and Viterbi algorithms
    • HULL, J. J. AND SRIHARI, S. N. 1982. Experiments in text recognition with binary n-gram and Viterbi algorithms. IEEE Trans. Pattern Anal. Mach. Intell. PAMI-4, 5 (Sept.), 520-530.
    • (1982) IEEE Trans. Pattern Anal. Mach. Intell. , vol.PAMI-4 , Issue.5 SEPT. , pp. 520-530
    • Hull, J.J.1    Srihari, S.N.2
  • 22
    • 0016069448 scopus 로고
    • Suboptimal sequential decision schemes with on-line feature ordering
    • HUSSAIN, A. B. S. AND DONALDSON, R. W. 1974. Suboptimal sequential decision schemes with on-line feature ordering. IEEE Trans. Comput. C-23, 6, 582-590.
    • (1974) IEEE Trans. Comput. , vol.C-23 , Issue.6 , pp. 582-590
    • Hussain, A.B.S.1    Donaldson, R.W.2
  • 23
    • 0022030599 scopus 로고
    • Efficient randomized pattern-matching algorithms
    • KARP, R. M. AND RABIN, M. O. 1987. Efficient randomized pattern-matching algorithms. IBM J. Res. Devel. 31, 2 (Mar.), 249-260.
    • (1987) IBM J. Res. Devel. , vol.31 , Issue.2 MAR. , pp. 249-260
    • Karp, R.M.1    Rabin, M.O.2
  • 24
    • 0027113242 scopus 로고
    • An approximate string-matching algorithm
    • KIM, J. Y. AND SHAWE-TAYLOR, J. 1992. An approximate string-matching algorithm. Theoret. Comput. Sci. 92, 107-117.
    • (1992) Theoret. Comput. Sci. , vol.92 , pp. 107-117
    • Kim, J.Y.1    Shawe-Taylor, J.2
  • 26
    • 0025401073 scopus 로고
    • Accelerated text searching through signature trees
    • KOTAMARTI, U. AND THARP, A. L. 1990. Accelerated text searching through signature trees. J. Am. Soc. Inf. Sci. 41, 2 (Mar.), 79-86.
    • (1990) J. Am. Soc. Inf. Sci. , vol.41 , Issue.2 MAR. , pp. 79-86
    • Kotamarti, U.1    Tharp, A.L.2
  • 27
    • 0026979939 scopus 로고
    • Techniques for automatically correcting words in text
    • KUKICH, K. 1992. Techniques for automatically correcting words in text. ACM Comput. Surv. 24, 4 (Dec.), 377-439.
    • (1992) ACM Comput. Surv. , vol.24 , Issue.4 DEC. , pp. 377-439
    • Kukich, K.1
  • 28
    • 0015039128 scopus 로고
    • Key-to-address transform techniques: A fundamental performance study on large existing formatted files
    • LUM, V. Y., YUEN, P. S. T., AND DODD, M. 1971. Key-to-address transform techniques: A fundamental performance study on large existing formatted files. Commun. ACM 14, 4 (Apr.), 228-239.
    • (1971) Commun. ACM , vol.14 , Issue.4 APR. , pp. 228-239
    • Lum, V.Y.1    Yuen, P.S.T.2    Dodd, M.3
  • 29
    • 0017392187 scopus 로고
    • Variety generation - A reinterpretation of Shannon's mathematical theory of communication, and its implications for information science
    • LYNCH, M. F. 1977. Variety generation - A reinterpretation of Shannon's mathematical theory of communication, and its implications for information science. J. Am. Soc. Inf. Sci. 28, 1 (Jan.), 19-25.
    • (1977) J. Am. Soc. Inf. Sci. , vol.28 , Issue.1 JAN. , pp. 19-25
    • Lynch, M.F.1
  • 30
    • 18044381437 scopus 로고
    • The degarbler - A program for correcting machine-read morse code
    • MCELWAIN, C. K. AND EVENS, M. B. 1962. The degarbler - A program for correcting machine-read morse code. Inf. Control 5, 368-384.
    • (1962) Inf. Control , vol.5 , pp. 368-384
    • Mcelwain, C.K.1    Evens, M.B.2
  • 32
    • 0016477023 scopus 로고
    • Computer detection of typographical errors
    • MORRIS, R. AND CHERRY, L. L. 1975. Computer detection of typographical errors. IEEE Trans. Prof. Commun. PC-18 (Mar.), 54-64.
    • (1975) IEEE Trans. Prof. Commun. , vol.PC-18 , Issue.MAR. , pp. 54-64
    • Morris, R.1    Cherry, L.L.2
  • 34
    • 0016486735 scopus 로고
    • The Viterbi algorithm as an aid in text recognition
    • NEUHOFF, D. L. 1975. The Viterbi algorithm as an aid in text recognition. IEEE Trans. Inf. Theory IT-21 (Mar.), 222-226.
    • (1975) IEEE Trans. Inf. Theory , vol.IT-21 , Issue.MAR. , pp. 222-226
    • Neuhoff, D.L.1
  • 37
    • 0009861027 scopus 로고
    • Addressing for random-access storage
    • PETERSON, W. W. 1957. Addressing for random-access storage. IBM J. Res. Devel. 1 (Apr.), 130-146.
    • (1957) IBM J. Res. Devel. , vol.1 , Issue.APR. , pp. 130-146
    • Peterson, W.W.1
  • 41
    • 18044379842 scopus 로고
    • A method for key-to-address transformation
    • SCHAY, G. AND RAVER, N. 1963. A method for key-to-address transformation. IBM J. Res. Devel. 7 (Apr.), 121-126.
    • (1963) IBM J. Res. Devel. , vol.7 , Issue.APR. , pp. 121-126
    • Schay, G.1    Raver, N.2
  • 42
    • 18044396444 scopus 로고    scopus 로고
    • 1990. Trigram-based method of language identification. U.S. Patent No. 5,062,143. U.S. Patent Office, Washington, D.C.
    • SCHMITT, J. C. 1990. Trigram-based method of language identification. U.S. Patent No. 5,062,143. U.S. Patent Office, Washington, D.C.
    • Schmitt, J.C.1
  • 43
    • 0015725242 scopus 로고
    • Selection of equifrequent word fragments for information retrieval
    • SCHUEGRAF, E. J. AND HEAPS, H. S. 1973. Selection of equifrequent word fragments for information retrieval. Inf. Storage Retrieval. 9, 697-711.
    • (1973) Inf. Storage Retrieval , vol.9 , pp. 697-711
    • Schuegraf, E.J.1    Heaps, H.S.2
  • 44
    • 84944486544 scopus 로고
    • Prediction and entropy of printed english
    • SHANNON, C. E. 1951. Prediction and entropy of printed english. Bell System Tech. J. 30 (Jan.), 50-64.
    • (1951) Bell System Tech. J. , vol.30 , Issue.JAN. , pp. 50-64
    • Shannon, C.E.1
  • 45
    • 0017969061 scopus 로고
    • A simplified heuristic version of a recursive Bayes algorithm for using context in text recognition
    • SHINGHAL, R., ROSENBERG, D., AND TOUSSAINT, G. T. 1978. A simplified heuristic version of a recursive Bayes algorithm for using context in text recognition. IEEE Trans. Syst. Man Cyber. SMC-8, 5 (May), 412-414.
    • (1978) IEEE Trans. Syst. Man Cyber. , vol.SMC-8 , Issue.5 MAY , pp. 412-414
    • Shinghal, R.1    Rosenberg, D.2    Toussaint, G.T.3
  • 46
    • 0017952955 scopus 로고
    • n-gram statistics for natural language understanding and text processing
    • SUEN, C. Y. 1979. n-gram statistics for natural language understanding and text processing. IEEE Trans. Patt. Anal. Mach. Intell. PAMI-1, 2 (Apr.), 164-172.
    • (1979) IEEE Trans. Patt. Anal. Mach. Intell. , vol.PAMI-1 , Issue.2 APR. , pp. 164-172
    • Suen, C.Y.1
  • 47
    • 18044372732 scopus 로고
    • Character recognition in context
    • THOMAS, R. B. AND KASSLER, M. 1967. Character recognition in context. Inf. Control 10, 43-64.
    • (1967) Inf. Control , vol.10 , pp. 43-64
    • Thomas, R.B.1    Kassler, M.2
  • 48
    • 0027113212 scopus 로고
    • Approximate string-matching with q-grams and maximal matches
    • UKKONEN, E. 1992. Approximate string-matching with q-grams and maximal matches. Theor. Comput. Sci. 92, 191-211.
    • (1992) Theor. Comput. Sci. , vol.92 , pp. 191-211
    • Ukkonen, E.1
  • 50
    • 0018736472 scopus 로고
    • Document retrieval experiments using indexing vocabularies of varying size. II. Hashing, truncation, digram and trigram encoding of index terms
    • WILLET, P. 1979. Document retrieval experiments using indexing vocabularies of varying size. II. Hashing, truncation, digram and trigram encoding of index terms. J. Doc. 35, 4 (Dec.), 296-305.
    • (1979) J. Doc. , vol.35 , Issue.4 DEC. , pp. 296-305
    • Willet, P.1
  • 51
    • 0023255153 scopus 로고
    • Effective text compression with simultaneous digram and trigram encoding
    • WISNIEWSKI, J. L. 1987. Effective text compression with simultaneous digram and trigram encoding. J. Inf. Sci. 13, 159-164.
    • (1987) J. Inf. Sci. , vol.13 , pp. 159-164
    • Wisniewski, J.L.1
  • 53
    • 0026980718 scopus 로고
    • An assessment of n-phoneme statistics in phoneme guessing algorithms which aim to incorporate phonotactic constraints
    • YANNAKOUDAKIS, E. J. AND HUTTON, P. J. 1992. An assessment of n-phoneme statistics in phoneme guessing algorithms which aim to incorporate phonotactic constraints. Speech Commun. 11, 6 (Dec.), 581-602.
    • (1992) Speech Commun. , vol.11 , Issue.6 DEC. , pp. 581-602
    • Yannakoudakis, E.J.1    Hutton, P.J.2
  • 54
    • 0038602946 scopus 로고
    • The use of trigram analysis for spelling error detection
    • ZAMORA, E. M., POLLOCK, J. J., AND ZAMORA, A. 1981. The use of trigram analysis for spelling error detection. Inf. Process. Manage. 17, 6, 305-316.
    • (1981) Inf. Process. Manage , vol.17 , Issue.6 , pp. 305-316
    • Zamora, E.M.1    Pollock, J.J.2    Zamora, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.