-
2
-
-
25844528603
-
A fast algorithm for finding the nearest neighbor of a word in a dictionary
-
In, Tsukuba Science City, Japan, October
-
H. Bunke. A fast algorithm for finding the nearest neighbor of a word in a dictionary. In: Proc. 2nd Int. Conf. on Doc. Anal. and Recognition, pp. 632-637, Tsukuba Science City, Japan, October 1993
-
(1993)
Proc. 2nd Int. Conf. on Doc. Anal. and Recognition
, pp. 632-637
-
-
Bunke, H.1
-
3
-
-
0030702642
-
The detection of duplicates in document image databases
-
In, Ulm, Germany, August
-
D. Doermann, H. Li, O. Kia. The detection of duplicates in document image databases. In: Proc. 4th Int. Conf. on Doc. Anal. and Recognition, pp. 314-318, Ulm, Germany, August 1997
-
(1997)
Proc. 4th Int. Conf. on Doc. Anal. and Recognition
, pp. 314-318
-
-
Doermann, D.1
Li, H.2
Kia, O.3
-
4
-
-
0028485378
-
An approach to designing very fast approximate string matching algorithms
-
M.-W. Du, S. C. Chang. An approach to designing very fast approximate string matching algorithms. IEEE Trans. on Knowl. and Data Eng. 6(4): 620-633 (1994)
-
(1994)
IEEE Trans. on Knowl. and Data Eng
, vol.6
, Issue.4
, pp. 620-633
-
-
Du, M.-W.1
Chang, S.C.2
-
5
-
-
0028750709
-
Classification and distribution of optical character recognition errors
-
In, San Jose, CA, February
-
J. Esakov, D. P. Lopresti, J. S. Sandberg. Classification and distribution of optical character recognition errors. In: Proc. Doc. Recognition I (IS&T/SPIE Electronic Imaging), pp. 204-216, San Jose, CA, February 1994
-
(1994)
Proc. Doc. Recognition I (IS&T/SPIE Electronic Imaging)
, pp. 204-216
-
-
Esakov, J.1
Lopresti, D.P.2
Sandberg, J.S.3
-
6
-
-
0021760002
-
Fast optimal alignment
-
J. W. Fickett. Fast optimal alignment. Nucleic Acids Research 12(1): 175-179 (1984)
-
(1984)
Nucleic Acids Research
, vol.12
, Issue.1
, pp. 175-179
-
-
Fickett, J.W.1
-
8
-
-
0025807368
-
Building and using a highly parallel programmable logic array
-
M. Gokhale, W. Holmes, A. Kopser, D. Lopresti, S. Lucas, R. Minnich, D. Sweely. Building and using a highly parallel programmable logic array. Computer, 24(1): 81-89, 1991
-
(1991)
Computer
, vol.24
, Issue.1
, pp. 81-89
-
-
Gokhale, M.1
Holmes, W.2
Kopser, A.3
Lopresti, D.4
Lucas, S.5
Minnich, R.6
Sweely, D.7
-
9
-
-
84892743207
-
-
GulfLink
-
GulfLink. http://www.gulflink.osd.mil/
-
-
-
-
10
-
-
0004137004
-
-
Cambridge University Press, Cambridge, UK
-
D. Gusfield. Algorithms on Strings, Trees, and Sequences. Cambridge University Press, Cambridge, UK, 1997
-
(1997)
Algorithms on Strings, Trees, and Sequences
-
-
Gusfield, D.1
-
11
-
-
0042258316
-
Document image similarity and equivalence detection
-
J. J. Hull. Document image similarity and equivalence detection. Int. J. Doc. Anal. and Recognition 1(1): 37-42 (1998)
-
(1998)
Int. J. Doc. Anal. and Recognition
, vol.1
, Issue.1
, pp. 37-42
-
-
Hull, J.J.1
-
12
-
-
0041757622
-
Document image matching and retrieval techniques
-
In, Annapolis, MD, April-May
-
J. J. Hull, J. Cullen, M. Peairs. Document image matching and retrieval techniques. In: Proc. Symp. on Doc. Image Understanding Technol., pp. 31-35, Annapolis, MD, April-May 1997
-
(1997)
Proc. Symp. on Doc. Image Understanding Technol
, pp. 31-35
-
-
Hull, J.J.1
Cullen, J.2
Peairs, M.3
-
13
-
-
0006513656
-
Duplicate detection for symbolically compressed documents
-
In, Bangalore, India, September
-
D.-S. Lee, J. J. Hull. Duplicate detection for symbolically compressed documents. In: Proc. 5th Int. Conf. on Doc. Anal. and Recognition, pp. 305-308, Bangalore, India, September 1999
-
(1999)
Proc. 5th Int. Conf. on Doc. Anal. and Recognition
, pp. 305-308
-
-
Lee, D.-S.1
Hull, J.J.2
-
14
-
-
0001116877
-
Binary codes capable of correcting deletions, insertions, and reversals
-
V. I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. Cybernetics and Control Theory 10(8): 707-710 (1966)
-
(1966)
Cybernetics and Control Theory
, vol.10
, Issue.8
, pp. 707-710
-
-
Levenshtein, V.I.1
-
16
-
-
0041757621
-
Models and algorithms for duplicate document detection
-
In, Bangalore, India, September
-
D. P. Lopresti. Models and algorithms for duplicate document detection. In: Proc. 5th Int. Conf. on Doc. Anal. and Recognition, pp. 297-300, Bangalore, India, September 1999
-
(1999)
Proc. 5th Int. Conf. on Doc. Anal. and Recognition
, pp. 297-300
-
-
Lopresti, D.P.1
-
17
-
-
0043260683
-
String techniques for duplicate document detection
-
In, Annapolis, MD, April
-
D. P. Lopresti. String techniques for duplicate document detection. In: Proc. Symp. on Doc. Image Understanding Technol., pp. 101-112, Annapolis, MD, April 1999
-
(1999)
Proc. Symp. on Doc. Image Understanding Technol
, pp. 101-112
-
-
Lopresti, D.P.1
-
18
-
-
0033908952
-
A comparison of text-based methods for detecting duplication in document image databases
-
In, January, CA, San Jose
-
D. P. Lopresti. A comparison of text-based methods for detecting duplication in document image databases. In: Proc. Doc. Recognition and Retrieval VII (IS&T/SPIE Electronic Imaging), 3967: 210-221, San Jose, CA, January 2000
-
(2000)
Proc. Doc. Recognition and Retrieval VII (IS&T/SPIE Electronic Imaging)
, vol.3967
, pp. 210-221
-
-
Lopresti, D.P.1
-
19
-
-
0031187745
-
Block edit models for approximate string matching
-
D. P. Lopresti, A. Tomkins. Block edit models for approximate string matching. Theoretical Computer Science (181): 159-179 (1997)
-
(1997)
Theoretical Computer Science
, Issue.181
, pp. 159-179
-
-
Lopresti, D.P.1
Tomkins, A.2
-
20
-
-
85043988965
-
Finding similar files in a large file system
-
In, San Francisco, CA, January
-
U. Manber. Finding similar files in a large file system. In: Proc. USENIX, pp. 1-10, San Francisco, CA, January 1994
-
(1994)
Proc. USENIX
, pp. 1-10
-
-
Manber, U.1
-
21
-
-
0014757386
-
A general method applicable to the search for similarities in the amino-acid sequences of two proteins
-
S. B. Needleman, C. D. Wunsch. A general method applicable to the search for similarities in the amino-acid sequences of two proteins. J. Mol. Biol. 48:443-453 (1970)
-
(1970)
J. Mol. Biol
, vol.48
, pp. 443-453
-
-
Needleman, S.B.1
Wunsch, C.D.2
-
22
-
-
0042758967
-
Database partitioning and duplicate document detection based on optical correlation
-
In, Annapolis, MD, April
-
F. Prokoski. Database partitioning and duplicate document detection based on optical correlation. In: Proc. Symp. on Doc. Image Understanding Technol., pp. 86-97, Annapolis, MD, April 1999
-
(1999)
Proc. Symp. on Doc. Image Understanding Technol
, pp. 86-97
-
-
Prokoski, F.1
-
23
-
-
0018106375
-
Considerations in dynamic time warping algorithms for discrete word recognition
-
L. R. Rabiner, A. E. Rosenberg, S. E. Levinson. Considerations in dynamic time warping algorithms for discrete word recognition. IEEE Trans. on Acoust., Speech, and Signal Process. ASSP-26(6): 575-582 (1978)
-
(1978)
IEEE Trans. on Acoust., Speech, and Signal Process
, vol.ASSP-26
, Issue.6
, pp. 575-582
-
-
Rabiner, L.R.1
Rosenberg, A.E.2
Levinson, S.E.3
-
24
-
-
0042258313
-
Duplicate document detection in DocBrowse
-
In, Annapolis, MD, April
-
R. Rogers, V. Chalana, G. Marchisio, T. Nguyen, A. Bruce. Duplicate document detection in DocBrowse. In: Proc. Symp. on Doc. Image Understanding Technol., pp. 119-127, Annapolis, MD, April 1999
-
(1999)
Proc. Symp. on Doc. Image Understanding Technol
, pp. 119-127
-
-
Rogers, R.1
Chalana, V.2
Marchisio, G.3
Nguyen, T.4
Bruce, A.5
-
25
-
-
0003725141
-
-
(eds.), Addison-Wesley, Reading, MA
-
D. Sankoff, J. B. Kruskal (eds.). Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison. Addison-Wesley, Reading, MA, 1983
-
(1983)
Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison
-
-
Sankoff, D.1
Kruskal, J.B.2
-
26
-
-
49149141669
-
The theory and computation of evolutionary distances: Pattern recognition
-
P. H. Sellers. The theory and computation of evolutionary distances: pattern recognition. J. Algorithms 1: 359-373 (1980)
-
(1980)
J. Algorithms
, vol.1
, pp. 359-373
-
-
Sellers, P.H.1
-
29
-
-
0019887799
-
identification of common molecular sequences
-
T. F. Smith, M. S. Waterman. identification of common molecular sequences. J. Mol. Biol. 147: 195-197 (1981)
-
(1981)
J. Mol. Biol
, vol.147
, pp. 195-197
-
-
Smith, T.F.1
Waterman, M.S.2
-
30
-
-
0031354094
-
Duplicate document detection
-
In:, San Jose, CA, February
-
A. L. Spitz. Duplicate document detection. In: Proc. Doc. Recognition IV (IS&T/SPIE Electronic Imaging), 3027: 88-94, San Jose, CA, February 1997
-
(1997)
Proc. Doc. Recognition IV (IS&T/SPIE Electronic Imaging)
, vol.3027
, pp. 88-94
-
-
Spitz, A.L.1
-
32
-
-
84892685950
-
-
In: Annual Report of UNLV Information Science Research Institute, Las Vegas, NV
-
K. Taghva, J. Borsack, A. Condit, P. Inaparthy. Effects of OCR errors on short documents. In: Annual Report of UNLV Information Science Research Institute, pp. 99-105, Las Vegas, NV, 1995
-
(1995)
Effects of OCR errors on short documents
, pp. 99-105
-
-
Taghva, K.1
Borsack, J.2
Condit, A.3
Inaparthy, P.4
-
33
-
-
84983986619
-
On approximate string matching
-
In, LNCS 158, Springer, Berlin Heidelberg New York
-
E. Ukkonen. On approximate string matching. In: Proc. Int. Conf. on Foundations of Comput. Theory, LNCS 158, pp. 487-493. Springer, Berlin Heidelberg New York, 1983
-
(1983)
Proc. Int. Conf. on Foundations of Comput. Theory
, pp. 487-493
-
-
Ukkonen, E.1
-
34
-
-
0015960104
-
The string-to-string correction problem
-
R. A. Wagner, M. J. Fischer. The string-to-string correction problem. J. ACM 21: 168-173 (1974)
-
(1974)
J. ACM
, vol.21
, pp. 168-173
-
-
Wagner, R.A.1
Fischer, M.J.2
|