-
1
-
-
0000417994
-
Developments in automatic text retrieval
-
G. Salton, "Developments in automatic text retrieval," Science, vol. 253, pp. 974-980, 1991.
-
(1991)
Science
, vol.253
, pp. 974-980
-
-
Salton, G.1
-
2
-
-
0001626792
-
Global text matching for information retrieval
-
G. Salton and C. Buckley, "Global text matching for information retrieval," Science, vol. 253, pp. 1012-1015, 1991.
-
(1991)
Science
, vol.253
, pp. 1012-1015
-
-
Salton, G.1
Buckley, C.2
-
3
-
-
0028304946
-
Automatic analysis, theme generation, and summarization of machine-readable text
-
G. Salton, J. Allan, C. Buckley, and A. Singhal, "Automatic analysis, theme generation, and summarization of machine-readable text," Science, vol. 264, pp. 1421-1426, 1994.
-
(1994)
Science
, vol.264
, pp. 1421-1426
-
-
Salton, G.1
Allan, J.2
Buckley, C.3
Singhal, A.4
-
5
-
-
0017952955
-
N-gram statistics for natural language understanding and text processing
-
C.Y. Suen, "N-gram statistics for natural language understanding and text processing," IEEE Trans. on Pattern Analysis & Machine Intelligence, PAMI, vol. 1, no. 2, pp. 164-172, 1979.
-
(1979)
IEEE Trans. on Pattern Analysis & Machine Intelligence, PAMI
, vol.1
, Issue.2
, pp. 164-172
-
-
Suen, C.Y.1
-
6
-
-
0018922491
-
Automatic detection and correcting of spelling errors in a large data base
-
A. Zamora, "Automatic detection and correcting of spelling errors in a large data base," Journal of the American Society for Information Science, vol. 31, no. 51, 1980.
-
(1980)
Journal of the American Society for Information Science
, vol.31
, Issue.51
-
-
Zamora, A.1
-
7
-
-
84976659272
-
Computer programs for detecting and correcting spelling errors
-
J.L. Peterson, "Computer programs for detecting and correcting spelling errors," Comm. vol. ACM 23, p. 676, 1980.
-
(1980)
Comm.
, vol.ACM 23
, pp. 676
-
-
Peterson, J.L.1
-
8
-
-
0038602946
-
The use of trigram analysis for spelling error detection
-
E.M. Zamora, J.J. Pollock, and A. Zamora, "The use of trigram analysis for spelling error detection," Inf. Proc. Mgt. vol. 17, p. 305, 1981.
-
(1981)
Inf. Proc. Mgt.
, vol.17
, pp. 305
-
-
Zamora, E.M.1
Pollock, J.J.2
Zamora, A.3
-
10
-
-
0020290089
-
Spelling error detection and correction by computer: Some notes and a bibliography
-
J.J. Pollock, "Spelling error detection and correction by computer: Some notes and a bibliography," J. Doc. vol. 38, p. 282, 1982.
-
(1982)
J. Doc.
, vol.38
, pp. 282
-
-
Pollock, J.J.1
-
11
-
-
0020685593
-
Automatic spelling correction using trigram similarity measure
-
R.C. Angell, G.E. Freund, and P. Willette, "Automatic spelling correction using trigram similarity measure," Inf. Proc. Mgt. vol. 18, p. 255, 1983.
-
(1983)
Inf. Proc. Mgt.
, vol.18
, pp. 255
-
-
Angell, R.C.1
Freund, G.E.2
Willette, P.3
-
12
-
-
0020250756
-
The generation and use of text fragments for data compression
-
E.J. Yannakoudakis, P. Goyal, and J.A. Huggill, "The generation and use of text fragments for data compression," Inf. Proc. Mgt. vol. 18, p. 15, 1982.
-
(1982)
Inf. Proc. Mgt.
, vol.18
, pp. 15
-
-
Yannakoudakis, E.J.1
Goyal, P.2
Huggill, J.A.3
-
13
-
-
4243269926
-
Trigram-based method of language identification
-
U.S. Patent No. 5,062,143
-
J.C. Schmitt, "Trigram-based method of language identification," U.S. Patent No. 5,062,143, 1990.
-
(1990)
-
-
Schmitt, J.C.1
-
14
-
-
35248819638
-
N-gram-based text categorization
-
W.B. Cavnar and J.M. Trenkle, "N-gram-based text categorization," in Proceeding of the Symposium on Document Analysis and Information Retrieval, University of Nevada, Las Vegas, 1994.
-
Proceeding of the Symposium on Document Analysis and Information Retrieval, University of Nevada, Las Vegas, 1994
-
-
Cavnar, W.B.1
Trenkle, J.M.2
-
15
-
-
0018736472
-
Document retrieval experiments using indexing vocabularies of varying size. II. Hashing, truncation, digram and trigram encoding of index terms
-
P. Willett, "Document retrieval experiments using indexing vocabularies of varying size. II. Hashing, Truncation, Digram and trigram encoding of index terms," J. Doc. vol. 35, p. 296, 1979.
-
(1979)
J. Doc.
, vol.35
, pp. 296
-
-
Willett, P.1
-
16
-
-
0037926598
-
N-gram-based text filtering for TREC-2
-
NIST Special Publication 500-215, National Institute of Standards and Technology, Gaithesburg, Maryland
-
W.B. Cavnar, "N-gram-based text filtering for TREC-2," The Second Text Retrieval Conference (TREC-2), NIST Special Publication 500-215, National Institute of Standards and Technology, Gaithesburg, Maryland, 1994.
-
(1994)
The Second Text Retrieval Conference (TREC-2)
-
-
Cavnar, W.B.1
-
17
-
-
0028911698
-
Gauging similarity via N-grams: Language-independent sorting, categorization, and retrieval of text
-
Marc Damashek, "Gauging similarity via N-grams: Language-independent sorting, categorization, and retrieval of text," Science, vol. 267, pp. 843-848, 1995.
-
(1995)
Science
, vol.267
, pp. 843-848
-
-
Damashek, M.1
-
18
-
-
84957545678
-
Probabilistic retrieval of OCR degraded text using N-grams
-
S.M. Harding, W.B. Croft, and C. Weir, "Probabilistic retrieval of OCR degraded text using N-grams," in European Conference on Digital Libraries, pp. 345-359, 1997.
-
(1997)
European Conference on Digital Libraries
, pp. 345-359
-
-
Harding, S.M.1
Croft, W.B.2
Weir, C.3
-
19
-
-
0003081765
-
An evaluation of information retrieval accuracy with simulated OCR output
-
W.B. Croft, S.M. Harding, K. Taghva, and J. Borsack, "An evaluation of information retrieval accuracy with simulated OCR output," in Symposium of Document Analysis and Information Retrieval, pp. 115-126, 1994.
-
(1994)
Symposium of Document Analysis and Information Retrieval
, pp. 115-126
-
-
Croft, W.B.1
Harding, S.M.2
Taghva, K.3
Borsack, J.4
-
23
-
-
84995297946
-
Content-based indexing and retrieval method of chinese document images
-
Y. He, Z. Jiang, B. Liu, and H. Zhao, "Content-based indexing and retrieval method of chinese document images," in Proceedings of the Fifth International Conference on Document Analysis and Recognition (ICDAR'99), pp. 685-688, 1999.
-
(1999)
Proceedings of the Fifth International Conference on Document Analysis and Recognition (ICDAR'99)
, pp. 685-688
-
-
He, Y.1
Jiang, Z.2
Liu, B.3
Zhao, H.4
-
25
-
-
0031098394
-
Determination of the script and language content of document images
-
A.L. Spitz, "Determination of the script and language content of document images," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 19, no. 3, pp. 235-245, 1997.
-
(1997)
IEEE Trans. on Pattern Analysis and Machine Intelligence
, vol.19
, Issue.3
, pp. 235-245
-
-
Spitz, A.L.1
-
26
-
-
0005936103
-
Categorizing document images into script and language classes
-
23-25 Nov
-
C.Y. Suen, S. Bergler, N. Nobile, B. Waked, C.P. Nadal, and A. Bloch, "Categorizing document images into script and language classes," in Proceedings of the International Conference on Advances in Pattern Recognition, Plymouth, UK, pp. 297-306, 23-25 Nov 1998.
-
(1998)
Proceedings of the International Conference on Advances in Pattern Recognition, Plymouth, UK
, pp. 297-306
-
-
Suen, C.Y.1
Bergler, S.2
Nobile, N.3
Waked, B.4
Nadal, C.P.5
Bloch, A.6
-
29
-
-
0031083861
-
Word shape analysis for a hybrid recognition system
-
R.K. Powalka, N. Sherkat, and R.J. Whitrow, "Word shape analysis for a hybrid recognition system," Pattern Recognition, vol. 30, no. 3, pp. 421-445, 1997.
-
(1997)
Pattern Recognition
, vol.30
, Issue.3
, pp. 421-445
-
-
Powalka, R.K.1
Sherkat, N.2
Whitrow, R.J.3
-
30
-
-
0029453012
-
Measuring document image skew and orientation
-
Doucment Recognition II, San Jose, CA
-
D.S. Bloomberg, G.E. Kopec, and L. Dasari, "Measuring document image skew and orientation," SPIE Conf. 2422, Doucment Recognition II, San Jose, CA, pp. 302-316, 1995.
-
(1995)
SPIE Conf. 2422
, pp. 302-316
-
-
Bloomberg, D.S.1
Kopec, G.E.2
Dasari, L.3
|