-
4
-
-
0030151440
-
Effects of OCR errors on ranking and feedback using the vector space model
-
K. Taghva, J. Borsack, and A. Condit, "Effects of OCR errors on ranking and feedback using the vector space model," Information Processing and Management 32(3), pp. 317-327, 1996.
-
(1996)
Information Processing and Management
, vol.32
, Issue.3
, pp. 317-327
-
-
Taghva, K.1
Borsack, J.2
Condit, A.3
-
5
-
-
0002849652
-
Evaluation of model-based retrieval effectiveness with OCR text
-
January
-
K. Taghva, J. Borsack, and A. Condit, "Evaluation of model-based retrieval effectiveness with OCR text," ACM Transactions on Information Systems 14, pp. 64-93, January 1996.
-
(1996)
ACM Transactions on Information Systems
, vol.14
, pp. 64-93
-
-
Taghva, K.1
Borsack, J.2
Condit, A.3
-
6
-
-
30144433682
-
Named entity extraction from noisy input: Speech and OCR
-
Seattle, WA
-
D. Miller, S. Boisen, R. Schwartz, R. Stone, and R. Weischedel, "Named entity extraction from noisy input: Speech and OCR," in Proceedings of the 6th Applied Natural Language Processing Conference, pp. 316-324, (Seattle, WA), 2000.
-
(2000)
Proceedings of the 6th Applied Natural Language Processing Conference
, pp. 316-324
-
-
Miller, D.1
Boisen, S.2
Schwartz, R.3
Stone, R.4
Weischedel, R.5
-
7
-
-
20444490177
-
Summarizing noisy documents
-
April
-
H. Jing, D. Lopresti, and C. Shih, "Summarizing noisy documents," in Proceedings of the Symposium on Document Image Understanding Technology, pp. 111-119, April 2003.
-
(2003)
Proceedings of the Symposium on Document Image Understanding Technology
, pp. 111-119
-
-
Jing, H.1
Lopresti, D.2
Shih, C.3
-
8
-
-
33644553071
-
Performance evaluation for text processing of noisy inputs
-
Santa Fe, NM, March
-
D. Lopresti, "Performance evaluation for text processing of noisy inputs," in Proceedings of the 20th Annual ACM Symposium on Applied Computing (Document Engineering Track), pp. 759-763, (Santa Fe, NM), March 2005.
-
(2005)
Proceedings of the 20th Annual ACM Symposium on Applied Computing (Document Engineering Track)
, pp. 759-763
-
-
Lopresti, D.1
-
9
-
-
41149113389
-
-
Tesseract open source OCR engine, November 2007
-
"Tesseract open source OCR engine," November 2007. http://sourceforge.net/projects/tesseract-ocr.
-
-
-
-
13
-
-
0028750709
-
Classification and distribution of optical character recognition errors
-
San Jose, CA, February
-
J. Esakov, D. P. Lopresti, and J. S. Sandberg, "Classification and distribution of optical character recognition errors," in Proceedings of Document Recognition I (IS&T/SPIE Electronic Imaging), 2181, pp. 204-216, (San Jose, CA), February 1994.
-
(1994)
Proceedings of Document Recognition I (IS&T/SPIE Electronic Imaging)
, vol.2181
, pp. 204-216
-
-
Esakov, J.1
Lopresti, D.P.2
Sandberg, J.S.3
-
14
-
-
0043260095
-
Issues in automatic OCR error classification
-
April
-
J. Esakov, D. P. Lopresti, J. S. Sandberg, and J. Zhou, "Issues in automatic OCR error classification," in Proceedings of the Third Annual Symposium on Document Analysis and Information Retrieval, pp. 401-412, April 1994.
-
(1994)
Proceedings of the Third Annual Symposium on Document Analysis and Information Retrieval
, pp. 401-412
-
-
Esakov, J.1
Lopresti, D.P.2
Sandberg, J.S.3
Zhou, J.4
-
16
-
-
41149174659
-
-
Project Gutenberg, November 2007
-
"Project Gutenberg," November 2007. http://www.gutenberg.net/.
-
-
-
-
18
-
-
84886884351
-
Treebanks gone bad: Generating a treebank of ungrammatical English
-
Hyderabad, India, January 2007
-
J. Foster, "Treebanks gone bad: Generating a treebank of ungrammatical English," in Proceedings of the Workshop on Analytics for Noisy Unstructured Text Data, (Hyderabad, India), January 2007. http://research.ihost.com/and2007/cd/Proceedings.files/p39.pdf.
-
Proceedings of the Workshop on Analytics for Noisy Unstructured Text Data
-
-
Foster, J.1
-
19
-
-
85014862282
-
Prediction of OCR accuracy using simple image features
-
Montréal, Canada, August
-
L. R. Blando, J. Kanai, and T. A. Nartker, "Prediction of OCR accuracy using simple image features," in Proceedings of the Third International Conference on Document Analysis and Recognition, pp. 319-322, (Montréal, Canada), August 1995.
-
(1995)
Proceedings of the Third International Conference on Document Analysis and Recognition
, pp. 319-322
-
-
Blando, L.R.1
Kanai, J.2
Nartker, T.A.3
-
20
-
-
33644548762
-
Quality assessment and restoration of typewritten document images,
-
99-1233, Los Alamos National Laboratory
-
M. Cannon, J. Hochberg, and P. Kelly, "Quality assessment and restoration of typewritten document images," Tech. Rep. LA-UR 99-1233, Los Alamos National Laboratory, 1999.
-
(1999)
Tech. Rep. LA-UR
-
-
Cannon, M.1
Hochberg, J.2
Kelly, P.3
-
21
-
-
0029754083
-
Assessment of image quality to predict readability of documents
-
San Jose, CA, January
-
V. Govindaraju and S. N. Srihari, "Assessment of image quality to predict readability of documents," in Proceedings of Document Recognition III (IS&T/SPIE Electronic Imaging), 2660, pp. 333-342, (San Jose, CA), January 1996.
-
(1996)
Proceedings of Document Recognition III (IS&T/SPIE Electronic Imaging)
, vol.2660
, pp. 333-342
-
-
Govindaraju, V.1
Srihari, S.N.2
|