-
1
-
-
84906500060
-
-
http://algoval.essex.ac.uk/icdar/datasets.html
-
-
-
-
2
-
-
84906490584
-
-
https://code.google.com/p/tesseract-ocr/
-
-
-
-
3
-
-
84906490586
-
-
http://www.flickr.com/
-
-
-
-
4
-
-
84906500058
-
-
http://www.flickr.com/groups/type/
-
-
-
-
5
-
-
84906509767
-
-
http://www.iapr-tc11.org/mediawiki/index.php/kaist-scene-text-database
-
-
-
-
6
-
-
85083953799
-
End-to-End Text Recognition with Hybrid HMM Maxout Models
-
Alsharif, O., Pineau, J.: End-to-End Text Recognition with Hybrid HMM Maxout Models. In: ICLR (2014)
-
(2014)
ICLR
-
-
Alsharif, O.1
Pineau, J.2
-
7
-
-
84880615968
-
Detection of artificial and scene text in images and video frames
-
Anthimopoulos, M., Gatos, B., Pratikakis, I.: Detection of artificial and scene text in images and video frames. Pattern Analysis and Applications, 1-16 (2011)
-
(2011)
Pattern Analysis and Applications
, pp. 1-16
-
-
Anthimopoulos, M.1
Gatos, B.2
Pratikakis, I.3
-
8
-
-
84898778744
-
PhotoOCR: Reading text in uncontrolled conditions
-
Bissacco, A., Cummins, M., Netzer, Y., Neven, H.: PhotoOCR: Reading text in uncontrolled conditions. In: ICCV (2013)
-
(2013)
ICCV
-
-
Bissacco, A.1
Cummins, M.2
Netzer, Y.3
Neven, H.4
-
9
-
-
0034844730
-
Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images
-
Boykov, Y., Jolly, M.P.: Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images. In: Proc. ICCV, vol. 2, pp. 105-112 (2001)
-
(2001)
Proc. ICCV
, vol.2
, pp. 105-112
-
-
Boykov, Y.1
Jolly, M.P.2
-
10
-
-
84865813192
-
-
de Campos, T., Babu, B.R., Varma, M.: Character recognition in natural images, pp. 591-604 (2009)
-
(2009)
Character Recognition in Natural Images
, pp. 591-604
-
-
De Campos, T.1
Babu, B.R.2
Varma, M.3
-
11
-
-
84863052045
-
Robust text detection in natural images with edge-enhanced maximally stable extremal regions
-
Chen, H., Tsai, S., Schroth, G., Chen, D., Grzeszczuk, R., Girod, B.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: Proc. International Conference on Image Processing (ICIP), pp. 2609-2612 (2011)
-
(2011)
Proc. International Conference on Image Processing (ICIP)
, pp. 2609-2612
-
-
Chen, H.1
Tsai, S.2
Schroth, G.3
Chen, D.4
Grzeszczuk, R.5
Girod, B.6
-
12
-
-
5044227851
-
Detecting and reading text in natural scenes
-
IEEE
-
Chen, X., Yuille, A.L.: Detecting and reading text in natural scenes. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004, vol. 2, p. II-366. IEEE (2004)
-
(2004)
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004
, vol.2
-
-
Chen, X.1
Yuille, A.L.2
-
13
-
-
82355160847
-
Text detection and character recognition in scene images with unsupervised feature learning
-
IEEE
-
Coates, A., Carpenter, B., Case, C., Satheesh, S., Suresh, B., Wang, T., Wu, D.J., Ng, A.Y.: Text detection and character recognition in scene images with unsupervised feature learning. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 440-445. IEEE (2011)
-
(2011)
2011 International Conference on Document Analysis and Recognition (ICDAR)
, pp. 440-445
-
-
Coates, A.1
Carpenter, B.2
Case, C.3
Satheesh, S.4
Suresh, B.5
Wang, T.6
Wu, D.J.7
Ng, A.Y.8
-
14
-
-
84904482223
-
-
arXiv preprint arXiv:1310.1531
-
Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: Decaf: A deep convolutional activation feature for generic visual recognition. arXiv preprint arXiv:1310.1531 (2013)
-
(2013)
Decaf: A Deep Convolutional Activation Feature for Generic Visual Recognition
-
-
Donahue, J.1
Jia, Y.2
Vinyals, O.3
Hoffman, J.4
Zhang, N.5
Tzeng, E.6
Darrell, T.7
-
15
-
-
84862061986
-
Robust recognition of degraded documents using character n-grams
-
IEEE
-
Dutta, S., Sankaran, N., Sankar, K., Jawahar, C.: Robust recognition of degraded documents using character n-grams. In: International Workshop on Document Analysis Systems (DAS), pp. 130-134. IEEE (2012)
-
(2012)
International Workshop on Document Analysis Systems (DAS)
, pp. 130-134
-
-
Dutta, S.1
Sankaran, N.2
Sankar, K.3
Jawahar, C.4
-
16
-
-
77955991043
-
Detecting text in natural scenes with stroke width transform
-
IEEE
-
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proc. CVPR, pp. 2963-2970. IEEE (2010)
-
(2010)
Proc. CVPR
, pp. 2963-2970
-
-
Epshtein, B.1
Ofek, E.2
Wexler, Y.3
-
17
-
-
77949524387
-
-
Tech. rep. University of Montreal
-
Erhan, D., Bengio, Y., Courville, A., Vincent, P.: Visualizing higher-layer features of a deep network. Tech. rep. University of Montreal (2009)
-
(2009)
Visualizing Higher-layer Features of a Deep Network
-
-
Erhan, D.1
Bengio, Y.2
Courville, A.3
Vincent, P.4
-
18
-
-
4644354464
-
Pictorial structures for object recognition
-
Felzenszwalb, P., Huttenlocher, D.: Pictorial structures for object recognition. IJCV 61(1) (2005)
-
(2005)
IJCV
, vol.61
, Issue.1
-
-
Felzenszwalb, P.1
Huttenlocher, D.2
-
19
-
-
84889587871
-
Whole is greater than sum of parts: Recognizing scene text words
-
IEEE
-
Goel, V., Mishra, A., Alahari, K., Jawahar, C.: Whole is greater than sum of parts: Recognizing scene text words. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 398-402. IEEE (2013)
-
(2013)
2013 12th International Conference on Document Analysis and Recognition (ICDAR)
, pp. 398-402
-
-
Goel, V.1
Mishra, A.2
Alahari, K.3
Jawahar, C.4
-
20
-
-
85083953281
-
Multi-digit number recognition from street view imagery using deep convolutional neural networks
-
Goodfellow, I.J., Bulatov, Y., Ibarz, J., Arnoud, S., Shet, V.: Multi-digit number recognition from street view imagery using deep convolutional neural networks. In: ICLR (2014)
-
(2014)
ICLR
-
-
Goodfellow, I.J.1
Bulatov, Y.2
Ibarz, J.3
Arnoud, S.4
Shet, V.5
-
21
-
-
84892421248
-
-
arXiv preprint arXiv:1302.4389
-
Goodfellow, I.J., Warde-Farley, D., Mirza, M., Courville, A., Bengio, Y.: Maxout networks. arXiv preprint arXiv:1302.4389 (2013)
-
(2013)
Maxout Networks
-
-
Goodfellow, I.J.1
Warde-Farley, D.2
Mirza, M.3
Courville, A.4
Bengio, Y.5
-
22
-
-
84867720412
-
-
arXiv preprint arXiv:1207.0580
-
Hinton, G.E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.R.: Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580 (2012)
-
(2012)
Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
-
-
Hinton, G.E.1
Srivastava, N.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.R.5
-
24
-
-
84889582459
-
Icdar 2013 robust reading competition
-
IEEE
-
Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., Mestre, S.R., Mas, J., Mota, D.F., Almazan, J.A., de las Heras, L.P., et al.: Icdar 2013 robust reading competition. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 1484-1493. IEEE (2013)
-
(2013)
2013 12th International Conference on Document Analysis and Recognition (ICDAR)
, pp. 1484-1493
-
-
Karatzas, D.1
Shafait, F.2
Uchida, S.3
Iwamura, M.4
Mestre, S.R.5
Mas, J.6
Mota, D.F.7
Almazan, J.A.8
De Las Heras, L.P.9
-
25
-
-
84878919540
-
Imagenet classification with deep convolutional neural networks
-
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: NIPS, vol. 1, p. 4 (2012)
-
(2012)
NIPS
, vol.1
, pp. 4
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
26
-
-
33645693855
-
Key-text spotting in documentary videos using adaboost
-
International Society for Optics and Photonics
-
Lalonde, M., Gagnon, L.: Key-text spotting in documentary videos using adaboost. In: Electronic Imaging 2006, p. 60641N. International Society for Optics and Photonics (2006)
-
(2006)
Electronic Imaging 2006
-
-
Lalonde, M.1
Gagnon, L.2
-
27
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proceedings of the IEEE 86(11), 2278-2324 (1998)
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
LeCun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
29
-
-
0041416425
-
Robust wide baseline stereo from maximally stable extremal regions
-
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: Proc. BMVC, pp. 384-393 (2002)
-
(2002)
Proc. BMVC
, pp. 384-393
-
-
Matas, J.1
Chum, O.2
Urban, M.3
Pajdla, T.4
-
30
-
-
84906509764
-
Fast training of convolutional networks through FFTs
-
abs/1312.5851
-
Mathieu, M., Henaff, M., LeCun, Y.: Fast training of convolutional networks through FFTs. CoRR abs/1312.5851 (2013)
-
(2013)
CoRR
-
-
Mathieu, M.1
Henaff, M.2
LeCun, Y.3
-
32
-
-
79952525611
-
A method for text localization and recognition in realworld images
-
Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III Springer, Heidelberg
-
Neumann, L., Matas, J.: A method for text localization and recognition in realworld images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 770-783. Springer, Heidelberg (2011)
-
(2011)
LNCS
, vol.6494
, pp. 770-783
-
-
Neumann, L.1
Matas, J.2
-
33
-
-
82455203972
-
Text localization in real-world images using efficiently pruned exhaustive search
-
IEEE
-
Neumann, L., Matas, J.: Text localization in real-world images using efficiently pruned exhaustive search. In: Proc. ICDAR, pp. 687-691. IEEE (2011)
-
(2011)
Proc. ICDAR
, pp. 687-691
-
-
Neumann, L.1
Matas, J.2
-
34
-
-
84881132045
-
Real-time scene text localization and recognition
-
IEEE
-
Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: Proc. CVPR, vol. 3, pp. 1187-1190. IEEE (2012)
-
(2012)
Proc. CVPR
, vol.3
, pp. 1187-1190
-
-
Neumann, L.1
Matas, J.2
-
36
-
-
84867865679
-
Large-lexicon attributeconsistent text recognition in natural images
-
Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI Springer, Heidelberg
-
Novikova, T., Barinova, O., Kohli, P., Lempitsky, V.: Large-lexicon attributeconsistent text recognition in natural images. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 752-765. Springer, Heidelberg (2012)
-
(2012)
LNCS
, vol.7577
, pp. 752-765
-
-
Novikova, T.1
Barinova, O.2
Kohli, P.3
Lempitsky, V.4
-
37
-
-
0018306059
-
A threshold selection method from gray-level histograms
-
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man, and Cybernetics 9(1), 62-66 (1979)
-
(1979)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.9
, Issue.1
, pp. 62-66
-
-
Otsu, N.1
-
39
-
-
78651477245
-
Using text-spotting to query the world
-
Posner, I., Corke, P., Newman, P.: Using text-spotting to query the world. In: Proc. of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, IROS (2010)
-
Proc. of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, IROS (2010)
-
-
Posner, I.1
Corke, P.2
Newman, P.3
-
41
-
-
34147194609
-
Word spotting for historical documents
-
Rath, T., Manmatha, R.: Word spotting for historical documents. IJDAR 9(2-4), 139-152 (2007)
-
(2007)
IJDAR
, vol.9
, Issue.2-4
, pp. 139-152
-
-
Rath, T.1
Manmatha, R.2
-
42
-
-
82355175563
-
Icdar 2011 robust reading competition challenge 2: Reading text in scene images
-
IEEE
-
Shahab, A., Shafait, F., Dengel, A.: Icdar 2011 robust reading competition challenge 2: Reading text in scene images. In: Proc. ICDAR, pp. 1491-1496. IEEE (2011)
-
(2011)
Proc. ICDAR
, pp. 1491-1496
-
-
Shahab, A.1
Shafait, F.2
Dengel, A.3
-
44
-
-
5044224293
-
Sharing features: Efficient boosting procedures for multiclass object detection
-
Torralba, A., Murphy, K.P., Freeman, W.T.: Sharing features: efficient boosting procedures for multiclass object detection. In: Proc. CVPR, pp. 762-769 (2004)
-
(2004)
Proc. CVPR
, pp. 762-769
-
-
Torralba, A.1
Murphy, K.P.2
Freeman, W.T.3
-
45
-
-
84863057818
-
End-to-end scene text recognition
-
IEEE
-
Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: Proc. ICCV, pp. 1457-1464. IEEE (2011)
-
(2011)
Proc. ICCV
, pp. 1457-1464
-
-
Wang, K.1
Babenko, B.2
Belongie, S.3
-
46
-
-
78149313522
-
Word spotting in the wild
-
Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I Springer, Heidelberg
-
Wang, K., Belongie, S.: Word spotting in the wild. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 591-604. Springer, Heidelberg (2010)
-
(2010)
LNCS
, vol.6311
, pp. 591-604
-
-
Wang, K.1
Belongie, S.2
-
47
-
-
84874562673
-
End-to-end text recognition with convolutional neural networks
-
IEEE
-
Wang, T., Wu, D.J., Coates, A., Ng, A.Y.: End-to-end text recognition with convolutional neural networks. In: 2012 21st International Conference on Pattern Recognition (ICPR), pp. 3304-3308. IEEE (2012)
-
(2012)
2012 21st International Conference on Pattern Recognition (ICPR)
, pp. 3304-3308
-
-
Wang, T.1
Wu, D.J.2
Coates, A.3
Ng, A.Y.4
-
48
-
-
84891621153
-
Toward integrated scene text reading
-
Weinman, J.J., Butler, Z., Knoll, D., Feild, J.: Toward integrated scene text reading. IEEE Trans. Pattern Anal. Mach. Intell. 36(2), 375-387 (2014)
-
(2014)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.36
, Issue.2
, pp. 375-387
-
-
Weinman, J.J.1
Butler, Z.2
Knoll, D.3
Feild, J.4
-
49
-
-
84894625033
-
A framework for improved video text detection and recognition
-
Yang, H., Quehl, B., Sack, H.: A framework for improved video text detection and recognition. Int. Journal of Multimedia Tools and Applications, MTAP (2012)
-
(2012)
Int. Journal of Multimedia Tools and Applications, MTAP
-
-
Yang, H.1
Quehl, B.2
Sack, H.3
-
50
-
-
82355182431
-
Text string detection from natural scenes by structure-based partition and grouping
-
Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. IEEE Transactions on Image Processing 20(9), 2594-2605 (2011)
-
(2011)
IEEE Transactions on Image Processing
, vol.20
, Issue.9
, pp. 2594-2605
-
-
Yi, C.1
Tian, Y.2
-
51
-
-
84906509755
-
Robust text detection in natural scene images
-
abs/1301.2628
-
Yin, X.C., Yin, X., Huang, K.: Robust text detection in natural scene images. CoRR abs/1301.2628 (2013)
-
(2013)
CoRR
-
-
Yin, X.C.1
Yin, X.2
Huang, K.3
|