-
1
-
-
84904559086
-
Combined script and page orientation estimation using the tesseract ocr engine
-
ACM
-
[1] Unnikrishnan, R., Smith, R., Combined script and page orientation estimation using the tesseract ocr engine. Proceedings of the International Workshop on Multilingual OCR, 2009, ACM, 6.
-
(2009)
Proceedings of the International Workshop on Multilingual OCR
, pp. 6
-
-
Unnikrishnan, R.1
Smith, R.2
-
2
-
-
78049529924
-
Script recognitiona review
-
[2] Ghosh, D., Dube, T., Shivaprasad, A.P., Script recognitiona review. Pattern Anal Mach Intell, IEEE Trans 32:12 (2010), 2142–2161.
-
(2010)
Pattern Anal Mach Intell, IEEE Trans
, vol.32
, Issue.12
, pp. 2142-2161
-
-
Ghosh, D.1
Dube, T.2
Shivaprasad, A.P.3
-
3
-
-
3042681884
-
Indian script character recognition: a survey
-
[3] Pal, U., Chaudhuri, B., Indian script character recognition: a survey. Pattern Recogn. 37:9 (2004), 1887–1899.
-
(2004)
Pattern Recogn.
, vol.37
, Issue.9
, pp. 1887-1899
-
-
Pal, U.1
Chaudhuri, B.2
-
4
-
-
84898778744
-
Photoocr: reading text in uncontrolled conditions
-
[4] Bissacco, A., Cummins, M., Netzer, Y., Neven, H., Photoocr: reading text in uncontrolled conditions. Proceedings of the IEEE International Conference on Computer Vision, 2013, 785–792.
-
(2013)
Proceedings of the IEEE International Conference on Computer Vision
, pp. 785-792
-
-
Bissacco, A.1
Cummins, M.2
Netzer, Y.3
Neven, H.4
-
5
-
-
84906517083
-
Deep features for text spotting
-
Springer
-
[5] Jaderberg, M., Vedaldi, A., Zisserman, A., Deep features for text spotting. Computer Vision–ECCV 2014, 2014, Springer, 512–528.
-
(2014)
Computer Vision–ECCV 2014
, pp. 512-528
-
-
Jaderberg, M.1
Vedaldi, A.2
Zisserman, A.3
-
6
-
-
84981285560
-
Real-time lexicon-free scene text localization and recognition
-
[6] Neumann, L., Matas, J., Real-time lexicon-free scene text localization and recognition. IEEE Trans. Pattern Anal. Mach. Intell. 38:9 (2016), 1872–1885.
-
(2016)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.38
, Issue.9
, pp. 1872-1885
-
-
Neumann, L.1
Matas, J.2
-
7
-
-
84962517309
-
Automatic script identification in the wild
-
IEEE
-
[7] Shi, B., Yao, C., Zhang, C., Guo, X., Huang, F., Bai, X., Automatic script identification in the wild. Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, 2015, IEEE, 531–535.
-
(2015)
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
, pp. 531-535
-
-
Shi, B.1
Yao, C.2
Zhang, C.3
Guo, X.4
Huang, F.5
Bai, X.6
-
8
-
-
84949254659
-
Script identification in the wild via discriminative convolutional neural network
-
[8] Shi, B., Bai, X., Yao, C., Script identification in the wild via discriminative convolutional neural network. Pattern Recogn. 52 (2016), 448–458.
-
(2016)
Pattern Recogn.
, vol.52
, pp. 448-458
-
-
Shi, B.1
Bai, X.2
Yao, C.3
-
9
-
-
85016165764
-
Visual script and language recognition
-
[9] Nicolaou, A., Bagdanov, A.D., Gomez-Bigorda, L., Karatzas, D., Visual script and language recognition. DAS, 2016.
-
(2016)
DAS
-
-
Nicolaou, A.1
Bagdanov, A.D.2
Gomez-Bigorda, L.3
Karatzas, D.4
-
10
-
-
84979530220
-
A fine-grained approach to scene text script identification
-
[10] Gomez-Bigorda, L., Karatzas, D., A fine-grained approach to scene text script identification. DAS, 2016.
-
(2016)
DAS
-
-
Gomez-Bigorda, L.1
Karatzas, D.2
-
11
-
-
84955683269
-
Multilingual scene character recognition with co-occurrence of histogram of oriented gradients
-
[11] Tian, S., Bhattacharya, U., Lu, S., Su, B., Wang, Q., Wei, X., Lu, Y., Tan, C.L., Multilingual scene character recognition with co-occurrence of histogram of oriented gradients. Pattern Recogn. 51 (2016), 125–134.
-
(2016)
Pattern Recogn.
, vol.51
, pp. 125-134
-
-
Tian, S.1
Bhattacharya, U.2
Lu, S.3
Su, B.4
Wang, Q.5
Wei, X.6
Lu, Y.7
Tan, C.L.8
-
12
-
-
33846942265
-
Script recognition in images with complex backgrounds
-
IEEE
-
[12] Gllavata, J., Freisleben, B., Script recognition in images with complex backgrounds. Signal Processing and Information Technology, 2005. Proceedings of the Fifth IEEE International Symposium on, 2005, IEEE, 589–594.
-
(2005)
Signal Processing and Information Technology, 2005. Proceedings of the Fifth IEEE International Symposium on
, pp. 589-594
-
-
Gllavata, J.1
Freisleben, B.2
-
13
-
-
84912012167
-
New gradient-spatial-structural features for video script identification
-
[13] Shivakumara, P., Yuan, Z., Zhao, D., Lu, T., Tan, C.L., New gradient-spatial-structural features for video script identification. Comput. Vision Image Understanding 130 (2015), 35–53.
-
(2015)
Comput. Vision Image Understanding
, vol.130
, pp. 35-53
-
-
Shivakumara, P.1
Yuan, Z.2
Zhao, D.3
Lu, T.4
Tan, C.L.5
-
14
-
-
84862283411
-
An analysis of single-layer networks in unsupervised feature learning
-
[14] Coates, A., Ng, A.Y., Lee, H., An analysis of single-layer networks in unsupervised feature learning. International Conference on Artificial Intelligence and Statistics, 2011, 215–223.
-
(2011)
International Conference on Artificial Intelligence and Statistics
, pp. 215-223
-
-
Coates, A.1
Ng, A.Y.2
Lee, H.3
-
15
-
-
51949090223
-
In defense of nearest-neighbor based image classification
-
IEEE
-
[15] Boiman, O., Shechtman, E., Irani, M., In defense of nearest-neighbor based image classification. Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, 2008, IEEE, 1–8.
-
(2008)
Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
, pp. 1-8
-
-
Boiman, O.1
Shechtman, E.2
Irani, M.3
-
16
-
-
28044462712
-
Palace: a multilingual document recognition system
-
World Scientific Singapore
-
[16] Spitz, A.L., Ozaki, M., Palace: a multilingual document recognition system. Document Analysis Systems, vol. 1, 1995, World Scientific, Singapore, 16–37.
-
(1995)
Document Analysis Systems
, vol.1
, pp. 16-37
-
-
Spitz, A.L.1
Ozaki, M.2
-
17
-
-
0031098394
-
Determination of the script and language content of document images
-
[17] Spitz, A.L., Determination of the script and language content of document images. Pattern Anal. Mach. Intell., IEEE Trans. 19:3 (1997), 235–245.
-
(1997)
Pattern Anal. Mach. Intell., IEEE Trans.
, vol.19
, Issue.3
, pp. 235-245
-
-
Spitz, A.L.1
-
18
-
-
0002231472
-
Language identification in complex, unoriented, and degraded document images
-
[18] Lee, D., Nohl, C.R., Baird, H.S., Language identification in complex, unoriented, and degraded document images. Ser. Mach. Percept. Artif. Intell. 29 (1998), 17–39.
-
(1998)
Ser. Mach. Percept. Artif. Intell.
, vol.29
, pp. 17-39
-
-
Lee, D.1
Nohl, C.R.2
Baird, H.S.3
-
19
-
-
0032316550
-
Skew detection, page segmentation, and script classification of printed document images
-
IEEE
-
[19] Waked, B., Bergler, S., Suen, C., Khoury, S., Skew detection, page segmentation, and script classification of printed document images. Systems, Man, and Cybernetics, 1998. 1998 IEEE International Conference on, vol. 5, 1998, IEEE, 4470–4475.
-
(1998)
Systems, Man, and Cybernetics, 1998. 1998 IEEE International Conference on
, vol.5
, pp. 4470-4475
-
-
Waked, B.1
Bergler, S.2
Suen, C.3
Khoury, S.4
-
20
-
-
85038084590
-
Trainable script identification strategies for indian languages
-
IEEE
-
[20] Chaudhury, S., Sheth, R., Trainable script identification strategies for indian languages. Document Analysis and Recognition, 1999. ICDAR’99. Proceedings of the Fifth International Conference on, 1999, IEEE, 657–660.
-
(1999)
Document Analysis and Recognition, 1999. ICDAR’99. Proceedings of the Fifth International Conference on
, pp. 657-660
-
-
Chaudhury, S.1
Sheth, R.2
-
21
-
-
85020199436
-
Automatic script identification from images using cluster-based templates
-
IEEE
-
[21] Hochberg, J., Kerns, L., Kelly, P., Thomas, T., Automatic script identification from images using cluster-based templates. Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on, vol. 1, 1995, IEEE, 378–381.
-
(1995)
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
, vol.1
, pp. 378-381
-
-
Hochberg, J.1
Kerns, L.2
Kelly, P.3
Thomas, T.4
-
22
-
-
0029547702
-
Language identification for printed text independent of segmentation
-
IEEE
-
[22] Wood, S.L., Yao, X., Krishnamurthi, K., Dang, L., Language identification for printed text independent of segmentation. Image Processing, 1995. Proceedings., International Conference on, vol. 3, 1995, IEEE, 428–431.
-
(1995)
Image Processing, 1995. Proceedings., International Conference on
, vol.3
, pp. 428-431
-
-
Wood, S.L.1
Yao, X.2
Krishnamurthi, K.3
Dang, L.4
-
23
-
-
0032122663
-
Rotation invariant texture features and their use in automatic script identification
-
[23] Tan, T., Rotation invariant texture features and their use in automatic script identification. Pattern Anal. Mach. Intell., IEEE Trans. 20:7 (1998), 751–756.
-
(1998)
Pattern Anal. Mach. Intell., IEEE Trans.
, vol.20
, Issue.7
, pp. 751-756
-
-
Tan, T.1
-
24
-
-
0035546419
-
Text analysis using local energy
-
[24] Chan, W., Coghill, G., Text analysis using local energy. Pattern Recogn. 34:12 (2001), 2523–2532.
-
(2001)
Pattern Recogn.
, vol.34
, Issue.12
, pp. 2523-2532
-
-
Chan, W.1
Coghill, G.2
-
25
-
-
33751299170
-
Script identification using steerable gabor filters
-
IEEE
-
[25] Pan, W., Suen, C.Y., Bui, T.D., Script identification using steerable gabor filters. Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on, 2005, IEEE, 883–887.
-
(2005)
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
, pp. 883-887
-
-
Pan, W.1
Suen, C.Y.2
Bui, T.D.3
-
26
-
-
84889574417
-
Lbp based line-wise script identification
-
IEEE
-
[26] Ferrer, M.A., Morales, A., Pal, U., Lbp based line-wise script identification. Document Analysis and Recognition (ICDAR), 2013 12th International Conference on, 2013, IEEE, 369–373.
-
(2013)
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
, pp. 369-373
-
-
Ferrer, M.A.1
Morales, A.2
Pal, U.3
-
27
-
-
0030151620
-
Page segmentation using texture analysis
-
[27] Jain, A.K., Zhong, Y., Page segmentation using texture analysis. Pattern Recogn. 29:5 (1996), 743–770.
-
(1996)
Pattern Recogn.
, vol.29
, Issue.5
, pp. 743-770
-
-
Jain, A.K.1
Zhong, Y.2
-
28
-
-
0141863195
-
Hierarchical content classification and script determination for automatic document image processing
-
[28] Chi, Z., Wang, Q., Siu, W.-C., Hierarchical content classification and script determination for automatic document image processing. Pattern Recogn. 36:11 (2003), 2483–2500.
-
(2003)
Pattern Recogn.
, vol.36
, Issue.11
, pp. 2483-2500
-
-
Chi, Z.1
Wang, Q.2
Siu, W.-C.3
-
29
-
-
0036466961
-
Exploiting zoning based on approximating splines in cursive script recognition
-
[29] Hennig, A., Sherkat, N., Exploiting zoning based on approximating splines in cursive script recognition. Pattern Recogn. 35:2 (2002), 445–454.
-
(2002)
Pattern Recogn.
, vol.35
, Issue.2
, pp. 445-454
-
-
Hennig, A.1
Sherkat, N.2
-
30
-
-
68249112410
-
Novel script line identification method for script normalization and feature extraction in on-line handwritten whiteboard note recognition
-
[30] Schenk, J., Lenz, J., Rigoll, G., Novel script line identification method for script normalization and feature extraction in on-line handwritten whiteboard note recognition. Pattern Recogn. 42:12 (2009), 3383–3393.
-
(2009)
Pattern Recogn.
, vol.42
, Issue.12
, pp. 3383-3393
-
-
Schenk, J.1
Lenz, J.2
Rigoll, G.3
-
31
-
-
68249091393
-
Language identification for handwritten document images using a shape codebook
-
[31] Zhu, G., Yu, X., Li, Y., Doermann, D., Language identification for handwritten document images using a shape codebook. Pattern Recogn. 42:12 (2009), 3184–3191.
-
(2009)
Pattern Recogn.
, vol.42
, Issue.12
, pp. 3184-3191
-
-
Zhu, G.1
Yu, X.2
Li, Y.3
Doermann, D.4
-
32
-
-
77953613023
-
A novel framework for automatic sorting of postal documents with multi-script address blocks
-
[32] Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K., A novel framework for automatic sorting of postal documents with multi-script address blocks. Pattern Recogn. 43:10 (2010), 3507–3521.
-
(2010)
Pattern Recogn.
, vol.43
, Issue.10
, pp. 3507-3521
-
-
Basu, S.1
Das, N.2
Sarkar, R.3
Kundu, M.4
Nasipuri, M.5
Basu, D.K.6
-
33
-
-
84920654331
-
Tensor representation learning based image patch analysis for text identification and recognition
-
[33] Zhong, G., Cheriet, M., Tensor representation learning based image patch analysis for text identification and recognition. Pattern Recogn. 48:4 (2015), 1211–1224.
-
(2015)
Pattern Recogn.
, vol.48
, Issue.4
, pp. 1211-1224
-
-
Zhong, G.1
Cheriet, M.2
-
34
-
-
84889609370
-
Word-wise script identification from video frames
-
IEEE
-
[34] Sharma, N., Chanda, S., Pal, U., Blumenstein, M., Word-wise script identification from video frames. Document Analysis and Recognition (ICDAR), 2013 12th International Conference on, 2013, IEEE, 867–871.
-
(2013)
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
, pp. 867-871
-
-
Sharma, N.1
Chanda, S.2
Pal, U.3
Blumenstein, M.4
-
35
-
-
82355175685
-
Video script identification based on text lines
-
IEEE
-
[35] Phan, T.Q., Shivakumara, P., Ding, Z., Lu, S., Tan, C.L., Video script identification based on text lines. Document Analysis and Recognition (ICDAR), 2011 International Conference on, 2011, IEEE, 1240–1244.
-
(2011)
Document Analysis and Recognition (ICDAR), 2011 International Conference on
, pp. 1240-1244
-
-
Phan, T.Q.1
Shivakumara, P.2
Ding, Z.3
Lu, S.4
Tan, C.L.5
-
36
-
-
84912030519
-
Gradient-angular-features for word-wise video script identification
-
IEEE
-
[36] Shivakumara, P., Sharma, N., Pal, U., Blumenstein, M., Tan, C.L., Gradient-angular-features for word-wise video script identification. 2014 22nd International Conference on Pattern Recognition (ICPR), 2014, IEEE, 3098–3103.
-
(2014)
2014 22nd International Conference on Pattern Recognition (ICPR)
, pp. 3098-3103
-
-
Shivakumara, P.1
Sharma, N.2
Pal, U.3
Blumenstein, M.4
Tan, C.L.5
-
37
-
-
84951169810
-
Bag-of-visual words for word-wise video script identification: A study
-
IEEE
-
[37] Sharma, N., Mandal, R., Sharma, R., Pal, U., Blumenstein, M., Bag-of-visual words for word-wise video script identification: A study. Neural Networks (IJCNN), 2015 International Joint Conference on, 2015, IEEE, 1–7.
-
(2015)
Neural Networks (IJCNN), 2015 International Joint Conference on
, pp. 1-7
-
-
Sharma, N.1
Mandal, R.2
Sharma, R.3
Pal, U.4
Blumenstein, M.5
-
38
-
-
84962579361
-
Icdar2015 competition on video script identification (cvsi 2015)
-
IEEE
-
[38] Sharma, N., Mandal, R., Sharma, R., Pal, U., Blumenstein, M., Icdar2015 competition on video script identification (cvsi 2015). Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, 2015, IEEE, 1196–1200.
-
(2015)
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
, pp. 1196-1200
-
-
Sharma, N.1
Mandal, R.2
Sharma, R.3
Pal, U.4
Blumenstein, M.5
-
39
-
-
81855221241
-
Sequential deep learning for human action recognition
-
Springer
-
[39] Baccouche, M., Mamalet, F., Wolf, C., Garcia, C., Baskurt, A., Sequential deep learning for human action recognition. International Workshop on Human Behavior Understanding, 2011, Springer, 29–39.
-
(2011)
International Workshop on Human Behavior Understanding
, pp. 29-39
-
-
Baccouche, M.1
Mamalet, F.2
Wolf, C.3
Garcia, C.4
Baskurt, A.5
-
40
-
-
84870183903
-
3d convolutional neural networks for human action recognition
-
[40] Ji, S., Xu, W., Yang, M., Yu, K., 3d convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35:1 (2013), 221–231.
-
(2013)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.35
, Issue.1
, pp. 221-231
-
-
Ji, S.1
Xu, W.2
Yang, M.3
Yu, K.4
-
41
-
-
84911364368
-
Large-scale video classification with convolutional neural networks
-
[41] Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., Fei-Fei, L., Large-scale video classification with convolutional neural networks. Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 2014, 1725–1732.
-
(2014)
Proceedings of the IEEE conference on Computer Vision and Pattern Recognition
, pp. 1725-1732
-
-
Karpathy, A.1
Toderici, G.2
Shetty, S.3
Leung, T.4
Sukthankar, R.5
Fei-Fei, L.6
-
42
-
-
84876231242
-
Imagenet classification with deep convolutional neural networks
-
[42] Krizhevsky, A., Sutskever, I., Hinton, G.E., Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems, 2012, 1097–1105.
-
(2012)
Advances in Neural Information Processing Systems
, pp. 1097-1105
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
43
-
-
0005594495
-
Signature verification using a ǣsiameseǥ time delay neural network
-
[43] Bromley, J., Bentz, J.W., Bottou, L., Guyon, I., LeCun, Y., Moore, C., Säckinger, E., Shah, R., Signature verification using a ǣsiameseǥ time delay neural network. Int. J. Pattern Recogn. Artif. Intell. 7:04 (1993), 669–688.
-
(1993)
Int. J. Pattern Recogn. Artif. Intell.
, vol.7
, Issue.4
, pp. 669-688
-
-
Bromley, J.1
Bentz, J.W.2
Bottou, L.3
Guyon, I.4
LeCun, Y.5
Moore, C.6
Säckinger, E.7
Shah, R.8
-
44
-
-
71249118724
-
Text localization in natural scene images based on conditional random field
-
IEEE
-
[44] Pan, Y.-F., Hou, X., Liu, C.-L., Text localization in natural scene images based on conditional random field. Document Analysis and Recognition, 2009. ICDAR’09. 10th International Conference on, 2009, IEEE, 6–10.
-
(2009)
Document Analysis and Recognition, 2009. ICDAR’09. 10th International Conference on
, pp. 6-10
-
-
Pan, Y.-F.1
Hou, X.2
Liu, C.-L.3
-
45
-
-
84866640582
-
Detecting texts of arbitrary orientations in natural images
-
IEEE
-
[45] Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z., Detecting texts of arbitrary orientations in natural images. Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, 2012, IEEE, 1083–1090.
-
(2012)
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
, pp. 1083-1090
-
-
Yao, C.1
Bai, X.2
Liu, W.3
Ma, Y.4
Tu, Z.5
-
46
-
-
70349280061
-
Character recognition in natural images.
-
[46] de Campos, T.E., Babu, B.R., Varma, M., Character recognition in natural images. VISAPP (2), 2009, 273–280.
-
(2009)
VISAPP (2)
, pp. 273-280
-
-
de Campos, T.E.1
Babu, B.R.2
Varma, M.3
-
47
-
-
84999558635
-
Multi-script robust reading competition in icdar 2013
-
ACM
-
[47] Kumar, D., Prasad, M., Ramakrishnan, A., Multi-script robust reading competition in icdar 2013. Proceedings of the 4th International Workshop on Multilingual OCR, 2013, ACM, 14.
-
(2013)
Proceedings of the 4th International Workshop on Multilingual OCR
, pp. 14
-
-
Kumar, D.1
Prasad, M.2
Ramakrishnan, A.3
-
48
-
-
78149484901
-
Scene text extraction with edge constraint and text collinearity
-
IEEE
-
[48] Lee, S., Cho, M.S., Jung, K., Kim, J.H., Scene text extraction with edge constraint and text collinearity. Pattern Recognition (ICPR), 2010 20th International Conference on, 2010, IEEE, 3983–3986.
-
(2010)
Pattern Recognition (ICPR), 2010 20th International Conference on
, pp. 3983-3986
-
-
Lee, S.1
Cho, M.S.2
Jung, K.3
Kim, J.H.4
-
49
-
-
84903692134
-
An on-line platform for ground truthing and performance evaluation of text extraction systems
-
IEEE
-
[49] Karatzas, D., Robles, S., Gomez, L., An on-line platform for ground truthing and performance evaluation of text extraction systems. Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on, 2014, IEEE, 242–246.
-
(2014)
Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
, pp. 242-246
-
-
Karatzas, D.1
Robles, S.2
Gomez, L.3
-
50
-
-
84913580146
-
Caffe: Convolutional architecture for fast feature embedding
-
ACM
-
[50] Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T., Caffe: Convolutional architecture for fast feature embedding. Proceedings of the ACM International Conference on Multimedia, 2014, ACM, 675–678.
-
(2014)
Proceedings of the ACM International Conference on Multimedia
, pp. 675-678
-
-
Jia, Y.1
Shelhamer, E.2
Donahue, J.3
Karayev, S.4
Long, J.5
Girshick, R.6
Guadarrama, S.7
Darrell, T.8
-
51
-
-
77953183471
-
What is the best multi-stage architecture for object recognition?
-
IEEE
-
[51] Jarrett, K., Kavukcuoglu, K., Ranzato, M., LeCun, Y., What is the best multi-stage architecture for object recognition?. Computer Vision, 2009 IEEE 12th International Conference on, 2009, IEEE, 2146–2153.
-
(2009)
Computer Vision, 2009 IEEE 12th International Conference on
, pp. 2146-2153
-
-
Jarrett, K.1
Kavukcuoglu, K.2
Ranzato, M.3
LeCun, Y.4
-
52
-
-
84904163933
-
Dropout: a simple way to prevent neural networks from overfitting
-
[52] Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R., Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15:1 (2014), 1929–1958.
-
(2014)
J. Mach. Learn. Res.
, vol.15
, Issue.1
, pp. 1929-1958
-
-
Srivastava, N.1
Hinton, G.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
53
-
-
0033284915
-
Object recognition from local scale-invariant features
-
IEEE
-
[53] Lowe, D.G., Object recognition from local scale-invariant features. Computer vision, 1999. The Proceedings of the Seventh IEEE International Conference on, vol. 2, 1999, IEEE, 1150–1157.
-
(1999)
Computer vision, 1999. The Proceedings of the Seventh IEEE International Conference on
, vol.2
, pp. 1150-1157
-
-
Lowe, D.G.1
-
54
-
-
70349362313
-
VLFeat: an open and portable library of computer vision algorithms
-
()
-
[54] Vedaldi, A., Fulkerson, B., VLFeat: an open and portable library of computer vision algorithms. 2008. ( http://www.vlfeat.org/).
-
(2008)
-
-
Vedaldi, A.1
Fulkerson, B.2
-
55
-
-
50949133669
-
LIBLINEAR: a library for large linear classification
-
[55] Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J., LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9 (2008), 1871–1874.
-
(2008)
J. Mach. Learn. Res.
, vol.9
, pp. 1871-1874
-
-
Fan, R.-E.1
Chang, K.-W.2
Hsieh, C.-J.3
Wang, X.-R.4
Lin, C.-J.5
-
56
-
-
84962593507
-
Sparse radial sampling lbp for writer identification
-
IEEE
-
[56] Nicolaou, A., Bagdanov, A.D., Liwicki, M., Karatzas, D., Sparse radial sampling lbp for writer identification. Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, 2015, IEEE, 716–720.
-
(2015)
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
, pp. 716-720
-
-
Nicolaou, A.1
Bagdanov, A.D.2
Liwicki, M.3
Karatzas, D.4
-
57
-
-
0000596361
-
Note on the sampling error of the difference between correlated proportions or percentages
-
[57] McNemar, Q., Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12:2 (1947), 153–157.
-
(1947)
Psychometrika
, vol.12
, Issue.2
, pp. 153-157
-
-
McNemar, Q.1
-
58
-
-
85016010144
-
A fast hierarchical method for multi-script and arbitrary oriented scene text extraction
-
[58] Gomez, L., Karatzas, D., A fast hierarchical method for multi-script and arbitrary oriented scene text extraction. arXiv preprint arXiv:1407.7504, 2014.
-
(2014)
arXiv preprint arXiv:1407.7504
-
-
Gomez, L.1
Karatzas, D.2
-
59
-
-
3142736062
-
Robust wide-baseline stereo from maximally stable extremal regions
-
[59] Matas, J., Chum, O., Urban, M., Pajdla, T., Robust wide-baseline stereo from maximally stable extremal regions. Image Vision Comput. 22:10 (2004), 761–767.
-
(2004)
Image Vision Comput.
, vol.22
, Issue.10
, pp. 761-767
-
-
Matas, J.1
Chum, O.2
Urban, M.3
Pajdla, T.4
-
60
-
-
84921069139
-
The pascal visual object classes challenge: a retrospective
-
[60] Everingham, M., Eslami, S.A., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A., The pascal visual object classes challenge: a retrospective. Int. J. Comput. Vision 111:1 (2015), 98–136.
-
(2015)
Int. J. Comput. Vision
, vol.111
, Issue.1
, pp. 98-136
-
-
Everingham, M.1
Eslami, S.A.2
Van Gool, L.3
Williams, C.K.4
Winn, J.5
Zisserman, A.6
-
61
-
-
84962622810
-
Icdar 2015 competition on robust reading
-
IEEE
-
[61] Karatzas, D., Gomez-Bigorda, L., Nicolaou, A., Ghosh, S., Bagdanov, A., Iwamura, M., Matas, J., Neumann, L., Chandrasekhar, V.R., Lu, S., et al. Icdar 2015 competition on robust reading. Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, 2015, IEEE, 1156–1160.
-
(2015)
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
, pp. 1156-1160
-
-
Karatzas, D.1
Gomez-Bigorda, L.2
Nicolaou, A.3
Ghosh, S.4
Bagdanov, A.5
Iwamura, M.6
Matas, J.7
Neumann, L.8
Chandrasekhar, V.R.9
Lu, S.10
-
62
-
-
51149098551
-
An overview of the tesseract ocr engine
-
IEEE
-
[62] Smith, R., An overview of the tesseract ocr engine. ICDAR, 2007, IEEE, 629–633.
-
(2007)
ICDAR
, pp. 629-633
-
-
Smith, R.1
-
63
-
-
84889606097
-
Image binarization for end-to-end text understanding in natural images
-
IEEE
-
[63] Milyaev, S., Barinova, O., Novikova, T., Kohli, P., Lempitsky, V., Image binarization for end-to-end text understanding in natural images. Document Analysis and Recognition (ICDAR), 2013 12th International Conference on, 2013, IEEE, 128–132.
-
(2013)
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
, pp. 128-132
-
-
Milyaev, S.1
Barinova, O.2
Novikova, T.3
Kohli, P.4
Lempitsky, V.5
-
64
-
-
84942517592
-
Scene text recognition: No country for old men?
-
Springer
-
[64] Gómez, L., Karatzas, D., Scene text recognition: No country for old men?. Computer Vision-ACCV 2014 Workshops, 2014, Springer, 157–168.
-
(2014)
Computer Vision-ACCV 2014 Workshops
, pp. 157-168
-
-
Gómez, L.1
Karatzas, D.2
-
65
-
-
84939960007
-
Fast and accurate scene text understanding with image binarization and off-the-shelf ocr
-
[65] Milyaev, S., Barinova, O., Novikova, T., Kohli, P., Lempitsky, V., Fast and accurate scene text understanding with image binarization and off-the-shelf ocr. Int. J. Doc. Anal. Recogn. (IJDAR) 18:2 (2015), 169–182.
-
(2015)
Int. J. Doc. Anal. Recogn. (IJDAR)
, vol.18
, Issue.2
, pp. 169-182
-
-
Milyaev, S.1
Barinova, O.2
Novikova, T.3
Kohli, P.4
Lempitsky, V.5
-
66
-
-
84863057818
-
End-to-end scene text recognition
-
IEEE
-
[66] Wang, K., Babenko, B., Belongie, S., End-to-end scene text recognition. Computer Vision (ICCV), 2011 IEEE International Conference on, 2011, IEEE, 1457–1464.
-
(2011)
Computer Vision (ICCV), 2011 IEEE International Conference on
, pp. 1457-1464
-
-
Wang, K.1
Babenko, B.2
Belongie, S.3
-
67
-
-
84889582459
-
Icdar 2013 robust reading competition
-
IEEE
-
[67] Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., Gomez i Bigorda, L., Robles Mestre, S., Mas, J., Fernandez Mota, D., Almazan Almazan, J., de las Heras, L.-P., Icdar 2013 robust reading competition. Document Analysis and Recognition (ICDAR), 2013 12th International Conference on, 2013, IEEE, 1484–1493.
-
(2013)
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
, pp. 1484-1493
-
-
Karatzas, D.1
Shafait, F.2
Uchida, S.3
Iwamura, M.4
Gomez i Bigorda, L.5
Robles Mestre, S.6
Mas, J.7
Fernandez Mota, D.8
Almazan Almazan, J.9
de las Heras, L.-P.10
-
68
-
-
84962556250
-
Alif: A dataset for arabic embedded text recognition in tv broadcast
-
IEEE
-
[68] Yousfi, S., Berrani, S.-A., Garcia, C., Alif: A dataset for arabic embedded text recognition in tv broadcast. Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, 2015, IEEE, 1221–1225.
-
(2015)
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
, pp. 1221-1225
-
-
Yousfi, S.1
Berrani, S.-A.2
Garcia, C.3
|