SCOPUS 정보 검색 플랫폼

Volumn 67, Issue , 2017, Pages 85-96

Improving patch-based scene text script identification with ensembles of conjoined networks

(3) Gomez, Lluis a Nicolaou, Anguelos a Karatzas, Dimosthenis a

a Barcelona Perception Computing Lab (Spain)

Author keywords

Convolutional neural networks; Ensemble of conjoined networks; Multi language OCR; Scene text understanding; Script identification

Indexed keywords

NEURAL NETWORKS;

CLASSIFICATION FRAMEWORK; CLASSIFICATION SCHEME; CONVOLUTIONAL NEURAL NETWORK; KEY CHARACTERISTICS; LEARNING PROCEDURES; MULTI LANGUAGES; SCENE TEXT; SCRIPT IDENTIFICATION;

ASPECT RATIO;

EID: 85015987251 PISSN: 00313203 EISSN: None Source Type: Journal
DOI: 10.1016/j.patcog.2017.01.032 Document Type: Article

Times cited : (78)

References (68)

1
- 84904559086
- Combined script and page orientation estimation using the tesseract ocr engine
- ACM
- [1] Unnikrishnan, R., Smith, R., Combined script and page orientation estimation using the tesseract ocr engine. Proceedings of the International Workshop on Multilingual OCR, 2009, ACM, 6.
- (2009) Proceedings of the International Workshop on Multilingual OCR , pp. 6
- Unnikrishnan, R.¹ Smith, R.²

2
- 78049529924
- Script recognitiona review
- [2] Ghosh, D., Dube, T., Shivaprasad, A.P., Script recognitiona review. Pattern Anal Mach Intell, IEEE Trans 32:12 (2010), 2142–2161.
- (2010) Pattern Anal Mach Intell, IEEE Trans , vol.32 , Issue.12 , pp. 2142-2161
- Ghosh, D.¹ Dube, T.² Shivaprasad, A.P.³

3
- 3042681884
- Indian script character recognition: a survey
- [3] Pal, U., Chaudhuri, B., Indian script character recognition: a survey. Pattern Recogn. 37:9 (2004), 1887–1899.
- (2004) Pattern Recogn. , vol.37 , Issue.9 , pp. 1887-1899
- Pal, U.¹ Chaudhuri, B.²

4
- 84898778744
- Photoocr: reading text in uncontrolled conditions
- [4] Bissacco, A., Cummins, M., Netzer, Y., Neven, H., Photoocr: reading text in uncontrolled conditions. Proceedings of the IEEE International Conference on Computer Vision, 2013, 785–792.
- (2013) Proceedings of the IEEE International Conference on Computer Vision , pp. 785-792
- Bissacco, A.¹ Cummins, M.² Netzer, Y.³ Neven, H.⁴

5
- 84906517083
- Deep features for text spotting
- Springer
- [5] Jaderberg, M., Vedaldi, A., Zisserman, A., Deep features for text spotting. Computer Vision–ECCV 2014, 2014, Springer, 512–528.
- (2014) Computer Vision–ECCV 2014 , pp. 512-528
- Jaderberg, M.¹ Vedaldi, A.² Zisserman, A.³

6
- 84981285560
- Real-time lexicon-free scene text localization and recognition
- [6] Neumann, L., Matas, J., Real-time lexicon-free scene text localization and recognition. IEEE Trans. Pattern Anal. Mach. Intell. 38:9 (2016), 1872–1885.
- (2016) IEEE Trans. Pattern Anal. Mach. Intell. , vol.38 , Issue.9 , pp. 1872-1885
- Neumann, L.¹ Matas, J.²

7
- 84962517309
- Automatic script identification in the wild
- IEEE
- [7] Shi, B., Yao, C., Zhang, C., Guo, X., Huang, F., Bai, X., Automatic script identification in the wild. Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, 2015, IEEE, 531–535.
- (2015) Document Analysis and Recognition (ICDAR), 2015 13th International Conference on , pp. 531-535
- Shi, B.¹ Yao, C.² Zhang, C.³ Guo, X.⁴ Huang, F.⁵ Bai, X.⁶

8
- 84949254659
- Script identification in the wild via discriminative convolutional neural network
- [8] Shi, B., Bai, X., Yao, C., Script identification in the wild via discriminative convolutional neural network. Pattern Recogn. 52 (2016), 448–458.
- (2016) Pattern Recogn. , vol.52 , pp. 448-458
- Shi, B.¹ Bai, X.² Yao, C.³

9
- 85016165764
- Visual script and language recognition
- [9] Nicolaou, A., Bagdanov, A.D., Gomez-Bigorda, L., Karatzas, D., Visual script and language recognition. DAS, 2016.
- (2016) DAS
- Nicolaou, A.¹ Bagdanov, A.D.² Gomez-Bigorda, L.³ Karatzas, D.⁴

10
- 84979530220
- A fine-grained approach to scene text script identification
- [10] Gomez-Bigorda, L., Karatzas, D., A fine-grained approach to scene text script identification. DAS, 2016.
- (2016) DAS
- Gomez-Bigorda, L.¹ Karatzas, D.²

11
- 84955683269
- Multilingual scene character recognition with co-occurrence of histogram of oriented gradients
- [11] Tian, S., Bhattacharya, U., Lu, S., Su, B., Wang, Q., Wei, X., Lu, Y., Tan, C.L., Multilingual scene character recognition with co-occurrence of histogram of oriented gradients. Pattern Recogn. 51 (2016), 125–134.
- (2016) Pattern Recogn. , vol.51 , pp. 125-134
- Tian, S.¹ Bhattacharya, U.² Lu, S.³ Su, B.⁴ Wang, Q.⁵ Wei, X.⁶ Lu, Y.⁷ Tan, C.L.⁸

12
- 33846942265
- Script recognition in images with complex backgrounds
- IEEE
- [12] Gllavata, J., Freisleben, B., Script recognition in images with complex backgrounds. Signal Processing and Information Technology, 2005. Proceedings of the Fifth IEEE International Symposium on, 2005, IEEE, 589–594.
- (2005) Signal Processing and Information Technology, 2005. Proceedings of the Fifth IEEE International Symposium on , pp. 589-594
- Gllavata, J.¹ Freisleben, B.²

13
- 84912012167
- New gradient-spatial-structural features for video script identification
- [13] Shivakumara, P., Yuan, Z., Zhao, D., Lu, T., Tan, C.L., New gradient-spatial-structural features for video script identification. Comput. Vision Image Understanding 130 (2015), 35–53.
- (2015) Comput. Vision Image Understanding , vol.130 , pp. 35-53
- Shivakumara, P.¹ Yuan, Z.² Zhao, D.³ Lu, T.⁴ Tan, C.L.⁵

14
- 84862283411
- An analysis of single-layer networks in unsupervised feature learning
- [14] Coates, A., Ng, A.Y., Lee, H., An analysis of single-layer networks in unsupervised feature learning. International Conference on Artificial Intelligence and Statistics, 2011, 215–223.
- (2011) International Conference on Artificial Intelligence and Statistics , pp. 215-223
- Coates, A.¹ Ng, A.Y.² Lee, H.³

15
- 51949090223
- In defense of nearest-neighbor based image classification
- IEEE
- [15] Boiman, O., Shechtman, E., Irani, M., In defense of nearest-neighbor based image classification. Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, 2008, IEEE, 1–8.
- (2008) Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on , pp. 1-8
- Boiman, O.¹ Shechtman, E.² Irani, M.³

16
- 28044462712
- Palace: a multilingual document recognition system
- World Scientific Singapore
- [16] Spitz, A.L., Ozaki, M., Palace: a multilingual document recognition system. Document Analysis Systems, vol. 1, 1995, World Scientific, Singapore, 16–37.
- (1995) Document Analysis Systems , vol.1 , pp. 16-37
- Spitz, A.L.¹ Ozaki, M.²

17
- 0031098394
- Determination of the script and language content of document images
- [17] Spitz, A.L., Determination of the script and language content of document images. Pattern Anal. Mach. Intell., IEEE Trans. 19:3 (1997), 235–245.
- (1997) Pattern Anal. Mach. Intell., IEEE Trans. , vol.19 , Issue.3 , pp. 235-245
- Spitz, A.L.¹

18
- 0002231472
- Language identification in complex, unoriented, and degraded document images
- [18] Lee, D., Nohl, C.R., Baird, H.S., Language identification in complex, unoriented, and degraded document images. Ser. Mach. Percept. Artif. Intell. 29 (1998), 17–39.
- (1998) Ser. Mach. Percept. Artif. Intell. , vol.29 , pp. 17-39
- Lee, D.¹ Nohl, C.R.² Baird, H.S.³

19
- 0032316550
- Skew detection, page segmentation, and script classification of printed document images
- IEEE
- [19] Waked, B., Bergler, S., Suen, C., Khoury, S., Skew detection, page segmentation, and script classification of printed document images. Systems, Man, and Cybernetics, 1998. 1998 IEEE International Conference on, vol. 5, 1998, IEEE, 4470–4475.
- (1998) Systems, Man, and Cybernetics, 1998. 1998 IEEE International Conference on , vol.5 , pp. 4470-4475
- Waked, B.¹ Bergler, S.² Suen, C.³ Khoury, S.⁴

20
- 85038084590
- Trainable script identification strategies for indian languages
- IEEE
- [20] Chaudhury, S., Sheth, R., Trainable script identification strategies for indian languages. Document Analysis and Recognition, 1999. ICDAR’99. Proceedings of the Fifth International Conference on, 1999, IEEE, 657–660.
- (1999) Document Analysis and Recognition, 1999. ICDAR’99. Proceedings of the Fifth International Conference on , pp. 657-660
- Chaudhury, S.¹ Sheth, R.²

21
- 85020199436
- Automatic script identification from images using cluster-based templates
- IEEE
- [21] Hochberg, J., Kerns, L., Kelly, P., Thomas, T., Automatic script identification from images using cluster-based templates. Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on, vol. 1, 1995, IEEE, 378–381.
- (1995) Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on , vol.1 , pp. 378-381
- Hochberg, J.¹ Kerns, L.² Kelly, P.³ Thomas, T.⁴

22
- 0029547702
- Language identification for printed text independent of segmentation
- IEEE
- [22] Wood, S.L., Yao, X., Krishnamurthi, K., Dang, L., Language identification for printed text independent of segmentation. Image Processing, 1995. Proceedings., International Conference on, vol. 3, 1995, IEEE, 428–431.
- (1995) Image Processing, 1995. Proceedings., International Conference on , vol.3 , pp. 428-431
- Wood, S.L.¹ Yao, X.² Krishnamurthi, K.³ Dang, L.⁴

23
- 0032122663
- Rotation invariant texture features and their use in automatic script identification
- [23] Tan, T., Rotation invariant texture features and their use in automatic script identification. Pattern Anal. Mach. Intell., IEEE Trans. 20:7 (1998), 751–756.
- (1998) Pattern Anal. Mach. Intell., IEEE Trans. , vol.20 , Issue.7 , pp. 751-756
- Tan, T.¹

24
- 0035546419
- Text analysis using local energy
- [24] Chan, W., Coghill, G., Text analysis using local energy. Pattern Recogn. 34:12 (2001), 2523–2532.
- (2001) Pattern Recogn. , vol.34 , Issue.12 , pp. 2523-2532
- Chan, W.¹ Coghill, G.²

25
- 33751299170
- Script identification using steerable gabor filters
- IEEE
- [25] Pan, W., Suen, C.Y., Bui, T.D., Script identification using steerable gabor filters. Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on, 2005, IEEE, 883–887.
- (2005) Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on , pp. 883-887
- Pan, W.¹ Suen, C.Y.² Bui, T.D.³

26
- 84889574417
- Lbp based line-wise script identification
- IEEE
- [26] Ferrer, M.A., Morales, A., Pal, U., Lbp based line-wise script identification. Document Analysis and Recognition (ICDAR), 2013 12th International Conference on, 2013, IEEE, 369–373.
- (2013) Document Analysis and Recognition (ICDAR), 2013 12th International Conference on , pp. 369-373
- Ferrer, M.A.¹ Morales, A.² Pal, U.³

27
- 0030151620
- Page segmentation using texture analysis
- [27] Jain, A.K., Zhong, Y., Page segmentation using texture analysis. Pattern Recogn. 29:5 (1996), 743–770.
- (1996) Pattern Recogn. , vol.29 , Issue.5 , pp. 743-770
- Jain, A.K.¹ Zhong, Y.²

28
- 0141863195
- Hierarchical content classification and script determination for automatic document image processing
- [28] Chi, Z., Wang, Q., Siu, W.-C., Hierarchical content classification and script determination for automatic document image processing. Pattern Recogn. 36:11 (2003), 2483–2500.
- (2003) Pattern Recogn. , vol.36 , Issue.11 , pp. 2483-2500
- Chi, Z.¹ Wang, Q.² Siu, W.-C.³

29
- 0036466961
- Exploiting zoning based on approximating splines in cursive script recognition
- [29] Hennig, A., Sherkat, N., Exploiting zoning based on approximating splines in cursive script recognition. Pattern Recogn. 35:2 (2002), 445–454.
- (2002) Pattern Recogn. , vol.35 , Issue.2 , pp. 445-454
- Hennig, A.¹ Sherkat, N.²

30
- 68249112410
- Novel script line identification method for script normalization and feature extraction in on-line handwritten whiteboard note recognition
- [30] Schenk, J., Lenz, J., Rigoll, G., Novel script line identification method for script normalization and feature extraction in on-line handwritten whiteboard note recognition. Pattern Recogn. 42:12 (2009), 3383–3393.
- (2009) Pattern Recogn. , vol.42 , Issue.12 , pp. 3383-3393
- Schenk, J.¹ Lenz, J.² Rigoll, G.³

31
- 68249091393
- Language identification for handwritten document images using a shape codebook
- [31] Zhu, G., Yu, X., Li, Y., Doermann, D., Language identification for handwritten document images using a shape codebook. Pattern Recogn. 42:12 (2009), 3184–3191.
- (2009) Pattern Recogn. , vol.42 , Issue.12 , pp. 3184-3191
- Zhu, G.¹ Yu, X.² Li, Y.³ Doermann, D.⁴

32
- 77953613023
- A novel framework for automatic sorting of postal documents with multi-script address blocks
- [32] Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K., A novel framework for automatic sorting of postal documents with multi-script address blocks. Pattern Recogn. 43:10 (2010), 3507–3521.
- (2010) Pattern Recogn. , vol.43 , Issue.10 , pp. 3507-3521
- Basu, S.¹ Das, N.² Sarkar, R.³ Kundu, M.⁴ Nasipuri, M.⁵ Basu, D.K.⁶

33
- 84920654331
- Tensor representation learning based image patch analysis for text identification and recognition
- [33] Zhong, G., Cheriet, M., Tensor representation learning based image patch analysis for text identification and recognition. Pattern Recogn. 48:4 (2015), 1211–1224.
- (2015) Pattern Recogn. , vol.48 , Issue.4 , pp. 1211-1224
- Zhong, G.¹ Cheriet, M.²

34
- 84889609370
- Word-wise script identification from video frames
- IEEE
- [34] Sharma, N., Chanda, S., Pal, U., Blumenstein, M., Word-wise script identification from video frames. Document Analysis and Recognition (ICDAR), 2013 12th International Conference on, 2013, IEEE, 867–871.
- (2013) Document Analysis and Recognition (ICDAR), 2013 12th International Conference on , pp. 867-871
- Sharma, N.¹ Chanda, S.² Pal, U.³ Blumenstein, M.⁴

35
- 82355175685
- Video script identification based on text lines
- IEEE
- [35] Phan, T.Q., Shivakumara, P., Ding, Z., Lu, S., Tan, C.L., Video script identification based on text lines. Document Analysis and Recognition (ICDAR), 2011 International Conference on, 2011, IEEE, 1240–1244.
- (2011) Document Analysis and Recognition (ICDAR), 2011 International Conference on , pp. 1240-1244
- Phan, T.Q.¹ Shivakumara, P.² Ding, Z.³ Lu, S.⁴ Tan, C.L.⁵

36
- 84912030519
- Gradient-angular-features for word-wise video script identification
- IEEE
- [36] Shivakumara, P., Sharma, N., Pal, U., Blumenstein, M., Tan, C.L., Gradient-angular-features for word-wise video script identification. 2014 22nd International Conference on Pattern Recognition (ICPR), 2014, IEEE, 3098–3103.
- (2014) 2014 22nd International Conference on Pattern Recognition (ICPR) , pp. 3098-3103
- Shivakumara, P.¹ Sharma, N.² Pal, U.³ Blumenstein, M.⁴ Tan, C.L.⁵

37
- 84951169810
- Bag-of-visual words for word-wise video script identification: A study
- IEEE
- [37] Sharma, N., Mandal, R., Sharma, R., Pal, U., Blumenstein, M., Bag-of-visual words for word-wise video script identification: A study. Neural Networks (IJCNN), 2015 International Joint Conference on, 2015, IEEE, 1–7.
- (2015) Neural Networks (IJCNN), 2015 International Joint Conference on , pp. 1-7
- Sharma, N.¹ Mandal, R.² Sharma, R.³ Pal, U.⁴ Blumenstein, M.⁵

38
- 84962579361
- Icdar2015 competition on video script identification (cvsi 2015)
- IEEE
- [38] Sharma, N., Mandal, R., Sharma, R., Pal, U., Blumenstein, M., Icdar2015 competition on video script identification (cvsi 2015). Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, 2015, IEEE, 1196–1200.
- (2015) Document Analysis and Recognition (ICDAR), 2015 13th International Conference on , pp. 1196-1200
- Sharma, N.¹ Mandal, R.² Sharma, R.³ Pal, U.⁴ Blumenstein, M.⁵

39
- 81855221241
- Sequential deep learning for human action recognition
- Springer
- [39] Baccouche, M., Mamalet, F., Wolf, C., Garcia, C., Baskurt, A., Sequential deep learning for human action recognition. International Workshop on Human Behavior Understanding, 2011, Springer, 29–39.
- (2011) International Workshop on Human Behavior Understanding , pp. 29-39
- Baccouche, M.¹ Mamalet, F.² Wolf, C.³ Garcia, C.⁴ Baskurt, A.⁵

40
- 84870183903
- 3d convolutional neural networks for human action recognition
- [40] Ji, S., Xu, W., Yang, M., Yu, K., 3d convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35:1 (2013), 221–231.
- (2013) IEEE Trans. Pattern Anal. Mach. Intell. , vol.35 , Issue.1 , pp. 221-231
- Ji, S.¹ Xu, W.² Yang, M.³ Yu, K.⁴

41
- 84911364368
- Large-scale video classification with convolutional neural networks
- [41] Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., Fei-Fei, L., Large-scale video classification with convolutional neural networks. Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 2014, 1725–1732.
- (2014) Proceedings of the IEEE conference on Computer Vision and Pattern Recognition , pp. 1725-1732
- Karpathy, A.¹ Toderici, G.² Shetty, S.³ Leung, T.⁴ Sukthankar, R.⁵ Fei-Fei, L.⁶

42
- 84876231242
- Imagenet classification with deep convolutional neural networks
- [42] Krizhevsky, A., Sutskever, I., Hinton, G.E., Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems, 2012, 1097–1105.
- (2012) Advances in Neural Information Processing Systems , pp. 1097-1105
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

43
- 0005594495
- Signature verification using a ǣsiameseǥ time delay neural network
- [43] Bromley, J., Bentz, J.W., Bottou, L., Guyon, I., LeCun, Y., Moore, C., Säckinger, E., Shah, R., Signature verification using a ǣsiameseǥ time delay neural network. Int. J. Pattern Recogn. Artif. Intell. 7:04 (1993), 669–688.
- (1993) Int. J. Pattern Recogn. Artif. Intell. , vol.7 , Issue.4 , pp. 669-688
- Bromley, J.¹ Bentz, J.W.² Bottou, L.³ Guyon, I.⁴ LeCun, Y.⁵ Moore, C.⁶ Säckinger, E.⁷ Shah, R.⁸

44
- 71249118724
- Text localization in natural scene images based on conditional random field
- IEEE
- [44] Pan, Y.-F., Hou, X., Liu, C.-L., Text localization in natural scene images based on conditional random field. Document Analysis and Recognition, 2009. ICDAR’09. 10th International Conference on, 2009, IEEE, 6–10.
- (2009) Document Analysis and Recognition, 2009. ICDAR’09. 10th International Conference on , pp. 6-10
- Pan, Y.-F.¹ Hou, X.² Liu, C.-L.³

45
- 84866640582
- Detecting texts of arbitrary orientations in natural images
- IEEE
- [45] Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z., Detecting texts of arbitrary orientations in natural images. Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, 2012, IEEE, 1083–1090.
- (2012) Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on , pp. 1083-1090
- Yao, C.¹ Bai, X.² Liu, W.³ Ma, Y.⁴ Tu, Z.⁵

46
- 70349280061
- Character recognition in natural images.
- [46] de Campos, T.E., Babu, B.R., Varma, M., Character recognition in natural images. VISAPP (2), 2009, 273–280.
- (2009) VISAPP (2) , pp. 273-280
- de Campos, T.E.¹ Babu, B.R.² Varma, M.³

47
- 84999558635
- Multi-script robust reading competition in icdar 2013
- ACM
- [47] Kumar, D., Prasad, M., Ramakrishnan, A., Multi-script robust reading competition in icdar 2013. Proceedings of the 4th International Workshop on Multilingual OCR, 2013, ACM, 14.
- (2013) Proceedings of the 4th International Workshop on Multilingual OCR , pp. 14
- Kumar, D.¹ Prasad, M.² Ramakrishnan, A.³

48
- 78149484901
- Scene text extraction with edge constraint and text collinearity
- IEEE
- [48] Lee, S., Cho, M.S., Jung, K., Kim, J.H., Scene text extraction with edge constraint and text collinearity. Pattern Recognition (ICPR), 2010 20th International Conference on, 2010, IEEE, 3983–3986.
- (2010) Pattern Recognition (ICPR), 2010 20th International Conference on , pp. 3983-3986
- Lee, S.¹ Cho, M.S.² Jung, K.³ Kim, J.H.⁴

49
- 84903692134
- An on-line platform for ground truthing and performance evaluation of text extraction systems
- IEEE
- [49] Karatzas, D., Robles, S., Gomez, L., An on-line platform for ground truthing and performance evaluation of text extraction systems. Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on, 2014, IEEE, 242–246.
- (2014) Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on , pp. 242-246
- Karatzas, D.¹ Robles, S.² Gomez, L.³

50
- 84913580146
- Caffe: Convolutional architecture for fast feature embedding
- ACM
- [50] Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T., Caffe: Convolutional architecture for fast feature embedding. Proceedings of the ACM International Conference on Multimedia, 2014, ACM, 675–678.
- (2014) Proceedings of the ACM International Conference on Multimedia , pp. 675-678
- Jia, Y.¹ Shelhamer, E.² Donahue, J.³ Karayev, S.⁴ Long, J.⁵ Girshick, R.⁶ Guadarrama, S.⁷ Darrell, T.⁸

51
- 77953183471
- What is the best multi-stage architecture for object recognition?
- IEEE
- [51] Jarrett, K., Kavukcuoglu, K., Ranzato, M., LeCun, Y., What is the best multi-stage architecture for object recognition?. Computer Vision, 2009 IEEE 12th International Conference on, 2009, IEEE, 2146–2153.
- (2009) Computer Vision, 2009 IEEE 12th International Conference on , pp. 2146-2153
- Jarrett, K.¹ Kavukcuoglu, K.² Ranzato, M.³ LeCun, Y.⁴

52
- 84904163933
- Dropout: a simple way to prevent neural networks from overfitting
- [52] Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R., Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15:1 (2014), 1929–1958.
- (2014) J. Mach. Learn. Res. , vol.15 , Issue.1 , pp. 1929-1958
- Srivastava, N.¹ Hinton, G.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

53
- 0033284915
- Object recognition from local scale-invariant features
- IEEE
- [53] Lowe, D.G., Object recognition from local scale-invariant features. Computer vision, 1999. The Proceedings of the Seventh IEEE International Conference on, vol. 2, 1999, IEEE, 1150–1157.
- (1999) Computer vision, 1999. The Proceedings of the Seventh IEEE International Conference on , vol.2 , pp. 1150-1157
- Lowe, D.G.¹

54
- 70349362313
- VLFeat: an open and portable library of computer vision algorithms
- ()
- [54] Vedaldi, A., Fulkerson, B., VLFeat: an open and portable library of computer vision algorithms. 2008. ( http://www.vlfeat.org/).
- (2008)
- Vedaldi, A.¹ Fulkerson, B.²

55
- 50949133669
- LIBLINEAR: a library for large linear classification
- [55] Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J., LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9 (2008), 1871–1874.
- (2008) J. Mach. Learn. Res. , vol.9 , pp. 1871-1874
- Fan, R.-E.¹ Chang, K.-W.² Hsieh, C.-J.³ Wang, X.-R.⁴ Lin, C.-J.⁵

56
- 84962593507
- Sparse radial sampling lbp for writer identification
- IEEE
- [56] Nicolaou, A., Bagdanov, A.D., Liwicki, M., Karatzas, D., Sparse radial sampling lbp for writer identification. Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, 2015, IEEE, 716–720.
- (2015) Document Analysis and Recognition (ICDAR), 2015 13th International Conference on , pp. 716-720
- Nicolaou, A.¹ Bagdanov, A.D.² Liwicki, M.³ Karatzas, D.⁴

57
- 0000596361
- Note on the sampling error of the difference between correlated proportions or percentages
- [57] McNemar, Q., Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12:2 (1947), 153–157.
- (1947) Psychometrika , vol.12 , Issue.2 , pp. 153-157
- McNemar, Q.¹

58
- 85016010144
- A fast hierarchical method for multi-script and arbitrary oriented scene text extraction
- [58] Gomez, L., Karatzas, D., A fast hierarchical method for multi-script and arbitrary oriented scene text extraction. arXiv preprint arXiv:1407.7504, 2014.
- (2014) arXiv preprint arXiv:1407.7504
- Gomez, L.¹ Karatzas, D.²

59
- 3142736062
- Robust wide-baseline stereo from maximally stable extremal regions
- [59] Matas, J., Chum, O., Urban, M., Pajdla, T., Robust wide-baseline stereo from maximally stable extremal regions. Image Vision Comput. 22:10 (2004), 761–767.
- (2004) Image Vision Comput. , vol.22 , Issue.10 , pp. 761-767
- Matas, J.¹ Chum, O.² Urban, M.³ Pajdla, T.⁴

60
- 84921069139
- The pascal visual object classes challenge: a retrospective
- [60] Everingham, M., Eslami, S.A., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A., The pascal visual object classes challenge: a retrospective. Int. J. Comput. Vision 111:1 (2015), 98–136.
- (2015) Int. J. Comput. Vision , vol.111 , Issue.1 , pp. 98-136
- Everingham, M.¹ Eslami, S.A.² Van Gool, L.³ Williams, C.K.⁴ Winn, J.⁵ Zisserman, A.⁶

61
- 84962622810
- Icdar 2015 competition on robust reading
- IEEE
- [61] Karatzas, D., Gomez-Bigorda, L., Nicolaou, A., Ghosh, S., Bagdanov, A., Iwamura, M., Matas, J., Neumann, L., Chandrasekhar, V.R., Lu, S., et al. Icdar 2015 competition on robust reading. Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, 2015, IEEE, 1156–1160.
- (2015) Document Analysis and Recognition (ICDAR), 2015 13th International Conference on , pp. 1156-1160
- Karatzas, D.¹ Gomez-Bigorda, L.² Nicolaou, A.³ Ghosh, S.⁴ Bagdanov, A.⁵ Iwamura, M.⁶ Matas, J.⁷ Neumann, L.⁸ Chandrasekhar, V.R.⁹ Lu, S.¹⁰

62
- 51149098551
- An overview of the tesseract ocr engine
- IEEE
- [62] Smith, R., An overview of the tesseract ocr engine. ICDAR, 2007, IEEE, 629–633.
- (2007) ICDAR , pp. 629-633
- Smith, R.¹

63
- 84889606097
- Image binarization for end-to-end text understanding in natural images
- IEEE
- [63] Milyaev, S., Barinova, O., Novikova, T., Kohli, P., Lempitsky, V., Image binarization for end-to-end text understanding in natural images. Document Analysis and Recognition (ICDAR), 2013 12th International Conference on, 2013, IEEE, 128–132.
- (2013) Document Analysis and Recognition (ICDAR), 2013 12th International Conference on , pp. 128-132
- Milyaev, S.¹ Barinova, O.² Novikova, T.³ Kohli, P.⁴ Lempitsky, V.⁵

64
- 84942517592
- Scene text recognition: No country for old men?
- Springer
- [64] Gómez, L., Karatzas, D., Scene text recognition: No country for old men?. Computer Vision-ACCV 2014 Workshops, 2014, Springer, 157–168.
- (2014) Computer Vision-ACCV 2014 Workshops , pp. 157-168
- Gómez, L.¹ Karatzas, D.²

65
- 84939960007
- Fast and accurate scene text understanding with image binarization and off-the-shelf ocr
- [65] Milyaev, S., Barinova, O., Novikova, T., Kohli, P., Lempitsky, V., Fast and accurate scene text understanding with image binarization and off-the-shelf ocr. Int. J. Doc. Anal. Recogn. (IJDAR) 18:2 (2015), 169–182.
- (2015) Int. J. Doc. Anal. Recogn. (IJDAR) , vol.18 , Issue.2 , pp. 169-182
- Milyaev, S.¹ Barinova, O.² Novikova, T.³ Kohli, P.⁴ Lempitsky, V.⁵

66
- 84863057818
- End-to-end scene text recognition
- IEEE
- [66] Wang, K., Babenko, B., Belongie, S., End-to-end scene text recognition. Computer Vision (ICCV), 2011 IEEE International Conference on, 2011, IEEE, 1457–1464.
- (2011) Computer Vision (ICCV), 2011 IEEE International Conference on , pp. 1457-1464
- Wang, K.¹ Babenko, B.² Belongie, S.³

67
- 84889582459
- Icdar 2013 robust reading competition
- IEEE
- [67] Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., Gomez i Bigorda, L., Robles Mestre, S., Mas, J., Fernandez Mota, D., Almazan Almazan, J., de las Heras, L.-P., Icdar 2013 robust reading competition. Document Analysis and Recognition (ICDAR), 2013 12th International Conference on, 2013, IEEE, 1484–1493.
- (2013) Document Analysis and Recognition (ICDAR), 2013 12th International Conference on , pp. 1484-1493
- Karatzas, D.¹ Shafait, F.² Uchida, S.³ Iwamura, M.⁴ Gomez i Bigorda, L.⁵ Robles Mestre, S.⁶ Mas, J.⁷ Fernandez Mota, D.⁸ Almazan Almazan, J.⁹ de las Heras, L.-P.¹⁰

68
- 84962556250
- Alif: A dataset for arabic embedded text recognition in tv broadcast
- IEEE
- [68] Yousfi, S., Berrani, S.-A., Garcia, C., Alif: A dataset for arabic embedded text recognition in tv broadcast. Document Analysis and Recognition (ICDAR), 2015 13th International Conference on, 2015, IEEE, 1221–1225.
- (2015) Document Analysis and Recognition (ICDAR), 2015 13th International Conference on , pp. 1221-1225
- Yousfi, S.¹ Berrani, S.-A.² Garcia, C.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.