SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 8692 LNCS, Issue PART 4, 2014, Pages 512-528

Deep features for text spotting

(3) Jaderberg, Max a Vedaldi, Andrea a Zisserman, Andrew a

a UNIVERSITY OF OXFORD (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

NEURAL NETWORKS; COMPUTER VISION; DATA MINING; NETWORK ARCHITECTURE;

AUTOMATED DATA MINING; CONVOLUTIONAL NEURAL NETWORK; NOVEL ARCHITECTURE; NUMBER OF LAYERS; SEQUENTIAL TASK; STATE-OF-THE-ART PERFORMANCE; TECHNICAL CHANGE; WORD AND CHARACTERS;

TEXT PROCESSING;

EID: 84906517083 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-319-10593-2_34 Document Type: Conference Paper

Times cited : (517)

References (51)

1
- 84906500060
- http://algoval.essex.ac.uk/icdar/datasets.html

2
- 84906490584
- https://code.google.com/p/tesseract-ocr/

3
- 84906490586
- http://www.flickr.com/

4
- 84906500058
- http://www.flickr.com/groups/type/

5
- 84906509767
- http://www.iapr-tc11.org/mediawiki/index.php/kaist-scene-text-database

6
- 85083953799
- End-to-End Text Recognition with Hybrid HMM Maxout Models
- Alsharif, O., Pineau, J.: End-to-End Text Recognition with Hybrid HMM Maxout Models. In: ICLR (2014)
- (2014) ICLR
- Alsharif, O.¹ Pineau, J.²

7
- 84880615968
- Detection of artificial and scene text in images and video frames
- Anthimopoulos, M., Gatos, B., Pratikakis, I.: Detection of artificial and scene text in images and video frames. Pattern Analysis and Applications, 1-16 (2011)
- (2011) Pattern Analysis and Applications , pp. 1-16
- Anthimopoulos, M.¹ Gatos, B.² Pratikakis, I.³

8
- 84898778744
- PhotoOCR: Reading text in uncontrolled conditions
- Bissacco, A., Cummins, M., Netzer, Y., Neven, H.: PhotoOCR: Reading text in uncontrolled conditions. In: ICCV (2013)
- (2013) ICCV
- Bissacco, A.¹ Cummins, M.² Netzer, Y.³ Neven, H.⁴

9
- 0034844730
- Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images
- Boykov, Y., Jolly, M.P.: Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images. In: Proc. ICCV, vol. 2, pp. 105-112 (2001)
- (2001) Proc. ICCV , vol.2 , pp. 105-112
- Boykov, Y.¹ Jolly, M.P.²

10
- 84865813192
- de Campos, T., Babu, B.R., Varma, M.: Character recognition in natural images, pp. 591-604 (2009)
- (2009) Character Recognition in Natural Images , pp. 591-604
- De Campos, T.¹ Babu, B.R.² Varma, M.³

11
- 84863052045
- Robust text detection in natural images with edge-enhanced maximally stable extremal regions
- Chen, H., Tsai, S., Schroth, G., Chen, D., Grzeszczuk, R., Girod, B.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: Proc. International Conference on Image Processing (ICIP), pp. 2609-2612 (2011)
- (2011) Proc. International Conference on Image Processing (ICIP) , pp. 2609-2612
- Chen, H.¹ Tsai, S.² Schroth, G.³ Chen, D.⁴ Grzeszczuk, R.⁵ Girod, B.⁶

12
- 5044227851
- Detecting and reading text in natural scenes
- IEEE
- Chen, X., Yuille, A.L.: Detecting and reading text in natural scenes. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004, vol. 2, p. II-366. IEEE (2004)
- (2004) Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004 , vol.2
- Chen, X.¹ Yuille, A.L.²

13
- 82355160847
- Text detection and character recognition in scene images with unsupervised feature learning
- IEEE
- Coates, A., Carpenter, B., Case, C., Satheesh, S., Suresh, B., Wang, T., Wu, D.J., Ng, A.Y.: Text detection and character recognition in scene images with unsupervised feature learning. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 440-445. IEEE (2011)
- (2011) 2011 International Conference on Document Analysis and Recognition (ICDAR) , pp. 440-445
- Coates, A.¹ Carpenter, B.² Case, C.³ Satheesh, S.⁴ Suresh, B.⁵ Wang, T.⁶ Wu, D.J.⁷ Ng, A.Y.⁸

14
- 84904482223
- arXiv preprint arXiv:1310.1531
- Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: Decaf: A deep convolutional activation feature for generic visual recognition. arXiv preprint arXiv:1310.1531 (2013)
- (2013) Decaf: A Deep Convolutional Activation Feature for Generic Visual Recognition
- Donahue, J.¹ Jia, Y.² Vinyals, O.³ Hoffman, J.⁴ Zhang, N.⁵ Tzeng, E.⁶ Darrell, T.⁷

15
- 84862061986
- Robust recognition of degraded documents using character n-grams
- IEEE
- Dutta, S., Sankaran, N., Sankar, K., Jawahar, C.: Robust recognition of degraded documents using character n-grams. In: International Workshop on Document Analysis Systems (DAS), pp. 130-134. IEEE (2012)
- (2012) International Workshop on Document Analysis Systems (DAS) , pp. 130-134
- Dutta, S.¹ Sankaran, N.² Sankar, K.³ Jawahar, C.⁴

16
- 77955991043
- Detecting text in natural scenes with stroke width transform
- IEEE
- Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proc. CVPR, pp. 2963-2970. IEEE (2010)
- (2010) Proc. CVPR , pp. 2963-2970
- Epshtein, B.¹ Ofek, E.² Wexler, Y.³

17
- 77949524387
- Tech. rep. University of Montreal
- Erhan, D., Bengio, Y., Courville, A., Vincent, P.: Visualizing higher-layer features of a deep network. Tech. rep. University of Montreal (2009)
- (2009) Visualizing Higher-layer Features of a Deep Network
- Erhan, D.¹ Bengio, Y.² Courville, A.³ Vincent, P.⁴

18
- 4644354464
- Pictorial structures for object recognition
- Felzenszwalb, P., Huttenlocher, D.: Pictorial structures for object recognition. IJCV 61(1) (2005)
- (2005) IJCV , vol.61 , Issue.1
- Felzenszwalb, P.¹ Huttenlocher, D.²

19
- 84889587871
- Whole is greater than sum of parts: Recognizing scene text words
- IEEE
- Goel, V., Mishra, A., Alahari, K., Jawahar, C.: Whole is greater than sum of parts: Recognizing scene text words. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 398-402. IEEE (2013)
- (2013) 2013 12th International Conference on Document Analysis and Recognition (ICDAR) , pp. 398-402
- Goel, V.¹ Mishra, A.² Alahari, K.³ Jawahar, C.⁴

20
- 85083953281
- Multi-digit number recognition from street view imagery using deep convolutional neural networks
- Goodfellow, I.J., Bulatov, Y., Ibarz, J., Arnoud, S., Shet, V.: Multi-digit number recognition from street view imagery using deep convolutional neural networks. In: ICLR (2014)
- (2014) ICLR
- Goodfellow, I.J.¹ Bulatov, Y.² Ibarz, J.³ Arnoud, S.⁴ Shet, V.⁵

21
- 84892421248
- arXiv preprint arXiv:1302.4389
- Goodfellow, I.J., Warde-Farley, D., Mirza, M., Courville, A., Bengio, Y.: Maxout networks. arXiv preprint arXiv:1302.4389 (2013)
- (2013) Maxout Networks
- Goodfellow, I.J.¹ Warde-Farley, D.² Mirza, M.³ Courville, A.⁴ Bengio, Y.⁵

22
- 84867720412
- arXiv preprint arXiv:1207.0580
- Hinton, G.E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.R.: Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580 (2012)
- (2012) Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
- Hinton, G.E.¹ Srivastava, N.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.R.⁵

23
- 85062833929
- arXiv preprint arXiv:1405.3866
- Jaderberg, M., Vedaldi, A., Zisserman, A.: Speeding up convolutional neural networks with low rank expansions. arXiv preprint arXiv:1405.3866 (2014)
- (2014) Speeding Up Convolutional Neural Networks with Low Rank Expansions
- Jaderberg, M.¹ Vedaldi, A.² Zisserman, A.³

24
- 84889582459
- Icdar 2013 robust reading competition
- IEEE
- Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., Mestre, S.R., Mas, J., Mota, D.F., Almazan, J.A., de las Heras, L.P., et al.: Icdar 2013 robust reading competition. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 1484-1493. IEEE (2013)
- (2013) 2013 12th International Conference on Document Analysis and Recognition (ICDAR) , pp. 1484-1493
- Karatzas, D.¹ Shafait, F.² Uchida, S.³ Iwamura, M.⁴ Mestre, S.R.⁵ Mas, J.⁶ Mota, D.F.⁷ Almazan, J.A.⁸ De Las Heras, L.P.⁹

25
- 84878919540
- Imagenet classification with deep convolutional neural networks
- Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: NIPS, vol. 1, p. 4 (2012)
- (2012) NIPS , vol.1 , pp. 4
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

26
- 33645693855
- Key-text spotting in documentary videos using adaboost
- International Society for Optics and Photonics
- Lalonde, M., Gagnon, L.: Key-text spotting in documentary videos using adaboost. In: Electronic Imaging 2006, p. 60641N. International Society for Optics and Photonics (2006)
- (2006) Electronic Imaging 2006
- Lalonde, M.¹ Gagnon, L.²

27
- 0032203257
- Gradient-based learning applied to document recognition
- LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proceedings of the IEEE 86(11), 2278-2324 (1998)
- (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

28
- 33947430146
- Icdar 2005 text locating competition results
- IEEE
- Lucas, S.M.: Icdar 2005 text locating competition results. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition 2005, pp. 80-84. IEEE (2005)
- (2005) Proceedings of the Eighth International Conference on Document Analysis and Recognition 2005 , pp. 80-84
- Lucas, S.M.¹

29
- 0041416425
- Robust wide baseline stereo from maximally stable extremal regions
- Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: Proc. BMVC, pp. 384-393 (2002)
- (2002) Proc. BMVC , pp. 384-393
- Matas, J.¹ Chum, O.² Urban, M.³ Pajdla, T.⁴

30
- 84906509764
- Fast training of convolutional networks through FFTs
- abs/1312.5851
- Mathieu, M., Henaff, M., LeCun, Y.: Fast training of convolutional networks through FFTs. CoRR abs/1312.5851 (2013)
- (2013) CoRR
- Mathieu, M.¹ Henaff, M.² LeCun, Y.³

31
- 84898404913
- Scene text recognition using higher order language priors
- Mishra, A., Alahari, K., Jawahar, C., et al.: Scene text recognition using higher order language priors. In: 23rd British Machine Vision Conference on BMVC 2012 (2012)
- (2012) 23rd British Machine Vision Conference on BMVC 2012
- Mishra, A.¹ Alahari, K.² Jawahar, C.³

32
- 79952525611
- A method for text localization and recognition in realworld images
- Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III Springer, Heidelberg
- Neumann, L., Matas, J.: A method for text localization and recognition in realworld images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 770-783. Springer, Heidelberg (2011)
- (2011) LNCS , vol.6494 , pp. 770-783
- Neumann, L.¹ Matas, J.²

33
- 82455203972
- Text localization in real-world images using efficiently pruned exhaustive search
- IEEE
- Neumann, L., Matas, J.: Text localization in real-world images using efficiently pruned exhaustive search. In: Proc. ICDAR, pp. 687-691. IEEE (2011)
- (2011) Proc. ICDAR , pp. 687-691
- Neumann, L.¹ Matas, J.²

34
- 84881132045
- Real-time scene text localization and recognition
- IEEE
- Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: Proc. CVPR, vol. 3, pp. 1187-1190. IEEE (2012)
- (2012) Proc. CVPR , vol.3 , pp. 1187-1190
- Neumann, L.¹ Matas, J.²

35
- 84898792558
- Scene text localization and recognition with oriented stroke detection
- IEEE, California
- Neumann, L., Matas, J.: Scene text localization and recognition with oriented stroke detection. In: 2013 IEEE International Conference on Computer Vision (ICCV 2013), pp. 97-104. IEEE, California (2013)
- (2013) 2013 IEEE International Conference on Computer Vision (ICCV 2013) , pp. 97-104
- Neumann, L.¹ Matas, J.²

36
- 84867865679
- Large-lexicon attributeconsistent text recognition in natural images
- Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI Springer, Heidelberg
- Novikova, T., Barinova, O., Kohli, P., Lempitsky, V.: Large-lexicon attributeconsistent text recognition in natural images. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 752-765. Springer, Heidelberg (2012)
- (2012) LNCS , vol.7577 , pp. 752-765
- Novikova, T.¹ Barinova, O.² Kohli, P.³ Lempitsky, V.⁴

37
- 0018306059
- A threshold selection method from gray-level histograms
- Otsu, N.: A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man, and Cybernetics 9(1), 62-66 (1979)
- (1979) IEEE Transactions on Systems, Man, and Cybernetics , vol.9 , Issue.1 , pp. 62-66
- Otsu, N.¹

38
- 34948862825
- Fast keypoint recognition in ten lines of code
- Ozuysal, M., Fua, P., Lepetit, V.: Fast keypoint recognition in ten lines of code. In: Proc. CVPR (2007)
- Proc. CVPR (2007)
- Ozuysal, M.¹ Fua, P.² Lepetit, V.³

39
- 78651477245
- Using text-spotting to query the world
- Posner, I., Corke, P., Newman, P.: Using text-spotting to query the world. In: Proc. of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, IROS (2010)
- Proc. of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, IROS (2010)
- Posner, I.¹ Corke, P.² Newman, P.³

40
- 84906492459
- Ph.D. thesis, ETH Zurich
- Quack, T.: Large scale mining and retrieval of visual data in a multimodal context. Ph.D. thesis, ETH Zurich (2009)
- (2009) Large Scale Mining and Retrieval of Visual Data in a Multimodal Context
- Quack, T.¹

41
- 34147194609
- Word spotting for historical documents
- Rath, T., Manmatha, R.: Word spotting for historical documents. IJDAR 9(2-4), 139-152 (2007)
- (2007) IJDAR , vol.9 , Issue.2-4 , pp. 139-152
- Rath, T.¹ Manmatha, R.²

42
- 82355175563
- Icdar 2011 robust reading competition challenge 2: Reading text in scene images
- IEEE
- Shahab, A., Shafait, F., Dengel, A.: Icdar 2011 robust reading competition challenge 2: Reading text in scene images. In: Proc. ICDAR, pp. 1491-1496. IEEE (2011)
- (2011) Proc. ICDAR , pp. 1491-1496
- Shahab, A.¹ Shafait, F.² Dengel, A.³

43
- 85083953896
- Deep inside convolutional networks: Visualising image classification models and saliency maps
- Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: Visualising image classification models and saliency maps. In: Workshop at International Conference on Learning Representations (2014)
- Workshop at International Conference on Learning Representations (2014)
- Simonyan, K.¹ Vedaldi, A.² Zisserman, A.³

44
- 5044224293
- Sharing features: Efficient boosting procedures for multiclass object detection
- Torralba, A., Murphy, K.P., Freeman, W.T.: Sharing features: efficient boosting procedures for multiclass object detection. In: Proc. CVPR, pp. 762-769 (2004)
- (2004) Proc. CVPR , pp. 762-769
- Torralba, A.¹ Murphy, K.P.² Freeman, W.T.³

45
- 84863057818
- End-to-end scene text recognition
- IEEE
- Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: Proc. ICCV, pp. 1457-1464. IEEE (2011)
- (2011) Proc. ICCV , pp. 1457-1464
- Wang, K.¹ Babenko, B.² Belongie, S.³

46
- 78149313522
- Word spotting in the wild
- Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I Springer, Heidelberg
- Wang, K., Belongie, S.: Word spotting in the wild. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 591-604. Springer, Heidelberg (2010)
- (2010) LNCS , vol.6311 , pp. 591-604
- Wang, K.¹ Belongie, S.²

47
- 84874562673
- End-to-end text recognition with convolutional neural networks
- IEEE
- Wang, T., Wu, D.J., Coates, A., Ng, A.Y.: End-to-end text recognition with convolutional neural networks. In: 2012 21st International Conference on Pattern Recognition (ICPR), pp. 3304-3308. IEEE (2012)
- (2012) 2012 21st International Conference on Pattern Recognition (ICPR) , pp. 3304-3308
- Wang, T.¹ Wu, D.J.² Coates, A.³ Ng, A.Y.⁴

48
- 84891621153
- Toward integrated scene text reading
- Weinman, J.J., Butler, Z., Knoll, D., Feild, J.: Toward integrated scene text reading. IEEE Trans. Pattern Anal. Mach. Intell. 36(2), 375-387 (2014)
- (2014) IEEE Trans. Pattern Anal. Mach. Intell. , vol.36 , Issue.2 , pp. 375-387
- Weinman, J.J.¹ Butler, Z.² Knoll, D.³ Feild, J.⁴

49
- 84894625033
- A framework for improved video text detection and recognition
- Yang, H., Quehl, B., Sack, H.: A framework for improved video text detection and recognition. Int. Journal of Multimedia Tools and Applications, MTAP (2012)
- (2012) Int. Journal of Multimedia Tools and Applications, MTAP
- Yang, H.¹ Quehl, B.² Sack, H.³

50
- 82355182431
- Text string detection from natural scenes by structure-based partition and grouping
- Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. IEEE Transactions on Image Processing 20(9), 2594-2605 (2011)
- (2011) IEEE Transactions on Image Processing , vol.20 , Issue.9 , pp. 2594-2605
- Yi, C.¹ Tian, Y.²

51
- 84906509755
- Robust text detection in natural scene images
- abs/1301.2628
- Yin, X.C., Yin, X., Huang, K.: Robust text detection in natural scene images. CoRR abs/1301.2628 (2013)
- (2013) CoRR
- Yin, X.C.¹ Yin, X.² Huang, K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.