메뉴 건너뛰기




Volumn 67, Issue , 2017, Pages 85-96

Improving patch-based scene text script identification with ensembles of conjoined networks

Author keywords

Convolutional neural networks; Ensemble of conjoined networks; Multi language OCR; Scene text understanding; Script identification

Indexed keywords

NEURAL NETWORKS;

EID: 85015987251     PISSN: 00313203     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patcog.2017.01.032     Document Type: Article
Times cited : (78)

References (68)
  • 3
    • 3042681884 scopus 로고    scopus 로고
    • Indian script character recognition: a survey
    • [3] Pal, U., Chaudhuri, B., Indian script character recognition: a survey. Pattern Recogn. 37:9 (2004), 1887–1899.
    • (2004) Pattern Recogn. , vol.37 , Issue.9 , pp. 1887-1899
    • Pal, U.1    Chaudhuri, B.2
  • 6
    • 84981285560 scopus 로고    scopus 로고
    • Real-time lexicon-free scene text localization and recognition
    • [6] Neumann, L., Matas, J., Real-time lexicon-free scene text localization and recognition. IEEE Trans. Pattern Anal. Mach. Intell. 38:9 (2016), 1872–1885.
    • (2016) IEEE Trans. Pattern Anal. Mach. Intell. , vol.38 , Issue.9 , pp. 1872-1885
    • Neumann, L.1    Matas, J.2
  • 8
    • 84949254659 scopus 로고    scopus 로고
    • Script identification in the wild via discriminative convolutional neural network
    • [8] Shi, B., Bai, X., Yao, C., Script identification in the wild via discriminative convolutional neural network. Pattern Recogn. 52 (2016), 448–458.
    • (2016) Pattern Recogn. , vol.52 , pp. 448-458
    • Shi, B.1    Bai, X.2    Yao, C.3
  • 10
    • 84979530220 scopus 로고    scopus 로고
    • A fine-grained approach to scene text script identification
    • [10] Gomez-Bigorda, L., Karatzas, D., A fine-grained approach to scene text script identification. DAS, 2016.
    • (2016) DAS
    • Gomez-Bigorda, L.1    Karatzas, D.2
  • 11
    • 84955683269 scopus 로고    scopus 로고
    • Multilingual scene character recognition with co-occurrence of histogram of oriented gradients
    • [11] Tian, S., Bhattacharya, U., Lu, S., Su, B., Wang, Q., Wei, X., Lu, Y., Tan, C.L., Multilingual scene character recognition with co-occurrence of histogram of oriented gradients. Pattern Recogn. 51 (2016), 125–134.
    • (2016) Pattern Recogn. , vol.51 , pp. 125-134
    • Tian, S.1    Bhattacharya, U.2    Lu, S.3    Su, B.4    Wang, Q.5    Wei, X.6    Lu, Y.7    Tan, C.L.8
  • 16
    • 28044462712 scopus 로고
    • Palace: a multilingual document recognition system
    • World Scientific Singapore
    • [16] Spitz, A.L., Ozaki, M., Palace: a multilingual document recognition system. Document Analysis Systems, vol. 1, 1995, World Scientific, Singapore, 16–37.
    • (1995) Document Analysis Systems , vol.1 , pp. 16-37
    • Spitz, A.L.1    Ozaki, M.2
  • 17
    • 0031098394 scopus 로고    scopus 로고
    • Determination of the script and language content of document images
    • [17] Spitz, A.L., Determination of the script and language content of document images. Pattern Anal. Mach. Intell., IEEE Trans. 19:3 (1997), 235–245.
    • (1997) Pattern Anal. Mach. Intell., IEEE Trans. , vol.19 , Issue.3 , pp. 235-245
    • Spitz, A.L.1
  • 18
    • 0002231472 scopus 로고    scopus 로고
    • Language identification in complex, unoriented, and degraded document images
    • [18] Lee, D., Nohl, C.R., Baird, H.S., Language identification in complex, unoriented, and degraded document images. Ser. Mach. Percept. Artif. Intell. 29 (1998), 17–39.
    • (1998) Ser. Mach. Percept. Artif. Intell. , vol.29 , pp. 17-39
    • Lee, D.1    Nohl, C.R.2    Baird, H.S.3
  • 23
    • 0032122663 scopus 로고    scopus 로고
    • Rotation invariant texture features and their use in automatic script identification
    • [23] Tan, T., Rotation invariant texture features and their use in automatic script identification. Pattern Anal. Mach. Intell., IEEE Trans. 20:7 (1998), 751–756.
    • (1998) Pattern Anal. Mach. Intell., IEEE Trans. , vol.20 , Issue.7 , pp. 751-756
    • Tan, T.1
  • 24
    • 0035546419 scopus 로고    scopus 로고
    • Text analysis using local energy
    • [24] Chan, W., Coghill, G., Text analysis using local energy. Pattern Recogn. 34:12 (2001), 2523–2532.
    • (2001) Pattern Recogn. , vol.34 , Issue.12 , pp. 2523-2532
    • Chan, W.1    Coghill, G.2
  • 27
    • 0030151620 scopus 로고    scopus 로고
    • Page segmentation using texture analysis
    • [27] Jain, A.K., Zhong, Y., Page segmentation using texture analysis. Pattern Recogn. 29:5 (1996), 743–770.
    • (1996) Pattern Recogn. , vol.29 , Issue.5 , pp. 743-770
    • Jain, A.K.1    Zhong, Y.2
  • 28
    • 0141863195 scopus 로고    scopus 로고
    • Hierarchical content classification and script determination for automatic document image processing
    • [28] Chi, Z., Wang, Q., Siu, W.-C., Hierarchical content classification and script determination for automatic document image processing. Pattern Recogn. 36:11 (2003), 2483–2500.
    • (2003) Pattern Recogn. , vol.36 , Issue.11 , pp. 2483-2500
    • Chi, Z.1    Wang, Q.2    Siu, W.-C.3
  • 29
    • 0036466961 scopus 로고    scopus 로고
    • Exploiting zoning based on approximating splines in cursive script recognition
    • [29] Hennig, A., Sherkat, N., Exploiting zoning based on approximating splines in cursive script recognition. Pattern Recogn. 35:2 (2002), 445–454.
    • (2002) Pattern Recogn. , vol.35 , Issue.2 , pp. 445-454
    • Hennig, A.1    Sherkat, N.2
  • 30
    • 68249112410 scopus 로고    scopus 로고
    • Novel script line identification method for script normalization and feature extraction in on-line handwritten whiteboard note recognition
    • [30] Schenk, J., Lenz, J., Rigoll, G., Novel script line identification method for script normalization and feature extraction in on-line handwritten whiteboard note recognition. Pattern Recogn. 42:12 (2009), 3383–3393.
    • (2009) Pattern Recogn. , vol.42 , Issue.12 , pp. 3383-3393
    • Schenk, J.1    Lenz, J.2    Rigoll, G.3
  • 31
    • 68249091393 scopus 로고    scopus 로고
    • Language identification for handwritten document images using a shape codebook
    • [31] Zhu, G., Yu, X., Li, Y., Doermann, D., Language identification for handwritten document images using a shape codebook. Pattern Recogn. 42:12 (2009), 3184–3191.
    • (2009) Pattern Recogn. , vol.42 , Issue.12 , pp. 3184-3191
    • Zhu, G.1    Yu, X.2    Li, Y.3    Doermann, D.4
  • 32
    • 77953613023 scopus 로고    scopus 로고
    • A novel framework for automatic sorting of postal documents with multi-script address blocks
    • [32] Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K., A novel framework for automatic sorting of postal documents with multi-script address blocks. Pattern Recogn. 43:10 (2010), 3507–3521.
    • (2010) Pattern Recogn. , vol.43 , Issue.10 , pp. 3507-3521
    • Basu, S.1    Das, N.2    Sarkar, R.3    Kundu, M.4    Nasipuri, M.5    Basu, D.K.6
  • 33
    • 84920654331 scopus 로고    scopus 로고
    • Tensor representation learning based image patch analysis for text identification and recognition
    • [33] Zhong, G., Cheriet, M., Tensor representation learning based image patch analysis for text identification and recognition. Pattern Recogn. 48:4 (2015), 1211–1224.
    • (2015) Pattern Recogn. , vol.48 , Issue.4 , pp. 1211-1224
    • Zhong, G.1    Cheriet, M.2
  • 40
    • 84870183903 scopus 로고    scopus 로고
    • 3d convolutional neural networks for human action recognition
    • [40] Ji, S., Xu, W., Yang, M., Yu, K., 3d convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35:1 (2013), 221–231.
    • (2013) IEEE Trans. Pattern Anal. Mach. Intell. , vol.35 , Issue.1 , pp. 221-231
    • Ji, S.1    Xu, W.2    Yang, M.3    Yu, K.4
  • 46
    • 70349280061 scopus 로고    scopus 로고
    • Character recognition in natural images.
    • [46] de Campos, T.E., Babu, B.R., Varma, M., Character recognition in natural images. VISAPP (2), 2009, 273–280.
    • (2009) VISAPP (2) , pp. 273-280
    • de Campos, T.E.1    Babu, B.R.2    Varma, M.3
  • 54
    • 70349362313 scopus 로고    scopus 로고
    • VLFeat: an open and portable library of computer vision algorithms
    • ()
    • [54] Vedaldi, A., Fulkerson, B., VLFeat: an open and portable library of computer vision algorithms. 2008. ( http://www.vlfeat.org/).
    • (2008)
    • Vedaldi, A.1    Fulkerson, B.2
  • 57
    • 0000596361 scopus 로고
    • Note on the sampling error of the difference between correlated proportions or percentages
    • [57] McNemar, Q., Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12:2 (1947), 153–157.
    • (1947) Psychometrika , vol.12 , Issue.2 , pp. 153-157
    • McNemar, Q.1
  • 58
    • 85016010144 scopus 로고    scopus 로고
    • A fast hierarchical method for multi-script and arbitrary oriented scene text extraction
    • [58] Gomez, L., Karatzas, D., A fast hierarchical method for multi-script and arbitrary oriented scene text extraction. arXiv preprint arXiv:1407.7504, 2014.
    • (2014) arXiv preprint arXiv:1407.7504
    • Gomez, L.1    Karatzas, D.2
  • 59
    • 3142736062 scopus 로고    scopus 로고
    • Robust wide-baseline stereo from maximally stable extremal regions
    • [59] Matas, J., Chum, O., Urban, M., Pajdla, T., Robust wide-baseline stereo from maximally stable extremal regions. Image Vision Comput. 22:10 (2004), 761–767.
    • (2004) Image Vision Comput. , vol.22 , Issue.10 , pp. 761-767
    • Matas, J.1    Chum, O.2    Urban, M.3    Pajdla, T.4
  • 62
    • 51149098551 scopus 로고    scopus 로고
    • An overview of the tesseract ocr engine
    • IEEE
    • [62] Smith, R., An overview of the tesseract ocr engine. ICDAR, 2007, IEEE, 629–633.
    • (2007) ICDAR , pp. 629-633
    • Smith, R.1
  • 64
    • 84942517592 scopus 로고    scopus 로고
    • Scene text recognition: No country for old men?
    • Springer
    • [64] Gómez, L., Karatzas, D., Scene text recognition: No country for old men?. Computer Vision-ACCV 2014 Workshops, 2014, Springer, 157–168.
    • (2014) Computer Vision-ACCV 2014 Workshops , pp. 157-168
    • Gómez, L.1    Karatzas, D.2
  • 65
    • 84939960007 scopus 로고    scopus 로고
    • Fast and accurate scene text understanding with image binarization and off-the-shelf ocr
    • [65] Milyaev, S., Barinova, O., Novikova, T., Kohli, P., Lempitsky, V., Fast and accurate scene text understanding with image binarization and off-the-shelf ocr. Int. J. Doc. Anal. Recogn. (IJDAR) 18:2 (2015), 169–182.
    • (2015) Int. J. Doc. Anal. Recogn. (IJDAR) , vol.18 , Issue.2 , pp. 169-182
    • Milyaev, S.1    Barinova, O.2    Novikova, T.3    Kohli, P.4    Lempitsky, V.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.