메뉴 건너뛰기




Volumn 48, Issue 4, 2015, Pages 1211-1224

Tensor representation learning based image patch analysis for text identification and recognition

Author keywords

Ancient document understanding; Convergence; Tensor representation learning; Text identification; Text recognition

Indexed keywords

DECISION TREES; IMAGE ANALYSIS; TENSORS;

EID: 84920654331     PISSN: 00313203     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patcog.2014.09.025     Document Type: Article
Times cited : (17)

References (69)
  • 1
    • 0020330528 scopus 로고
    • Block segmentation and text extraction in mixed text/image documents
    • F.M. Wahl, K.Y. Wong, R.G. Casey, Block segmentation and text extraction in mixed text/image documents, Comput. Graph. Image Process. 20 (4) (1982) 375-390.
    • (1982) Comput. Graph. Image Process. , vol.20 , Issue.4 , pp. 375-390
    • Wahl, F.M.1    Wong, K.Y.2    Casey, R.G.3
  • 2
    • 0024855667 scopus 로고
    • Classification of newspaper image blocks using texture analysis
    • D. Wang, S.N. Srihari, Classification of newspaper image blocks using texture analysis, Comput. Vis. Graph. Image Process. 47 (3) (1989) 327-352.
    • (1989) Comput. Vis. Graph. Image Process. , vol.47 , Issue.3 , pp. 327-352
    • Wang, D.1    Srihari, S.N.2
  • 3
    • 0001248329 scopus 로고
    • Text segmentation using gabor filters for automatic document processing
    • A.K. Jain, S.K. Bhattacharjee, Text segmentation using gabor filters for automatic document processing, Mach. Vis. Appl. 5 (3) (1992) 169-184.
    • (1992) Mach. Vis. Appl. , vol.5 , Issue.3 , pp. 169-184
    • Jain, A.K.1    Bhattacharjee, S.K.2
  • 5
    • 33747423476 scopus 로고
    • A new approach to document analysis based on modified fractal signature
    • Y.Y. Tang, H. Ma, X. Mao, D. Liu, C.Y. Suen, A new approach to document analysis based on modified fractal signature, in: ICDAR, 1995, pp. 567-570.
    • (1995) ICDAR , pp. 567-570
    • Tang, Y.Y.1    Ma, H.2    Mao, X.3    Liu, D.4    Suen, C.Y.5
  • 6
    • 77954584354 scopus 로고
    • Representation and classification of complex-shaped printed regions using white tiles
    • A. Antonacopoulos, R.T. Ritchings, Representation and classification of complex-shaped printed regions using white tiles, in: ICDAR, 1995, pp. 1132-1135.
    • (1995) ICDAR , pp. 1132-1135
    • Antonacopoulos, A.1    Ritchings, R.T.2
  • 7
    • 0030151620 scopus 로고    scopus 로고
    • Page segmentation using texture analysis
    • A.K. Jain, Y. Zhong, Page segmentation using texture analysis, Pattern Recognit. 29 (5) (1996) 743-770.
    • (1996) Pattern Recognit. , vol.29 , Issue.5 , pp. 743-770
    • Jain, A.K.1    Zhong, Y.2
  • 9
    • 0031208642 scopus 로고    scopus 로고
    • Chinese document layout analysis based on adaptive split-and-merge and qualitative spatial reasoning
    • J. Liu, Y.Y. Tang, C.Y. Suen, Chinese document layout analysis based on adaptive split-and-merge and qualitative spatial reasoning, Pattern Recognit. 30 (7) (1997) 1265-1278.
    • (1997) Pattern Recognit. , vol.30 , Issue.7 , pp. 1265-1278
    • Liu, J.1    Tang, Y.Y.2    Suen, C.Y.3
  • 10
    • 0031248049 scopus 로고    scopus 로고
    • A simplified approach to the HMM based texture analysis and its application to document segmentation
    • J.-L. Chen, A simplified approach to the HMM based texture analysis and its application to document segmentation, Pattern Recognit. Lett. 18 (10) (1997) 993-1007.
    • (1997) Pattern Recognit. Lett. , vol.18 , Issue.10 , pp. 993-1007
    • Chen, J.-L.1
  • 12
    • 0035546419 scopus 로고    scopus 로고
    • Text analysis using local energy
    • W. Chan, G. Coghill, Text analysis using local energy, Pattern Recognit. 34 (12) (2001) 2523-2532.
    • (2001) Pattern Recognit. , vol.34 , Issue.12 , pp. 2523-2532
    • Chan, W.1    Coghill, G.2
  • 13
    • 0042199020 scopus 로고    scopus 로고
    • Character location in scene images from digital camera
    • K. Wang, J. Kangas, Character location in scene images from digital camera, Pattern Recognit. 36 (10) (2003) 2287-2299.
    • (2003) Pattern Recognit. , vol.36 , Issue.10 , pp. 2287-2299
    • Wang, K.1    Kangas, J.2
  • 14
    • 38149046441 scopus 로고
    • A prototype document image analysis system for technical journals
    • G. Nagy, S.C. Seth, M. Viswanathan, A prototype document image analysis system for technical journals, IEEE Comput. 25 (7) (1992) 10-22.
    • (1992) IEEE Comput. , vol.25 , Issue.7 , pp. 10-22
    • Nagy, G.1    Seth, S.C.2    Viswanathan, M.3
  • 16
    • 0002562070 scopus 로고
    • Language-free layout analysis
    • D.J. Ittner, H.S. Baird, Language-free layout analysis, in: ICDAR, 1993, pp. 336-340.
    • (1993) ICDAR , pp. 336-340
    • Ittner, D.J.1    Baird, H.S.2
  • 17
    • 0027698791 scopus 로고
    • The document spectrum for page layout analysis
    • L. O'Gorman, The document spectrum for page layout analysis, IEEE Trans. Pattern Anal. Mach. Intell. 15 (11) (1993) 1162-1173.
    • (1993) IEEE Trans. Pattern Anal. Mach. Intell. , vol.15 , Issue.11 , pp. 1162-1173
    • O'Gorman, L.1
  • 18
    • 0001502831 scopus 로고
    • Major components of a complete text reading system
    • S. Tsujimoto, H. Asada, Major components of a complete text reading system, Proc. IEEE 80 (7) (1992) 1133-1149.
    • (1992) Proc. IEEE , vol.80 , Issue.7 , pp. 1133-1149
    • Tsujimoto, S.1    Asada, H.2
  • 20
    • 0028706648 scopus 로고
    • Segmentation and classification of mixed text/graphics/image documents
    • K.-C. Fan, C.-H. Liu, Y.-K. Wang, Segmentation and classification of mixed text/graphics/image documents, Pattern Recognit. Lett. 15 (12) (1994) 1201-1209.
    • (1994) Pattern Recognit. Lett. , vol.15 , Issue.12 , pp. 1201-1209
    • Fan, K.-C.1    Liu, C.-H.2    Wang, Y.-K.3
  • 22
    • 0030871179 scopus 로고    scopus 로고
    • Multiscale segmentation of unstructured document pages using soft decision integration
    • K. Etemad, D.S. Doermann, R. Chellappa, Multiscale segmentation of unstructured document pages using soft decision integration, IEEE Trans. Pattern Anal. Mach. Intell. 19 (1) (1997) 92-96.
    • (1997) IEEE Trans. Pattern Anal. Mach. Intell. , vol.19 , Issue.1 , pp. 92-96
    • Etemad, K.1    Doermann, D.S.2    Chellappa, R.3
  • 23
    • 0035691466 scopus 로고    scopus 로고
    • Text identification in complex background using SVM
    • D. Chen, H. Bourlard, J.-P. Thiran, Text identification in complex background using SVM, in: CVPR, 2001, pp. 621-627.
    • (2001) CVPR , pp. 621-627
    • Chen, D.1    Bourlard, H.2    Thiran, J.-P.3
  • 25
    • 0035510433 scopus 로고    scopus 로고
    • Parameter-free geometric document layout analysis
    • S.-W. Lee, D.-S. Ryu, Parameter-free geometric document layout analysis, IEEE Trans. Pattern Anal. Mach. Intell. 23 (11) (2001) 1240-1256.
    • (2001) IEEE Trans. Pattern Anal. Mach. Intell. , vol.23 , Issue.11 , pp. 1240-1256
    • Lee, S.-W.1    Ryu, D.-S.2
  • 27
    • 42749094384 scopus 로고    scopus 로고
    • Forty years of research in character and document recognition - an industrial perspective
    • H. Fujisawa, Forty years of research in character and document recognition - an industrial perspective, Pattern Recognit. 41 (8) (2008) 2435-2446.
    • (2008) Pattern Recognit. , vol.41 , Issue.8 , pp. 2435-2446
    • Fujisawa, H.1
  • 28
    • 68749084616 scopus 로고    scopus 로고
    • Handwriting recognition research: Twenty years of achievement... And beyond
    • M. Cheriet, M.A. El-Yacoubi, H. Fujisawa, D.P. Lopresti, G. Lorette, Handwriting recognition research: twenty years of achievement... and beyond, Pattern Recognit. 42 (12) (2009) 3131-3135.
    • (2009) Pattern Recognit. , vol.42 , Issue.12 , pp. 3131-3135
    • Cheriet, M.1    El-Yacoubi, M.A.2    Fujisawa, H.3    Lopresti, D.P.4    Lorette, G.5
  • 29
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • G.E. Hinton, R.R. Salakhutdinov, Reducing the dimensionality of data with neural networks, Science 313 (5786) (2006) 504-507.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 31
    • 0742268833 scopus 로고    scopus 로고
    • Two-dimensional PCA: A new approach to appearance-based face representation and recognition
    • J. Yang, D. Zhang, A.F. Frangi, J.-Y. Yang, Two-dimensional PCA: a new approach to appearance-based face representation and recognition, IEEE Trans. Pattern Anal. Mach. Intell. 26 (1) (2004) 131-137.
    • (2004) IEEE Trans. Pattern Anal. Mach. Intell. , vol.26 , Issue.1 , pp. 131-137
    • Yang, J.1    Zhang, D.2    Frangi, A.F.3    Yang, J.-Y.4
  • 32
    • 27244450132 scopus 로고    scopus 로고
    • Two-dimensional linear discriminant analysis
    • J. Ye, R. Janardan, Q. Li, Two-dimensional linear discriminant analysis, in: NIPS, 2004, pp. 1569-1576.
    • (2004) NIPS , pp. 1569-1576
    • Ye, J.1    Janardan, R.2    Li, Q.3
  • 33
    • 0000764772 scopus 로고
    • The use of multiple measurements in taxonomic problems
    • R.A. Fisher, The use of multiple measurements in taxonomic problems, Ann. Eugen. 7 (7) (1936) 179-188.
    • (1936) Ann. Eugen. , vol.7 , Issue.7 , pp. 179-188
    • Fisher, R.A.1
  • 34
    • 84864026346 scopus 로고    scopus 로고
    • Tensor subspace analysis
    • X. He, D. Cai, P. Niyogi, Tensor subspace analysis, in: NIPS, 2005, pp. 499-506.
    • (2005) NIPS , pp. 499-506
    • He, X.1    Cai, D.2    Niyogi, P.3
  • 36
    • 39549087054 scopus 로고    scopus 로고
    • Image classification using correlation tensor analysis
    • Y. Fu, T.S. Huang, Image classification using correlation tensor analysis, IEEE Trans. Image Process. 17 (2) (2008) 226-234.
    • (2008) IEEE Trans. Image Process. , vol.17 , Issue.2 , pp. 226-234
    • Fu, Y.1    Huang, T.S.2
  • 37
    • 84896845961 scopus 로고    scopus 로고
    • Large margin low rank tensor analysis
    • G. Zhong, M. Cheriet, Large margin low rank tensor analysis, Neural Comput. 26 (4) (2014) 761-780.
    • (2014) Neural Comput. , vol.26 , Issue.4 , pp. 761-780
    • Zhong, G.1    Cheriet, M.2
  • 38
    • 84908209908 scopus 로고    scopus 로고
    • Low-rank tensor learning with discriminant analysis for action classification and image recovery
    • C. Jia, G. Zhong, Y. Fu, Low-rank tensor learning with discriminant analysis for action classification and image recovery, in: AAAI, 2014, pp. 1228-1234.
    • (2014) AAAI , pp. 1228-1234
    • Jia, C.1    Zhong, G.2    Fu, Y.3
  • 39
    • 84913530319 scopus 로고    scopus 로고
    • Latent tensor transfer learning for RGB-D action recognition
    • C. Jia, Y. Kong, Z. Ding, Y. Fu, Latent tensor transfer learning for RGB-D action recognition, in: ACM Multimedia (MM), 2014.
    • (2014) ACM Multimedia (MM)
    • Jia, C.1    Kong, Y.2    Ding, Z.3    Fu, Y.4
  • 40
    • 84880903188 scopus 로고    scopus 로고
    • A convengent solution to tensor subspace learning
    • H. Wang, S. Yan, T.S. Huang, X. Tang, A convengent solution to tensor subspace learning, in: IJCAI, 2007, pp. 629-634.
    • (2007) IJCAI , pp. 629-634
    • Wang, H.1    Yan, S.2    Huang, T.S.3    Tang, X.4
  • 41
    • 68649096448 scopus 로고    scopus 로고
    • Tensor decompositions and applications
    • T.G. Kolda, B.W. Bader, Tensor decompositions and applications, SIAM Rev. 51 (3) (2009) 455-500.
    • (2009) SIAM Rev. , vol.51 , Issue.3 , pp. 455-500
    • Kolda, T.G.1    Bader, B.W.2
  • 42
    • 33750741417 scopus 로고    scopus 로고
    • Tensor embedding methods
    • G. Dai, D.-Y. Yeung, Tensor embedding methods, in: AAAI, 2006, pp. 330-335.
    • (2006) AAAI , pp. 330-335
    • Dai, G.1    Yeung, D.-Y.2
  • 43
    • 34249086815 scopus 로고    scopus 로고
    • Dimensionality reduction of multimodal labeled data by local fisher discriminant analysis
    • M. Sugiyama, Dimensionality reduction of multimodal labeled data by local fisher discriminant analysis, J. Mach. Learn. Res. 8 (2007) 1027-1061.
    • (2007) J. Mach. Learn. Res. , vol.8 , pp. 1027-1061
    • Sugiyama, M.1
  • 44
    • 0034704229 scopus 로고    scopus 로고
    • A global geometric framework for nonlinear dimensionality reduction
    • J.B. Tenenbaum, V. de Silva, J.C. Langford, A global geometric framework for nonlinear dimensionality reduction, Science 290 (5500) (2000) 2319-2323.
    • (2000) Science , vol.290 , Issue.5500 , pp. 2319-2323
    • Tenenbaum, J.B.1    De Silva, V.2    Langford, J.C.3
  • 45
    • 0034704222 scopus 로고    scopus 로고
    • Nonlinear dimensionality reduction by locally linear embedding
    • S.T. Roweis, L.K. Saul, Nonlinear dimensionality reduction by locally linear embedding, Science 290 (5500) (2000) 2323-2326.
    • (2000) Science , vol.290 , Issue.5500 , pp. 2323-2326
    • Roweis, S.T.1    Saul, L.K.2
  • 46
    • 35148823228 scopus 로고    scopus 로고
    • Trace ratio vs. Ratio trace for dimensionality reduction
    • H. Wang, S. Yan, D. Xu, X. Tang, T.S. Huang, Trace ratio vs. ratio trace for dimensionality reduction, in: CVPR, 2007, pp. 1-8.
    • (2007) CVPR , pp. 1-8
    • Wang, H.1    Yan, S.2    Xu, D.3    Tang, X.4    Huang, T.S.5
  • 47
    • 0347243182 scopus 로고    scopus 로고
    • Nonlinear component analysis as a kernel eigenvalue problem
    • B. Schölkopf, A.J. Smola, K.-R. Müller, Nonlinear component analysis as a kernel eigenvalue problem, Neural Comput. 10 (5) (1998) 1299-1319.
    • (1998) Neural Comput. , vol.10 , Issue.5 , pp. 1299-1319
    • Schölkopf, B.1    Smola, A.J.2    Müller, K.-R.3
  • 48
    • 0034296402 scopus 로고    scopus 로고
    • Generalized discriminant analysis using a kernel approach
    • G. Baudat, F. Anouar, Generalized discriminant analysis using a kernel approach, Neural Comput. 12 (10) (2000) 2385-2404.
    • (2000) Neural Comput. , vol.12 , Issue.10 , pp. 2385-2404
    • Baudat, G.1    Anouar, F.2
  • 49
    • 57749182885 scopus 로고    scopus 로고
    • Trace ratio criterion for feature selection
    • F. Nie, S. Xiang, Y. Jia, C. Zhang, S. Yan, Trace ratio criterion for feature selection, in: AAAI, 2008, pp. 671-676.
    • (2008) AAAI , pp. 671-676
    • Nie, F.1    Xiang, S.2    Jia, Y.3    Zhang, C.4    Yan, S.5
  • 50
    • 70349967917 scopus 로고    scopus 로고
    • A convex method for locating regions of interest with multi-instance learning
    • Y.-F. Li, J.T. Kwok, I.W. Tsang, Z.-H. Zhou, A convex method for locating regions of interest with multi-instance learning, in: ECML/PKDD (2), 2009, pp. 15-30.
    • (2009) ECML/PKDD , vol.2 , pp. 15-30
    • Li, Y.-F.1    Kwok, J.T.2    Tsang, I.W.3    Zhou, Z.-H.4
  • 51
    • 0019083307 scopus 로고
    • Mathematical description of the responses of simple cortical cells
    • S. Marčelja, Mathematical description of the responses of simple cortical cells, J. Opt. Soc. Am. 70 (1980) 1297-1300.
    • (1980) J. Opt. Soc. Am. , vol.70 , pp. 1297-1300
    • Marčelja, S.1
  • 52
    • 0022098435 scopus 로고
    • Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters
    • J.G. Daugman, Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters, J. Opt. Soc. Am. A: Opt. Image Sci. Vis. 2 (7) (1985) 1160-1169.
    • (1985) J. Opt. Soc. Am. A: Opt. Image Sci. Vis. , vol.2 , Issue.7 , pp. 1160-1169
    • Daugman, J.G.1
  • 53
    • 0023583669 scopus 로고
    • An evaluation of the two-dimensional gabor filter model of simple receptive fields in cat striate cortex
    • J. Jones, L. Palmer, An evaluation of the two-dimensional gabor filter model of simple receptive fields in cat striate cortex, J. Neurophysiol. 58 (1987) 1233-1258.
    • (1987) J. Neurophysiol. , vol.58 , pp. 1233-1258
    • Jones, J.1    Palmer, L.2
  • 54
    • 0024053929 scopus 로고
    • Complete discrete 2-D gabor transforms by neural networks for image analysis and compression
    • J.G. Daugman, Complete discrete 2-D gabor transforms by neural networks for image analysis and compression, IEEE Trans. Acoust. Speech Signal Process. 36 (7) (1988) 1169-1179.
    • (1988) IEEE Trans. Acoust. Speech Signal Process. , vol.36 , Issue.7 , pp. 1169-1179
    • Daugman, J.G.1
  • 55
    • 0000293183 scopus 로고
    • Theory of communication
    • D. Gabor, Theory of communication, J. Inst. Electr. Eng. 93 (1946) 429-457.
    • (1946) J. Inst. Electr. Eng. , vol.93 , pp. 429-457
    • Gabor, D.1
  • 56
    • 1942451863 scopus 로고    scopus 로고
    • Gabor filter based block energy analysis for text extraction from digital document images
    • S.S. Raju, P.B. Pati, A.G. Ramakrishnan, Gabor filter based block energy analysis for text extraction from digital document images, in: DIAL, 2004, pp. 233-243.
    • (2004) DIAL , pp. 233-243
    • Raju, S.S.1    Pati, P.B.2    Ramakrishnan, A.G.3
  • 57
    • 0042665432 scopus 로고    scopus 로고
    • Contour detection based on nonclassical receptive field inhibition
    • C. Grigorescu, N. Petkov, M.A. Westenberg, Contour detection based on nonclassical receptive field inhibition, IEEE Trans. Image Process. 12 (7) (2003) 729-739.
    • (2003) IEEE Trans. Image Process. , vol.12 , Issue.7 , pp. 729-739
    • Grigorescu, C.1    Petkov, N.2    Westenberg, M.A.3
  • 58
  • 60
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • L. Breiman, Random forests, Mach. Learn. 45 (1) (2001) 5-32.
    • (2001) Mach. Learn. , vol.45 , Issue.1 , pp. 5-32
    • Breiman, L.1
  • 61
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • L. Breiman, Bagging predictors, Mach. Learn. 24 (2) (1996) 123-140.
    • (1996) Mach. Learn. , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 62
    • 0032280519 scopus 로고    scopus 로고
    • Boosting the margin: A new explanation for the effectiveness of voting methods
    • R.E. Schapire, Y. Freund, P. Bartlett, W.S. Lee, Boosting the margin: a new explanation for the effectiveness of voting methods, Ann. Stat. 26 (5) (1998) 1651-1686.
    • (1998) Ann. Stat. , vol.26 , Issue.5 , pp. 1651-1686
    • Schapire, R.E.1    Freund, Y.2    Bartlett, P.3    Lee, W.S.4
  • 63
    • 0345040873 scopus 로고    scopus 로고
    • Classification and regression by randomforest
    • A. Liaw, M. Wiener, Classification and regression by randomforest, R. News 2 (3) (2002) 18-22.
    • (2002) R. News , vol.2 , Issue.3 , pp. 18-22
    • Liaw, A.1    Wiener, M.2
  • 64
    • 29644438050 scopus 로고    scopus 로고
    • Statistical comparisons of classifiers over multiple data sets
    • J. Demšar, Statistical comparisons of classifiers over multiple data sets, J. Mach. Learn. Res. 7 (2006) 1-30.
    • (2006) J. Mach. Learn. Res. , vol.7 , pp. 1-30
    • Demšar, J.1
  • 65
    • 1342282160 scopus 로고    scopus 로고
    • Machine printed text and handwriting identification in noisy document images
    • Y. Zheng, H. Li, D.S. Doermann, Machine printed text and handwriting identification in noisy document images, IEEE Trans. Pattern Anal. Mach. Intell. 26 (3) (2003) 337-353.
    • (2003) IEEE Trans. Pattern Anal. Mach. Intell. , vol.26 , Issue.3 , pp. 337-353
    • Zheng, Y.1    Li, H.2    Doermann, D.S.3
  • 67
    • 79551480483 scopus 로고    scopus 로고
    • Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion
    • P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, P.-A. Manzagol, Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion, J. Mach. Learn. Res. 11 (2010) 3371-3408.
    • (2010) J. Mach. Learn. Res. , vol.11 , pp. 3371-3408
    • Vincent, P.1    Larochelle, H.2    Lajoie, I.3    Bengio, Y.4    Manzagol, P.-A.5
  • 68
    • 34547971961 scopus 로고    scopus 로고
    • Self-taught learning: Transfer learning from unlabeled data
    • R. Raina, A. Battle, H. Lee, B. Packer, A.Y. Ng, Self-taught learning: transfer learning from unlabeled data, in: ICML, 2007, pp. 759-766.
    • (2007) ICML , pp. 759-766
    • Raina, R.1    Battle, A.2    Lee, H.3    Packer, B.4    Ng, A.Y.5
  • 69
    • 77956031473 scopus 로고    scopus 로고
    • A survey on transfer learning
    • S.J. Pan, Q. Yang, A survey on transfer learning, IEEE Trans. Knowl. Data Eng. 22 (10) (2010) 1345-1359.
    • (2010) IEEE Trans. Knowl. Data Eng. , vol.22 , Issue.10 , pp. 1345-1359
    • Pan, S.J.1    Yang, Q.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.