메뉴 건너뛰기




Volumn 10, Issue 1, 2007, Pages 1-16

A survey of document image classification: Problem statement, classifier architecture and performance evaluation

Author keywords

Class models; Classification algorithms; Document categorization; Document classification; Document classifiers; Document features; Document image classification; Feature representations; Learning mechanisms; Performance evaluation

Indexed keywords

DIGITAL LIBRARIES; FUZZY SETS; IMAGE CLASSIFICATION; LEARNING ALGORITHMS; LEARNING SYSTEMS;

EID: 34249795774     PISSN: 14332833     EISSN: 14332825     Source Type: Journal    
DOI: 10.1007/s10032-006-0020-2     Document Type: Article
Times cited : (151)

References (64)
  • 2
    • 0037410628 scopus 로고    scopus 로고
    • First order Gaussian graphs for efficient structure classification
    • Bagdanov A.D., Worring M. (2003). First order Gaussian graphs for efficient structure classification. Pattern Recognit. 36(6): 1311-1324
    • (2003) Pattern Recognit. , vol.36 , Issue.6 , pp. 1311-1324
    • Bagdanov, A.D.1    Worring, M.2
  • 6
    • 85153946439 scopus 로고
    • An input output HMM architecture
    • In: Tesauro G., Touretzky D., Leen T. (eds) MIT, Cambridge
    • Bengio Y., Frasconi P. (1995). An input output HMM architecture. In: Tesauro G., Touretzky D., Leen T. (eds) Advances in Neural Information Processing Systems, vol. 7. MIT, Cambridge, pp. 427-434
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 427-434
    • Bengio, Y.1    Frasconi, P.2
  • 7
    • 84958964452 scopus 로고    scopus 로고
    • On-line algorithms in machine learning
    • In: Fiat, A., Woeginger, G. (eds.) Springer, Berlin Heidelberg New York
    • Blum, A.: On-line algorithms in machine learning. In: Fiat, A., Woeginger, G. (eds.) Online algorithms: The state of the art, vol. 1442, pp. 306-325. Springer, Berlin Heidelberg New York (1998)
    • (1998) Online Algorithms: The State of the Art , vol.1442 , pp. 306-325
    • Blum, A.1
  • 15
    • 0031359679 scopus 로고    scopus 로고
    • Bridging the media gap from the Guthenberg's world to electronic document management systems
    • In: Orlando, Florida, USA, October (1997)
    • Dengel, A.: Bridging the media gap from the Guthenberg's world to electronic document management systems. In: Proceedings of 1997 IEEE International Conference on Systems, Man, and Cybernetics, Orlando, Florida, USA, October 1997, pp. 3540-3554 (1997)
    • (1997) Proceedings of 1997 IEEE International Conference on Systems, Man, and Cybernetics , pp. 3540-3554
    • Dengel, A.1
  • 19
    • 33748861966 scopus 로고    scopus 로고
    • Document page similarity based on layout visual saliency: Application to query by example and document classification
    • In: Edinburgh, Scotland, 3-6 August 2003
    • Eglin, V., Bres, S.: Document page similarity based on layout visual saliency: Application to query by example and document classification. In: Proceedings of the 7th International Conference on Document Analysis and Recognition, Edinburgh, Scotland, 3-6 August 2003, pp. 1208-1212 (2003)
    • (2003) Proceedings of the 7th International Conference on Document Analysis and Recognition , pp. 1208-1212
    • Eglin, V.1    Bres, S.2
  • 20
    • 25144501683 scopus 로고    scopus 로고
    • Analysis and interpretation of visual saliency for document functional labeling
    • Eglin V., Bres S. (2004). Analysis and interpretation of visual saliency for document functional labeling. Int. J. Doc. Anal. Recognit. 7(1): 28-43
    • (2004) Int. J. Doc. Anal. Recognit. , vol.7 , Issue.1 , pp. 28-43
    • Eglin, V.1    Bres, S.2
  • 21
    • 0034156954 scopus 로고    scopus 로고
    • Machine learning for intelligent processing of printed documents
    • Esposito F., Malerba D., Lisi F.A. (2000). Machine learning for intelligent processing of printed documents. J. Intell. Inf. Syst. 14(2-3): 175-198
    • (2000) J. Intell. Inf. Syst. , vol.14 , Issue.2-3 , pp. 175-198
    • Esposito, F.1    Malerba, D.2    Lisi, F.A.3
  • 25
    • 0011187879 scopus 로고    scopus 로고
    • Multiple classifier combination: Lessons and next steps
    • In: Kandel A., Bunke H. (eds) World Scientific, Singapore
    • Ho T.K. (2002). Multiple classifier combination: Lessons and next steps. In: Kandel A., Bunke H. (eds) Hybrid Methods in Pattern Recognition. World Scientific, Singapore, pp. 171-198
    • (2002) Hybrid Methods in Pattern Recognition , pp. 171-198
    • Ho, T.K.1
  • 31
    • 33947366519 scopus 로고    scopus 로고
    • User-defined template for identifying document type and extracting information from documents
    • In: Bangalore, India, 20-22 September 1999
    • Kochi, T., Saitoh, T.: User-defined template for identifying document type and extracting information from documents. In: Proceedings of the 5th International Conference on Document Analysis and Recognition, Bangalore, India, 20-22 September 1999, pp. 127-130 (1999)
    • (1999) Proceedings of the 5th International Conference on Document Analysis and Recognition , pp. 127-130
    • Kochi, T.1    Saitoh, T.2
  • 35
    • 34250091945 scopus 로고
    • Learning quickly when irrelevant attributes abound: A new linear threshold algorithm
    • Littlestone N. (1988). Learning quickly when irrelevant attributes abound: A new linear threshold algorithm. Mach. Learn. 2(4): 285-318
    • (1988) Mach. Learn. , vol.2 , Issue.4 , pp. 285-318
    • Littlestone, N.1
  • 36
    • 0031274436 scopus 로고    scopus 로고
    • Classification of documents by form and content
    • Maderlechner G., Suda P., Bräckner T. (1997). Classification of documents by form and content. Pattern Recognit. Lett. 18(11-13): 1225-1231
    • (1997) Pattern Recognit. Lett. , vol.18 , Issue.11-13 , pp. 1225-1231
    • Maderlechner, G.1    Suda, P.2    Bräckner, T.3
  • 37
    • 0038056109 scopus 로고    scopus 로고
    • Document structure analysis algorithms: A literature survey
    • In: (IS&T/SPIE electronic imaging), Santa Clara, California, USA, 20-24 January 2003, SPIE Proceedings Series 5010
    • Mao, S., Rosenfeld, A., Kanungo, T.: Document structure analysis algorithms: A literature survey. In: Proceedings of Document Recognition and Retrieval X (IS&T/SPIE electronic imaging), Santa Clara, California, USA, 20-24 January 2003, SPIE Proceedings Series 5010, 197-207 (2003)
    • (2003) Proceedings of Document Recognition and Retrieval X , pp. 197-207
    • Mao, S.1    Rosenfeld, A.2    Kanungo, T.3
  • 38
    • 0033640628 scopus 로고    scopus 로고
    • Twenty years of document image analysis in PAMI
    • Nagy G. (2000). Twenty years of document image analysis in PAMI. IEEE Tran. Pattern Anal. Mach. Intell. 22(1): 38-62
    • (2000) IEEE Tran. Pattern Anal. Mach. Intell. , vol.22 , Issue.1 , pp. 38-62
    • Nagy, G.1
  • 40
    • 34249795926 scopus 로고    scopus 로고
    • Geometric method for document understanding and classification using on-line machine learning
    • In: Seattle, USA, 10-13 September 2001
    • Nattee, C., Numao, M.: Geometric method for document understanding and classification using on-line machine learning. In: Proceedings of the 6th International Conference on Document Analysis and Recognition, Seattle, USA, 10-13 September 2001, pp. 602-606 (2001)
    • (2001) Proceedings of the 6th International Conference on Document Analysis and Recognition , pp. 602-606
    • Nattee, C.1    Numao, M.2
  • 41
    • 0038056101 scopus 로고    scopus 로고
    • Form type identification for banking applications and its implementation issues
    • In: (IS&T/SPIE electronic imaging), Santa Clara, California, 20-24 January 2003, SPIE Proceedings Series 5010
    • Ogata, H., Watanabe, S., Imaizumi, A., Yasue, T., Furukawa, N., Sako, H., Fujisawa, H.: Form type identification for banking applications and its implementation issues. In: Proceedings of Document Recognition and Retrieval X (IS&T/SPIE electronic imaging), Santa Clara, California, 20-24 January 2003, SPIE Proceedings Series 5010, 208-218 (2003)
    • (2003) Proceedings of Document Recognition and Retrieval X , pp. 208-218
    • Ogata, H.1    Watanabe, S.2    Imaizumi, A.3    Yasue, T.4    Furukawa, N.5    Sako, H.6    Fujisawa, H.7
  • 42
    • 0010706653 scopus 로고    scopus 로고
    • Page segmentation and zone classification: The state of the art
    • Technical report, LAMP-TR-036, University of Maryland, College Park
    • Okun, O., Doermann, D., Pietikäinen, M.: Page segmentation and zone classification: The state of the art. Technical report, LAMP-TR-036, University of Maryland, College Park (1999)
    • (1999)
    • Okun, O.1    Doermann, D.2    Pietikäinen, M.3
  • 49
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • Sebastiani F. (2002). Machine learning in automated text categorization. ACM Comput. Surveys 34(1): 1-47
    • (2002) ACM Comput. Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 51
    • 0038217205 scopus 로고    scopus 로고
    • Classification of document pages using structure-based features
    • Shin C., Doermann D., Rosenfeld A. (2001). Classification of document pages using structure-based features. Int. J. Doc. Anal. Recognit. 3(4): 232-247
    • (2001) Int. J. Doc. Anal. Recognit. , vol.3 , Issue.4 , pp. 232-247
    • Shin, C.1    Doermann, D.2    Rosenfeld, A.3
  • 53
    • 0033883970 scopus 로고    scopus 로고
    • Text categorization using character shape codes
    • In: (IS&T/SPIE electronic imaging), San Jose, California, 23-28 January 2000, SPIE Proceedings Series 3967
    • Spitz, A.L., Maghbouleh, A.: Text categorization using character shape codes. In: Proceedings of Document Recognition and Retrieval VII (IS&T/SPIE electronic imaging), San Jose, California, 23-28 January 2000, SPIE Proceedings Series 3967, 174-181 (2000)
    • (2000) Proceedings of Document Recognition and Retrieval VII , pp. 174-181
    • Spitz, A.L.1    Maghbouleh, A.2
  • 57
    • 0030128299 scopus 로고    scopus 로고
    • Feature extraction methods for character recognition - A survey
    • Trier D., Jain A.K., Taxt T. (1996). Feature extraction methods for character recognition - a survey. Pattern Recognit. 29(4): 641-662
    • (1996) Pattern Recognit. , vol.29 , Issue.4 , pp. 641-662
    • Trier, D.1    Jain, A.K.2    Taxt, T.3
  • 60
    • 0032629301 scopus 로고    scopus 로고
    • A guideline for specifying layout knowledge
    • In: (IS&T/SPIE electronic imaging), San Jose, CA, 27 January 1999, SPIE Proceedings Series 3651
    • Watanabe, T.: A guideline for specifying layout knowledge. In: Proceedings of Document Recognition and Retrieval VI (IS&T/SPIE electronic imaging), San Jose, CA, 27 January 1999, SPIE Proceedings Series 3651, 162-172 (1999)
    • (1999) Proceedings of Document Recognition and Retrieval VI , pp. 162-172
    • Watanabe, T.1
  • 62
    • 0032641742 scopus 로고    scopus 로고
    • Learning to identify hundreds of flex-form documents
    • In: (IS&T/SPIE electronic imaging), San Jose, CA, 27 January SPIE Proceedings Series 3651 (1999)
    • Wnek, J.: Learning to identify hundreds of flex-form documents. In: Proceedings of Document Recognition and Retrieval VI (IS&T/SPIE electronic imaging), San Jose, CA, 27 January 1999, SPIE Proceedings Series 3651, 173-182 (1999)
    • (1999) Proceedings of Document Recognition and Retrieval VI , pp. 173-182
    • Wnek, J.1
  • 64
    • 0024889169 scopus 로고
    • Simple fast algorithms for the editing distance between trees and related problems
    • Zhang K., Shasha D. (1989). Simple fast algorithms for the editing distance between trees and related problems. SIAM J. Comput. 18(6): 1245-1262
    • (1989) SIAM J. Comput. , vol.18 , Issue.6 , pp. 1245-1262
    • Zhang, K.1    Shasha, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.