-
1
-
-
0002442796
-
Machine learning in automated text categorization
-
Sebastiani, F., "Machine learning in automated text categorization," ACM Computing Surveys 34 (1), 1-47 (2002).
-
(2002)
ACM Computing Surveys
, vol.34
, Issue.1
, pp. 1-47
-
-
Sebastiani, F.1
-
2
-
-
33745557026
-
The impact of OCR accuracy and feature transformation on automatic text classification
-
February
-
Murata, M., Busagala, L. S. P., Ohyama, W., Wakabayashi, T., and Kimura, F., "The impact of OCR accuracy and feature transformation on automatic text classification," in [Proceedings of the Seventh IAPR Workshop on Document Analysis Systems (DAS'06)], 506-517 (February 2006).
-
(2006)
[Proceedings of the Seventh IAPR Workshop on Document Analysis Systems (DAS'06)]
, pp. 506-517
-
-
Murata, M.1
Busagala, L.S.P.2
Ohyama, W.3
Wakabayashi, T.4
Kimura, F.5
-
3
-
-
0000414149
-
Text categorization of low quality images
-
April
-
Ittner, D. J., Lewis, D. D., and Ahn, D. D., "Text categorization of low quality images," in [Proceedings of the Fourth Annual Symposium on Document Analysis and Information Retrieval (SDAIR'95)], 301-315 (April 1995).
-
(1995)
[Proceedings of the Fourth Annual Symposium on Document Analysis and Information Retrieval (SDAIR'95)]
, pp. 301-315
-
-
Ittner, D.J.1
Lewis, D.D.2
Ahn, D.D.3
-
4
-
-
0008372212
-
An experimental evaluation of OCR text representations for learning document classifiers
-
Junker, M. and Hoch, R., "An experimental evaluation of OCR text representations for learning document classifiers," International Journal on Document Analysis and Recognition, IJDAR 1 (2), 116-122 (1998).
-
(1998)
International Journal on Document Analysis and Recognition, IJDAR
, vol.1
, Issue.2
, pp. 116-122
-
-
Junker, M.1
Hoch, R.2
-
5
-
-
34249795774
-
A survey of document image classification: Problem statement, classifier architecture and performance evaluation
-
Chen, N. and Blostein, D., "A survey of document image classification: Problem statement, classifier architecture and performance evaluation," International Journal on Document Analysis and Recognition, IJDAR 10 (1), 1-16 (2007).
-
(2007)
International Journal on Document Analysis and Recognition, IJDAR
, vol.10
, Issue.1
, pp. 1-16
-
-
Chen, N.1
Blostein, D.2
-
8
-
-
0016572913
-
A vector space model for automatic indexing
-
Salton, G., Wong, A., and Wang, C. S., "A vector space model for automatic indexing," Communications of the ACM 18 (11), 613-620 (1975).
-
(1975)
Communications of the ACM
, vol.18
, Issue.11
, pp. 613-620
-
-
Salton, G.1
Wong, A.2
Wang, C.S.3
-
9
-
-
84948481845
-
An algorithm for suffix stripping
-
Porter, M. F., "An algorithm for suffix stripping," Program 14 (3), 130-137 (1980).
-
(1980)
Program
, vol.14
, Issue.3
, pp. 130-137
-
-
Porter, M.F.1
-
10
-
-
14344263325
-
-
Forman, G., A pitfall and solution in multi-class feature selection for text classification, in [Proceedings of the Twenty-First International Conference on Machine Learning (ICML'04)], (July 2004).
-
Forman, G., "A pitfall and solution in multi-class feature selection for text classification," in [Proceedings of the Twenty-First International Conference on Machine Learning (ICML'04)], (July 2004).
-
-
-
-
12
-
-
0018699862
-
Experiments in relevance weighting of search terms
-
Spärck Jones, K., "Experiments in relevance weighting of search terms," Information Processing and Management 15, 133-144 (1979).
-
(1979)
Information Processing and Management
, vol.15
, pp. 133-144
-
-
Spärck Jones, K.1
-
13
-
-
85024373635
-
A re-examination of text categorization methods
-
Yang, Y. and Liu, X., "A re-examination of text categorization methods," in [Proceedings of the Twenty-Second Annual International ACM SIGIR Conference on Research and Development in Information Retrieval], 22nd Annual International SIGIR, 42-49 (1999).
-
(1999)
Proceedings of the Twenty-Second Annual International ACM SIGIR Conference on Research and Development in Information Retrieval], 22nd Annual International SIGIR
, pp. 42-49
-
-
Yang, Y.1
Liu, X.2
-
15
-
-
17644390231
-
An analysis of the relative hardness of reuters-21578 subsets
-
Debole, F. and Sebastiani, F., "An analysis of the relative hardness of reuters-21578 subsets," Journal of the American Society for Information Science and Technology, JASIST 56 (6), 584-596 (2005).
-
(2005)
Journal of the American Society for Information Science and Technology, JASIST
, vol.56
, Issue.6
, pp. 584-596
-
-
Debole, F.1
Sebastiani, F.2
-
16
-
-
0009878181
-
Text categorisation: A survey,
-
Norwegian Computing Center
-
Aas, K. and Eikvil, L., "Text categorisation: A survey," tech. rep., Norwegian Computing Center, http://www.nr.no/files/samba/bamg/tm-survey.ps (1999).
-
(1999)
tech. rep
-
-
Aas, K.1
Eikvil, L.2
-
18
-
-
0005540823
-
Modern Information Retrieval], ch
-
Addison-Wesley
-
Baeza-Yates, R. and Ribeiro-Neto, B., [Modern Information Retrieval], ch. Retrieval Evaluation, 73-99, Addison-Wesley (1999).
-
(1999)
Retrieval Evaluation
, pp. 73-99
-
-
Baeza-Yates, R.1
Ribeiro-Neto, B.2
-
20
-
-
84962683765
-
Towards language independent automated learning of text categorization models
-
Apté, C., Damerau, F., and Weiss, S. M., "Towards language independent automated learning of text categorization models," in [Research and Development in Information Retrieval], (1994).
-
(1994)
Research and Development in Information Retrieval
-
-
Apté, C.1
Damerau, F.2
Weiss, S.M.3
-
22
-
-
0001001098
-
Feature selection for svms
-
Weston, J., Mukherjee, S., Chapelle, O., Pontil, M., Poggio, T., and Vapnik, V., "Feature selection for svms," Advances in Neural Information Processing Systems 13, 668-674 (2000).
-
(2000)
Advances in Neural Information Processing Systems
, vol.13
, pp. 668-674
-
-
Weston, J.1
Mukherjee, S.2
Chapelle, O.3
Pontil, M.4
Poggio, T.5
Vapnik, V.6
-
23
-
-
57649243824
-
Categorization of on-line handwritten documents
-
Peña Saldarriaga, S., Morin, E., and Viard-Gaudin, C., "Categorization of on-line handwritten documents," in [Proceedings of the Eight IAPR International Workshop on Document Analysis Systems (DAS'08)], (2008).
-
(2008)
[Proceedings of the Eight IAPR International Workshop on Document Analysis Systems (DAS'08)]
-
-
Peña Saldarriaga, S.1
Morin, E.2
Viard-Gaudin, C.3
|