-
1
-
-
84871059997
-
Evaluation of Web Page Representations by Content through Clustering. String Processing and Information Retrieval
-
A. Casillas, V. Fresno, M. González de Lena and R. Martínez. "Evaluation of Web Page Representations by Content through Clustering". String Processing and Information Retrieval. LNCS series of Springer-Verlag, 129-130, 2004.
-
(2004)
LNCS series of Springer-Verlag
, vol.129-130
-
-
Casillas, A.1
Fresno, V.2
González de Lena, M.3
Martínez, R.4
-
2
-
-
0034791059
-
Enhanced topic distillation using text, markup tags, and hyperlinks
-
New Orleans, Louisiana, United States
-
S. Chakrabarti, M. Joshi and V. Tawde. "Enhanced topic distillation using text, markup tags, and hyperlinks". SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, New Orleans, Louisiana, United States, 208-216, 2001.
-
(2001)
SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
, pp. 208-216
-
-
Chakrabarti, S.1
Joshi, M.2
Tawde, V.3
-
3
-
-
3543096195
-
An Analytical Approach to Concept Extraction in HTML Environments
-
Kluwer Academic Publishers
-
V. Fresno and A. Ribeiro. "An Analytical Approach to Concept Extraction in HTML Environments". Journal of Intelligent Information Systems - JIIS. Kluwer Academic Publishers, 215-235, 2004.
-
(2004)
Journal of Intelligent Information Systems - JIIS
, pp. 215-235
-
-
Fresno, V.1
Ribeiro, A.2
-
5
-
-
0345566259
-
THESUS: Organizing Web Document Collections Based on Link Semantics
-
M. Halkidi, B. Nguyen, I. Varlamis and M. Vazirgiannis. "THESUS: Organizing Web Document Collections Based on Link Semantics". In VLDB Journal, special issue on Semantic Web, 2003.
-
(2003)
VLDB Journal
, Issue.SPEC. ISSUE ON SEMANTIC WEB
-
-
Halkidi, M.1
Nguyen, B.2
Varlamis, I.3
Vazirgiannis, M.4
-
6
-
-
0031710353
-
WebACE: A Web Agent for Document Categorization and Exploration
-
98
-
E. Han, D. Boley, M. Gini, R. Gross, K. Hastings, G. Karypis, V. Kumar, B. Mobasher and J. Moore. "WebACE: A Web Agent for Document Categorization and Exploration". Proceedings of the 2nd International Conference on Autonomous Agents (Agents'98), 1998.
-
(1998)
Proceedings of the 2nd International Conference on Autonomous Agents (Agents
-
-
Han, E.1
Boley, D.2
Gini, M.3
Gross, R.4
Hastings, K.5
Karypis, G.6
Kumar, V.7
Mobasher, B.8
Moore, J.9
-
9
-
-
84953744816
-
A statistical interpretation of term specificity and its application in retrieval
-
S. Jones. "A statistical interpretation of term specificity and its application in retrieval". Journal of Documentation, Vol. 28, N. 1, 11-21, 1972.
-
(1972)
Journal of Documentation
, vol.28
, Issue.1
, pp. 11-21
-
-
Jones, S.1
-
10
-
-
0038163983
-
CLUTO: "A Clustering Toolkit
-
Technical Report: 02-017. University of Minnesota, Department of Computer Science, Minneapolis, MN 55455
-
G. Karypis. CLUTO: "A Clustering Toolkit". Technical Report: 02-017. University of Minnesota, Department of Computer Science, Minneapolis, MN 55455.
-
-
-
Karypis, G.1
-
11
-
-
51849095425
-
-
A. Leuski and J., Allan. Improving interactive retrieval by combining ranked lists and clustering. Proceedings of RIAO2000, 665-681, 2000.
-
A. Leuski and J., Allan. "Improving interactive retrieval by combining ranked lists and clustering". Proceedings of RIAO2000, 665-681, 2000.
-
-
-
-
12
-
-
0000159640
-
A statistical approach to mechanized encoding and searching of literaty information
-
H. P. Luhn. "A statistical approach to mechanized encoding and searching of literaty information". IBM Journal of Research and Development, Vol. 1, N. 4, 307-319, 1957.
-
(1957)
IBM Journal of Research and Development
, vol.1
, Issue.4
, pp. 307-319
-
-
Luhn, H.P.1
-
15
-
-
0037660997
-
-
A. Molinari, G. Passi and R. A. Marques Pereira. An indexing model of HTML documents. SAC '03: Proceedings of the 2003 ACM symposium on Applied computing, Melbourne, Florida, 834-840, 2003.
-
A. Molinari, G. Passi and R. A. Marques Pereira. "An indexing model of HTML documents". SAC '03: Proceedings of the 2003 ACM symposium on Applied computing, Melbourne, Florida, 834-840, 2003.
-
-
-
-
16
-
-
0013115137
-
Web Page Categorization and Feature Selection Using Association Rule and Principal Component Clustering
-
J. Moore, E. Han, D. Boley, M. Gini, R. Gross, K. Hastings, G. Karypis, V. Kumar and B. Mobasher. "Web Page Categorization and Feature Selection Using Association Rule and Principal Component Clustering". Workshop on Information Technologies and Systems, 1997.
-
(1997)
Workshop on Information Technologies and Systems
-
-
Moore, J.1
Han, E.2
Boley, D.3
Gini, M.4
Gross, R.5
Hastings, K.6
Karypis, G.7
Kumar, V.8
Mobasher, B.9
-
18
-
-
0013362290
-
-
Reprinted in Sparck Jones, Karen, and Peter Willet, Readings in Information Retrieval, San Francisco: Morgan Kaufmann, 1997
-
M.F. Porter. "An algorithm for suffix stripping". Reprinted in Sparck Jones, Karen, and Peter Willet, Readings in Information Retrieval, San Francisco: Morgan Kaufmann, 1997.
-
An algorithm for suffix stripping
-
-
Porter, M.F.1
-
19
-
-
51849158288
-
-
A. Ribeiro, V. Fresno, M. García-Alegre and D. Guinea. A Fuzzy System for the Web Page Representation. Intelligent Exploration of the Web, Springer-Verlag Group, 19-38, 2002.
-
A. Ribeiro, V. Fresno, M. García-Alegre and D. Guinea. "A Fuzzy System for the Web Page Representation". Intelligent Exploration of the Web, Springer-Verlag Group, 19-38, 2002.
-
-
-
-
20
-
-
84953588425
-
On the specification of term values in automatic indexing
-
G. Salton, C. S. Yang. "On the specification of term values in automatic indexing". Journal of Documentation, Vol. 29, N. 4, 351-372, 1973.
-
(1973)
Journal of Documentation
, vol.29
, Issue.4
, pp. 351-372
-
-
Salton, G.1
Yang, C.S.2
-
23
-
-
0002442796
-
Machine Learning in Automated Text Categorization
-
F. Sebastiani. "Machine Learning in Automated Text Categorization". ACM Computing Surveys, Vol. 34, N. 1, 1-47, 2002.
-
(2002)
ACM Computing Surveys
, vol.34
, Issue.1
, pp. 1-47
-
-
Sebastiani, F.1
-
24
-
-
0242602366
-
A Large Benchmark Dataset for Web Document Clustering. Soft Computing Systems: Design, Management and Applications
-
M. P. Sinka and D. W. Corne. "A Large Benchmark Dataset for Web Document Clustering". Soft Computing Systems: Design, Management and Applications, Frontiers in Artificial Intelligence and Applications, Vol. 87, 881-890, 2002.
-
(2002)
Frontiers in Artificial Intelligence and Applications
, vol.87
, pp. 881-890
-
-
Sinka, M.P.1
Corne, D.W.2
-
26
-
-
26844550931
-
Text categorization based on Weighted Inverse Document Frequency
-
Technical Report 94 TR0001, Department of Computer Science, Tokyo Institute of Technology
-
T. Tokunaga and M. Iwayama. "Text categorization based on Weighted Inverse Document Frequency". Technical Report 94 TR0001, Department of Computer Science, Tokyo Institute of Technology, 1994.
-
(1994)
-
-
Tokunaga, T.1
Iwayama, M.2
|