SCOPUS 정보 검색 플랫폼

2006 1st International Conference on Digital Information Management, ICDIM

Volumn , Issue , 2006, Pages 511-518

Improving web page clustering through selecting appropiate term weighting functions

(3) Fresno, Víctor a Martínez, Raquel b Montalvo, Soto a

a UNIVERSIDAD REY JUAN CARLOS (Spain)

b UNED

Author keywords

[No Author keywords available]

Indexed keywords

INFORMATION EXTRACTIONS; REDUCTION METHODS; SIMILARITY SEARCHES; TERM WEIGHTING; WEB DOCUMENTS; WEB INFORMATION EXTRACTIONS; WEB MININGS; WEB PAGE CLUSTERING; WEB PAGES;

FUNCTION EVALUATION; INFORMATION ANALYSIS; INFORMATION MANAGEMENT; SEARCH ENGINES; TAXONOMIES; WEBSITES;

CLUSTERING ALGORITHMS;

EID: 51849128804 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICDIM.2007.369244 Document Type: Conference Paper

Times cited : (3)

References (30)

1
- 84871059997
- Evaluation of Web Page Representations by Content through Clustering. String Processing and Information Retrieval
- A. Casillas, V. Fresno, M. González de Lena and R. Martínez. "Evaluation of Web Page Representations by Content through Clustering". String Processing and Information Retrieval. LNCS series of Springer-Verlag, 129-130, 2004.
- (2004) LNCS series of Springer-Verlag , vol.129-130
- Casillas, A.¹ Fresno, V.² González de Lena, M.³ Martínez, R.⁴

2
- 0034791059
- Enhanced topic distillation using text, markup tags, and hyperlinks
- New Orleans, Louisiana, United States
- S. Chakrabarti, M. Joshi and V. Tawde. "Enhanced topic distillation using text, markup tags, and hyperlinks". SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, New Orleans, Louisiana, United States, 208-216, 2001.
- (2001) SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval , pp. 208-216
- Chakrabarti, S.¹ Joshi, M.² Tawde, V.³

3
- 3543096195
- An Analytical Approach to Concept Extraction in HTML Environments
- Kluwer Academic Publishers
- V. Fresno and A. Ribeiro. "An Analytical Approach to Concept Extraction in HTML Environments". Journal of Intelligent Information Systems - JIIS. Kluwer Academic Publishers, 215-235, 2004.
- (2004) Journal of Intelligent Information Systems - JIIS , pp. 215-235
- Fresno, V.¹ Ribeiro, A.²

4
- 51849109429
- Thesis Faculty of Computer Science Dalhousie University, Canada
- J. Guo. Integrating Automatic Document Clustering into Web Log Association Rule Mining. Thesis Faculty of Computer Science Dalhousie University, Canada, 2004.
- (2004) Integrating Automatic Document Clustering into Web Log Association Rule Mining
- Guo, J.¹

5
- 0345566259
- THESUS: Organizing Web Document Collections Based on Link Semantics
- M. Halkidi, B. Nguyen, I. Varlamis and M. Vazirgiannis. "THESUS: Organizing Web Document Collections Based on Link Semantics". In VLDB Journal, special issue on Semantic Web, 2003.
- (2003) VLDB Journal , Issue.SPEC. ISSUE ON SEMANTIC WEB
- Halkidi, M.¹ Nguyen, B.² Varlamis, I.³ Vazirgiannis, M.⁴

6
- 0031710353
- WebACE: A Web Agent for Document Categorization and Exploration
- 98
- E. Han, D. Boley, M. Gini, R. Gross, K. Hastings, G. Karypis, V. Kumar, B. Mobasher and J. Moore. "WebACE: A Web Agent for Document Categorization and Exploration". Proceedings of the 2nd International Conference on Autonomous Agents (Agents'98), 1998.
- (1998) Proceedings of the 2nd International Conference on Autonomous Agents (Agents
- Han, E.¹ Boley, D.² Gini, M.³ Gross, R.⁴ Hastings, K.⁵ Karypis, G.⁶ Kumar, V.⁷ Mobasher, B.⁸ Moore, J.⁹

7
- 0030656850
- Images retrieval by hypertext links
- V. Harmadas, M. Sanderson and M. D. Dunlop. "Images retrieval by hypertext links". Proceeding of SIGIR-97, 20th ACM International Conference on Research and Development in Information Retrieval, 296-303, 1997.
- (1997) Proceeding of SIGIR-97, 20th ACM International Conference on Research and Development in Information Retrieval , pp. 296-303
- Harmadas, V.¹ Sanderson, M.² Dunlop, M.D.³

8
- 51849084769
- F. Iavernaro. "Web Usage Mining Using Artificial Ant Colony Clustering and Genetic Programming". http://www.knowledgeboard.com/cgibin/ item.cgi?id=129238&d=pnd, 2004.
- (2004) Web Usage Mining Using Artificial Ant Colony Clustering and Genetic Programming
- Iavernaro, F.¹

9
- 84953744816
- A statistical interpretation of term specificity and its application in retrieval
- S. Jones. "A statistical interpretation of term specificity and its application in retrieval". Journal of Documentation, Vol. 28, N. 1, 11-21, 1972.
- (1972) Journal of Documentation , vol.28 , Issue.1 , pp. 11-21
- Jones, S.¹

10
- 0038163983
- CLUTO: "A Clustering Toolkit
- Technical Report: 02-017. University of Minnesota, Department of Computer Science, Minneapolis, MN 55455
- G. Karypis. CLUTO: "A Clustering Toolkit". Technical Report: 02-017. University of Minnesota, Department of Computer Science, Minneapolis, MN 55455.
- Karypis, G.¹

11
- 51849095425
- A. Leuski and J., Allan. Improving interactive retrieval by combining ranked lists and clustering. Proceedings of RIAO2000, 665-681, 2000.
- A. Leuski and J., Allan. "Improving interactive retrieval by combining ranked lists and clustering". Proceedings of RIAO2000, 665-681, 2000.

12
- 0000159640
- A statistical approach to mechanized encoding and searching of literaty information
- H. P. Luhn. "A statistical approach to mechanized encoding and searching of literaty information". IBM Journal of Research and Development, Vol. 1, N. 4, 307-319, 1957.
- (1957) IBM Journal of Research and Development , vol.1 , Issue.4 , pp. 307-319
- Luhn, H.P.¹

13
- 51849106473
- Text data mining
- R. Dale, H. Moisl and H. Sommer Eds, New York: Marcel Dekker
- D. Merkl. "Text data mining". A handbook of Natural Languages Processing Techniques and Applications for the Processing of Languages as Text. R. Dale, H. Moisl and H. Sommer (Eds). New York: Marcel Dekker, 1998.
- (1998) A handbook of Natural Languages Processing Techniques and Applications for the Processing of Languages as Text
- Merkl, D.¹

14
- 0030377628
- A Fuzzy representation of HTML documents for Information Retrieval Systems
- New Orleans
- A. Molinari and G. Passi. "A Fuzzy representation of HTML documents for Information Retrieval Systems". Proceedings of the IEEE International Conference on Fuzzy Systems, New Orleans. Vol. 1, 107-112, 1996.
- (1996) Proceedings of the IEEE International Conference on Fuzzy Systems , vol.1 , pp. 107-112
- Molinari, A.¹ Passi, G.²

15
- 0037660997
- A. Molinari, G. Passi and R. A. Marques Pereira. An indexing model of HTML documents. SAC '03: Proceedings of the 2003 ACM symposium on Applied computing, Melbourne, Florida, 834-840, 2003.
- A. Molinari, G. Passi and R. A. Marques Pereira. "An indexing model of HTML documents". SAC '03: Proceedings of the 2003 ACM symposium on Applied computing, Melbourne, Florida, 834-840, 2003.

16
- 0013115137
- Web Page Categorization and Feature Selection Using Association Rule and Principal Component Clustering
- J. Moore, E. Han, D. Boley, M. Gini, R. Gross, K. Hastings, G. Karypis, V. Kumar and B. Mobasher. "Web Page Categorization and Feature Selection Using Association Rule and Principal Component Clustering". Workshop on Information Technologies and Systems, 1997.
- (1997) Workshop on Information Technologies and Systems
- Moore, J.¹ Han, E.² Boley, D.³ Gini, M.⁴ Gross, R.⁵ Hastings, K.⁶ Karypis, G.⁷ Kumar, V.⁸ Mobasher, B.⁹

17
- 0003365044
- On the Automated Classification of Web Sites
- J. M. Pierre. "On the Automated Classification of Web Sites". Linking Electronic Articles in Computer and Information Science. Vol. 6, 2001.
- (2001) Linking Electronic Articles in Computer and Information Science , vol.6
- Pierre, J.M.¹

18
- 0013362290
- Reprinted in Sparck Jones, Karen, and Peter Willet, Readings in Information Retrieval, San Francisco: Morgan Kaufmann, 1997
- M.F. Porter. "An algorithm for suffix stripping". Reprinted in Sparck Jones, Karen, and Peter Willet, Readings in Information Retrieval, San Francisco: Morgan Kaufmann, 1997.
- An algorithm for suffix stripping
- Porter, M.F.¹

19
- 51849158288
- A. Ribeiro, V. Fresno, M. García-Alegre and D. Guinea. A Fuzzy System for the Web Page Representation. Intelligent Exploration of the Web, Springer-Verlag Group, 19-38, 2002.
- A. Ribeiro, V. Fresno, M. García-Alegre and D. Guinea. "A Fuzzy System for the Web Page Representation". Intelligent Exploration of the Web, Springer-Verlag Group, 19-38, 2002.

20
- 84953588425
- On the specification of term values in automatic indexing
- G. Salton, C. S. Yang. "On the specification of term values in automatic indexing". Journal of Documentation, Vol. 29, N. 4, 351-372, 1973.
- (1973) Journal of Documentation , vol.29 , Issue.4 , pp. 351-372
- Salton, G.¹ Yang, C.S.²

21
- 51849090100
- McGraw Hill, New York
- G. Salton and M. McGill. Introduction to Modern information Retrieval. McGraw Hill, New York, 1983. Vol. 30, 365-373, 1974.
- (1974) Introduction to Modern information Retrieval , vol.30 , pp. 365-373
- Salton, G.¹ McGill, M.²

22
- 0003882234
- Addison-Wesley
- G. Salton. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, 1988.
- (1988) Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer
- Salton, G.¹

23
- 0002442796
- Machine Learning in Automated Text Categorization
- F. Sebastiani. "Machine Learning in Automated Text Categorization". ACM Computing Surveys, Vol. 34, N. 1, 1-47, 2002.
- (2002) ACM Computing Surveys , vol.34 , Issue.1 , pp. 1-47
- Sebastiani, F.¹

24
- 0242602366
- A Large Benchmark Dataset for Web Document Clustering. Soft Computing Systems: Design, Management and Applications
- M. P. Sinka and D. W. Corne. "A Large Benchmark Dataset for Web Document Clustering". Soft Computing Systems: Design, Management and Applications, Frontiers in Artificial Intelligence and Applications, Vol. 87, 881-890, 2002.
- (2002) Frontiers in Artificial Intelligence and Applications , vol.87 , pp. 881-890
- Sinka, M.P.¹ Corne, D.W.²

25
- 84955768333
- Foundations of evaluation
- C. J. van Rijsbergen. "Foundations of evaluation". Journal of Documentation, Vol. 30, 365-373, 1974.
- (1974) Journal of Documentation , vol.30 , pp. 365-373
- van Rijsbergen, C.J.¹

26
- 26844550931
- Text categorization based on Weighted Inverse Document Frequency
- Technical Report 94 TR0001, Department of Computer Science, Tokyo Institute of Technology
- T. Tokunaga and M. Iwayama. "Text categorization based on Weighted Inverse Document Frequency". Technical Report 94 TR0001, Department of Computer Science, Tokyo Institute of Technology, 1994.
- (1994)
- Tokunaga, T.¹ Iwayama, M.²

27
- 0036498398
- A Study of Approaches to Hypertext Categorization
- Kluwer Academic Publishers
- Y. Yang, S. Slattery and R. Ghani. "A Study of Approaches to Hypertext Categorization". Journal of Intelligent Information Systems - JIIS. Kluwer Academic Publishers, Vol 18, 1-25, 2002.
- (2002) Journal of Intelligent Information Systems - JIIS , vol.18 , pp. 1-25
- Yang, Y.¹ Slattery, S.² Ghani, R.³

28
- 51849103729
- L. Yi, B. Liu. "Web Page Cleaning for Web Mining through Feature Weighting". www.cs.uic.edu/liub/publications/ijcai03-webClean.pdf, 2003.
- (2003) Web Page Cleaning for Web Mining through Feature Weighting
- Yi, L.¹ Liu, B.²

29
- 0003227299
- Grouper: A dynamic clustering interface to web search results
- O. Zamir and O. Etzioni. "Grouper: A dynamic clustering interface to web search results". Proceedings of the WWW8 Conference, 1999.
- (1999) Proceedings of the WWW8 Conference
- Zamir, O.¹ Etzioni, O.²

30
- 84983095992
- Bidirectional Hierarchical Clustering for Web Mining
- Y. Zhongmei and B. Choi. "Bidirectional Hierarchical Clustering for Web Mining". IEEE/WIC International Conference on Web Intelligence (WT03), 2003.
- (2003) IEEE/WIC International Conference on Web Intelligence (WT03)
- Zhongmei, Y.¹ Choi, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.