-
1
-
-
63349100334
-
-
20 Newsgroups DataSet (accessed 22 December 2004)
-
20 Newsgroups DataSet (1998), The 4 Universities Data Set, available at: www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-20/www/data/news20.html (accessed 22 December 2004).
-
(1998)
The 4 Universities Data Set
-
-
-
2
-
-
33646340641
-
-
Dewey Services, available at: www.oclc.org/dewey/about/research/ (accessed 8 August 2005)
-
DDC (2005), "About DDC: research: a vital part of ongoing development", Dewey Services, available at: www.oclc.org/dewey/about/ research/ (accessed 8 August 2005).
-
(2005)
About DDC: Research: A Vital Part of Ongoing Development
-
-
-
3
-
-
33646351243
-
Improving resource discovery and retrieval on the internet: The Nordic WAIS/world wide web project summary report
-
Ardö, A. et al., (1994), "Improving resource discovery and retrieval on the internet: the Nordic WAIS/world wide web project summary report", NORDINFO Nytt, Vol. 17 No. 4, pp. 13-28.
-
(1994)
NORDINFO Nytt
, vol.17
, Issue.4
, pp. 13-28
-
-
Ardö, A.1
-
4
-
-
0012346217
-
Automatic web page categorization by link and context analysis
-
Hutchison, C. Lanzarone, G.
-
Attardi, G., Gullì, A. and Sebastiani, F. (1999), "Automatic web page categorization by link and context analysis", in Hutchison, C. and Lanzarone, G. (Eds), Proceedings of THAI-99, European Symposium on Telematics, Hypermedia and Artificial Intelligence, pp. 105-19.
-
(1999)
Proceedings of THAI-99, European Symposium on Telematics, Hypermedia and Artificial Intelligence
, pp. 105-19
-
-
Attardi, G.1
Gullì, A.2
Sebastiani, F.3
-
5
-
-
77951430107
-
Distributional word clusters vs words for text categorization
-
Bekkerman, R. et al., (2003), "Distributional word clusters vs words for text categorization", Journal of Machine Learning Research, Vol. 3, pp. 1183-208.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 1183-208
-
-
Bekkerman, R.1
-
6
-
-
33646367618
-
-
HLTCentral, available at: www.hltcentral.org/projects/print.php?acronym= BINDEX (accessed 22 December 2004)
-
BINDEX (2001), "HLT Project Factsheet: BINDEX", HLTCentral, available at: www.hltcentral.org/projects/print.php?acronym=BINDEX (accessed 22 December 2004).
-
(2001)
HLT Project Factsheet: BINDEX
-
-
-
7
-
-
0031620208
-
Combining labeled and unlabeled data with co-training
-
Morgan Kaufmann Publishers San Mateo, CA
-
Blum, A. and Mitchell, T. (1998), "Combining labeled and unlabeled data with co-training", COLT: Proceedings of the Workshop on Computational Learning Theory, Morgan Kaufmann Publishers, San Mateo, CA.
-
(1998)
COLT: Proceedings of the Workshop on Computational Learning Theory
-
-
Blum, A.1
Mitchell, T.2
-
8
-
-
1542377542
-
Text categorization by boosting automatically extracted concepts
-
Callan, J.
-
Cai, L. and Hofmann, T. (2003), "Text categorization by boosting automatically extracted concepts", in Callan, J. et al. (Eds), Proceedings of SIGIR-03, 26th ACM International Conference on Research and Development in Information Retrieval, pp. 182-9.
-
(2003)
Proceedings of SIGIR-03, 26th ACM International Conference on Research and Development in Information Retrieval
, pp. 182-9
-
-
Cai, L.1
Hofmann, T.2
-
9
-
-
33646357863
-
CERES thesaurus effort
-
available at: http://ceres.ca.gov/thesaurus/ (accessed 22 December 2004)
-
CERES (2003), "CERES thesaurus effort", CERES The California Environmental Resources Evaluation System, available at: http://ceres.ca.gov/ thesaurus/ (accessed 22 December 2004).
-
(2003)
CERES The California Environmental Resources Evaluation System
-
-
-
10
-
-
20444410138
-
Automatic resource compilation by analyzing hyperlink structure and associated text
-
Chakrabarti, S. et al. (1998a), "Automatic resource compilation by analyzing hyperlink structure and associated text", Proceedings of the Seventh International Conference on World Wide Web 7, Brisbane, Australia, pp. 65-74.
-
(1998)
Proceedings of the Seventh International Conference on World Wide Web 7, Brisbane, Australia
, pp. 65-74
-
-
Chakrabarti, S.1
-
11
-
-
0000776545
-
Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies
-
Chakrabarti, S., Dom, B. and Indyk, P. (1998b), "Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies", Journal of Very Large Data Bases, Vol. 7 No. 3, pp. 163-78.
-
(1998)
Journal of Very Large Data Bases
, vol.7
, Issue.3
, pp. 163-78
-
-
Chakrabarti, S.1
Dom, B.2
Indyk, P.3
-
13
-
-
0033726424
-
Bringing order to the web: Automatically categorizing search results
-
Chen, H. and Dumais, S.T. (2000), "Bringing order to the web: automatically categorizing search results", Proceedings of CHI-00, ACM International Conference on Human Factors in Computing Systems, Den Haag, pp. 145-52.
-
(2000)
Proceedings of CHI-00, ACM International Conference on Human Factors in Computing Systems, Den Haag
, pp. 145-52
-
-
Chen, H.1
Dumais, S.T.2
-
14
-
-
0038494833
-
Categorizing information objects from user access patterns
-
Chen, M., LaPaugh, A. and Singh, J.P. (2002), "Categorizing information objects from user access patterns", Proceedings of the Eleventh International Conference on Information and Knowledge Management, 4-9 November, pp. 365-72.
-
(2002)
Proceedings of the Eleventh International Conference on Information and Knowledge Management, 4-9 November
, pp. 365-72
-
-
Chen, M.1
Lapaugh, A.2
Singh, J.P.3
-
15
-
-
33646351005
-
-
Vivsimo, available at: www.clusty.com (accessed 22 December 2004)
-
Clusty (2004), "Clusty the clustering engine", Vivsimo, available at: www.clusty.com (accessed 22 December 2004).
-
(2004)
Clusty the Clustering Engine
-
-
-
18
-
-
33646361534
-
-
Lunds Universitets Bibliotek, available at: www.lub.lu.se/desire (accessed 22 December 2004)
-
DESIRE Project (1999), Lunds Universitets Bibliotek, available at: www.lub.lu.se/desire (accessed 22 December 2004).
-
(1999)
-
-
Project, D.1
-
19
-
-
33646339363
-
Improving domain ontologies by mining semantics from text
-
Dittenbach, M., Berger, H. and Merkl, D. (2004), "Improving domain ontologies by mining semantics from text", Proceedings of the first Asian-Pacific Conference on Conceptual Modeling, Dunedin, New Zealand, Vol. 31, pp. 91-100.
-
(2004)
Proceedings of the First Asian-Pacific Conference on Conceptual Modeling, Dunedin, New Zealand
, vol.31
, pp. 91-100
-
-
Dittenbach, M.1
Berger, H.2
Merkl, D.3
-
20
-
-
0033656184
-
Hierarchical classification of web content
-
Dumais, S.T. and Chen, H. (2000), "Hierarchical classification of web content", Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 24-28 July, Athens, Greece, pp. 256-63.
-
(2000)
Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 24-28 July, Athens, Greece
, pp. 256-63
-
-
Dumais, S.T.1
Chen, H.2
-
21
-
-
33646381839
-
Report on the workshop on operational text classification systems (OTC-02)
-
Dumais, S.T., Lewis, D.D. and Sebastiani, F. (2002), "Report on the workshop on operational text classification systems (OTC-02)", ACM SIGIR Forum, Vol. 35 No. 2, pp. 8-11.
-
(2002)
ACM SIGIR Forum
, vol.35
, Issue.2
, pp. 8-11
-
-
Dumais, S.T.1
Lewis, D.D.2
Sebastiani, F.3
-
22
-
-
33645980702
-
-
EELS, Engineering E-Library, Sweden, available at: http://eels.lub.lu.se/ ae/ (accessed 22 December 2004)
-
EELS (2003), "'All' Engineering resources on the internet: a companion service to EELS", EELS, Engineering E-Library, Sweden, available at: http://eels.lub.lu.se/ae/ (accessed 22 December 2004).
-
(2003)
'All' Engineering Resources on the Internet: A Companion Service to EELS
-
-
-
23
-
-
33646344924
-
-
Lund University Libraries, available at: http://engine-e.lub.lu.se/ (accessed 22 December)
-
Engine-e (2004), Lund University Libraries, available at: http://engine-e.lub.lu.se/ (accessed 22 December).
-
(2004)
-
-
-
24
-
-
33646366363
-
-
Lund University Libraries, available at: http://eels.lub.lu.se/ (accessed 22 December 2004)
-
Engineering Electronic Library (2003), Lund University Libraries, available at: http://eels.lub.lu.se/ (accessed 22 December 2004).
-
(2003)
-
-
-
25
-
-
33646350475
-
-
OCLC projects, available at: www.oclc.org/research/projects/fastac/ (accessed 7 August 2005)
-
FAST (2003), "FAST as a knowledge base for automated classification", OCLC projects, available at: www.oclc.org/research/ projects/fastac/ (accessed 7 August 2005).
-
(2003)
FAST as a Knowledge Base for Automated Classification
-
-
-
26
-
-
33646384111
-
-
OCLC projects, available at: www.oclc.org/research/projects/fast/ (accessed 22 December 2004)
-
FAST (2004), "FAST: faceted application of subject terminology", OCLC projects, available at: www.oclc.org/research/projects/ fast/ (accessed 22 December 2004).
-
(2004)
FAST: Faceted Application of Subject Terminology
-
-
-
27
-
-
0004140078
-
-
University of Washington, available at: http://citeseer.nj.nec.com/ fasulo99analysi.html (accessed 22 December 2004)
-
Fasulo, D. (1999), "An analysis of recent work on clustering algorithms: technical report", University of Washington, available at: http://citeseer.nj.nec.com/fasulo99analysi.html (accessed 22 December 2004).
-
(1999)
An Analysis of Recent Work on Clustering Algorithms: Technical Report
-
-
Fasulo, D.1
-
28
-
-
35248878741
-
When are links useful? Experiments in text classification
-
Fisher, M. and Everson, R. (2003), "When are links useful? Experiments in text classification", Proceedings of ECIR-03, 25th European Conference on Information Retrieval, Pisa, IT, pp. 41-56.
-
(2003)
Proceedings of ECIR-03, 25th European Conference on Information Retrieval, Pisa, IT
, pp. 41-56
-
-
Fisher, M.1
Everson, R.2
-
29
-
-
0842300463
-
Predicting library of congress classifications from library of congress subject headings
-
Frank, E. and Paynter, G.W. (2004), "Predicting library of congress classifications from library of congress subject headings", Journal of the American Society for Information Science and Technology, Vol. 55 No. 3, pp. 214-27.
-
(2004)
Journal of the American Society for Information Science and Technology
, vol.55
, Issue.3
, pp. 214-27
-
-
Frank, E.1
Paynter, G.W.2
-
31
-
-
0036895475
-
Hyperlink ensembles: A case study in hypertext classification
-
Fürnkranz, J. (2002), "Hyperlink ensembles: a case study in hypertext classification", Information Fusion, Vol. 3 No. 4, pp. 299-312.
-
(2002)
Information Fusion
, vol.3
, Issue.4
, pp. 299-312
-
-
Fürnkranz, J.1
-
32
-
-
33645019702
-
A system for automatic classification of scientific literature
-
(Reprinted in: Essays of an Information Scientist, Vol. 2, pp. 356-65)
-
Garfield, E., Malin, M.V. and Small, H. (1975), "A system for automatic classification of scientific literature", Journal of the Indian Institute of Science, Vol. 57 No. 2, pp. 61-74, (Reprinted in: Essays of an Information Scientist, Vol. 2, pp. 356-65).
-
(1975)
Journal of the Indian Institute of Science
, vol.57
, Issue.2
, pp. 61-74
-
-
Garfield, E.1
Malin, M.V.2
Small, H.3
-
33
-
-
33646362995
-
-
GERHARD, available at: www.gerhard.de/ (accessed 22 December 2004)
-
GERHARD (1998), "GERHARD: German harvest automated retrieval and directory", GERHARD, available at: www.gerhard.de/ (accessed 22 December 2004).
-
(1998)
GERHARD: German Harvest Automated Retrieval and Directory
-
-
-
34
-
-
33646340116
-
-
GERHARD, available at: www.gerhard.de/info/dokumente/vortraege/ecdl99/ html/index.htm (accessed 22 December 2004)
-
GERHARD (1999), "GERHARD - navigating the web with the universal decimal classification system", GERHARD, available at: www.gerhard.de/info/ dokumente/vortraege/ecdl99/html/index.htm (accessed 22 December 2004).
-
(1999)
GERHARD - Navigating the Web with the Universal Decimal Classification System
-
-
-
35
-
-
0003194541
-
Hypertext categorization using hyperlink patterns and metadata
-
Ghani, R., Slattery, S. and Yang, Y. (2001), "Hypertext categorization using hyperlink patterns and metadata", Proceedings of ICML-01, 18th International Conference on Machine Learning, pp. 178-85.
-
(2001)
Proceedings of ICML-01, 18th International Conference on Machine Learning
, pp. 178-85
-
-
Ghani, R.1
Slattery, S.2
Yang, Y.3
-
36
-
-
77953065956
-
Using web structure for classifying and describing web pages
-
Glover, E.J. et al. (2002), "Using web structure for classifying and describing web pages", Proceedings of the Eleventh International Conference on World Wide Web Honolulu, Hawaii, USA, pp. 562-9.
-
(2002)
Proceedings of the Eleventh International Conference on World Wide Web Honolulu, Hawaii, USA
, pp. 562-9
-
-
Glover, E.J.1
-
37
-
-
0037480901
-
Inferring hierarchical descriptions
-
Glover, E.J. et al. (2003), "Inferring hierarchical descriptions", Proceedings of the Eleventh International Conference on Information and Knowledge Management, CIKM 2002, November 4-9, pp. 507-14.
-
(2003)
Proceedings of the Eleventh International Conference on Information and Knowledge Management, CIKM 2002, November 4-9
, pp. 507-14
-
-
Glover, E.J.1
-
38
-
-
33646358183
-
-
OCLC Digital Archive, available at: http://digitalarchive.oclc.org/da/ ViewObject.jsp?fileid=0000003487: 000000090408&reqid=33836 (accessed 22 December 2004)
-
Godby, J. and Reighart, R. (1998), "The WordSmith indexing system", OCLC Digital Archive, available at: http://digitalarchive.oclc. org/da/ViewObject.jsp?fileid=0000003487: 000000090408&reqid=33836 (accessed 22 December 2004).
-
(1998)
The WordSmith Indexing System
-
-
Godby, J.1
Reighart, R.2
-
39
-
-
33646343161
-
Different approaches to automated classification: Is there an exchange of ideas?
-
Ingwersen, P. Larsen, B. Karolinska University Press Stockholm
-
Golub, K. and Larsen, B. (2005), "Different approaches to automated classification: is there an exchange of ideas?", in Ingwersen, P. and Larsen, B. (Eds), Proceedings of ISSI 2005 - the 10th International Conference of the International Society for Scientometrics and Informetrics, Stockholm, Sweden, 24-28 July, Vol. 1, Karolinska University Press, Stockholm, pp. 270-4.
-
(2005)
Proceedings of ISSI 2005 - The 10th International Conference of the International Society for Scientometrics and Informetrics, Stockholm, Sweden, 24-28 July
, vol.1
, pp. 270-4
-
-
Golub, K.1
Larsen, B.2
-
40
-
-
22944445334
-
Supervised learning for automatic classification of documents using self-organizing maps
-
Goren-Bar, D. et al. (2000), "Supervised learning for automatic classification of documents using self-organizing maps", Proceedings of the First DELOS Network of Excellence Workshop on Information Seeking, Searching and Querying in Digital Libraries, Zrich, Switzerland, Vol. 11-12, p. 2000.
-
(2000)
Proceedings of the First DELOS Network of Excellence Workshop on Information Seeking, Searching and Querying in Digital Libraries, Zrich, Switzerland
, vol.1112
, pp. 2000
-
-
Goren-Bar, D.1
-
41
-
-
0033279309
-
A probabilistic description-oriented approach for categorising web documents
-
Gövert, N., Lalmas, M. and Fuhr, N. (1999), "A probabilistic description-oriented approach for categorising web documents", Proceedings of the Eighth International Conference on Information and Knowledge Management, pp. 475-82.
-
(1999)
Proceedings of the Eighth International Conference on Information and Knowledge Management
, pp. 475-82
-
-
Gövert, N.1
Lalmas, M.2
Fuhr, N.3
-
42
-
-
33646381311
-
Introduction
-
Hubert, L. De Soete, G. World Scientific Singapore
-
Hartigan, J.A. (1996), "Introduction", in Hubert, L. and De Soete, G. (Eds), Clustering and Classification Arabie, World Scientific, Singapore.
-
(1996)
Clustering and Classification Arabie
-
-
Hartigan, J.A.1
-
43
-
-
0033661291
-
An investigation of linguistic features and clustering algorithms for topical document clustering
-
Hatzivassiloglou, V., Gravano, L. and Maganti, A. (2000), "An investigation of linguistic features and clustering algorithms for topical document clustering", Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Athens, Greece, pp. 224-31.
-
(2000)
Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Athens, Greece
, pp. 224-31
-
-
Hatzivassiloglou, V.1
Gravano, L.2
Maganti, A.3
-
44
-
-
0003067623
-
Scalable techniques for clustering the web
-
Haveliwala, T.H., Gionis, A. and Indyk, P. (2000), "Scalable techniques for clustering the web", Third International Workshop on the Web and Databases, May, pp. 129-34.
-
(2000)
Third International Workshop on the Web and Databases, May
, pp. 129-34
-
-
Haveliwala, T.H.1
Gionis, A.2
Indyk, P.3
-
46
-
-
33646381066
-
HTML documents classification using (non-linear) principal component analysis and self-organizing maps
-
Heuser, U., Babanine, A. and Rosenstiel, W. (1998), "HTML documents classification using (non-linear) principal component analysis and self-organizing maps", Proceedings of the Fourth International Conference on Neural Networks and their Applications (Neurap'98), 11-13 March 1998, Marseilles, France, pp. 291-5.
-
(1998)
Proceedings of the Fourth International Conference on Neural Networks and Their Applications (Neurap'98), 11-13 March 1998, Marseilles, France
, pp. 291-5
-
-
Heuser, U.1
Babanine, A.2
Rosenstiel, W.3
-
47
-
-
84860961515
-
-
accessed 22 December 2004
-
INitiative for the Evaluation of XML Retrieval (2004), DELOS Network of Excellence for Digital Libraries, available at: http://inex.is.informatik.uni- duisburg.de/ (accessed 22 December 2004).
-
(2004)
DELOS Network of Excellence for Digital Libraries
-
-
-
48
-
-
84893405732
-
Data clustering: A review
-
Jain, A.K., Murty, M.N. and Flynn, P.J. (1999), "Data clustering: a review", ACM Computing Surveys, Vol. 31 No. 3, pp. 264-323.
-
(1999)
ACM Computing Surveys
, vol.31
, Issue.3
, pp. 264-323
-
-
Jain, A.K.1
Murty, M.N.2
Flynn, P.J.3
-
49
-
-
0000645505
-
Automatic classification of web resources using Java and Dewey decimal classification
-
Jenkins, C. et al., (1998), "Automatic classification of web resources using Java and Dewey decimal classification", Computer Networks & ISDN Systems, Vol. 30, pp. 646-8.
-
(1998)
Computer Networks & ISDN Systems
, vol.30
, pp. 646-8
-
-
Jenkins, C.1
-
51
-
-
33646375915
-
Experiments with automatic classification of WAIS databases and indexing of WWW
-
Koch, T. (1994), "Experiments with automatic classification of WAIS databases and indexing of WWW", Internet World & Document Delivery World International 94, London, May, pp. 112-5.
-
(1994)
Internet World & Document Delivery World International 94, London, May
, pp. 112-5
-
-
Koch, T.1
-
52
-
-
33646345312
-
-
DESIRE II D3.6a, Overview of Results, available at: www.lub.lu.se/desire/ DESIRE36a-overview.html (accessed 22 December 2004)
-
Koch, T. and Ardö, A. (2000), "Automatic classification", DESIRE II D3.6a, Overview of Results, available at: www.lub.lu.se/desire/ DESIRE36a-overview.html (accessed 22 December 2004).
-
(2000)
Automatic Classification
-
-
Koch, T.1
Ardö, A.2
-
53
-
-
0003468869
-
-
EU Project DESIRE, Deliverable D3.2.3, available at: www.lub.lu.se/ desire/radar/reports/D3.2.3/ (accessed 22 December 2004)
-
Koch, T. and Day, M. (1997), "The role of classification schemes in internet resource description and discovery", EU Project DESIRE, Deliverable D3.2.3, available at: www.lub.lu.se/desire/radar/reports/D3.2.3/ (accessed 22 December 2004).
-
(1997)
The Role of Classification Schemes in Internet Resource Description and Discovery
-
-
Koch, T.1
Day, M.2
-
56
-
-
0002346866
-
Hierarchically classifying documents using very few words
-
Koller, D. and Sahami, M. (1997), "Hierarchically classifying documents using very few words", Proceedings of ICML-97, 14th International Conference on Machine Learning, pp. 170-8.
-
(1997)
Proceedings of ICML-97, 14th International Conference on Machine Learning
, pp. 170-8
-
-
Koller, D.1
Sahami, M.2
-
57
-
-
0033279035
-
Yahoo As an ontology: Using Yahoo Categories to describe documents
-
Labrou, Y. and Finin, T. (1999), "Yahoo As an ontology: using Yahoo Categories to describe documents", Proceedings of CIKM-99, 8th ACM International Conference on Information and Knowledge Management, pp. 180-7.
-
(1999)
Proceedings of CIKM-99, 8th ACM International Conference on Information and Knowledge Management
, pp. 180-7
-
-
Labrou, Y.1
Finin, T.2
-
58
-
-
84989528918
-
Experiments in automatic library of congress classification
-
Larson, R.R. (1992), "Experiments in automatic library of congress classification", Journal of the American Society for Information Science, Vol. 43 No. 2, pp. 130-48.
-
(1992)
Journal of the American Society for Information Science
, vol.43
, Issue.2
, pp. 130-48
-
-
Larson, R.R.1
-
59
-
-
26944485923
-
Classification of text documents
-
Li, Y.H. and Jain, A.K. (1998), "Classification of text documents", The Computer Journal, Vol. 41 No. 8, pp. 537-46.
-
(1998)
The Computer Journal
, vol.41
, Issue.8
, pp. 537-46
-
-
Li, Y.H.1
Jain, A.K.2
-
61
-
-
33646353892
-
Experiences of harvesting web resources in engineering using automatic classification
-
available at: www.ariadne.ac.uk/issue37/lindholm/
-
Lindholm, J., Schönthal, T. and Jansson, K. (2003), "Experiences of harvesting web resources in engineering using automatic classification", Ariadne, No. 37, available at: www.ariadne.ac.uk/issue37/ lindholm/.
-
(2003)
Ariadne
, Issue.37
-
-
Lindholm, J.1
Schönthal, T.2
Jansson, K.3
-
62
-
-
0036993117
-
Document clustering with cluster refinement and model selection capabilities
-
Liu, X. et al. (2002), "Document clustering with cluster refinement and model selection capabilities", Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, pp. 191-8.
-
(2002)
Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland
, pp. 191-8
-
-
Liu, X.1
-
63
-
-
0002332781
-
Improving text classification by shrinkage in a hierarchy of classes
-
paper presented at
-
McCallum, A. et al. (1998), "Improving text classification by shrinkage in a hierarchy of classes", paper presented at ICML-98, 15th International Conference on Machine Learning, pp. 359-67.
-
(1998)
ICML-98, 15th International Conference on Machine Learning
, pp. 359-67
-
-
McCallum, A.1
-
64
-
-
0038548015
-
Building domain-specific search engines with machine learning techniques
-
paper presented at
-
McCallum, A. et al. (1999), "Building domain-specific search engines with machine learning techniques", paper presented at AAAI-99 Spring Symposium on Intelligent Agents in Cyberspace.
-
(1999)
AAAI-99 Spring Symposium on Intelligent Agents in Cyberspace
-
-
McCallum, A.1
-
65
-
-
0000806922
-
Automating the construction of internet portals with machine learning
-
McCallum, A. et al., (2000), "Automating the construction of internet portals with machine learning", Information Retrieval Journal, Vol. 3, pp. 127-63.
-
(2000)
Information Retrieval Journal
, vol.3
, pp. 127-63
-
-
McCallum, A.1
-
66
-
-
84880454471
-
A matrix density based algorithm to hierarchically co-cluster documents and words
-
Mandhani, B., Joshi, S. and Kummamuru, K. (2003), "A matrix density based algorithm to hierarchically co-cluster documents and words", Proceedings of the Twelfth International Conference on World Wide Web, Budapest, Hungary, pp. 511-8.
-
(2003)
Proceedings of the Twelfth International Conference on World Wide Web, Budapest, Hungary
, pp. 511-8
-
-
Mandhani, B.1
Joshi, S.2
Kummamuru, K.3
-
68
-
-
0032262815
-
The WebCluster project: Using clustering for mediating access to the world wide web
-
Merchkour, M., Harper, D.J. and Muresan, G. (1998), "The WebCluster project: using clustering for mediating access to the world wide web", Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, pp. 357-8.
-
(1998)
Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia
, pp. 357-8
-
-
Merchkour, M.1
Harper, D.J.2
Muresan, G.3
-
69
-
-
33646369112
-
-
available at: http://metacrawler.com (accessed 5 August 2005)
-
MetaCrawler Web Search (2005), available at: http://metacrawler.com (accessed 5 August 2005).
-
(2005)
-
-
-
72
-
-
0037375142
-
Feature selection on hierarchy of web documents
-
Mladenic, D. and Grobelnik, M. (2003), "Feature selection on hierarchy of web documents", Decision Support Systems, Vol. 35 No. 1, pp. 45-87.
-
(2003)
Decision Support Systems
, vol.35
, Issue.1
, pp. 45-87
-
-
Mladenic, D.1
Grobelnik, M.2
-
73
-
-
33646358447
-
Automatic classification of the WWW using the universal decimal classification
-
McKenna, B.
-
Möller, G. et al. (1999), "Automatic classification of the WWW using the universal decimal classification", in McKenna, B. (Ed.), Proceedings of the 23rd International Online Information Meeting, London, 7-9 December, pp. 231-8.
-
(1999)
Proceedings of the 23rd International Online Information Meeting, London, 7-9 December
, pp. 231-8
-
-
Möller, G.1
-
74
-
-
33646378111
-
-
Lund University Libraries accessed 22 December 2004
-
Nordic WAIS/World Wide Web Project (1995), Lund University Libraries, available at: www.lub.lu.se/W4/ (accessed 22 December 2004).
-
(1995)
-
-
-
75
-
-
33646365312
-
Bilingual indexing for information retrieval with AUTINDEX
-
Nübel, R. et al. (2002), "Bilingual indexing for information retrieval with AUTINDEX", LREC Proceedings, Las Palmas.
-
(2002)
LREC Proceedings, Las Palmas
-
-
Nübel, R.1
-
76
-
-
18944385580
-
-
2nd ed. Libraries Unlimited Englewood, CO
-
Olson, H.A. and Boll, J.J. (2001), Subject Analysis in Online Catalogs, 2nd ed., Libraries Unlimited, Englewood, CO.
-
(2001)
Subject Analysis in Online Catalogs
-
-
Olson, H.A.1
Boll, J.J.2
-
77
-
-
0035785721
-
Demonstration of hierarchical document clustering of digital library retrieval results
-
Palmer, C.R. et al. (2001), "Demonstration of hierarchical document clustering of digital library retrieval results", Proceedings of the 1st ACM/IEEE-CS Joint Conference on Digital Libraries, Roanoke, Virginia, p. 451.
-
(2001)
Proceedings of the 1st ACM/IEEE-CS Joint Conference on Digital Libraries, Roanoke, Virginia
, pp. 451
-
-
Palmer, C.R.1
-
79
-
-
0000699588
-
A spatial user interface to the astronomical literature
-
2 May
-
Poincot, P., Lesteven, P.S. and Murtagh, F. (1998), "A spatial user interface to the astronomical literature", Astronomy & Astrophysics, 2 May, pp. 183-91.
-
(1998)
Astronomy & Astrophysics
, pp. 183-91
-
-
Poincot, P.1
Lesteven, P.S.2
Murtagh, F.3
-
81
-
-
33646352675
-
Clustering algoritms
-
Frakes, W.B. Baeza-Yates, R. Prentice-Hall Engelwood Cliffs, NJ
-
Rasmussen, E. (1992), "Clustering algoritms", in Frakes, W.B. and Baeza-Yates, R. (Eds), Information Retrieval: Data Structures and Algorithms, Prentice-Hall, Engelwood Cliffs, NJ.
-
(1992)
Information Retrieval: Data Structures and Algorithms
-
-
Rasmussen, E.1
-
82
-
-
0001909076
-
SOMLib: A digital library system based on neural networks
-
Rauber, A. and Merkl, D. (1999), "SOMLib: a digital library system based on neural networks", Proceedings of the Fourth ACM Conference on Digital Libraries, Berkeley, California, United States, pp. 240-1.
-
(1999)
Proceedings of the Fourth ACM Conference on Digital Libraries, Berkeley, California, United States
, pp. 240-1
-
-
Rauber, A.1
Merkl, D.2
-
83
-
-
33646351004
-
-
Reuters-21578, available at: www.daviddlewis.com/resources/ testcollections/reuters21578/ (accessed 3 August 2005)
-
Reuters-21578 (2004), available at: www.daviddlewis.com/resources/ testcollections/reuters21578/ (accessed 3 August 2005).
-
(2004)
-
-
-
84
-
-
0001560952
-
Relevance feedback in information retrieval
-
Salton, G. Prentice-Hall Englewood Cliffs, NJ
-
Rocchio, J.J. (1971), "Relevance feedback in information retrieval", in Salton, G. (Ed.), The SMART Retrieval System: Experiments in Automatic Document Processing, Prentice-Hall, Englewood Cliffs, NJ, pp. 313-23.
-
(1971)
The SMART Retrieval System: Experiments in Automatic Document Processing
, pp. 313-23
-
-
Rocchio, J.J.1
-
86
-
-
0031635140
-
SONIA: A service for organizing networked information autonomously
-
paper presented at
-
Sahami, M., Yusufali, M. and Baldonado, M.Q. (1998), "SONIA: a service for organizing networked information autonomously", paper presented at 3rd ACM Conference on digital libraries, Pittsburgh, pp. 200-9.
-
(1998)
3rd ACM Conference on Digital Libraries Pittsburgh
, pp. 200-9
-
-
Sahami, M.1
Yusufali, M.2
Baldonado, M.Q.3
-
87
-
-
0000417994
-
Developments in automatic text retrieval
-
Salton, G. (1991), "Developments in automatic text retrieval", Science, Vol. 253, pp. 974-9.
-
(1991)
Science
, vol.253
, pp. 974-9
-
-
Salton, G.1
-
88
-
-
0029206376
-
A comparison of classifiers and document representations for the routing problem
-
Schütze, H., Hull, D.A. and Pedersen, J.O. (1995), "A comparison of classifiers and document representations for the routing problem", Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, pp. 229-37.
-
(1995)
Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle
, pp. 229-37
-
-
Schütze, H.1
Hull, D.A.2
Pedersen, J.O.3
-
90
-
-
33749351398
-
Automatic text representation, classification and labeling in European law
-
Schweighofer, E., Rauber, A. and Dittenbach, M. (2001), "Automatic text representation, classification and labeling in European law", ICAIL 2001, pp. 78-87.
-
(2001)
ICAIL 2001
, pp. 78-87
-
-
Schweighofer, E.1
Rauber, A.2
Dittenbach, M.3
-
91
-
-
33646376172
-
-
OCLC software, available at: www.oclc.org/research/software/scorpion/ default.htm (accessed 22 December)
-
Scorpion (2004), OCLC software, available at: www.oclc.org/research/ software/scorpion/default.htm (accessed 22 December).
-
(2004)
-
-
-
92
-
-
0002442796
-
Machine learning in automated text categorization
-
Sebastiani, F. (2002), "Machine learning in automated text categorization", ACM Computing Surveys, Vol. 34 No. 1, pp. 1-47.
-
(2002)
ACM Computing Surveys
, vol.34
, Issue.1
, pp. 1-47
-
-
Sebastiani, F.1
-
93
-
-
0003228830
-
Discovering test set regularities in relational domains
-
Slattery, S. and Craven, M. (2000), "Discovering test set regularities in relational domains", Proceedings of ICML-00, 17th International Conference on Machine Learning, pp. 895-902.
-
(2000)
Proceedings of ICML-00, 17th International Conference on Machine Learning
, pp. 895-902
-
-
Slattery, S.1
Craven, M.2
-
94
-
-
0036993190
-
Unsupervised document classification using sequential information maximization
-
Slonim, N., Friedman, N. and Tishby, N. (2003), "Unsupervised document classification using sequential information maximization", Proceedings of SIGIR'02, 25th ACM International Conference on Research and Development of Information Retireval, Tampere, Finland, 2002.
-
(2003)
Proceedings of SIGIR'02, 25th ACM International Conference on Research and Development of Information Retireval, Tampere, Finland, 2002
-
-
Slonim, N.1
Friedman, N.2
Tishby, N.3
-
95
-
-
2942755807
-
Reengineering thesauri for new applications: The AGROVOC example
-
Article No. 257, available at: http://jodi.ecs.soton.ac.uk/Articles/v04/ i04/Soergel/
-
Soergel, D. et al., (2004), "Reengineering thesauri for new applications: the AGROVOC example", Journal of Digital Information, Vol. 4 No. 4, Article No. 257, available at: http://jodi.ecs.soton.ac.uk/Articles/v04/ i04/Soergel/.
-
(2004)
Journal of Digital Information
, vol.4
, Issue.4
-
-
Soergel, D.1
-
96
-
-
2442439674
-
A comparison of document clustering techniques
-
Steinbach, M., Karypis, G. and Kumar, V. (2000), "A comparison of document clustering techniques", KDD Workshop on Text Mining, Boston, MA, 20-23 August.
-
(2000)
KDD Workshop on Text Mining, Boston, MA, 20-23 August
-
-
Steinbach, M.1
Karypis, G.2
Kumar, V.3
-
97
-
-
35048839996
-
Correlation-based document clustering using web logs
-
Su, Z. et al. (2001), "Correlation-based document clustering using web logs", Proceedings of the 34th Annual Hawaii International Conference on System Sciences (HICSS-34), 3-6 January, Vol. 5, p. 5022.
-
(2001)
Proceedings of the 34th Annual Hawaii International Conference on System Sciences (HICSS-34), 3-6 January
, vol.5
, pp. 5022
-
-
Su, Z.1
-
98
-
-
33746857882
-
-
OCLC Publications, available at: http://digitalarchive.oclc.org/da/ ViewObject.jsp?objid=0000003409 (accessed 22 December 2004)
-
Subramanian, S. and Shafer, K.E. (1998), "Clustering", OCLC Publications, available at: http://digitalarchive.oclc.org/da/ViewObject.jsp? objid=0000003409 (accessed 22 December 2004).
-
(1998)
Clustering
-
-
Subramanian, S.1
Shafer, K.E.2
-
99
-
-
1542310201
-
Hierarchical text classification and evaluation
-
Sun, A., Lim, E-P. and Ng, W-K. (2001), "Hierarchical text classification and evaluation", ICDM 2001, IEEE International Conference on Data Mining.
-
(2001)
ICDM 2001, IEEE International Conference on Data Mining
-
-
Sun, A.1
Lim, E.-P.2
Ng, W.-K.3
-
101
-
-
33646368097
-
-
available at: http://search.thunderstone.com/texis/websearch (accessed 4 August 2005)
-
Thunderstone (2005), Thunderstone's Web Site Catalog, available at: http://search.thunderstone.com/texis/websearch (accessed 4 August 2005).
-
(2005)
Thunderstone's Web Site Catalog
-
-
-
102
-
-
0035754575
-
Query-sensitive similarity measures for the calculation of interdocument relationships
-
Tombros, A. and van Rijsbergen, C.J. (2001), "Query-sensitive similarity measures for the calculation of interdocument relationships", Proceedings of the Tenth International Conference on Information and Knowledge Management, Atlanta, Georgia, USA, pp. 17-24.
-
(2001)
Proceedings of the Tenth International Conference on Information and Knowledge Management, Atlanta, Georgia, USA
, pp. 17-24
-
-
Tombros, A.1
Van Rijsbergen, C.J.2
-
103
-
-
2442471873
-
Innovative solutions in automatic classification: A brief summary
-
Toth, E. (2002), "Innovative solutions in automatic classification: a brief summary", Libri, Vol. 25 No. 1, pp. 48-53.
-
(2002)
Libri
, vol.25
, Issue.1
, pp. 48-53
-
-
Toth, E.1
-
104
-
-
33646341179
-
-
National Institute of Standards and Technology, available at: http://trec.nist.gov/ (accessed 22 December 2004)
-
TREC (2004), "TREC: Text REtrieval Conference", National Institute of Standards and Technology, available at: http://trec.nist.gov/ (accessed 22 December 2004).
-
(2004)
TREC: Text REtrieval Conference
-
-
-
105
-
-
33646356174
-
Using library classification schemes for internet resources
-
available at: http://webdoc.sub.gwdg.de/ebook/aw/oclc/man/colloq/v-g.htm, (accessed 4 April 2006)
-
Vizine-Goetz, D. (1996), "Using library classification schemes for internet resources", OCLC Internet Cataloging Project Colloquium, available at: http://webdoc.sub.gwdg.de/ebook/aw/oclc/man/colloq/v-g.htm, (accessed 4 April 2006).
-
(1996)
OCLC Internet Cataloging Project Colloquium
-
-
Vizine-Goetz, D.1
-
106
-
-
0035785209
-
Automatic identification and organization of index terms for interactive browsing
-
Wacholder, N., Evans, D.K. and Klavans, J.L. (2001), "Automatic identification and organization of index terms for interactive browsing", Proceedings of the ACM-IEEE Joint Conference on Digital Libraries, Roanoke, Virginia, June, pp. 128-34.
-
(2001)
Proceedings of the ACM-IEEE Joint Conference on Digital Libraries, Roanoke, Virginia, June
, pp. 128-34
-
-
Wacholder, N.1
Evans, D.K.2
Klavans, J.L.3
-
107
-
-
20444365424
-
-
University of Wolverhampton, Wolverhampton, available at: www.scit.wlv.ac.uk/wwlib/position.html (accessed 22 December 2004)
-
Wallis, J. and Burden, P. (1995), "Towards a classification-based approach to resource discovery on the web", University of Wolverhampton, Wolverhampton, available at: www.scit.wlv.ac.uk/wwlib/position.html (accessed 22 December 2004).
-
(1995)
Towards a classification-based approach to resource discovery on the web
-
-
Wallis, J.1
Burden, P.2
-
108
-
-
0038156234
-
Evaluating contents-link coupled web page clustering for web search results
-
Wang, Y. and Kitsuregawa, M. (2002), "Evaluating contents-link coupled web page clustering for web search results", Proceedings of the Eleventh International Conference on Information and Knowledge Management, McLean, Virginia, USA, pp. 499-506.
-
(2002)
Proceedings of the Eleventh International Conference on Information and Knowledge Management, McLean, Virginia, USA
, pp. 499-506
-
-
Wang, Y.1
Kitsuregawa, M.2
-
109
-
-
33646351505
-
-
CMU World Wide Knowledge Base, available at: www-2.cs.cmu.edu/ ∼ webkb/ (accessed 22 December 2004)
-
WebKB (2001), CMU World Wide Knowledge Base, available at: www-2.cs.cmu.edu/ ∼ webkb/ (accessed 22 December 2004).
-
(2001)
-
-
-
110
-
-
0029717331
-
HyPursuit: A hierarchical network search engine that exploits content-link hypertext clustering
-
Weiss, R. et al. (1996), "HyPursuit: a hierarchical network search engine that exploits content-link hypertext clustering", Proceedings of the Seventh ACM Conference on Hypertext, Washington, DC, March, pp. 180-93.
-
(1996)
Proceedings of the Seventh ACM Conference on Hypertext, Washington, DC, March
, pp. 180-93
-
-
Weiss, R.1
-
111
-
-
3543147086
-
Recent trends in hierarchic document clustering: A critical review
-
Willet, P. (1988), "Recent trends in hierarchic document clustering: a critical review", Information Processing and Management, Vol. 24 No. 5, pp. 577-97.
-
(1988)
Information Processing and Management
, vol.24
, Issue.5
, pp. 577-97
-
-
Willet, P.1
-
112
-
-
33646369345
-
-
available at: http://dir.yahoo.com/ (accessed 8 August 2005)
-
Yahoo (2005), Yahoo Directory, available at: http://dir.yahoo.com/ (accessed 8 August 2005).
-
(2005)
Yahoo Directory
-
-
-
113
-
-
27144441097
-
An evaluation of statistical approaches to text categorization
-
Yang, Y. (1999), "An evaluation of statistical approaches to text categorization", Journal of Information Retrieval, Vol. 1 Nos 1/2, pp. 67-88.
-
(1999)
Journal of Information Retrieval
, vol.1
, Issue.12
, pp. 67-88
-
-
Yang, Y.1
-
114
-
-
0037375043
-
Visualization of large category map for internet browsing
-
Yang, C., Chen, H. and Hong, K. (2003), "Visualization of large category map for internet browsing", Decision Support Systems (DSS), Vol. 35 No. 1, pp. 89-102.
-
(2003)
Decision Support Systems (DSS)
, vol.35
, Issue.1
, pp. 89-102
-
-
Yang, C.1
Chen, H.2
Hong, K.3
-
115
-
-
0036498398
-
A study of approaches to hypertext categorization
-
Yang, Y., Slattery, S. and Ghani, R. (2002), "A study of approaches to hypertext categorization", Journal of Intelligent Information Systems, Vol. 8 Nos 2/3, pp. 219-41.
-
(2002)
Journal of Intelligent Information Systems
, vol.8
, Issue.23
, pp. 219-41
-
-
Yang, Y.1
Slattery, S.2
Ghani, R.3
-
116
-
-
0032268443
-
Web document clustering: A feasibility demonstration
-
Zamir, O. and Etzioni, O. (1998), "Web document clustering: a feasibility demonstration", ACM SIGIR'98, Australia, pp. 46-54.
-
(1998)
ACM SIGIR'98, Australia
, pp. 46-54
-
-
Zamir, O.1
Etzioni, O.2
-
118
-
-
0038156237
-
Evaluation of hierarchical clustering algorithms for document dataset
-
Zhao, Y. and Karypis, G. (2002), "Evaluation of hierarchical clustering algorithms for document dataset", Proceedings of the Eleventh International Conference on Information and Knowledge Management, McLean, Virginia, pp. 515-24.
-
(2002)
Proceedings of the Eleventh International Conference on Information and Knowledge Management, McLean, Virginia
, pp. 515-24
-
-
Zhao, Y.1
Karypis, G.2
|