메뉴 건너뛰기




Volumn 28, Issue 2, 2005, Pages 129-146

The BankSearch web document dataset: Investigating unsupervised clustering and category similarity

Author keywords

Benchmark dataset; Clustering; Stemming; Stoplists; Text classification; Unsupervised learning

Indexed keywords

BENCHMARKING; INFORMATION RETRIEVAL SYSTEMS; INTERNET; SEARCH ENGINES;

EID: 10844283610     PISSN: 10848045     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.jnca.2004.01.002     Document Type: Article
Times cited : (18)

References (27)
  • 2
    • 0028461417 scopus 로고
    • Automated learning of decision rules for text categorization
    • Apte C, Damerau F, Weiss S. Automated learning of decision rules for text categorization. ACM Trans Inf Syst 1994;12(3):233-51.
    • (1994) ACM Trans Inf Syst , vol.12 , Issue.3 , pp. 233-251
    • Apte, C.1    Damerau, F.2    Weiss, S.3
  • 5
    • 0006019644 scopus 로고    scopus 로고
    • A probabilistic approach to full-text document clustering
    • SRI international
    • Goldszmidt M, Sahami M, Probabilistic A. A probabilistic approach to full-text document clustering, SRI international. Technical Report ITAD-433-MS-98-044; 1998. At URL: http://citeseer.nj.nec.com/ goldszmidt98probabilistic.html.
    • (1998) Technical Report , vol.ITAD-433-MS-98-044
    • Goldszmidt, M.1    Sahami, M.2    Probabilistic, A.3
  • 6
    • 0027001621 scopus 로고
    • An evaluation of phrasal and clustered representations on a text categorization task
    • 15th ACM Int Conf Research and Development in Information Retrieval
    • Lewis DD. An evaluation of phrasal and clustered representations on a text categorization task. In Proceedings of SIGIR-92, 15th ACM Int Conf Research and Development in Information Retrieval 1992;37-50.
    • (1992) Proceedings of SIGIR-92 , pp. 37-50
    • Lewis, D.D.1
  • 7
    • 0001214384 scopus 로고
    • A comparison of two learning algorithms for text categorization
    • 3rd Annual Symposium on Document Analysis and Information Retrieval
    • Lewis DD, Ringuette M. A comparison of two learning algorithms for text categorization. In Proceedings of SDAIR-94, 3rd Annual Symposium on Document Analysis and Information Retrieval 1994;81-93.
    • (1994) Proceedings of SDAIR-94 , pp. 81-93
    • Lewis, D.D.1    Ringuette, M.2
  • 8
    • 10844288348 scopus 로고    scopus 로고
    • LookSmart. At URL: http://www.looksmart.com
  • 10
    • 10844269144 scopus 로고    scopus 로고
    • The Google Search Engine. At URL: http://www.google.com
  • 12
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • Porter MF. An algorithm for suffix stripping. Program 1980;14(3):130-7.
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.F.1
  • 14
    • 0016572913 scopus 로고
    • A vector space model for automatic indexing
    • Salton G, Wong A, Yang C. A vector space model for automatic indexing. Commun ACM 1975;18(11):613-20.
    • (1975) Commun ACM , vol.18 , Issue.11 , pp. 613-620
    • Salton, G.1    Wong, A.2    Yang, C.3
  • 15
    • 0003755589 scopus 로고    scopus 로고
    • Machine learning in automated text categorisation: A survey
    • Istituto di Elaborazione dell'Informazione, C.N.R., Pisa, IT
    • Sebastiani F. Machine learning in automated text categorisation: a survey. Istituto di Elaborazione dell'Informazione, C.N.R., Pisa, IT. Technical Report IEI-B4-31-1999; 1999a. At URL: http://citeseer.nj.nec.com/article/ sebastiani99machine.html.
    • (1999) Technical Report , vol.IEI-B4-31-1999
    • Sebastiani, F.1
  • 16
    • 0002621269 scopus 로고    scopus 로고
    • A tutorial on automated text categorisation
    • Amandi A, Zunino R, editors. 1st Argentinean Symposium on Artificial Intelligence, Buenos Aires, Argentina
    • Sebastiani F, Tutorial A. A tutorial on automated text categorisation. In: Amandi A, Zunino R, editors. Proceedings of ASAI-99, 1st Argentinean Symposium on Artificial Intelligence, Buenos Aires, Argentina. 1999b. p. 7-35. At URL: http://citeseer.nj.nec.com/slonim01power.html.
    • (1999) Proceedings of ASAI-99 , pp. 7-35
    • Sebastiani, F.1    Tutorial, A.2
  • 21
    • 84942613917 scopus 로고    scopus 로고
    • Automatic web-page classification by using machine learning methods
    • Zhong N, Yao Y, Liu S, Oshuga S, editors. In web intelligence: research and development. Berlin: Springer-Verlag
    • Tsukada M, Washio T, Motoda H. Automatic web-page classification by using machine learning methods. In: Zhong N, Yao Y, Liu S, Oshuga S, editors. In web intelligence: research and development. Proceedings of the 1st Asia Pacific Web Conference on Web Intelligence, 2198. Berlin: Springer-Verlag; 2001. p. 303-13.
    • (2001) Proceedings of the 1st Asia Pacific Web Conference on Web Intelligence , vol.2198 , pp. 303-313
    • Tsukada, M.1    Washio, T.2    Motoda, H.3
  • 23
    • 0001700195 scopus 로고
    • A neural network approach to topic spotting
    • 4th Annual and Information Retrieval
    • Wiener E, Pederson JO, Weigend AS. A neural network approach to topic spotting. In Proceedings of SDAIR-95, 4th Annual and Information Retrieval 1995;317-32.
    • (1995) Proceedings of SDAIR-95 , pp. 317-332
    • Wiener, E.1    Pederson, J.O.2    Weigend, A.S.3
  • 25
    • 0003141935 scopus 로고    scopus 로고
    • A comparative study on feature selection in text categorization
    • Yang Y, Pederson JO. A comparative study on feature selection in text categorization. In, Proceedings of ICML-97 1997;412-20.
    • (1997) Proceedings of ICML-97 , pp. 412-420
    • Yang, Y.1    Pederson, J.O.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.