-
1
-
-
33244485264
-
Non-parametric classifier-independent feature selection
-
Abe N, Kudo M (2006) Non-parametric classifier-independent feature selection. Pattern Recogn 39: 737-746.
-
(2006)
Pattern Recogn
, vol.39
, pp. 737-746
-
-
Abe, N.1
Kudo, M.2
-
2
-
-
84928016636
-
The base-rate fallacy and the difficulty of intrusion detection
-
Axelsson S (2000) The base-rate fallacy and the difficulty of intrusion detection. ACM Trans Inf Syst Sec 3(3): 186-205.
-
(2000)
ACM Trans Inf Syst Sec
, vol.3
, Issue.3
, pp. 186-205
-
-
Axelsson, S.1
-
3
-
-
78149340977
-
Detecting defects with an interactive code review tool based on visualisation and machine learning
-
Boston, USA
-
Axelsson S, Baca D, Feldt R, Sidlauskas D, Kacan D (2009) Detecting defects with an interactive code review tool based on visualisation and machine learning. In: 21st international conference on software engineering and knowledge engineering, Boston, USA.
-
(2009)
21st international conference on software engineering and knowledge engineering
-
-
Axelsson, S.1
Baca, D.2
Feldt, R.3
Sidlauskas, D.4
Kacan, D.5
-
4
-
-
0029546874
-
Using linear algebra for intelligent information retrieval
-
Berry MW, Dumais ST, O'Brien GW (1995) Using linear algebra for intelligent information retrieval. SIAM Rev 37(4): 573-595.
-
(1995)
SIAM Rev
, vol.37
, Issue.4
, pp. 573-595
-
-
Berry, M.W.1
Dumais, S.T.2
O'Brien, G.W.3
-
6
-
-
34248374466
-
The normalized compression distance is resistant to noise
-
Cebrian M, Alfonseca M, Ortega A (2007) The normalized compression distance is resistant to noise. IEEE Trans Inf Theory 53(5): 1895-1900.
-
(2007)
IEEE Trans Inf Theory
, vol.53
, Issue.5
, pp. 1895-1900
-
-
Cebrian, M.1
Alfonseca, M.2
Ortega, A.3
-
7
-
-
51849162587
-
Common pitfalls using normalized compression distance: what to watch out for in a compressor
-
Cebrian M, Alfonseca M, Ortega A (2005) Common pitfalls using normalized compression distance: what to watch out for in a compressor. Commun Inf Syst 5(4): 367-400.
-
(2005)
Commun Inf Syst
, vol.5
, Issue.4
, pp. 367-400
-
-
Cebrian, M.1
Alfonseca, M.2
Ortega, A.3
-
8
-
-
52249086218
-
-
PhD thesis, Institute for Logic, Language and Computation Universiteit van Amsterdam, Plantage Muidergracht 24, 1018 TV Amsterdam
-
Cilibrasi R (2007) Statistical inference through data compression. PhD thesis, Institute for Logic, Language and Computation Universiteit van Amsterdam, Plantage Muidergracht 24, 1018 TV Amsterdam. http://www. illc. uva. nl/.
-
(2007)
Statistical inference through data compression
-
-
Cilibrasi, R.1
-
9
-
-
84989525001
-
Indexing by latent semantic analysis
-
Deerwester S, Dumais S, Furnas G, Landauer T, Harshman R (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci 41(6): 391-407.
-
(1990)
J Am Soc Inf Sci
, vol.41
, Issue.6
, pp. 391-407
-
-
Deerwester, S.1
Dumais, S.2
Furnas, G.3
Landauer, T.4
Harshman, R.5
-
10
-
-
70350350699
-
The Good, the bad and the incorrectly classified: Profiling cases for case-base editing
-
Delany SJ (2009) The Good, the bad and the incorrectly classified: profiling cases for case-base editing. In: 8th international conference on case-based reasoning, pp 135-149.
-
(2009)
8th international conference on case-based reasoning
, pp. 135-149
-
-
Delany, S.J.1
-
11
-
-
29644438050
-
Statistical comparisons of classifiers over multiple data sets
-
Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7: 1-30.
-
(2006)
J Mach Learn Res
, vol.7
, pp. 1-30
-
-
Demsar, J.1
-
13
-
-
38049032340
-
Novelty detection in patient histories: experiments with measures based on text compression
-
In: Berthold MR, Shawe-Taylor J, Lavrac N (eds) Springer, New York
-
Edsberg O, Nytro O, Rost TB (2007) Novelty detection in patient histories: experiments with measures based on text compression. In: Berthold MR, Shawe-Taylor J, Lavrac N (eds) Advances in intelligent data analysis VII. Springer, New York, pp 367-378.
-
(2007)
Advances in intelligent data analysis VII
, pp. 367-378
-
-
Edsberg, O.1
Nytro, O.2
Rost, T.B.3
-
15
-
-
34547753523
-
Compression-based classification of biological sequences and structures via the universal similarity metric: experimental assessment
-
Ferragina P, Giancarlo R, Greco V, Manzini G, Valiente G (2007) Compression-based classification of biological sequences and structures via the universal similarity metric: experimental assessment. BMC Bioinf 8(1).
-
(2007)
BMC Bioinf
, vol.8
, Issue.1
-
-
Ferragina, P.1
Giancarlo, R.2
Greco, V.3
Manzini, G.4
Valiente, G.5
-
16
-
-
0001837148
-
A comparison of alternative tests of significance for the problem of m rankings
-
Friedman M (1940) A comparison of alternative tests of significance for the problem of m rankings. Ann Math Stat 11: 86-92.
-
(1940)
Ann Math Stat
, vol.11
, pp. 86-92
-
-
Friedman, M.1
-
18
-
-
70350403231
-
User choices and regret: understanding users' decision process about consensually acquired spyware
-
Good N, Grossklags J, Thaw D, Perzanowski A, Mulligan DK, Konstan J (2006) User choices and regret: understanding users' decision process about consensually acquired spyware. I/S Law Policy Inf Soc 2(2): 283-344.
-
(2006)
I/S Law Policy Inf Soc
, vol.2
, Issue.2
, pp. 283-344
-
-
Good, N.1
Grossklags, J.2
Thaw, D.3
Perzanowski, A.4
Mulligan, D.K.5
Konstan, J.6
-
19
-
-
52149094995
-
Evaluating the impact of information distortion on normalized compression distance
-
In: Barbero A (ed). Springer, Berlin
-
Granados A, Cebrian M, Camacho D, Rodriguez FB (2008) Evaluating the impact of information distortion on normalized compression distance. In: Barbero A (ed) Coding Theory and Applications. Springer, Berlin, pp 69-79.
-
(2008)
Coding Theory and Applications
, pp. 69-79
-
-
Granados, A.1
Cebrian, M.2
Camacho, D.3
Rodriguez, F.B.4
-
21
-
-
0001750957
-
Approximations of the critical region of the friedman statistic
-
Iman RL, Davenport JM (1980) Approximations of the critical region of the friedman statistic. Commun Stat A 9(6): 571-595.
-
(1980)
Commun Stat A
, vol.9
, Issue.6
, pp. 571-595
-
-
Iman, R.L.1
Davenport, J.M.2
-
22
-
-
10644281769
-
Towards parameter-free data mining
-
ACM Press, New York, NY, USA
-
Keogh E, Lonardi S, Ratanamahatana CA (2004) Towards parameter-free data mining. In: Tenth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM Press, New York, NY, USA, pp 206-215.
-
(2004)
Tenth ACM SIGKDD international conference on Knowledge discovery and data mining
, pp. 206-215
-
-
Keogh, E.1
Lonardi, S.2
Ratanamahatana, C.A.3
-
23
-
-
33847394102
-
Compression-based data mining of sequential data
-
Keogh E, Lonardi S, Ratanamahatana CA, Wei L, Lee S-H, Handley J (2007) Compression-based data mining of sequential data. Data Min Knowl Discov 14(1): 99-129.
-
(2007)
Data Min Knowl Discov
, vol.14
, Issue.1
, pp. 99-129
-
-
Keogh, E.1
Lonardi, S.2
Ratanamahatana, C.A.3
Wei, L.4
Lee, S.-H.5
Handley, J.6
-
26
-
-
45949092426
-
Spyware prevention by classifying end user license agreements
-
In: Nguyen NT, Katarzyniak R (eds). Springer, Berlin
-
Lavesson N, Boldt M, Davidsson P, Jacobsson A (2008) Spyware prevention by classifying end user license agreements. In: Nguyen NT, Katarzyniak R (eds) New Challenges in Applied Intelligence Technologies, Studies in Computational Intelligence. Springer, Berlin.
-
(2008)
New Challenges in Applied Intelligence Technologies, Studies in Computational Intelligence
-
-
Lavesson, N.1
Boldt, M.2
Davidsson, P.3
Jacobsson, A.4
-
27
-
-
78651485505
-
Learning to detect spyware using end user license agreements
-
Lavesson N, Boldt M, Davidsson P, Jacobsson A (2011) Learning to detect spyware using end user license agreements. Knowl Inf Syst 26(2): 285-307.
-
(2011)
Knowl Inf Syst
, vol.26
, Issue.2
, pp. 285-307
-
-
Lavesson, N.1
Boldt, M.2
Davidsson, P.3
Jacobsson, A.4
-
28
-
-
19944407179
-
Similarity measures, author cocitation analysis,and information theory
-
Leydesdorff L (2005) Similarity measures, author cocitation analysis, and information theory. J Am Soc Inf Sci Technol 56(7): 769-772.
-
(2005)
J Am Soc Inf Sci Technol
, vol.56
, Issue.7
, pp. 769-772
-
-
Leydesdorff, L.1
-
29
-
-
10644294829
-
The similarity metric
-
Li M, Chen X, Xin ML, Ma B, Vitanyi PMB (2004) The similarity metric. IEEE Trans Inf Theory 50(12): 3250-3264.
-
(2004)
IEEE Trans Inf Theory
, vol.50
, Issue.12
, pp. 3250-3264
-
-
Li, M.1
Chen, X.2
Xin, M.L.3
Ma, B.4
Vitanyi, P.M.B.5
-
30
-
-
70350539300
-
Parameter determination and feature selection for back-propagation network by particle swarm optimization
-
Lin S-W, Chen S-C, Wu W-J, Chen C-H (2009) Parameter determination and feature selection for back-propagation network by particle swarm optimization. Knowl Inf Syst 21(2): 249-266.
-
(2009)
Knowl Inf Syst
, vol.21
, Issue.2
, pp. 249-266
-
-
Lin, S.-W.1
Chen, S.-C.2
Wu, W.-J.3
Chen, C.-H.4
-
31
-
-
0001794236
-
Development of a stemming algorithm
-
Lovins JB (1968) Development of a stemming algorithm. Mech Transl Comput Linguist 11: 22-31.
-
(1968)
Mech Transl Comput Linguist
, vol.11
, pp. 22-31
-
-
Lovins, J.B.1
-
34
-
-
0002442796
-
Machine learning in automated text categorization
-
Sebastiani F (2002) Machine learning in automated text categorization. ACM Comput Surv 34(1): 1-47.
-
(2002)
ACM Comput Surv
, vol.34
, Issue.1
, pp. 1-47
-
-
Sebastiani, F.1
-
36
-
-
69849084283
-
Categorical proportional difference: a feature selection method for text categorization
-
In: Roddick JF, Li J, Christen P, Kennedy PJ (eds). ACS, Glenelg, South Australia
-
Simeon M, Hilderman R (2008) Categorical proportional difference: a feature selection method for text categorization. In: Roddick JF, Li J, Christen P, Kennedy PJ (eds) Seventh Australasian Data Mining Conference, volume 87 of CRPIT. ACS, Glenelg, South Australia, pp 201-208.
-
(2008)
Seventh Australasian Data Mining Conference, volume 87 of CRPIT
, pp. 201-208
-
-
Simeon, M.1
Hilderman, R.2
-
37
-
-
34248190076
-
Normalized compression distance for visual analysis of document collections
-
Telles GP, Minghim R, Paulovich FV (2007) Normalized compression distance for visual analysis of document collections. Comput Graph 31: 327-337.
-
(2007)
Comput Graph
, vol.31
, pp. 327-337
-
-
Telles, G.P.1
Minghim, R.2
Paulovich, F.V.3
-
39
-
-
67349109407
-
Using wikipedia knowledge to improve text classification
-
Wang P, Hu J, Zeng HJ, Chen Z (2009) Using wikipedia knowledge to improve text classification. Knowl Inf Syst 19: 265-281.
-
(2009)
Knowl Inf Syst
, vol.19
, pp. 265-281
-
-
Wang, P.1
Hu, J.2
Zeng, H.J.3
Chen, Z.4
-
41
-
-
0021405335
-
Data compression using adaptive coding and partial string matching
-
Cleary JG, Witten IH (1984) Data compression using adaptive coding and partial string matching. IEEE Trans Commun 32(4): 396-402.
-
(1984)
IEEE Trans Commun
, vol.32
, Issue.4
, pp. 396-402
-
-
Cleary, J.G.1
Witten, I.H.2
-
42
-
-
38649124934
-
A systematic study on parameter correlations in large-scale duplicate document detection
-
Ye S, Wen J-R, Ma W-Y (2008) A systematic study on parameter correlations in large-scale duplicate document detection. Knowl Inf Syst 14(2): 217-232.
-
(2008)
Knowl Inf Syst
, vol.14
, Issue.2
, pp. 217-232
-
-
Ye, S.1
Wen, J.-R.2
Ma, W.-Y.3
-
43
-
-
75949100148
-
Effectiveness of NAQ-tree as index structure for similarity search in high-dimensional metric space
-
Zhang M, Alhajj R (2010) Effectiveness of NAQ-tree as index structure for similarity search in high-dimensional metric space. Knowl Inf Syst 22(1): 1-26.
-
(2010)
Knowl Inf Syst
, vol.22
, Issue.1
, pp. 1-26
-
-
Zhang, M.1
Alhajj, R.2
-
44
-
-
33845536164
-
The class imbalance problem: a systematic study
-
Japkowicz N, Stephen S (2002) The class imbalance problem: a systematic study. Intell Data Anal 6(5): 429-449.
-
(2002)
Intell Data Anal
, vol.6
, Issue.5
, pp. 429-449
-
-
Japkowicz, N.1
Stephen, S.2
|