메뉴 건너뛰기




Volumn 32, Issue 1, 2012, Pages 167-189

Similarity assessment for removal of noisy end user license agreements

Author keywords

End user license agreement; Latent semantic analysis; Normalized compression distance; Spyware

Indexed keywords


EID: 84862665042     PISSN: 02191377     EISSN: 02193116     Source Type: Journal    
DOI: 10.1007/s10115-011-0438-9     Document Type: Article
Times cited : (7)

References (44)
  • 1
    • 33244485264 scopus 로고    scopus 로고
    • Non-parametric classifier-independent feature selection
    • Abe N, Kudo M (2006) Non-parametric classifier-independent feature selection. Pattern Recogn 39: 737-746.
    • (2006) Pattern Recogn , vol.39 , pp. 737-746
    • Abe, N.1    Kudo, M.2
  • 2
    • 84928016636 scopus 로고    scopus 로고
    • The base-rate fallacy and the difficulty of intrusion detection
    • Axelsson S (2000) The base-rate fallacy and the difficulty of intrusion detection. ACM Trans Inf Syst Sec 3(3): 186-205.
    • (2000) ACM Trans Inf Syst Sec , vol.3 , Issue.3 , pp. 186-205
    • Axelsson, S.1
  • 4
    • 0029546874 scopus 로고
    • Using linear algebra for intelligent information retrieval
    • Berry MW, Dumais ST, O'Brien GW (1995) Using linear algebra for intelligent information retrieval. SIAM Rev 37(4): 573-595.
    • (1995) SIAM Rev , vol.37 , Issue.4 , pp. 573-595
    • Berry, M.W.1    Dumais, S.T.2    O'Brien, G.W.3
  • 6
    • 34248374466 scopus 로고    scopus 로고
    • The normalized compression distance is resistant to noise
    • Cebrian M, Alfonseca M, Ortega A (2007) The normalized compression distance is resistant to noise. IEEE Trans Inf Theory 53(5): 1895-1900.
    • (2007) IEEE Trans Inf Theory , vol.53 , Issue.5 , pp. 1895-1900
    • Cebrian, M.1    Alfonseca, M.2    Ortega, A.3
  • 7
    • 51849162587 scopus 로고    scopus 로고
    • Common pitfalls using normalized compression distance: what to watch out for in a compressor
    • Cebrian M, Alfonseca M, Ortega A (2005) Common pitfalls using normalized compression distance: what to watch out for in a compressor. Commun Inf Syst 5(4): 367-400.
    • (2005) Commun Inf Syst , vol.5 , Issue.4 , pp. 367-400
    • Cebrian, M.1    Alfonseca, M.2    Ortega, A.3
  • 8
    • 52249086218 scopus 로고    scopus 로고
    • PhD thesis, Institute for Logic, Language and Computation Universiteit van Amsterdam, Plantage Muidergracht 24, 1018 TV Amsterdam
    • Cilibrasi R (2007) Statistical inference through data compression. PhD thesis, Institute for Logic, Language and Computation Universiteit van Amsterdam, Plantage Muidergracht 24, 1018 TV Amsterdam. http://www. illc. uva. nl/.
    • (2007) Statistical inference through data compression
    • Cilibrasi, R.1
  • 10
    • 70350350699 scopus 로고    scopus 로고
    • The Good, the bad and the incorrectly classified: Profiling cases for case-base editing
    • Delany SJ (2009) The Good, the bad and the incorrectly classified: profiling cases for case-base editing. In: 8th international conference on case-based reasoning, pp 135-149.
    • (2009) 8th international conference on case-based reasoning , pp. 135-149
    • Delany, S.J.1
  • 11
    • 29644438050 scopus 로고    scopus 로고
    • Statistical comparisons of classifiers over multiple data sets
    • Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7: 1-30.
    • (2006) J Mach Learn Res , vol.7 , pp. 1-30
    • Demsar, J.1
  • 13
    • 38049032340 scopus 로고    scopus 로고
    • Novelty detection in patient histories: experiments with measures based on text compression
    • In: Berthold MR, Shawe-Taylor J, Lavrac N (eds) Springer, New York
    • Edsberg O, Nytro O, Rost TB (2007) Novelty detection in patient histories: experiments with measures based on text compression. In: Berthold MR, Shawe-Taylor J, Lavrac N (eds) Advances in intelligent data analysis VII. Springer, New York, pp 367-378.
    • (2007) Advances in intelligent data analysis VII , pp. 367-378
    • Edsberg, O.1    Nytro, O.2    Rost, T.B.3
  • 15
    • 34547753523 scopus 로고    scopus 로고
    • Compression-based classification of biological sequences and structures via the universal similarity metric: experimental assessment
    • Ferragina P, Giancarlo R, Greco V, Manzini G, Valiente G (2007) Compression-based classification of biological sequences and structures via the universal similarity metric: experimental assessment. BMC Bioinf 8(1).
    • (2007) BMC Bioinf , vol.8 , Issue.1
    • Ferragina, P.1    Giancarlo, R.2    Greco, V.3    Manzini, G.4    Valiente, G.5
  • 16
    • 0001837148 scopus 로고
    • A comparison of alternative tests of significance for the problem of m rankings
    • Friedman M (1940) A comparison of alternative tests of significance for the problem of m rankings. Ann Math Stat 11: 86-92.
    • (1940) Ann Math Stat , vol.11 , pp. 86-92
    • Friedman, M.1
  • 18
    • 70350403231 scopus 로고    scopus 로고
    • User choices and regret: understanding users' decision process about consensually acquired spyware
    • Good N, Grossklags J, Thaw D, Perzanowski A, Mulligan DK, Konstan J (2006) User choices and regret: understanding users' decision process about consensually acquired spyware. I/S Law Policy Inf Soc 2(2): 283-344.
    • (2006) I/S Law Policy Inf Soc , vol.2 , Issue.2 , pp. 283-344
    • Good, N.1    Grossklags, J.2    Thaw, D.3    Perzanowski, A.4    Mulligan, D.K.5    Konstan, J.6
  • 19
    • 52149094995 scopus 로고    scopus 로고
    • Evaluating the impact of information distortion on normalized compression distance
    • In: Barbero A (ed). Springer, Berlin
    • Granados A, Cebrian M, Camacho D, Rodriguez FB (2008) Evaluating the impact of information distortion on normalized compression distance. In: Barbero A (ed) Coding Theory and Applications. Springer, Berlin, pp 69-79.
    • (2008) Coding Theory and Applications , pp. 69-79
    • Granados, A.1    Cebrian, M.2    Camacho, D.3    Rodriguez, F.B.4
  • 21
    • 0001750957 scopus 로고
    • Approximations of the critical region of the friedman statistic
    • Iman RL, Davenport JM (1980) Approximations of the critical region of the friedman statistic. Commun Stat A 9(6): 571-595.
    • (1980) Commun Stat A , vol.9 , Issue.6 , pp. 571-595
    • Iman, R.L.1    Davenport, J.M.2
  • 27
    • 78651485505 scopus 로고    scopus 로고
    • Learning to detect spyware using end user license agreements
    • Lavesson N, Boldt M, Davidsson P, Jacobsson A (2011) Learning to detect spyware using end user license agreements. Knowl Inf Syst 26(2): 285-307.
    • (2011) Knowl Inf Syst , vol.26 , Issue.2 , pp. 285-307
    • Lavesson, N.1    Boldt, M.2    Davidsson, P.3    Jacobsson, A.4
  • 28
    • 19944407179 scopus 로고    scopus 로고
    • Similarity measures, author cocitation analysis,and information theory
    • Leydesdorff L (2005) Similarity measures, author cocitation analysis, and information theory. J Am Soc Inf Sci Technol 56(7): 769-772.
    • (2005) J Am Soc Inf Sci Technol , vol.56 , Issue.7 , pp. 769-772
    • Leydesdorff, L.1
  • 30
    • 70350539300 scopus 로고    scopus 로고
    • Parameter determination and feature selection for back-propagation network by particle swarm optimization
    • Lin S-W, Chen S-C, Wu W-J, Chen C-H (2009) Parameter determination and feature selection for back-propagation network by particle swarm optimization. Knowl Inf Syst 21(2): 249-266.
    • (2009) Knowl Inf Syst , vol.21 , Issue.2 , pp. 249-266
    • Lin, S.-W.1    Chen, S.-C.2    Wu, W.-J.3    Chen, C.-H.4
  • 31
    • 0001794236 scopus 로고
    • Development of a stemming algorithm
    • Lovins JB (1968) Development of a stemming algorithm. Mech Transl Comput Linguist 11: 22-31.
    • (1968) Mech Transl Comput Linguist , vol.11 , pp. 22-31
    • Lovins, J.B.1
  • 34
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • Sebastiani F (2002) Machine learning in automated text categorization. ACM Comput Surv 34(1): 1-47.
    • (2002) ACM Comput Surv , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 36
    • 69849084283 scopus 로고    scopus 로고
    • Categorical proportional difference: a feature selection method for text categorization
    • In: Roddick JF, Li J, Christen P, Kennedy PJ (eds). ACS, Glenelg, South Australia
    • Simeon M, Hilderman R (2008) Categorical proportional difference: a feature selection method for text categorization. In: Roddick JF, Li J, Christen P, Kennedy PJ (eds) Seventh Australasian Data Mining Conference, volume 87 of CRPIT. ACS, Glenelg, South Australia, pp 201-208.
    • (2008) Seventh Australasian Data Mining Conference, volume 87 of CRPIT , pp. 201-208
    • Simeon, M.1    Hilderman, R.2
  • 37
    • 34248190076 scopus 로고    scopus 로고
    • Normalized compression distance for visual analysis of document collections
    • Telles GP, Minghim R, Paulovich FV (2007) Normalized compression distance for visual analysis of document collections. Comput Graph 31: 327-337.
    • (2007) Comput Graph , vol.31 , pp. 327-337
    • Telles, G.P.1    Minghim, R.2    Paulovich, F.V.3
  • 39
    • 67349109407 scopus 로고    scopus 로고
    • Using wikipedia knowledge to improve text classification
    • Wang P, Hu J, Zeng HJ, Chen Z (2009) Using wikipedia knowledge to improve text classification. Knowl Inf Syst 19: 265-281.
    • (2009) Knowl Inf Syst , vol.19 , pp. 265-281
    • Wang, P.1    Hu, J.2    Zeng, H.J.3    Chen, Z.4
  • 41
    • 0021405335 scopus 로고
    • Data compression using adaptive coding and partial string matching
    • Cleary JG, Witten IH (1984) Data compression using adaptive coding and partial string matching. IEEE Trans Commun 32(4): 396-402.
    • (1984) IEEE Trans Commun , vol.32 , Issue.4 , pp. 396-402
    • Cleary, J.G.1    Witten, I.H.2
  • 42
    • 38649124934 scopus 로고    scopus 로고
    • A systematic study on parameter correlations in large-scale duplicate document detection
    • Ye S, Wen J-R, Ma W-Y (2008) A systematic study on parameter correlations in large-scale duplicate document detection. Knowl Inf Syst 14(2): 217-232.
    • (2008) Knowl Inf Syst , vol.14 , Issue.2 , pp. 217-232
    • Ye, S.1    Wen, J.-R.2    Ma, W.-Y.3
  • 43
    • 75949100148 scopus 로고    scopus 로고
    • Effectiveness of NAQ-tree as index structure for similarity search in high-dimensional metric space
    • Zhang M, Alhajj R (2010) Effectiveness of NAQ-tree as index structure for similarity search in high-dimensional metric space. Knowl Inf Syst 22(1): 1-26.
    • (2010) Knowl Inf Syst , vol.22 , Issue.1 , pp. 1-26
    • Zhang, M.1    Alhajj, R.2
  • 44
    • 33845536164 scopus 로고    scopus 로고
    • The class imbalance problem: a systematic study
    • Japkowicz N, Stephen S (2002) The class imbalance problem: a systematic study. Intell Data Anal 6(5): 429-449.
    • (2002) Intell Data Anal , vol.6 , Issue.5 , pp. 429-449
    • Japkowicz, N.1    Stephen, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.