메뉴 건너뛰기




Volumn 184, Issue 1, 2012, Pages 92-110

Distance metrics for high dimensional nearest neighborhood recovery: Compression and normalization

Author keywords

Dimensionality; GINI; Latent classes; Minkowski metrics; Nearest neighbors; Normalization

Indexed keywords

DIMENSIONALITY; GINI; LATENT CLASS; MINKOWSKI METRICS; NEAREST NEIGHBORS; NORMALIZATION;

EID: 80055063556     PISSN: 00200255     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.ins.2011.07.048     Document Type: Article
Times cited : (48)

References (55)
  • 1
    • 46249095180 scopus 로고    scopus 로고
    • Sentiment analysis in multiple languages: Feature selection for opinion classification in web forums
    • A. Abbasi, H. Chen, and A. Salem Sentiment analysis in multiple languages: feature selection for opinion classification in web forums ACM Transactions on Information Systems 26 2008 1 34
    • (2008) ACM Transactions on Information Systems , vol.26 , pp. 1-34
    • Abbasi, A.1    Chen, H.2    Salem, A.3
  • 3
    • 77952412927 scopus 로고    scopus 로고
    • Towards systematic design of distance functions for data mining applications
    • New York, NY
    • C.C. Aggarwal, Towards systematic design of distance functions for data mining applications, in: Proceedings of SIGKDD 2003, ACM, New York, NY, 2003, pp. 9-18.
    • (2003) Proceedings of SIGKDD 2003, ACM , pp. 9-18
    • Aggarwal, C.C.1
  • 4
    • 84949479246 scopus 로고    scopus 로고
    • On the surprising behavior of distance metrics in high dimensional space
    • C.C. Aggarwal, A. Hinneburg, D.A. Keim, On the surprising behavior of distance metrics in high dimensional space, in: Database Theory - ICDT 2001, 2001, pp. 420-434. (Pubitemid 33213340)
    • (2001) Lecture Notes in Computer Science , Issue.1973 , pp. 420-434
    • Aggarwal, C.C.1    Hinneburg, A.2    Keim, D.A.3
  • 6
    • 33750723835 scopus 로고    scopus 로고
    • PARAMAP vs. isomap: A comparison of two nonlinear mapping algorithms
    • DOI 10.1007/s00357-006-0014-2
    • U. Akkucuk, and J.D. Carroll PARAMAP vs. Isomap: a comparison of two nonlinear mapping algorithms Journal of Classification 23 2006 221 254 (Pubitemid 44704842)
    • (2006) Journal of Classification , vol.23 , Issue.2 , pp. 221-254
    • Akkucuk, U.1    Carroll, J.D.2
  • 7
    • 34548536094 scopus 로고    scopus 로고
    • The high-dimension, low-sample-size geometric representation holds under mild conditions
    • DOI 10.1093/biomet/asm050
    • J. Ahn, J.S. Marron, K.M. Muller, and Y. Chi The high-dimension, low-sample-size geometric representation holds under mild conditions Biometrika 94 2007 760 766 (Pubitemid 47384265)
    • (2007) Biometrika , vol.94 , Issue.3 , pp. 760-766
    • Ahn, J.1    Marron, J.S.2    Muller, K.M.3    Chi, Y.-Y.4
  • 8
    • 0004020376 scopus 로고
    • first ed. Princeton University Press Princeton, NJ
    • R. Bellman Adaptive Control Processes first ed. 1961 Princeton University Press Princeton, NJ
    • (1961) Adaptive Control Processes
    • Bellman, R.1
  • 11
    • 27144489164 scopus 로고    scopus 로고
    • A tutorial on support vector machines for pattern recognition
    • C.J.C. Burges A tutorial on support vector machines for pattern recognition Data Mining and Knowledge Discovery 2 1998 121 167 (Pubitemid 128695475)
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.2 , pp. 121-167
    • Burges, C.J.C.1
  • 12
    • 48949103145 scopus 로고    scopus 로고
    • Designing specific weighted similarity measures to improve collaborative filtering systems
    • P. Perner, LNCS Springer Verlag Berlin, Germany
    • L. Candillier, F. Meyer, and F. Fessant Designing specific weighted similarity measures to improve collaborative filtering systems P. Perner, Proceedings of the Industrial Conference on Data Mining (ICDM) 2008 LNCS vol. 5077 2008 Springer Verlag Berlin, Germany 242 255
    • (2008) Proceedings of the Industrial Conference on Data Mining (ICDM) 2008 , vol.5077 , pp. 242-255
    • Candillier, L.1    Meyer, F.2    Fessant, F.3
  • 14
    • 70350344696 scopus 로고    scopus 로고
    • Local multidimensional scaling for nonlinear dimension reduction, graph drawing, and proximity analysis
    • L. Chen, and A. Buja Local multidimensional scaling for nonlinear dimension reduction, graph drawing, and proximity analysis Journal of the American Statistical Association 104 2009 209 219
    • (2009) Journal of the American Statistical Association , vol.104 , pp. 209-219
    • Chen, L.1    Buja, A.2
  • 15
    • 0012057129 scopus 로고
    • Measurement of inequality and incomes
    • G. Corrodo Measurement of inequality and incomes The Economic Journal 31 1921 124 126
    • (1921) The Economic Journal , vol.31 , pp. 124-126
    • Corrodo, G.1
  • 18
  • 20
    • 0000665083 scopus 로고
    • Non-parametric estimation of a multivariate probability density
    • V.A. Epanechnikov Non-parametric estimation of a multivariate probability density Theory of Probability and its Applications 14 1969 153 158
    • (1969) Theory of Probability and Its Applications , vol.14 , pp. 153-158
    • Epanechnikov, V.A.1
  • 21
    • 0002516752 scopus 로고
    • Spoken letter recognition
    • R.P. Lippman, J. Moody, D.S. Touretzky, Morgan Kaufman San Mateo, CA
    • M. Fanty, and R. Cole Spoken letter recognition R.P. Lippman, J. Moody, D.S. Touretzky, Advances in Neural Information Processing Systems vol. 3 1990 Morgan Kaufman San Mateo, CA 220 226
    • (1990) Advances in Neural Information Processing Systems , vol.3 , pp. 220-226
    • Fanty, M.1    Cole, R.2
  • 22
    • 0001165055 scopus 로고
    • Frequency distribution of the values of the correlation coefficient in samples from an indefinitely large population
    • R.A. Fisher Frequency distribution of the values of the correlation coefficient in samples from an indefinitely large population Biometrika 10 1915 507 521
    • (1915) Biometrika , vol.10 , pp. 507-521
    • Fisher, R.A.1
  • 23
    • 70350238786 scopus 로고    scopus 로고
    • Is the Distance Compression Effect Overstated? Some Theory and Experimentation
    • Springer Verlag Berlin
    • S.L. France, and J.D. Carroll Is The Distance Compression Effect Overstated? Some Theory and Experimentation Proceedings of MLDM 2009 2009 Springer Verlag Berlin 280 294
    • (2009) Proceedings of MLDM 2009 , pp. 280-294
    • France, S.L.1    Carroll, J.D.2
  • 27
    • 77958111663 scopus 로고    scopus 로고
    • Aggregation functions: Construction methods, conjunctive, disjunctive and mixed classes
    • M. Grabisch, J. Marichal, R. Mesiar, and E. Pape Aggregation functions: construction methods, conjunctive, disjunctive and mixed classes Information Sciences 181 2011 23 43
    • (2011) Information Sciences , vol.181 , pp. 23-43
    • Grabisch, M.1    Marichal, J.2    Mesiar, R.3    Pape, E.4
  • 29
    • 3042829247 scopus 로고    scopus 로고
    • An empirical analysis of design choices in neighborhood-based collaborative filtering algorithms
    • J. Herlocker, J.A. Konstan, and J. Riedl An empirical analysis of design choices in neighborhood-based collaborative filtering algorithms Information Retrieval 5 2002 287 310
    • (2002) Information Retrieval , vol.5 , pp. 287-310
    • Herlocker, J.1    Konstan, J.A.2    Riedl, J.3
  • 33
    • 84874110943 scopus 로고    scopus 로고
    • A unified data mining solution for authorship analysis in anonymous textual communications
    • doi:10.1016/j.ins.2011.03.006
    • F. Iqbal, H. Binsalleeh, B.C.M. Fung, M. Debbabi, A unified data mining solution for authorship analysis in anonymous textual communications, Information Sciences (2011). doi:10.1016/j.ins.2011.03.006.
    • (2011) Information Sciences
    • Iqbal, F.1    Binsalleeh, H.2    Fung, B.C.M.3    Debbabi, M.4
  • 35
    • 77649340033 scopus 로고    scopus 로고
    • Collaborative filtering with ordinal scale-based implicit ratings for mobile music recommendations
    • S.K. Lee, Y.H. Cho, and S.H. Kim Collaborative filtering with ordinal scale-based implicit ratings for mobile music recommendations Information Sciences 180 2010 2142 2155
    • (2010) Information Sciences , vol.180 , pp. 2142-2155
    • Lee, S.K.1    Cho, Y.H.2    Kim, S.H.3
  • 37
    • 17544364135 scopus 로고    scopus 로고
    • On ultrametricity, data coding, and computation
    • DOI 10.1007/s00357-004-0015-y
    • F. Murtagh On ultrametricity, data coding, and computation Journal of Classification 21 2004 167 184 (Pubitemid 40936640)
    • (2004) Journal of Classification , vol.21 , Issue.2 , pp. 167-184
    • Murtagh, F.1
  • 38
    • 75549090468 scopus 로고    scopus 로고
    • The remarkable simplicity of very high dimensional data: Application of model-based clustering
    • F. Murtagh The remarkable simplicity of very high dimensional data: application of model-based clustering Journal of Classification 26 2009 249 277
    • (2009) Journal of Classification , vol.26 , pp. 249-277
    • Murtagh, F.1
  • 39
    • 33744509693 scopus 로고    scopus 로고
    • Defection detection: Measuring and understanding the predictive accuracy of customer churn models
    • DOI 10.1509/jmkr.43.2.204
    • S.A. Neslin, S. Gupta, W.A. Kamakura, J. Lu, and C.H. Mason Defection detection: measuring and understanding the predictive accuracy of customer churn models Journal of Marketing Research 43 2006 204 211 (Pubitemid 43813052)
    • (2006) Journal of Marketing Research , vol.43 , Issue.2 , pp. 204-211
    • Neslin, S.A.1    Gupta, S.2    Kamakura, W.3    Junxiang, L.U.4    Mason, C.H.5
  • 41
    • 2442432731 scopus 로고    scopus 로고
    • Similarity between Euclidean and cosine angle distance for nearest neighbor queries
    • H.M. Haddad, A. Omicini, R.L. Wainwright, L.M. Liebrock, ACM New York, NY
    • G. Qian, S. Surul, Y. Gu, and S. Pramanik Similarity between Euclidean and cosine angle distance for nearest neighbor queries H.M. Haddad, A. Omicini, R.L. Wainwright, L.M. Liebrock, Proceedings of the 2004 ACM symposium on Applied Computing 2004 ACM New York, NY 1232 1237
    • (2004) Proceedings of the 2004 ACM Symposium on Applied Computing , pp. 1232-1237
    • Qian, G.1    Surul, S.2    Gu, Y.3    Pramanik, S.4
  • 44
    • 34548584050 scopus 로고    scopus 로고
    • Matrix comparison, Part 1: Motivation and important issues for measuring the resemblance between proximity measures or ordination results
    • DOI 10.1002/asi.20643
    • J.W. Schneider, and P. Borland Matrix comparison, part 1: motivation and important issues for measuring the resemblance between proximity measures or ordination results Journal of the American Society for Information Science and Technology 58 2007 1586 1595 (Pubitemid 47397661)
    • (2007) Journal of the American Society for Information Science and Technology , vol.58 , Issue.11 , pp. 1586-1595
    • Schneider, J.W.1    Borlund, P.2
  • 46
    • 2442439674 scopus 로고    scopus 로고
    • A comparison of document clustering techniques
    • Boston, MA, USA, August 20, 2000
    • M. Steinbach, G. Karypis, V. Kumar, A comparison of document clustering techniques, KDD-2000 Workshop on Text Mining, 2000, Boston, MA, USA, August 20, 2000 pp. 1-20.
    • (2000) KDD-2000 Workshop on Text Mining , pp. 1-20
    • Steinbach, M.1    Karypis, G.2    Kumar, V.3
  • 49
    • 80055062766 scopus 로고    scopus 로고
    • accessed 3. 01. 2010
    • TREC Text REtrieval conference, http://www.trec.nist.gov, 2008 (accessed 3. 01. 2010).
    • (2008) TREC Text REtrieval Conference
  • 51
    • 61749090884 scopus 로고    scopus 로고
    • Distance metric learning for large margin nearest neighbor classification
    • K.Q. Weinberger, and L.K. Saul Distance metric learning for large margin nearest neighbor classification The Journal of Machine Learning Research 10 2009 207 244
    • (2009) The Journal of Machine Learning Research , vol.10 , pp. 207-244
    • Weinberger, K.Q.1    Saul, L.K.2
  • 53
    • 79953862205 scopus 로고    scopus 로고
    • On the effectiveness of subwords for lexical cohesion based story segmentation of Chinese broadcast news
    • L. Xie, Y.-L. Yang, and Z.-Q. On the effectiveness of subwords for lexical cohesion based story segmentation of Chinese broadcast news Information Sciences 181 2011 2873 2891
    • (2011) Information Sciences , vol.181 , pp. 2873-2891
    • Xie, L.1    Yang, Y.-L.2    On, Z.-Q.3
  • 54
    • 0032286861 scopus 로고    scopus 로고
    • Orthogonal column latin hypercubes and their application in computer experiments
    • K.Q. Ye Orthogonal column Latin hypercubes and their application in computer experiments Journal of the American Statistical Association 93 1998 1430 1439 (Pubitemid 128385553)
    • (1998) Journal of the American Statistical Association , vol.93 , Issue.444 , pp. 1430-1439
    • Ye, K.Q.1
  • 55
    • 3543085722 scopus 로고    scopus 로고
    • Empirical and theoretical comparisons of selected criterion functions for document clustering
    • Y. Zhao, and G. Karypis Empirical and theoretical comparisons of selected criterion functions for document clustering Machine Learning 55 2004 311 331
    • (2004) Machine Learning , vol.55 , pp. 311-331
    • Zhao, Y.1    Karypis, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.