메뉴 건너뛰기




Volumn 29, Issue 4, 2013, Pages 1024-1034

On the performance of high dimensional data clustering and classification algorithms

Author keywords

Classification; Clustering; Distributed stream processing; Granules; Hadoop; Machine learning; Mahout

Indexed keywords

ARTIFICIAL INTELLIGENCE; CLASSIFICATION (OF INFORMATION); DATA MINING; DISTRIBUTED PARAMETER CONTROL SYSTEMS; GRANULATION; LEARNING SYSTEMS; PATTERN RECOGNITION; PATTERN RECOGNITION SYSTEMS;

EID: 84863770788     PISSN: 0167739X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.future.2012.05.026     Document Type: Article
Times cited : (45)

References (34)
  • 1
    • 37549003336 scopus 로고    scopus 로고
    • MapReduce: simplified data processing on large clusters
    • [1] Dean, J., Ghemawat, S., MapReduce: simplified data processing on large clusters. ACM Communications 51 (2008), 107–113.
    • (2008) ACM Communications , vol.51 , pp. 107-113
    • Dean, J.1    Ghemawat, S.2
  • 2
    • 79960530372 scopus 로고    scopus 로고
    • Mahout in Action
    • Manning Publications (est.)
    • [2] Owen, S., et al. Mahout in Action. 2011, Manning Publications (est.).
    • (2011)
    • Owen, S.1
  • 3
    • 74049113467 scopus 로고    scopus 로고
    • Hadoop: The Definitive Guide
    • first ed. O'Reilly Media
    • [3] White, T., Hadoop: The Definitive Guide. first ed., 2009, O'Reilly Media.
    • (2009)
    • White, T.1
  • 4
    • 84892062680 scopus 로고    scopus 로고
    • A survey of clustering data mining techniques
    • J. Kogan C.K. Nicholas Springer
    • [4] Berkhin, P., A survey of clustering data mining techniques. Kogan, J., Nicholas, C.K., (eds.) Grouping Multidimensional Data: Recent Advances in Clustering, 2006, Springer, 25–83.
    • (2006) Grouping Multidimensional Data: Recent Advances in Clustering , pp. 25-83
    • Berkhin, P.1
  • 5
    • 84897546815 scopus 로고    scopus 로고
    • K. Byeong Man, Clustering approach for hybrid recommender system, in: Proceedings IEEE/WIC International Conference on Web Intelligence, WI 2003, 2003, pp. 33–38.
    • [5] L. Qing, K. Byeong Man, Clustering approach for hybrid recommender system, in: Proceedings IEEE/WIC International Conference on Web Intelligence, WI 2003, 2003, pp. 33–38.
    • Qing, L.1
  • 7
    • 0035024021 scopus 로고    scopus 로고
    • Validating clustering for gene expression data
    • [7] Yeung, K.Y., et al. Validating clustering for gene expression data. Bioinformatics 17 (2001), 309–318.
    • (2001) Bioinformatics , vol.17 , pp. 309-318
    • Yeung, K.Y.1
  • 8
    • 80054748843 scopus 로고    scopus 로고
    • Classification of EEG during imagined mental tasks by forecasting with Elman Recurrent Neural Networks,
    • [8] E.M. Forney, C.W. Anderson, Classification of EEG during imagined mental tasks by forecasting with Elman Recurrent Neural Networks, in: The 2011 International Joint Conference on Neural Networks, IJCNN, 2011, pp. 2749–2755.
    • (2011) The 2011 International Joint Conference on Neural Networks, IJCNN , pp. 2749-2755
    • Forney, E.M.1    Anderson, C.W.2
  • 9
    • 85014214696 scopus 로고    scopus 로고
    • An empirical comparison of supervised machine learning techniques in bioinformatics, Presented at the Proceedings of the First Asia-Pacific Bioinformatics Conference on Bioinformatics 2003—Volume 19, Adelaide, Australia, 2003.
    • [9] A.C. Tan, D. Gilbert, An empirical comparison of supervised machine learning techniques in bioinformatics, Presented at the Proceedings of the First Asia-Pacific Bioinformatics Conference on Bioinformatics 2003—Volume 19, Adelaide, Australia, 2003.
    • Tan, A.C.1    Gilbert, D.2
  • 10
    • 72049101634 scopus 로고    scopus 로고
    • Granules: a lightweight, streaming runtime for cloud computing with support for map-reduce, in: IEEE International Conference on Cluster Computing, New Orleans, LA, 2009.
    • [10] S. Pallickara, et al. Granules: a lightweight, streaming runtime for cloud computing with support for map-reduce, in: IEEE International Conference on Cluster Computing, New Orleans, LA, 2009.
    • Pallickara, S.1
  • 11
    • 62749104474 scopus 로고    scopus 로고
    • An Overview of the granules runtime for cloud computing, in: IEEE International Conference on e-Science, Indianapolis, 2008.
    • [11] S. Pallickara, et al. An Overview of the granules runtime for cloud computing, in: IEEE International Conference on e-Science, Indianapolis, 2008.
    • Pallickara, S.1
  • 13
    • 0004008854 scopus 로고
    • Pattern Recognition with Fuzzy Objective Function Algorithms
    • Kluwer Academic Publishers Norwell, MA
    • [13] Bezdek, J.C., Pattern Recognition with Fuzzy Objective Function Algorithms. 1981, Kluwer Academic Publishers, Norwell, MA.
    • (1981)
    • Bezdek, J.C.1
  • 16
    • 85014256625 scopus 로고    scopus 로고
    • Naive (Bayes) at forty: the independence assumption in information retrieval.
    • [16] D. Lewis, Naive (Bayes) at forty: the independence assumption in information retrieval.
    • Lewis, D.1
  • 17
    • 0021410050 scopus 로고
    • Learning characteristics of stochastic-gradient-descent algorithms: a general study, analysis, and critique
    • [17] Gardner, W.A., Learning characteristics of stochastic-gradient-descent algorithms: a general study, analysis, and critique. Signal Process. 6 (1984), 113–133.
    • (1984) Signal Process. , vol.6 , pp. 113-133
    • Gardner, W.A.1
  • 19
    • 53749089739 scopus 로고    scopus 로고
    • Fast support vector machine training and classification on graphics processors, Presented at the Proceedings of the 25th International Conference on Machine Learning, Helsinki, Finland, 2008.
    • [19] B. Catanzaro, et al. Fast support vector machine training and classification on graphics processors, Presented at the Proceedings of the 25th International Conference on Machine Learning, Helsinki, Finland, 2008.
    • Catanzaro, B.1
  • 20
    • 85014154567 scopus 로고    scopus 로고
    • DryadLINQ: a system for general-purpose distributed data-parallel computing using a high-level language, Presented at the Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation, San Diego, California, 2008.
    • [20] Y. Yu, et al. DryadLINQ: a system for general-purpose distributed data-parallel computing using a high-level language, Presented at the Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation, San Diego, California, 2008.
    • Yu, Y.1
  • 22
    • 85014148736 scopus 로고    scopus 로고
    • Map-reduce expansion of the ISGA genomic analysis web server, Presented at the CloudCom 2010, Indianapolis, USA, 2010.
    • [22] C. Hemmerich, et al. Map-reduce expansion of the ISGA genomic analysis web server, Presented at the CloudCom 2010, Indianapolis, USA, 2010.
    • Hemmerich, C.1
  • 23
    • 80052795428 scopus 로고    scopus 로고
    • Adaptive heterogeneous language support within a cloud runtime
    • [23] Ericson, K., Pallickara, S., Adaptive heterogeneous language support within a cloud runtime. Future Generation Computer Systems 28 (2012), 128–135.
    • (2012) Future Generation Computer Systems , vol.28 , pp. 128-135
    • Ericson, K.1    Pallickara, S.2
  • 24
    • 67650326696 scopus 로고    scopus 로고
    • The Hadoop distributed file system: architecture and design
    • Available:.
    • [24] D. Borthakur, 2007, The Hadoop distributed file system: architecture and design. Available: http://hadoop.apache.org/common/docs/r0.18.0/hdfs_design.pdf.
    • (2007)
    • Borthakur, D.1
  • 25
    • 85014300991 scopus 로고    scopus 로고
    • Dryad: distributed data-parallel programs from sequential building blocks, in: 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems, Lisbon, Portugal, 2007.
    • [25] M. Isard, et al. Dryad: distributed data-parallel programs from sequential building blocks, in: 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems, Lisbon, Portugal, 2007.
    • Isard, M.1
  • 27
    • 79952376729 scopus 로고    scopus 로고
    • Analyzing electroencephalograms using cloud computing techniques, in: IEEE Conference on Cloud Computing Technology and Science, Indianapolis, USA, 2010.
    • [27] K. Ericson, et al. Analyzing electroencephalograms using cloud computing techniques, in: IEEE Conference on Cloud Computing Technology and Science, Indianapolis, USA, 2010.
    • Ericson, K.1
  • 28
    • 0034592784 scopus 로고    scopus 로고
    • Efficient clustering of high-dimensional data sets with application to reference matching, Presented at the Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, Massachusetts, United States, 2000.
    • [28] A. McCallum, et al. Efficient clustering of high-dimensional data sets with application to reference matching, Presented at the Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, Massachusetts, United States, 2000.
    • McCallum, A.1
  • 29
    • 85014234390 scopus 로고    scopus 로고
    • Reuters-21578 text categorization test collection, Distribution 1.0, A.T.L.-. Research, Ed., 1.0 ed: UCI Machine Learning Repository, 1997.
    • [29] D.D. Lewis, Reuters-21578 text categorization test collection, Distribution 1.0, A.T.L.-. Research, Ed., 1.0 ed: UCI Machine Learning Repository, 1997.
    • Lewis, D.D.1
  • 30
    • 85014269463 scopus 로고    scopus 로고
    • Twenty newsgroups data set, ed: UCI Machine Learning Repository, 1999.
    • [30] T. Mitchell, Twenty newsgroups data set, ed: UCI Machine Learning Repository, 1999.
    • Mitchell, T.1
  • 31
    • 0004312284 scopus 로고    scopus 로고
    • Artificial Neural Networks
    • Prentice-Hall of India
    • [31] Yegnanarayana, B., Artificial Neural Networks. 2004, Prentice-Hall of India.
    • (2004)
    • Yegnanarayana, B.1
  • 33
    • 85014193785 scopus 로고    scopus 로고
    • The netflix prize, KDD Cup and Workshop, 2007.
    • [33] J. Bennet, S. Lanning, The netflix prize, KDD Cup and Workshop, 2007. www.netflixprize.com.
    • Bennet, J.1    Lanning, S.2
  • 34
    • 0037252945 scopus 로고    scopus 로고
    • Amazon.com recommendations: item-to-item collaborative 54 filtering
    • [34] Linden, G., et al. Amazon.com recommendations: item-to-item collaborative 54 filtering. IEEE Internet Computing 7 (2003), 76–80.
    • (2003) IEEE Internet Computing , vol.7 , pp. 76-80
    • Linden, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.