메뉴 건너뛰기




Volumn , Issue , 2009, Pages 400-409

A translation system for enabling data mining applications on GPUs

Author keywords

CUDA; Data mining; GPGPU

Indexed keywords

AUTOMATICALLY GENERATED; CODE GENERATION; COMPUTING POWER; CUDA; DATA MINING ALGORITHM; DATA MINING APPLICATIONS; EM CLUSTERING; GENERAL PURPOSE; GENERALIZED REDUCTION; K-MEANS CLUSTERING; PERFORMANCE IMPROVEMENTS; PROGRAM ANALYSIS; SCIENTIFIC DATA; TRANSLATION SYSTEMS;

EID: 70449707774     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1542275.1542331     Document Type: Conference Paper
Times cited : (38)

References (49)
  • 4
    • 70449720693 scopus 로고    scopus 로고
    • Sara Baghsorkhi, Melvin Lathara, and Wen mei Hwu. CUDA-lite: Reducing GPU Programming Complexity. In LCPC 2008, 2008.
    • Sara Baghsorkhi, Melvin Lathara, and Wen mei Hwu. CUDA-lite: Reducing GPU Programming Complexity. In LCPC 2008, 2008.
  • 5
    • 0029394470 scopus 로고    scopus 로고
    • Prithviraj Banerjee, John A. Chandy, Manish Gupta, Eugene W. Hodges IV, John G. Holm, Antonio Lain, Daniel J. Palermo, Shankar Ramaswamy, and Ernesto Su. The Paradigm Compiler for Distributed-Memory Multicomputers. IEEE Computer, 28(10):37-47, October 1995.
    • Prithviraj Banerjee, John A. Chandy, Manish Gupta, Eugene W. Hodges IV, John G. Holm, Antonio Lain, Daniel J. Palermo, Shankar Ramaswamy, and Ernesto Su. The Paradigm Compiler for Distributed-Memory Multicomputers. IEEE Computer, 28(10):37-47, October 1995.
  • 6
    • 57349180412 scopus 로고    scopus 로고
    • Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, and P. Sadayappan. A Compiler Framework for Optimization of Affine Loop Nests for GPGPUs. In International Conference on Supercomputing, pages 225-234, 2008.
    • Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, and P. Sadayappan. A Compiler Framework for Optimization of Affine Loop Nests for GPGPUs. In International Conference on Supercomputing, pages 225-234, 2008.
  • 8
    • 52949145167 scopus 로고    scopus 로고
    • Data-Intensive Supercomputing: The Case for DISC
    • Technical Report CMU-CS-07-128, School of Computer Science, Carnegie Mellon University
    • Randal E. Bryant. Data-Intensive Supercomputing: The Case for DISC. Technical Report CMU-CS-07-128, School of Computer Science, Carnegie Mellon University, 2007.
    • (2007)
    • Bryant, R.E.1
  • 10
    • 33746614750 scopus 로고    scopus 로고
    • A Graphics Hardware Accelerated Algorithm for Nearest Neighbor Search
    • Vassil N. Alexandrov, Geert Dick van Albada, Peter M.A. Sloot, and Jack Dongarra, editors, Computational Science, ICCS 2006, of, Springer
    • Benjamin Bustos, Oliver Deussen, Stefan Hiller, and Daniel Keim. A Graphics Hardware Accelerated Algorithm for Nearest Neighbor Search. In Vassil N. Alexandrov, Geert Dick van Albada, Peter M.A. Sloot, and Jack Dongarra, editors, Computational Science - ICCS 2006, volume 3994 of LNCS, pages 196-199. Springer, 2006.
    • (2006) LNCS , vol.3994 , pp. 196-199
    • Bustos, B.1    Deussen, O.2    Hiller, S.3    Keim, D.4
  • 11
    • 33646532283 scopus 로고    scopus 로고
    • Initial experiences porting a bioinformatics application to a graphics processor
    • Maria Charalambous, Pedro Trancoso, and Alexandros Stamatakis. Initial experiences porting a bioinformatics application to a graphics processor. In Panhellenic Conference on Informatics, pages 415-425, 2005.
    • (2005) Panhellenic Conference on Informatics , pp. 415-425
    • Charalambous, M.1    Trancoso, P.2    Stamatakis, A.3
  • 12
    • 70449717067 scopus 로고    scopus 로고
    • Shuai Che, Jiayuan Meng, and Jeremy W. Sheaffer. A Performance Study of General Purpose Applications on Graphics Processors
    • Shuai Che, Jiayuan Meng, and Jeremy W. Sheaffer. A Performance Study of General Purpose Applications on Graphics Processors.
  • 13
    • 0002607026 scopus 로고    scopus 로고
    • Bayesian classification (autoclass): Theory and practice
    • AAAI Press, MIT Press
    • P. Cheeseman and J. Stutz. Bayesian classification (autoclass): Theory and practice. In Advanced in Knowledge Discovery and Data Mining, pages 61-83. AAAI Press / MIT Press, 1996.
    • (1996) Advanced in Knowledge Discovery and Data Mining , pp. 61-83
    • Cheeseman, P.1    Stutz, J.2
  • 15
    • 85030321143 scopus 로고    scopus 로고
    • Mapreduce: Simplified data processing on large clusters
    • Jeffrey Dean and Sanjay Ghemawat. Mapreduce: Simplified data processing on large clusters. In OSDI, pages 137-150, 2004.
    • (2004) OSDI , pp. 137-150
    • Dean, J.1    Ghemawat, S.2
  • 16
    • 0002629270 scopus 로고
    • Maximum Likelihood Estimation from Incomplete Data via the EM Algorithm
    • Arthur Dempster, Nan Laird, and Donald Rubin. Maximum Likelihood Estimation from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, 39(1):1-38, 1977.
    • (1977) Journal of the Royal Statistical Society , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 22
    • 0030380793 scopus 로고    scopus 로고
    • Maximizing Multiprocessor Performance with the SUIF Compiler
    • December
    • M. Hall, S. Amarsinghe, B. Murphy, S. Liao, and M. Lam. Maximizing Multiprocessor Performance with the SUIF Compiler. IEEE Computer, (12), December 1996.
    • (1996) IEEE Computer , vol.12
    • Hall, M.1    Amarsinghe, S.2    Murphy, B.3    Liao, S.4    Lam, M.5
  • 26
    • 84976813879 scopus 로고
    • Compiling Fortran D for MIMD distributed-memory machines
    • August
    • Seema Hiranandani, Ken Kennedy, and Chau-Wen Tseng. Compiling Fortran D for MIMD distributed-memory machines. Communications of the ACM, 35(8):66-80, August 1992.
    • (1992) Communications of the ACM , vol.35 , Issue.8 , pp. 66-80
    • Hiranandani, S.1    Kennedy, K.2    Tseng, C.-W.3
  • 28
    • 70449713451 scopus 로고    scopus 로고
    • R. Jin and G. Agrawal. Shared memory parallelization of data mining algorithms: Techniques. citeseer.ist.psu.edu/article/jin02shared.html, 2002.
    • R. Jin and G. Agrawal. Shared memory parallelization of data mining algorithms: Techniques. citeseer.ist.psu.edu/article/jin02shared.html, 2002.
  • 30
    • 70449715278 scopus 로고    scopus 로고
    • Andreas Klockner. PyCuda, 2008.
    • (2008)
  • 31
    • 0026231040 scopus 로고
    • Compiling Global Name-Space Parallel Loops for Distributed Execution
    • October
    • C. Koelbel and P. Mehrotra. Compiling Global Name-Space Parallel Loops for Distributed Execution. IEEE Transactions on Parallel and Distributed Systems, 2(4):440-451, October 1991.
    • (1991) IEEE Transactions on Parallel and Distributed Systems , vol.2 , Issue.4 , pp. 440-451
    • Koelbel, C.1    Mehrotra, P.2
  • 33
    • 67650081010 scopus 로고    scopus 로고
    • OpenMP to GPGPU: A Compiler Framework for Automatic Translation and Optimization
    • Seyong Lee, Seung-Jai Min, and Rudolf Eigenmann. OpenMP to GPGPU: A Compiler Framework for Automatic Translation and Optimization. In PPoPP'09, 2009.
    • (2009) PPoPP'09
    • Lee, S.1    Min, S.-J.2    Eigenmann, R.3
  • 34
    • 33845187300 scopus 로고    scopus 로고
    • Parallelizing user-defined and implicit reductions globally on multiprocessors
    • Chris R. Jesshope and Colin Egan, editors, Asia-Pacific Computer Systems Architecture Conference, of, Springer
    • Shih-Wei Liao. Parallelizing user-defined and implicit reductions globally on multiprocessors. In Chris R. Jesshope and Colin Egan, editors, Asia-Pacific Computer Systems Architecture Conference, volume 4186 of Lecture Notes in Computer Science, pages 189-202. Springer, 2006.
    • (2006) Lecture Notes in Computer Science , vol.4186 , pp. 189-202
    • Liao, S.-W.1
  • 36
    • 0031674776 scopus 로고    scopus 로고
    • Optimization of Implicit Reductions for Distributed Memory Multiprocessors
    • Bo Lu and John Mellor-Crummey. Compiler, April
    • Bo Lu and John Mellor-Crummey. Compiler Optimization of Implicit Reductions for Distributed Memory Multiprocessors. In Proceedings of the 12th International Parallel Processing Symposium (IPPS), April 1998.
    • (1998) Proceedings of the 12th International Parallel Processing Symposium (IPPS)
  • 37
    • 0002431740 scopus 로고    scopus 로고
    • Automatic Construction of Decision Trees from Data: A Multi-disciplinary Survey
    • S. K. Murthy. Automatic Construction of Decision Trees from Data: A Multi-disciplinary Survey. Data Mining and Knowledge Discovery, 2(4):345-389, 1998.
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.4 , pp. 345-389
    • Murthy, S.K.1
  • 38
    • 70449715277 scopus 로고    scopus 로고
    • NVidia. NVIDIA CUDA Compute Unified Device Architecture Programming Guide. version 2.0. http://developer.download.nvidia.com/compute/cuda/2.0-Beta2/ docs/Programming-Guide-2.0beta2.pdf, June 7 2008.
    • NVidia. NVIDIA CUDA Compute Unified Device Architecture Programming Guide. version 2.0. http://developer.download.nvidia.com/compute/cuda/2.0-Beta2/ docs/Programming-Guide-2.0beta2.pdf, June 7 2008.
  • 40
    • 77951558943 scopus 로고    scopus 로고
    • A Performance-oriented Data Parallel Virtual Machine for GPUs
    • New York, NY, USA, ACM
    • Mark Peercy, Mark Segal, and Derek Gerstmann. A Performance-oriented Data Parallel Virtual Machine for GPUs. In SIGGRAPH '06: ACM SIGGRAPH 2006 Sketches, page 184, New York, NY, USA, 2006. ACM.
    • (2006) SIGGRAPH '06: ACM SIGGRAPH 2006 Sketches , pp. 184
    • Peercy, M.1    Segal, M.2    Gerstmann, D.3
  • 41
    • 0031631999 scopus 로고    scopus 로고
    • The Role of Associativity and Commutativity in the Detection and Transformation of Loop-Level Parallelism
    • ACM Press, July
    • William M. Pottenger. The Role of Associativity and Commutativity in the Detection and Transformation of Loop-Level Parallelism. In Conference Proceedings of the 1998 International Conference on Supercomputing (ICS), pages 188-195. ACM Press, July 1998.
    • (1998) Conference Proceedings of the 1998 International Conference on Supercomputing (ICS) , pp. 188-195
    • Pottenger, W.M.1
  • 44
    • 58449109179 scopus 로고    scopus 로고
    • John Stratton, Sam Stone, and Wen mei Hwu. MCUDA: An Efficient Implementation of CUDA Kernels for Multi-Core CPUs. In 21st Annual Workshop on Languages and Compilers for Parallel Computing (LCPC'2008), July 2008.
    • John Stratton, Sam Stone, and Wen mei Hwu. MCUDA: An Efficient Implementation of CUDA Kernels for Multi-Core CPUs. In 21st Annual Workshop on Languages and Compilers for Parallel Computing (LCPC'2008), July 2008.
  • 49
    • 0027543560 scopus 로고
    • Compiling for Distributed-Memory Systems
    • February, In Special Section on Languages and Compilers for Parallel Machines
    • Hans P. Zima and Barbara Mary Chapman. Compiling for Distributed-Memory Systems. Proceedings of the IEEE, 81(2):264-287, February 1993. In Special Section on Languages and Compilers for Parallel Machines.
    • (1993) Proceedings of the IEEE , vol.81 , Issue.2 , pp. 264-287
    • Zima, H.P.1    Mary Chapman, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.