SCOPUS 정보 검색 플랫폼

Proceedings of the International Conference on Supercomputing

Volumn , Issue , 2009, Pages 400-409

A translation system for enabling data mining applications on GPUs

(2) Ma, Wenjing a Agrawal, Gagan a

a Ohio State University (United States)

Author keywords

CUDA; Data mining; GPGPU

Indexed keywords

AUTOMATICALLY GENERATED; CODE GENERATION; COMPUTING POWER; CUDA; DATA MINING ALGORITHM; DATA MINING APPLICATIONS; EM CLUSTERING; GENERAL PURPOSE; GENERALIZED REDUCTION; K-MEANS CLUSTERING; PERFORMANCE IMPROVEMENTS; PROGRAM ANALYSIS; SCIENTIFIC DATA; TRANSLATION SYSTEMS;

DATA MINING; INTELLIGENT CONTROL; PRINCIPAL COMPONENT ANALYSIS; PROGRAM PROCESSORS;

CLUSTER ANALYSIS;

EID: 70449707774 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1542275.1542331 Document Type: Conference Paper

Times cited : (38)

References (49)

1
- 0031623811
- Using Integer Sets for Data-parallel Program Analysis and Optimization
- June
- Vikram Adve and John Mellor-Crummy. Using Integer Sets for Data-parallel Program Analysis and Optimization. In Proceedings of the SIGPLAN '98 Conference on Programming Language Design and Implementation, June 1998.
- (1998) Proceedings of the SIGPLAN '98 Conference on Programming Language Design and Implementation
- Adve, V.¹ Mellor-Crummy, J.²

2
- 0030403087
- Parallel Mining of Association Rules
- June
- R. Agrawal and J. Shafer. Parallel Mining of Association Rules. IEEE Transactions on Knowledge and Data Engineering, 8(6):962-969, June 1996.
- (1996) IEEE Transactions on Knowledge and Data Engineering , vol.8 , Issue.6 , pp. 962-969
- Agrawal, R.¹ Shafer, J.²

3
- 84963865510
- Flow Insensitive Points-To Sets
- 00:0081
- P. Anderson, D. Binkley, G. Rosay, and T. Teitelbaum. Flow Insensitive Points-To Sets. scam, 00:0081, 2001.
- (2001) scam
- Anderson, P.¹ Binkley, D.² Rosay, G.³ Teitelbaum, T.⁴

4
- 70449720693
- Sara Baghsorkhi, Melvin Lathara, and Wen mei Hwu. CUDA-lite: Reducing GPU Programming Complexity. In LCPC 2008, 2008.
- Sara Baghsorkhi, Melvin Lathara, and Wen mei Hwu. CUDA-lite: Reducing GPU Programming Complexity. In LCPC 2008, 2008.

5
- 0029394470
- Prithviraj Banerjee, John A. Chandy, Manish Gupta, Eugene W. Hodges IV, John G. Holm, Antonio Lain, Daniel J. Palermo, Shankar Ramaswamy, and Ernesto Su. The Paradigm Compiler for Distributed-Memory Multicomputers. IEEE Computer, 28(10):37-47, October 1995.
- Prithviraj Banerjee, John A. Chandy, Manish Gupta, Eugene W. Hodges IV, John G. Holm, Antonio Lain, Daniel J. Palermo, Shankar Ramaswamy, and Ernesto Su. The Paradigm Compiler for Distributed-Memory Multicomputers. IEEE Computer, 28(10):37-47, October 1995.

6
- 57349180412
- Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, and P. Sadayappan. A Compiler Framework for Optimization of Affine Loop Nests for GPGPUs. In International Conference on Supercomputing, pages 225-234, 2008.
- Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, and P. Sadayappan. A Compiler Framework for Optimization of Affine Loop Nests for GPGPUs. In International Conference on Supercomputing, pages 225-234, 2008.

7
- 0030382364
- Parallel programming with Polaris
- December
- W. Blume, R. Doallo, R. Eigenman, J. Grout, J. Hoelflinger, T. Lawrence, J. Lee, D. Padua, Y. Paek, B. Pottenger, L. Rauchwerger, and P. Tu. Parallel programming with Polaris. IEEE Computer, 29(12):78-82, December 1996.
- (1996) IEEE Computer , vol.29 , Issue.12 , pp. 78-82
- Blume, W.¹ Doallo, R.² Eigenman, R.³ Grout, J.⁴ Hoelflinger, J.⁵ Lawrence, T.⁶ Lee, J.⁷ Padua, D.⁸ Paek, Y.⁹ Pottenger, B.¹⁰ Rauchwerger, L.¹¹ Tu, P.¹²

8
- 52949145167
- Data-Intensive Supercomputing: The Case for DISC
- Technical Report CMU-CS-07-128, School of Computer Science, Carnegie Mellon University
- Randal E. Bryant. Data-Intensive Supercomputing: The Case for DISC. Technical Report CMU-CS-07-128, School of Computer Science, Carnegie Mellon University, 2007.
- (2007)
- Bryant, R.E.¹

9
- 61849183365
- I. Buck, T. Foley, D. Horn, J. Sugerman, K. Mike, and H. Pat. Brook for GPUs: Stream Computing on Graphics Hardware, 2004.
- (2004) Brook for GPUs: Stream Computing on Graphics Hardware
- Buck, I.¹ Foley, T.² Horn, D.³ Sugerman, J.⁴ Mike, K.⁵ Pat, H.⁶

10
- 33746614750
- A Graphics Hardware Accelerated Algorithm for Nearest Neighbor Search
- Vassil N. Alexandrov, Geert Dick van Albada, Peter M.A. Sloot, and Jack Dongarra, editors, Computational Science, ICCS 2006, of, Springer
- Benjamin Bustos, Oliver Deussen, Stefan Hiller, and Daniel Keim. A Graphics Hardware Accelerated Algorithm for Nearest Neighbor Search. In Vassil N. Alexandrov, Geert Dick van Albada, Peter M.A. Sloot, and Jack Dongarra, editors, Computational Science - ICCS 2006, volume 3994 of LNCS, pages 196-199. Springer, 2006.
- (2006) LNCS , vol.3994 , pp. 196-199
- Bustos, B.¹ Deussen, O.² Hiller, S.³ Keim, D.⁴

11
- 33646532283
- Initial experiences porting a bioinformatics application to a graphics processor
- Maria Charalambous, Pedro Trancoso, and Alexandros Stamatakis. Initial experiences porting a bioinformatics application to a graphics processor. In Panhellenic Conference on Informatics, pages 415-425, 2005.
- (2005) Panhellenic Conference on Informatics , pp. 415-425
- Charalambous, M.¹ Trancoso, P.² Stamatakis, A.³

12
- 70449717067
- Shuai Che, Jiayuan Meng, and Jeremy W. Sheaffer. A Performance Study of General Purpose Applications on Graphics Processors
- Shuai Che, Jiayuan Meng, and Jeremy W. Sheaffer. A Performance Study of General Purpose Applications on Graphics Processors.

13
- 0002607026
- Bayesian classification (autoclass): Theory and practice
- AAAI Press, MIT Press
- P. Cheeseman and J. Stutz. Bayesian classification (autoclass): Theory and practice. In Advanced in Knowledge Discovery and Data Mining, pages 61-83. AAAI Press / MIT Press, 1996.
- (1996) Advanced in Knowledge Discovery and Data Mining , pp. 61-83
- Cheeseman, P.¹ Stutz, J.²

14
- 70449699793
- General-Purpose Sparse Matrix Building Blocks using the NVIDIA CUDA Technology Platform
- Oct
- Matthias Christen, Olaf Schenk, and Helmar Burkhart. General-Purpose Sparse Matrix Building Blocks using the NVIDIA CUDA Technology Platform. In First Workshop on General Purpose Processing on Graphics Processing Units, Oct 2007.
- (2007) First Workshop on General Purpose Processing on Graphics Processing Units
- Christen, M.¹ Schenk, O.² Burkhart, H.³

15
- 85030321143
- Mapreduce: Simplified data processing on large clusters
- Jeffrey Dean and Sanjay Ghemawat. Mapreduce: Simplified data processing on large clusters. In OSDI, pages 137-150, 2004.
- (2004) OSDI , pp. 137-150
- Dean, J.¹ Ghemawat, S.²

16
- 0002629270
- Maximum Likelihood Estimation from Incomplete Data via the EM Algorithm
- Arthur Dempster, Nan Laird, and Donald Rubin. Maximum Likelihood Estimation from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, 39(1):1-38, 1977.
- (1977) Journal of the Royal Statistical Society , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

17
- 84934299651
- GPU Cluster for High Prformance Computing
- Washington, DC, USA, IEEE Computer Society
- Zhe Fan, Feng Qiu, Arie Kaufman, and Suzanne Yoakum-Stover. GPU Cluster for High Prformance Computing. In SC '04: Proceedings of the 2004 ACM/IEEE conference on Supercomputing, page 47, Washington, DC, USA, 2004. IEEE Computer Society.
- (2004) SC '04: Proceedings of the 2004 ACM/IEEE conference on Supercomputing , pp. 47
- Fan, Z.¹ Qiu, F.² Kaufman, A.³ Yoakum-Stover, S.⁴

18
- 84867705757
- Vincent Garcia, Eric Debreuve, and Michel Barlaud. Fast k Nearest Neighbor Search using GPU, 2008.
- (2008) Fast k Nearest Neighbor Search using GPU
- Garcia, V.¹ Debreuve, E.² Barlaud, M.³

19
- 33947607609
- GPUTeraSort: High Performance Graphics Co-processor Sorting for Large Database Management
- New York, NY, USA, ACM
- Naga Govindaraju, Jim Gray, Ritesh Kumar, and Dinesh Manocha. GPUTeraSort: High Performance Graphics Co-processor Sorting for Large Database Management. In SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of data, pages 325-336, New York, NY, USA, 2006. ACM.
- (2006) SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of data , pp. 325-336
- Govindaraju, N.¹ Gray, J.² Kumar, R.³ Manocha, D.⁴

20
- 0029722997
- Static Analysis to Reduce Synchronization Costs in Data-Parallel Programs
- ACM Press, January
- Manish Gupta and Edith Schonberg. Static Analysis to Reduce Synchronization Costs in Data-Parallel Programs. In Conference Record of the 23rd ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, pages 322-332. ACM Press, January 1996.
- (1996) Conference Record of the 23rd ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages , pp. 322-332
- Gupta, M.¹ Schonberg, E.²

21
- 79961196144
- Jun
- Jesse D. Hall and John C. Hart. GPU Acceleration of Iterative Clustering. Jun 2004.
- (2004) GPU Acceleration of Iterative Clustering
- Hall, J.D.¹ Hart, J.C.²

22
- 0030380793
- Maximizing Multiprocessor Performance with the SUIF Compiler
- December
- M. Hall, S. Amarsinghe, B. Murphy, S. Liao, and M. Lam. Maximizing Multiprocessor Performance with the SUIF Compiler. IEEE Computer, (12), December 1996.
- (1996) IEEE Computer , vol.12
- Hall, M.¹ Amarsinghe, S.² Murphy, B.³ Liao, S.⁴ Lam, M.⁵

23
- 0003238191
- Improving Compiler and Runtime Support for Irregular Reductions
- August
- H. Han and Chau-Wen Tseng. Improving Compiler and Runtime Support for Irregular Reductions. In Proceedings of the 11th Workshop on Languages and Compilers for Parallel Computing, August 1998.
- (1998) Proceedings of the 11th Workshop on Languages and Compilers for Parallel Computing
- Han, H.¹ Tseng, C.-W.²

24
- 0003585297
- Morgan Kaufmann Publishers
- Jiawei Han and Micheline Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, 2000.
- (2000) Data Mining: Concepts and Techniques
- Han, J.¹ Kamber, M.²

25
- 63549097654
- Mars: A MapReduce Framework on Graphics Processors
- Bingsheng He, Wenbin Fang, Qiong Luo, Naga K. Govindaraju, and Tuyong Wang. Mars: A MapReduce Framework on Graphics Processors. In PACT08: IEEE International Conference on Parallel Architecture and Compilation Techniques 2008, 2008.
- (2008) PACT08: IEEE International Conference on Parallel Architecture and Compilation Techniques 2008
- He, B.¹ Fang, W.² Luo, Q.³ Govindaraju, N.K.⁴ Wang, T.⁵

26
- 84976813879
- Compiling Fortran D for MIMD distributed-memory machines
- August
- Seema Hiranandani, Ken Kennedy, and Chau-Wen Tseng. Compiling Fortran D for MIMD distributed-memory machines. Communications of the ACM, 35(8):66-80, August 1992.
- (1992) Communications of the ACM , vol.35 , Issue.8 , pp. 66-80
- Hiranandani, S.¹ Kennedy, K.² Tseng, C.-W.³

27
- 0004161991
- Prentice Hall
- A. K. Jain and R. C. Dubes. Algorithms for Clustering Data. Prentice Hall, 1988.
- (1988) Algorithms for Clustering Data
- Jain, A.K.¹ Dubes, R.C.²

28
- 70449713451
- R. Jin and G. Agrawal. Shared memory parallelization of data mining algorithms: Techniques. citeseer.ist.psu.edu/article/jin02shared.html, 2002.
- R. Jin and G. Agrawal. Shared memory parallelization of data mining algorithms: Techniques. citeseer.ist.psu.edu/article/jin02shared.html, 2002.

29
- 12444324974
- A Middleware for Developing Parallel Data Mining Implementations
- April
- Ruoming Jin and Gagan Agrawal. A Middleware for Developing Parallel Data Mining Implementations. In Proceedings of the first SIAM conference on Data Mining, April 2001.
- (2001) Proceedings of the first SIAM conference on Data Mining
- Jin, R.¹ Agrawal, G.²

30
- 70449715278
- Andreas Klockner. PyCuda, 2008.
- (2008)

31
- 0026231040
- Compiling Global Name-Space Parallel Loops for Distributed Execution
- October
- C. Koelbel and P. Mehrotra. Compiling Global Name-Space Parallel Loops for Distributed Execution. IEEE Transactions on Parallel and Distributed Systems, 2(4):440-451, October 1991.
- (1991) IEEE Transactions on Parallel and Distributed Systems , vol.2 , Issue.4 , pp. 440-451
- Koelbel, C.¹ Mehrotra, P.²

32
- 3042658703
- LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation
- Palo Alto, California, Mar
- Chris Lattner and Vikram Adve. LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation. In Proceedings of the 2004 International Symposium on Code Generation and Optimization (CGO'04), Palo Alto, California, Mar 2004.
- (2004) Proceedings of the 2004 International Symposium on Code Generation and Optimization (CGO'04)
- Lattner, C.¹ Adve, V.²

33
- 67650081010
- OpenMP to GPGPU: A Compiler Framework for Automatic Translation and Optimization
- Seyong Lee, Seung-Jai Min, and Rudolf Eigenmann. OpenMP to GPGPU: A Compiler Framework for Automatic Translation and Optimization. In PPoPP'09, 2009.
- (2009) PPoPP'09
- Lee, S.¹ Min, S.-J.² Eigenmann, R.³

34
- 33845187300
- Parallelizing user-defined and implicit reductions globally on multiprocessors
- Chris R. Jesshope and Colin Egan, editors, Asia-Pacific Computer Systems Architecture Conference, of, Springer
- Shih-Wei Liao. Parallelizing user-defined and implicit reductions globally on multiprocessors. In Chris R. Jesshope and Colin Egan, editors, Asia-Pacific Computer Systems Architecture Conference, volume 4186 of Lecture Notes in Computer Science, pages 189-202. Springer, 2006.
- (2006) Lecture Notes in Computer Science , vol.4186 , pp. 189-202
- Liao, S.-W.¹

35
- 0005006119
- On the automatic parallelization of sparse and irregular Fortran programs
- May
- Yuan Lin and David Padua. On the automatic parallelization of sparse and irregular Fortran programs. In Proceedings of the Workshop on Languages, Compilers, and Runtime Systems for Scalable Computers (LCR - 98), May 1998.
- (1998) Proceedings of the Workshop on Languages, Compilers, and Runtime Systems for Scalable Computers (LCR - 98)
- Lin, Y.¹ Padua, D.²

36
- 0031674776
- Optimization of Implicit Reductions for Distributed Memory Multiprocessors
- Bo Lu and John Mellor-Crummey. Compiler, April
- Bo Lu and John Mellor-Crummey. Compiler Optimization of Implicit Reductions for Distributed Memory Multiprocessors. In Proceedings of the 12th International Parallel Processing Symposium (IPPS), April 1998.
- (1998) Proceedings of the 12th International Parallel Processing Symposium (IPPS)

37
- 0002431740
- Automatic Construction of Decision Trees from Data: A Multi-disciplinary Survey
- S. K. Murthy. Automatic Construction of Decision Trees from Data: A Multi-disciplinary Survey. Data Mining and Knowledge Discovery, 2(4):345-389, 1998.
- (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.4 , pp. 345-389
- Murthy, S.K.¹

38
- 70449715277
- NVidia. NVIDIA CUDA Compute Unified Device Architecture Programming Guide. version 2.0. http://developer.download.nvidia.com/compute/cuda/2.0-Beta2/ docs/Programming-Guide-2.0beta2.pdf, June 7 2008.
- NVidia. NVIDIA CUDA Compute Unified Device Architecture Programming Guide. version 2.0. http://developer.download.nvidia.com/compute/cuda/2.0-Beta2/ docs/Programming-Guide-2.0beta2.pdf, June 7 2008.

39
- 0039845361
- SQLEM: Fast Clustering in SQL Using the EM Algorithm
- ACM Press, June
- C. Ordonez and P. Cereghini. SQLEM: Fast Clustering in SQL Using the EM Algorithm. In Proceedings of the ACM SIGMOD Conference on Management of Data, pages 559-570. ACM Press, June 2000.
- (2000) Proceedings of the ACM SIGMOD Conference on Management of Data , pp. 559-570
- Ordonez, C.¹ Cereghini, P.²

40
- 77951558943
- A Performance-oriented Data Parallel Virtual Machine for GPUs
- New York, NY, USA, ACM
- Mark Peercy, Mark Segal, and Derek Gerstmann. A Performance-oriented Data Parallel Virtual Machine for GPUs. In SIGGRAPH '06: ACM SIGGRAPH 2006 Sketches, page 184, New York, NY, USA, 2006. ACM.
- (2006) SIGGRAPH '06: ACM SIGGRAPH 2006 Sketches , pp. 184
- Peercy, M.¹ Segal, M.² Gerstmann, D.³

41
- 0031631999
- The Role of Associativity and Commutativity in the Detection and Transformation of Loop-Level Parallelism
- ACM Press, July
- William M. Pottenger. The Role of Associativity and Commutativity in the Detection and Transformation of Loop-Level Parallelism. In Conference Proceedings of the 1998 International Conference on Supercomputing (ICS), pages 188-195. ACM Press, July 1998.
- (1998) Conference Proceedings of the 1998 International Conference on Supercomputing (ICS) , pp. 188-195
- Pottenger, W.M.¹

42
- 10444224900
- Photon Mapping on Programmable Graphics Hardware
- Eurographics Association
- Timothy J. Purcell, Craig Donner, Mike Cammarano, Henrik Wann Jensen, and Pat Hanrahan. Photon Mapping on Programmable Graphics Hardware. In Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware, pages 41-50. Eurographics Association, 2003.
- (2003) Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware , pp. 41-50
- Purcell, T.J.¹ Donner, C.² Cammarano, M.³ Wann Jensen, H.⁴ Hanrahan, P.⁵

43
- 57749177628
- Fast Parallel GPU-Sorting Using a Hybrid Algorithm
- Oct
- Erik Sintorn and Ulf Assarsson. Fast Parallel GPU-Sorting Using a Hybrid Algorithm. In First Workshop on General Purpose Processing on Graphics Processing Units, Oct 2007.
- (2007) First Workshop on General Purpose Processing on Graphics Processing Units
- Sintorn, E.¹ Assarsson, U.²

44
- 58449109179
- John Stratton, Sam Stone, and Wen mei Hwu. MCUDA: An Efficient Implementation of CUDA Kernels for Multi-Core CPUs. In 21st Annual Workshop on Languages and Compilers for Parallel Computing (LCPC'2008), July 2008.
- John Stratton, Sam Stone, and Wen mei Hwu. MCUDA: An Efficient Implementation of CUDA Kernels for Multi-Core CPUs. In 21st Annual Workshop on Languages and Compilers for Parallel Computing (LCPC'2008), July 2008.

45
- 33947595619
- Accelerator: Using Data Parallelism to Program GPUs for General-purpose Uses
- New York, NY, USA, ACM
- David Tarditi, Sidd Puri, and Jose Oglesby. Accelerator: Using Data Parallelism to Program GPUs for General-purpose Uses. In ASPLOS-XII: Proceedings of the 12th international conference on Architectural support for programming languages and operating systems, pages 325-335, New York, NY, USA, 2006. ACM.
- (2006) ASPLOS-XII: Proceedings of the 12th international conference on Architectural support for programming languages and operating systems , pp. 325-335
- Tarditi, D.¹ Puri, S.² Oglesby, J.³

46
- 33845330575
- Exploring Graphics Processor Performance for General Purpose Applications
- Pedro Trancoso and Maria Charalambous. Exploring Graphics Processor Performance for General Purpose Applications. In Eighth Euromicro Symposium on Digital Systems Design (DSD 2005), pages 306-313, 2005.
- (2005) Eighth Euromicro Symposium on Digital Systems Design (DSD 2005) , pp. 306-313
- Trancoso, P.¹ Charalambous, M.²

47
- 84880310111
- Neil Trevett. OpenCL: The Open Standdard for Heterogeneous Parallel Programming, 2008.
- (2008) OpenCL: The Open Standdard for Heterogeneous Parallel Programming
- Trevett, N.¹

48
- 0033703286
- Adaptive Reduction Parallelization Techniques
- ACM Press, May
- Hao Yu and Lawrence Rauchwerger. Adaptive Reduction Parallelization Techniques. In Proceedings of the 2000 International Conference on Supercomputing, pages 66-75. ACM Press, May 2000.
- (2000) Proceedings of the 2000 International Conference on Supercomputing , pp. 66-75
- Yu, H.¹ Rauchwerger, L.²

49
- 0027543560
- Compiling for Distributed-Memory Systems
- February, In Special Section on Languages and Compilers for Parallel Machines
- Hans P. Zima and Barbara Mary Chapman. Compiling for Distributed-Memory Systems. Proceedings of the IEEE, 81(2):264-287, February 1993. In Special Section on Languages and Compilers for Parallel Machines.
- (1993) Proceedings of the IEEE , vol.81 , Issue.2 , pp. 264-287
- Zima, H.P.¹ Mary Chapman, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.