메뉴 건너뛰기




Volumn , Issue , 2009, Pages

A framework for efficient and scalable execution of domain-specific templates on GPUs

Author keywords

[No Author keywords available]

Indexed keywords

ABSTRACTION LEVEL; CODE GENERATORS; COMPUTING INDUSTRY; CONVOLUTIONAL NEURAL NETWORK; CRITICAL PROBLEMS; DOMAIN SPECIFIC; GPU COMPUTING; GPU IMPLEMENTATION; GPU PROGRAMMING; GRAPHICS CARD; GRAPHICS PROCESSING UNITS; INPUT DATAS; LARGE DATA; LARGE DATASETS; MANY-CORE COMPUTING; MEMORY FOOTPRINT; OPERATOR-SPLITTING; PARALLEL OPERATORS; PERFORMANCE IMPROVEMENTS; SOFTWARE FRAMEWORKS; VIDEO ANALYSIS;

EID: 70450029523     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2009.5161039     Document Type: Conference Paper
Times cited : (39)

References (21)
  • 2
    • 70450101161 scopus 로고    scopus 로고
    • GPGPU community website
    • GPGPU community website. http ://www. gpgpu . org.
  • 3
    • 70450117428 scopus 로고    scopus 로고
    • Torch5 library. http://torch5.sourceforge.net.
    • Torch5 library
  • 4
    • 70449932975 scopus 로고    scopus 로고
    • Advanced Micro Devices, Inc
    • Advanced Micro Devices, Inc. AMD Stream Computing SDK. http://ati.amd.com/technology/ streamcomputing/index.html.
    • AMD Stream Computing SDK
  • 8
    • 70450036077 scopus 로고    scopus 로고
    • P. Dubey. A Platform 2015 Workload Model: Recognition, Mining and Synthesis Moves Computers to the Era of Tera, 2007. ftp ://download. intel. com/technology/computing/archinnov/ platform2015/download/RMS.pdf.
    • P. Dubey. A Platform 2015 Workload Model: Recognition, Mining and Synthesis Moves Computers to the Era of Tera, 2007. ftp ://download. intel. com/technology/computing/archinnov/ platform2015/download/RMS.pdf.
  • 12
    • 34748865391 scopus 로고    scopus 로고
    • T .J. Knight, J. Young, Park, M. Ren, M. Houston, M. Erez, K. Fatahalian, A. Aiken, W. J. Dally, and P. Hanrahan. Compilation for explicitly managed memory hierarchies. In PPoPP '07: Proceedings of the 12th ACM SIGPLAN Symposium on Principles and practice of parallel programming, March 2007.
    • T .J. Knight, J. Young, Park, M. Ren, M. Houston, M. Erez, K. Fatahalian, A. Aiken, W. J. Dally, and P. Hanrahan. Compilation for explicitly managed memory hierarchies. In PPoPP '07: Proceedings of the 12th ACM SIGPLAN Symposium on Principles and practice of parallel programming, March 2007.
  • 15
    • 70450109401 scopus 로고    scopus 로고
    • NVIDIA Corporation. NVIDIA CUDA, 2007. http:// nvidia.com/cuda.
    • NVIDIA Corporation. NVIDIA CUDA, 2007. http:// nvidia.com/cuda.
  • 16
    • 70450115089 scopus 로고    scopus 로고
    • Toward automatic parallelization and auto-tuning of affine kernels for GPUs
    • July
    • J. Ramanujam. Toward automatic parallelization and auto-tuning of affine kernels for GPUs. In Workshop on Automatic Tuning for Petascale Systems, July 2008.
    • (2008) Workshop on Automatic Tuning for Petascale Systems
    • Ramanujam, J.1
  • 18
    • 79959466764 scopus 로고    scopus 로고
    • S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. mei W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. In PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, pages 73-82, New York, NY, USA, 2008. ACM.
    • S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. mei W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. In PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, pages 73-82, New York, NY, USA, 2008. ACM.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.