메뉴 건너뛰기




Volumn , Issue , 2009, Pages 42-48

Improving performance of matrix multiplication and FFT on GPU

Author keywords

CUDA; FFT; GPU; Matrix multiplication

Indexed keywords

IMPROVING PERFORMANCE; MANY-CORE; MATRIX MULTIPLICATION; MEMORY BANDWIDTHS; PEAK PERFORMANCE; PRECISION MATRIX;

EID: 77949596908     PISSN: 15219097     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICPADS.2009.8     Document Type: Conference Paper
Times cited : (35)

References (9)
  • 2
    • 77949605109 scopus 로고    scopus 로고
    • NVIDIA Corp. CUDA CUFFT Library, Version 1.1. 2007
    • NVIDIA Corp. CUDA CUFFT Library, Version 1.1. 2007.
  • 5
    • 77949607214 scopus 로고    scopus 로고
    • Memory Locality Exploitation Strategies for FFT on the CUDA Architecture
    • June
    • E. Gutierrez, S. Romero, M. A. Trenas, and E. L. Zapata. Memory Locality Exploitation Strategies for FFT on the CUDA Architecture. VECPAR 2008, June 2008.
    • (2008) VECPAR 2008
    • Gutierrez, E.1    Romero, S.2    Trenas, M.A.3    Zapata, E.L.4
  • 6
    • 79959466764 scopus 로고    scopus 로고
    • S. Ryoo, C. I. R. amd S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. W. Hwu. Optimization Principles and Application Performance Evaluation of a Multithreaded GPU using CUDA. In Proceedings of the 13th ACM SIGPLAN, pages 73C82. ACM Press, 2008.
    • S. Ryoo, C. I. R. amd S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. W. Hwu. Optimization Principles and Application Performance Evaluation of a Multithreaded GPU using CUDA. In Proceedings of the 13th ACM SIGPLAN, pages 73C82. ACM Press, 2008.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.