메뉴 건너뛰기




Volumn 72, Issue 9, 2012, Pages 1117-1126

Performance models for asynchronous data transfers on consumer Graphics Processing Units

Author keywords

Asynchronous transfers; CUDA; GPU; Overlapping of communication and computation; Streams

Indexed keywords

APPLICATION PROGRAMMING INTERFACES (API); APPLICATION PROGRAMS; COMPUTER GRAPHICS; COMPUTER GRAPHICS EQUIPMENT; DATA TRANSFER; MEMORY ARCHITECTURE; PROGRAM PROCESSORS; SOFTWARE DESIGN;

EID: 84865705401     PISSN: 07437315     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.jpdc.2011.07.011     Document Type: Article
Times cited : (38)

References (20)
  • 5
    • 70450231944 scopus 로고    scopus 로고
    • An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness
    • ACM, New York, NY, USA
    • Sunpyo Hong, Hyesoon Kim, An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness, in: Proceedings of the 36th Annual International Symposium on Computer Architecture, ISCA'09, ACM, New York, NY, USA, 2009, pp. 152-163.
    • (2009) Proceedings of the 36th Annual International Symposium on Computer Architecture, ISCA'09 , pp. 152-163
    • Hong, S.1    Kim, H.2
  • 7
    • 85030487244 scopus 로고    scopus 로고
    • Khronos Group, OpenCL. http://www.khronos.org/opencl/.
  • 11
    • 79955066309 scopus 로고    scopus 로고
    • August
    • NVIDIA, CUDA C best practices guide 3.2, August 2010. http://developer.download.nvidia.com/compute/cuda/3-2/toolkit/docs/ CUDA-C-Best-Practices-Guide.pdf.
    • (2010) CUDA C Best Practices Guide 3.2
  • 12
    • 79955074605 scopus 로고    scopus 로고
    • September
    • NVIDIA, CUDA C programming guide 3.2, September 2010. http://developer.download.nvidia.com/compute/cuda/3-2/toolkit/docs/ CUDA-C-Programming-Guide.pdf.
    • (2010) CUDA C Programming Guide 3.2
  • 14
    • 84873478761 scopus 로고    scopus 로고
    • NVIDIA, CUDA Zone. http://www.nvidia.com/object/cuda-home-new.html.
    • CUDA Zone
  • 15
    • 84870669626 scopus 로고    scopus 로고
    • Peripheral Component Interconnect Special Interest Group, PCI Express. http://www.pcisig.com/.
    • PCI Express
  • 17
    • 57349086588 scopus 로고    scopus 로고
    • White Paper
    • V. Podlozhnyuk, Histogram calculation in CUDA, White Paper, 2007. http://developer.download.nvidia.com/compute/cuda/1-1/Website/projects/ histogram256/doc/histogram.pdf.
    • (2007) Histogram Calculation in CUDA
    • Podlozhnyuk, V.1
  • 19
    • 77954735057 scopus 로고    scopus 로고
    • Improving linpack performance on SMP clusters with asynchronous MPI programming
    • Ta Quoc Viet, Tsutomu Yoshinaga, Improving linpack performance on SMP clusters with asynchronous MPI programming, IPSJ Digital Courier 2 (2006) 598-606.
    • (2006) IPSJ Digital Courier , vol.2 , pp. 598-606
    • Quoc Viet, T.1    Yoshinaga, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.