메뉴 건너뛰기




Volumn 24, Issue 11, 2013, Pages 2273-2282

Performance modeling of atomic additions on gpu scratchpad memory

Author keywords

atomic operations; CUDA; GPU; histogram; K means; Performance model; shared memory

Indexed keywords

ATOMIC OPERATION; CUDA; GPU; HISTOGRAM; K-MEANS; PERFORMANCE MODEL; SHARED MEMORY;

EID: 84885108092     PISSN: 10459219     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPDS.2012.319     Document Type: Article
Times cited : (35)

References (28)
  • 1
    • 79958694059 scopus 로고    scopus 로고
    • NVIDIA, "CUDA Zone," http://developer. nvidia. com/category/zone/cuda-zone, 2011.
    • (2011) CUDA Zone
  • 2
    • 84885144878 scopus 로고    scopus 로고
    • Khronos Group
    • Khronos Group, "OpenCL," http://www.khronos. org/opencl/, 2011.
    • (2011)
  • 4
    • 80053211847 scopus 로고    scopus 로고
    • May
    • NVIDIA, "CUDA C Programming Guide 4. 0," http://developer. download. nvidia. com/compute/DevZone/docs/html/C/doc/CUDACProgrammingGuide. pdf, May 2011.
    • (2011) CUDA C Programming Guide 4. 0
  • 5
    • 80053212291 scopus 로고    scopus 로고
    • May
    • NVIDIA, "CUDA C Best Practices Guide 4. 0," http://developer. download. nvidia. com/compute/DevZone/docs/html/C/doc/CUDACBestPracticesGuide. pdf, May 2011.
    • (2011) CUDA C Best Practices Guide 4. 0
  • 8
    • 70450231944 scopus 로고    scopus 로고
    • An Analytical Model for a gpu Architecture with Memory-Level and Thread-Level Parallelism Awareness
    • S. Hong and H. Kim, "An Analytical Model for a gpu Architecture with Memory-Level and Thread-Level Parallelism Awareness," Proc. 36th Ann. Int'l Symp. Computer Architecture (ISCA '09), pp. 152-163, 2009.
    • (2009) Proc. 36th Ann. Int'l Symp. Computer Architecture (ISCA '09) , pp. 152-163
    • Hong, S.1    Kim, H.2
  • 13
    • 84879549908 scopus 로고    scopus 로고
    • White Paper
    • NVIDIA, "Fermi Compute Architecture. White Paper," http://www.nvidia. com/content/PDF/fermiwhitepapers/NVIDIA FermiComputeArchitectureWhitepaper. pdf, 2009.
    • (2009) Fermi Compute Architecture
  • 15
  • 20
  • 23
    • 51449118065 scopus 로고    scopus 로고
    • A performance study of general-purpose applications on graphics processors using cuda
    • Oct.
    • S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, and K. Skadron, "A Performance Study of General-Purpose Applications on Graphics Processors Using CUDA," J. Parallel Distributed Computing, vol. 68, no. 10, pp. 1370-1380, Oct. 2008.
    • (2008) J. Parallel Distributed Computing , vol.68 , Issue.10 , pp. 1370-1380
    • Che, S.1    Boyer, M.2    Meng, J.3    Tarjan, D.4    Sheaffer, J.W.5    Skadron, K.6
  • 25
    • 0032492432 scopus 로고    scopus 로고
    • Independent component filters of natural images compared with simple cells in primary visual cortex
    • Mar.
    • J. H. v. Hateren and A. v. d. Schaaf, "Independent Component Filters of Natural Images Compared with Simple Cells in Primary Visual Cortex," Proceedings: Biological Sciences, vol. 265, no. 1394, pp. 359-366, Mar. 1998.
    • (1998) Proceedings: Biological Sciences , vol.265 , Issue.1394 , pp. 359-366
    • Hateren, J.H.1    Schaaf, A.2
  • 28
    • 57349086588 scopus 로고    scopus 로고
    • White Paper
    • V. Podlozhnyuk, "Histogram Calculation in CUDA. White Paper," http://developer. download. nvidia. com/compute/cuda/11/Website/projects/ histogram256/doc/histogram. pdf, 2007.
    • (2007) Histogram Calculation in CUDA
    • Podlozhnyuk, V.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.