메뉴 건너뛰기




Volumn , Issue , 2010, Pages 115-125

Streamlining GPU applications on the fly - Thread divergence elimination through runtime thread-data remapping

Author keywords

CPU GPU pipelining; data transformation; GPGPU; thread divergence; thread data remapping

Indexed keywords

COMPUTING POWER; CONDITIONAL BRANCH; COST EFFICIENCY; DATA LAYOUTS; DATA TRANSFORMATION; GPGPU; GRAPHIC PROCESSING UNITS; HIGH PERFORMANCE COMPUTING; MASSIVE DATA; NON-TRIVIAL; ON THE FLIES; PERFORMANCE DEGRADATION; PERFORMANCE IMPROVEMENTS; REMAPPING; RUN-TIME THREADS; RUNTIMES; SYSTEMATIC INVESTIGATIONS;

EID: 77954724148     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1810085.1810104     Document Type: Conference Paper
Times cited : (72)

References (18)
  • 1
    • 84870629709 scopus 로고    scopus 로고
    • NVIDIA CUDA. http://www.nvidia.com/cuda.
    • NVIDIA CUDA
  • 6
    • 1642502420 scopus 로고    scopus 로고
    • Improving effective bandwidth through compiler enhancement of global cache reuse
    • C. Ding and K. Kennedy. Improving effective bandwidth through compiler enhancement of global cache reuse. Journal of Parallel and Distributed Computing, 64(1):108-134, 2004.
    • (2004) Journal of Parallel and Distributed Computing , vol.64 , Issue.1 , pp. 108-134
    • Ding, C.1    Kennedy, K.2
  • 18
    • 41249094477 scopus 로고    scopus 로고
    • Lattice boltzmann based pde solver on the gpu
    • Y. Zhao. Lattice boltzmann based pde solver on the gpu. The Visual Computer, (5):323-333, 2008.
    • (2008) The Visual Computer , Issue.5 , pp. 323-333
    • Zhao, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.