메뉴 건너뛰기




Volumn , Issue , 2012, Pages 33-42

Fast and efficient automatic memory management for GPUs using compiler-assisted runtime coherence scheme

Author keywords

Automatic; Data transfers; GPU; Memory management; Software coherence

Indexed keywords

AUTOMATIC; AUTOMATIC MEMORY MANAGEMENT; BENCHMARK SUITES; COMPILER ANALYSIS; COMPILER-ASSISTED; ERROR PRONES; GPU; MANUAL MEMORY-MANAGEMENT; MEMORY MANAGEMENT; MEMORY MANAGER; PERFORMANCE POTENTIALS; RODINIA; RUNTIMES;

EID: 84867509022     PISSN: 1089795X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2370816.2370824     Document Type: Conference Paper
Times cited : (40)

References (21)
  • 1
    • 79959456077 scopus 로고    scopus 로고
    • Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories
    • M. M. Baskaran, U. Bondhugula, et al. Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories. In PPoPP, 2008.
    • (2008) PPoPP
    • Baskaran, M.M.1    Bondhugula, U.2
  • 3
    • 31744441529 scopus 로고    scopus 로고
    • X10: An object-oriented approach to non-uniform cluster computing
    • P. Charles, C. Grothoff, V. Saraswat, et al. X10: An object-oriented approach to non-uniform cluster computing. In OOPSLA, 2005.
    • (2005) OOPSLA
    • Charles, P.1    Grothoff, C.2    Saraswat, V.3
  • 4
    • 70649092154 scopus 로고    scopus 로고
    • Rodinia: A benchmark suite for heterogeneous computing
    • S. Che, M. Boyer, J. Meng, et al. Rodinia: A benchmark suite for heterogeneous computing. In IISWC, 2009.
    • (2009) IISWC
    • Che, S.1    Boyer, M.2    Meng, J.3
  • 5
    • 33646009337 scopus 로고    scopus 로고
    • Optimizing compiler for the CELL processor
    • A. E. Eichenberger, K. O'Brien, et al. Optimizing compiler for the CELL processor. In PACT, 2005.
    • (2005) PACT
    • Eichenberger, A.E.1    O'brien, K.2
  • 6
    • 77952251540 scopus 로고    scopus 로고
    • An asymmetric distributed shared memory model for heterogeneous parallel systems
    • I. Gelado, J. E. Stone, et al. An asymmetric distributed shared memory model for heterogeneous parallel systems. In ASPLOS, 2010.
    • (2010) ASPLOS
    • Gelado, I.1    Stone, J.E.2
  • 8
    • 79959904195 scopus 로고    scopus 로고
    • Automatic CPU-GPU communication management and optimization
    • T. B. Jablin, P. Prabhu, et al. Automatic CPU-GPU communication management and optimization. In PLDI, 2011.
    • (2011) PLDI
    • Jablin, T.B.1    Prabhu, P.2
  • 9
    • 84863423999 scopus 로고    scopus 로고
    • Dynamically managed data for CPU-GPU architectures
    • March
    • T. B. Jablin, J. A. Jablin, et al. Dynamically Managed Data for CPU-GPU architectures. In CGO, March 2012.
    • (2012) CGO
    • Jablin, T.B.1    Jablin, J.A.2
  • 11
    • 78650802947 scopus 로고    scopus 로고
    • OpenMPC: Extended Open MP programming and tuning for GPUs
    • S. Lee and R. Eigenmann. OpenMPC: Extended OpenMP programming and tuning for GPUs. In SC, 2010.
    • (2010) SC
    • Lee, S.1    Eigenmann, R.2
  • 12
    • 67650081010 scopus 로고    scopus 로고
    • OpenMP to GPGPU: A compiler framework for automatic translation and optimization
    • S. Lee, S.-J. Min, and R. Eigenmann. OpenMP to GPGPU: a compiler framework for automatic translation and optimization. In PPoPP, 2009.
    • (2009) PPoPP
    • Lee, S.1    Min, S.-J.2    Eigenmann, R.3
  • 13
    • 77954995885 scopus 로고    scopus 로고
    • Debunking the 100X GPU vs. CPU myth: An evaluation of throughput computing on CPU and GPU
    • V. W. Lee, C. Kim, et al. Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU. In ISCA, 2010.
    • (2010) ISCA
    • Lee, V.W.1    Kim, C.2
  • 17
    • 70450263364 scopus 로고    scopus 로고
    • Programming model for a heterogeneous x86 platform
    • B. Saha, X. Zhou, et al. Programming model for a heterogeneous x86 platform. In PLDI, 2009.
    • (2009) PLDI
    • Saha, B.1    Zhou, X.2
  • 18
    • 49249086142 scopus 로고    scopus 로고
    • Larrabee: A many-core x86 architecture for visual computing
    • L. Seiler, D. Carmean, et al. Larrabee: A many-core x86 architecture for visual computing. ACM Trans. Graph., 27(3), 2008.
    • (2008) ACM Trans. Graph , vol.27 , Issue.3
    • Seiler, L.1    Carmean, D.2
  • 20
    • 70350441970 scopus 로고    scopus 로고
    • URL
    • TOP500.org. The Top 500. URL http://www.top500.org/.
    • TOP500.org
  • 21
    • 84867556832 scopus 로고    scopus 로고
    • URL, x10 lang.org. X10 2.1 cuda
    • x10 lang.org. X10 2.1 cuda. URL http://docs.codehaus.org/display/ XTENLANG/X10+2.1+CUDA.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.