메뉴 건너뛰기




Volumn , Issue , 2014, Pages 272-283

MRPB: Memory request prioritization for massively parallel processors

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER ARCHITECTURE; COMPUTER GRAPHICS; PROGRAM PROCESSORS; SUPERCOMPUTERS;

EID: 84903985058     PISSN: 15300897     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/HPCA.2014.6835938     Document Type: Conference Paper
Times cited : (142)

References (26)
  • 2
    • 70649092154 scopus 로고    scopus 로고
    • Rodinia: A benchmark suite for heterogeneous computing
    • S. Che et al. Rodinia: A benchmark suite for heterogeneous computing. In Intl. Symp. Workload Characterization, 2009.
    • (2009) Intl. Symp. Workload Characterization
    • Che, S.1
  • 4
    • 79951707102 scopus 로고    scopus 로고
    • Memory latency reduction via thread throttling
    • H.-Y. Cheng et al. Memory latency reduction via thread throttling. In Intl. Symp. Microarchitecture, 2010.
    • (2010) Intl. Symp. Microarchitecture
    • Cheng, H.-Y.1
  • 8
    • 80052533471 scopus 로고    scopus 로고
    • Energy-efficient mechanisms for managing thread context in throughput processors
    • M. Gebhart et al. Energy-efficient mechanisms for managing thread context in throughput processors. In Intl. Symp. Computer Architecture, 2011.
    • (2011) Intl. Symp. Computer Architecture
    • Gebhart, M.1
  • 9
    • 84870700173 scopus 로고    scopus 로고
    • Auto-tuning a high-level language targeted to GPU codes
    • S. Grauer-Gray et al. Auto-tuning a high-level language targeted to GPU codes. In Innovative Parallel Computing, 2012.
    • (2012) Innovative Parallel Computing
    • Grauer-Gray, S.1
  • 10
    • 67650635164 scopus 로고    scopus 로고
    • Many-core vs many-thread machines: Stay away from the valley
    • January-June
    • Z. Guz et al. Many-core vs. many-thread machines: Stay away from the valley. IEEE Computer Architecture Letters, 8(1):25-28, January-June 2009.
    • (2009) IEEE Computer Architecture Letters , vol.8 , Issue.1 , pp. 25-28
    • Guz, Z.1
  • 11
    • 0030677581 scopus 로고    scopus 로고
    • The design and analysis of a cache architecture for texture mapping
    • Z. S. Hakura and A. Gupta. The design and analysis of a cache architecture for texture mapping. In Intl. Symp. Computer Architecture, 1997.
    • (1997) Intl. Symp. Computer Architecture
    • Hakura, Z.S.1    Gupta, A.2
  • 12
    • 77954998134 scopus 로고    scopus 로고
    • High performance cache replacement using re-reference interval prediction (RRIP)
    • A. Jaleel et al. High performance cache replacement using re-reference interval prediction (RRIP). In Intl. Symp. on Computer Architecture, 2010.
    • (2010) Intl. Symp. on Computer Architecture
    • Jaleel, A.1
  • 13
    • 84864068497 scopus 로고    scopus 로고
    • Characterizing and improving the use of demand-fetched caches in GPUs
    • W. Jia, K. A. Shaw, and M. Martonosi. Characterizing and improving the use of demand-fetched caches in GPUs. In Intl. Conf. on Supercomputing, 2012.
    • (2012) Intl. Conf. on Supercomputing
    • Jia, W.1    Shaw, K.A.2    Martonosi, M.3
  • 14
    • 84904014280 scopus 로고    scopus 로고
    • Many-thread aware prefetching mechanisms for GPGPU applications
    • J. Lee et al. Many-thread aware prefetching mechanisms for GPGPU applications. In Intl. Symp. Microarchitecture, 2010.
    • (2010) Intl. Symp. Microarchitecture
    • Lee, J.1
  • 16
    • 77950987305 scopus 로고    scopus 로고
    • Avoiding cache thrashing due to private data placement in last-level cache for manycore scaling
    • J. Meng and K. Skadron. Avoiding cache thrashing due to private data placement in last-level cache for manycore scaling. In Intl. Conf. Computer Design, 2009.
    • (2009) Intl. Conf. Computer Design
    • Meng, J.1    Skadron, K.2
  • 17
    • 77954976292 scopus 로고    scopus 로고
    • Dynamic warp subdivision for integrated branch and memory divergence tolerance
    • J. Meng, D. Tarjan, and K. Skadron. Dynamic warp subdivision for integrated branch and memory divergence tolerance. In Intl. Symp. Computer Architecture, 2010.
    • (2010) Intl. Symp. Computer Architecture
    • Meng, J.1    Tarjan, D.2    Skadron, K.3
  • 19
    • 84863342255 scopus 로고    scopus 로고
    • Improving GPU performance via large warps and two-level warp scheduling
    • V. Narasiman et al. Improving GPU performance via large warps and two-level warp scheduling. In Intl. Symp. Microarchitecture, 2011.
    • (2011) Intl. Symp. Microarchitecture
    • Narasiman, V.1
  • 25
    • 84870691946 scopus 로고    scopus 로고
    • DL: A data layout transformation system for heterogeneous computing
    • I.-J. Sung, D. Liu, and W.-M. Hwu. DL: A data layout transformation system for heterogeneous computing. In Innovative Parallel Computing, 2012.
    • (2012) Innovative Parallel Computing
    • Sung, I.-J.1    Liu, D.2    Hwu, W.-M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.