메뉴 건너뛰기




Volumn , Issue , 2010, Pages

Exploring GPGPU workloads: Characterization methodology, analysis and microarchitecture evaluation implications

Author keywords

[No Author keywords available]

Indexed keywords

BENCHMARK SUITES; BRANCH DIVERGENCES; CLUSTERING ANALYSIS; DESIGN SPACES; DIMENSIONALITY REDUCTION; EVALUATION METRICS; FUNCTIONAL BLOCK; GPU ARCHITECTURES; HIGH-PERFORMANCE COMPUTING; K-MEANS; LARGE ARRAYS; MICRO ARCHITECTURES; MICRO-ARCHITECTURE DESIGN; NEAREST NEIGHBORS; RODINIA; SIMILARITY SCORES; WORKLOAD CHARACTERISTICS;

EID: 78751477137     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IISWC.2010.5649549     Document Type: Conference Paper
Times cited : (62)

References (51)
  • 1
    • 76749123978 scopus 로고    scopus 로고
    • Complexity effective memory access scheduling for many-core accelerator architectures
    • G. Yuan, A. Bakhoda, and T. Aamodt, Complexity Effective Memory Access Scheduling for Many-Core Accelerator Architectures, MICRO, 2009.
    • (2009) MICRO
    • Yuan, G.1    Bakhoda, A.2    Aamodt, T.3
  • 2
    • 47349104432 scopus 로고    scopus 로고
    • Dynamic warp formation and scheduling for efficient gpu control flow
    • W. Fung, I. Sham, G. Yuan, and T. Aamodt, Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow, MICRO, 2007.
    • (2007) MICRO
    • Fung, W.1    Sham, I.2    Yuan, G.3    Aamodt, T.4
  • 6
  • 10
    • 70649095417 scopus 로고    scopus 로고
    • On the (Dis)similarity of transactional memory workloads
    • C. Hughe, J. Poe, A. Qouneh, and T. Li, On the (Dis)similarity of Transactional Memory Workloads, IISWC, 2009.
    • (2009) IISWC
    • Hughe, C.1    Poe, J.2    Qouneh, A.3    Li, T.4
  • 13
    • 0034226001 scopus 로고    scopus 로고
    • SPEC CPU2000: Measuring CPU performance in the new millennium
    • July
    • J. Henning, SPEC CPU2000: Measuring CPU Performance in the New Millennium, IEEE Computer, pp. 28-35, July 2000.
    • (2000) IEEE Computer , pp. 28-35
    • Henning, J.1
  • 14
    • 0031339427 scopus 로고    scopus 로고
    • MediaBench: A tool for evaluating and synthesizing multimedia and communication systems
    • C. Lee, M. Potkonjak, and W. Smith, MediaBench: A Tool for Evaluating and Synthesizing Multimedia and Communication Systems, MICRO, 1997.
    • (1997) MICRO
    • Lee, C.1    Potkonjak, M.2    Smith, W.3
  • 16
    • 0029194459 scopus 로고
    • The SPLASH-2 programs: Characterization and methodological considerations
    • S. Woo, M. Ohara, E. Torrie, J. Singh, and A. Gupta, The SPLASH-2 Programs: Characterization and Methodological Considerations, ISCA, 1995.
    • (1995) ISCA
    • Woo, S.1    Ohara, M.2    Torrie, E.3    Singh, J.4    Gupta, A.5
  • 17
    • 70649115330 scopus 로고    scopus 로고
    • STAMP: Stanford transactional memory applications for multi-processing
    • C. Minh, J. Chung, C. Kozyrakis, and K. Olukotun, STAMP: Stanford Transactional Memory Applications for Multi-Processing, IISWC, 2008.
    • (2008) IISWC
    • Minh, C.1    Chung, J.2    Kozyrakis, C.3    Olukotun, K.4
  • 20
    • 78651550268 scopus 로고    scopus 로고
    • Scalable parallel programming with CUDA
    • Mar.
    • J. Nickolls, I. Buck, M. Garland, and K. Skadron, Scalable Parallel Programming with CUDA, Queue 6, 2 (Mar. 2008), 40-53.
    • (2008) Queue , vol.6 , Issue.2 , pp. 40-53
    • Nickolls, J.1    Buck, I.2    Garland, M.3    Skadron, K.4
  • 21
    • 44849137198 scopus 로고    scopus 로고
    • NVIDIA tesla: A unified graphics and computing architecture
    • E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym, NVIDIA Tesla: A Unified Graphics and Computing Architecture, Micro, vol.28, no.2, 2008.
    • (2008) Micro , vol.28 , Issue.2
    • Lindholm, E.1    Nickolls, J.2    Oberman, S.3    Montrym, J.4
  • 23
    • 78751492924 scopus 로고    scopus 로고
    • Technical overview
    • AMD Inc
    • Technical Overview, ATI Stream Computing, AMD Inc, 2009.
    • (2009) ATI Stream Computing
  • 24
    • 70649104826 scopus 로고    scopus 로고
    • A characterization and analysis of PTX kernels
    • A. Kerr, G. Diamos, and S. Yalamanchilli, A Characterization and Analysis of PTX Kernels, IISWC 2009.
    • (2009) IISWC
    • Kerr, A.1    Diamos, G.2    Yalamanchilli, S.3
  • 26
    • 78751510969 scopus 로고    scopus 로고
    • Parboil Benchmark suite. URL: http://impact.crhc.illinois.edu/parboil. php.
  • 28
    • 78751471889 scopus 로고    scopus 로고
    • http://www.nvidia.com/object/cuda-sdks.html
  • 33
  • 34
    • 78751498561 scopus 로고    scopus 로고
    • Billconan and Kavinguy
    • Billconan and Kavinguy, A Neural Network on GPU. http://www.codeproject. com/KB/graphics/GPUNN.aspx.
    • A Neural Network on GPU
  • 35
    • 78751541238 scopus 로고    scopus 로고
    • Pcchen
    • Pcchen. N-Queens Solver, http://forums.nvidia.com/index.php?showtopic= 76893, 2008.
    • (2008) N-Queens Solver
  • 36
    • 85015171905 scopus 로고    scopus 로고
    • Maxime
    • Maxime. Ray tracing. http://www.nvidia.com/cuda.
    • Ray Tracing
  • 37
    • 57349130987 scopus 로고    scopus 로고
    • StoreGPU: Exploiting graphics processing units to accelerate distributed storage systems
    • S. Al-Kiswany, A. Gharaibeh, E. Santos-Neto, G. Yuan, and M. Ripeanu, StoreGPU: Exploiting Graphics Processing Units to accelerate Distributed Storage Systems, HPDC, 2008.
    • (2008) HPDC
    • Al-Kiswany, S.1    Gharaibeh, A.2    Santos-Neto, E.3    Yuan, G.4    Ripeanu, M.5
  • 40
    • 38849131252 scopus 로고    scopus 로고
    • High-throughput sequence alignment using graphics processing units
    • M. Schatz, C. Trapnell, A. Delcher, and A. Varshney, High-throughput Sequence Alignment using Graphics Processing Units, BMC Bioinformatics, 8(1): 474, 2007.
    • (2007) BMC Bioinformatics , vol.8 , Issue.1 , pp. 474
    • Schatz, M.1    Trapnell, C.2    Delcher, A.3    Varshney, A.4
  • 42
    • 51049111938 scopus 로고    scopus 로고
    • CUDA compatible GPU as an efficient hardware accelerator for AES cryptography
    • S. Manavski, CUDA compatible GPU as an Efficient Hardware Accelerator for AES Cryptography, ICSPC, 2007.
    • (2007) ICSPC
    • Manavski, S.1
  • 43
    • 51049099597 scopus 로고    scopus 로고
    • GPU acceleration of numerical weather prediction
    • J. Michalakes and M. Vachharajani, GPU Acceleration of Numerical Weather Prediction, IPDPS, 2008.
    • (2008) IPDPS
    • Michalakes, J.1    Vachharajani, M.2
  • 45
    • 78751510968 scopus 로고    scopus 로고
    • StatSoft, Inc. STATISTICA, http://www.statsoft.com/.
    • Statistica
  • 47
    • 78751548339 scopus 로고    scopus 로고
    • Analysis of benchmark characteristics and benchmark performance prediction
    • R. H. Saavedra and A. J. Smith, Analysis of Benchmark Characteristics and Benchmark Performance Prediction, ACM Trans. Computer Systems, 1998.
    • (1998) ACM Trans. Computer Systems
    • Saavedra, R.H.1    Smith, A.J.2
  • 48
    • 34548329985 scopus 로고    scopus 로고
    • Microarchitecture-independent workload characterization
    • K. Hoste and L. Eeckhout. Microarchitecture-independent Workload Characterization. IEEE Micro, 27(3):63.-72, 2007.
    • (2007) IEEE Micro , vol.27 , Issue.3 , pp. 63-72
    • Hoste, K.1    Eeckhout, L.2
  • 49
    • 78751541744 scopus 로고    scopus 로고
    • http://www.khronos.org/opencl/
  • 51
    • 35348913704 scopus 로고    scopus 로고
    • Analysis of redundancy and application balance in the SPEC CPU2006 benchmark suite
    • A. Phansalkar, A. Joshi, and L. John, Analysis of Redundancy and Application Balance in the SPEC CPU2006 Benchmark Suite, ISCA, 2007.
    • (2007) ISCA
    • Phansalkar, A.1    Joshi, A.2    John, L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.