메뉴 건너뛰기




Volumn 32, Issue 6, 2012, Pages 4-16

Redefining the role of the CPU in the era of CPU-GPU integration

Author keywords

CPU architecture; CPU GPU systems; heterogeneous designs

Indexed keywords

BRANCH PREDICTION; CPU ARCHITECTURE; CPU DESIGN; INSTRUCTION LEVEL PARALLELISM; THREAD LEVEL PARALLELISM;

EID: 84875981232     PISSN: 02721732     EISSN: None     Source Type: Journal    
DOI: 10.1109/MM.2012.57     Document Type: Article
Times cited : (47)

References (26)
  • 3
    • 77954995885 scopus 로고    scopus 로고
    • Debunking the 100x GPU vs. CPU myth: An Evaluation of throughput computing on CPU and GPU
    • ACM
    • V.W. Lee et al., "Debunking the 100x GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU," Proc. 37th Ann. Int'l Symp. Computer Architecture (ISCA 10), ACM, 2010, pp. 451-460.
    • (2010) Proc. 37th Ann. Int'l Symp. Computer Architecture (ISCA 10) , pp. 451-460
    • Lee, V.W.1
  • 5
    • 34247174509 scopus 로고    scopus 로고
    • Core architecture optimization for heterogeneous chip multiprocessors
    • DOI 10.1145/1152154.1152162, PACT 2006 - Proceedings of the Fifteenth International Conference on Parallel Architectures and Compilation Techniques
    • R. Kumar, D.M. Tullsen, and N.P. Jouppi, "Core Architecture Optimization for Heterogeneous Chip Multiprocessors," Proc. 15th Int'l Conf. Parallel Architecture and Compilation Techniques (PACT 06), ACM, 2006, pp. 23-32. (Pubitemid 46601078)
    • (2006) Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT , vol.2006 , pp. 23-32
    • Kumar, R.1    Tullsen, D.M.2    Jouppi, N.P.3
  • 8
    • 77952340798 scopus 로고    scopus 로고
    • Performance insights on executing nongraphics applications on CUDA on the NVIDIA GeForce 8800 GTX
    • W.M. Hwu et al., "Performance Insights on Executing Nongraphics Applications on CUDA on the NVIDIA GeForce 8800 GTX," Hot Chips 19, 2007, http://www.hotchips. org/archives/hc19.
    • (2007) Hot Chips , vol.19
    • Hwu, W.M.1
  • 9
    • 84876238543 scopus 로고    scopus 로고
    • Scope for performance enhancement of CMU sphinx by parallelizing with OpenCL
    • Aug
    • S.C. Harish et al., "Scope for Performance Enhancement of CMU Sphinx by Parallelizing with OpenCL," J. Wisdom Based Computing, Aug. 2011, pp. 43-46.
    • (2011) J. Wisdom Based Computing , pp. 43-46
    • Harish, S.C.1
  • 10
    • 84861416065 scopus 로고    scopus 로고
    • Parallelization of particle filter algorithms
    • Springer-Verlag
    • M.A. Goodrum et al., "Parallelization of Particle Filter Algorithms," Proc. Int'l Conf. Computer Architecture, Springer-Verlag, 2010, pp. 139-149.
    • (2010) Proc. Int'l Conf. Computer Architecture , pp. 139-149
    • Goodrum, M.A.1
  • 11
    • 51049106282 scopus 로고    scopus 로고
    • Options pricing on the GPU
    • M. Pharr and R. Fernando, eds Addison-Wesley chapter 45
    • C. Kolb and M. Pharr, "Options Pricing on the GPU," GPU Gems 2, M. Pharr and R. Fernando, eds., Addison-Wesley, 2005, chapter 45.
    • (2005) GPU Gems 2
    • Kolb, C.1    Pharr, M.2
  • 12
    • 77949647837 scopus 로고    scopus 로고
    • Program Optimization of array-intensive SPEC2K benchmarks on multithreaded GPU using CUDA and brook+
    • IEEE CS
    • G. Wang et al., "Program Optimization of Array-Intensive SPEC2K Benchmarks on Multithreaded GPU Using CUDA and Brook+," Proc. 15th Int'l Conf. Parallel and Distributed Systems, IEEE CS, 2009, pp. 292-299.
    • (2009) Proc. 15th Int'l Conf. Parallel and Distributed Systems , pp. 292-299
    • Wang, G.1
  • 14
    • 70450029279 scopus 로고    scopus 로고
    • Evaluating the use of GPUs in liver image segmentation and HMMER database searches
    • IEEE CS doi:10.1109/IPDPS.2009.5161073
    • J. Walters et al., "Evaluating the Use of GPUs in Liver Image Segmentation and HMMER Database Searches," Proc. IEEE Int'l Symp. Parallel & Distributed Processing, IEEE CS, 2009, doi:10.1109/IPDPS.2009.5161073.
    • (2009) Proc. IEEE Int'l Symp. Parallel & Distributed Processing
    • Walters, J.1
  • 21
    • 34548329985 scopus 로고    scopus 로고
    • Microarchitecture-independent workload characterization
    • DOI 10.1109/MM.2007.56
    • K. Hoste and L. Eeckhout, "Microarchitecture-Independent Workload Characterization," IEEE Micro, May/June 2007, pp. 63-72. (Pubitemid 47337548)
    • (2007) IEEE Micro , vol.27 , Issue.3 , pp. 63-72
    • Hoste, K.1    Eeckhout, L.2
  • 23
    • 42549154520 scopus 로고    scopus 로고
    • The L-TAGE branch predictor
    • May
    • A. Seznec, "The L-TAGE Branch Predictor," J. Instruction-Level Parallelism, May 2007; http://www.jilp.org/vol9/v9paper6.pdf.
    • (2007) J. Instruction-Level Parallelism
    • Seznec, A.1
  • 26
    • 0036949391 scopus 로고    scopus 로고
    • A stateless, content-directed data prefetching mechanism
    • DOI 10.1145/635508.605427
    • R. Cooksey, S. Jourdan, and D. Grunwald, "A Stateless, Content Directed Data Prefetching Mechanism," Proc. 10th Int'l Conf. Architectural Support for Programming Languages and Operating Systems, ACM, 2002, pp. 279-290. (Pubitemid 44892240)
    • (2002) Operating Systems Review (ACM) , vol.36 , Issue.5 , pp. 279-290
    • Cooksey, R.1    Jourdan, S.2    Grunwald, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.