메뉴 건너뛰기




Volumn , Issue , 2014, Pages

GpuTejas: A parallel simulator for GPU architectures

Author keywords

Cycle level; GPU; NVIDIA; Parallel Architectural Simulation; Simulator; Tesla; Timing model

Indexed keywords

BENCHMARKING; PARALLEL ARCHITECTURES; PROGRAM PROCESSORS; SIMULATORS;

EID: 84977857181     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/HiPC.2014.7116897     Document Type: Conference Paper
Times cited : (22)

References (23)
  • 2
    • 44849137198 scopus 로고    scopus 로고
    • NVIDIA tesla: A unified graphics and computing architecture
    • March
    • E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym, "Nvidia tesla: A unified graphics and computing architecture," Micro, IEEE, Vol. 28, no. 2, pp. 39-55, March 2008.
    • (2008) Micro, IEEE , vol.28 , Issue.2 , pp. 39-55
    • Lindholm, E.1    Nickolls, J.2    Oberman, S.3    Montrym, J.4
  • 5
    • 84860558447 scopus 로고    scopus 로고
    • [Online]. Available
    • AMD. (2011) HD 6900 Series Instruction Set Architecture. [Online]. Available: http://developer.amd.com/wordpress/media/2012/10/AMDHD6900SeriesInstructionSetArchitecture.pdf
    • (2011) HD 6900 Series Instruction Set Architecture
  • 7
    • 78149233155 scopus 로고    scopus 로고
    • Ocelot: A dynamic optimization framework for bulk-synchronous applications in heterogeneous systems
    • G. F. Diamos, A. R. Kerr, S. Yalamanchili, and N. Clark, "Ocelot: A dynamic optimization framework for bulk-synchronous applications in heterogeneous systems," in PACT, 2010.
    • (2010) PACT
    • Diamos, G.F.1    Kerr, A.R.2    Yalamanchili, S.3    Clark, N.4
  • 9
    • 33750834456 scopus 로고    scopus 로고
    • Attila: A cycle-level execution-driven simulator for modern GPU architectures
    • March
    • V. del Barrio, C. Gonzalez, J. Roca, A. Fernandez, and R. Espasa, "Attila: a cycle-level execution-driven simulator for modern GPU architectures," in ISPASS, March 2006.
    • (2006) ISPASS
    • Del Barrio, V.1    Gonzalez, C.2    Roca, J.3    Fernandez, A.4    Espasa, R.5
  • 10
    • 84867504986 scopus 로고    scopus 로고
    • Multi2sim: A simulation framework for cpu-GPU computing
    • R. Ubal, B. Jang, P. Mistry, D. Schaa, and D. Kaeli, "Multi2sim: A simulation framework for cpu-GPU computing," in PACT, 2012.
    • (2012) PACT
    • Ubal, R.1    Jang, B.2    Mistry, P.3    Schaa, D.4    Kaeli, D.5
  • 13
    • 79957500177 scopus 로고    scopus 로고
    • A reconfigurable simulator for large-scale heterogeneous multicore architectures
    • J. Meng and K. Skadron, "A reconfigurable simulator for large-scale heterogeneous multicore architectures," in ISPASS, 2011.
    • (2011) ISPASS
    • Meng, J.1    Skadron, K.2
  • 14
    • 84885631725 scopus 로고    scopus 로고
    • Characterizing the performance benefits of fused cpu/GPU systems using fusionsim
    • V. Zakharenko, T. Aamodt, and A. Moshovos, "Characterizing the performance benefits of fused cpu/GPU systems using fusionsim," in DATE, 2013.
    • (2013) DATE
    • Zakharenko, V.1    Aamodt, T.2    Moshovos, A.3
  • 15
    • 84881446418 scopus 로고    scopus 로고
    • Parallel GPU architecture simulation framework exploiting work allocation unit parallelism
    • S. Lee and W. W. Ro, "Parallel GPU architecture simulation framework exploiting work allocation unit parallelism," in ISPASS, 2013.
    • (2013) ISPASS
    • Lee, S.1    Ro, W.W.2
  • 16
  • 17
    • 14744292475 scopus 로고    scopus 로고
    • Scientific computing with Java and c++: A case study using functional magnetic resonance neuroimages
    • R. A. Vivanco and N. J. Pizzi, "Scientific computing with java and c++: a case study using functional magnetic resonance neuroimages," Software: Practice and Experience, Vol. 35, no. 3, pp. 237-254, 2005.
    • (2005) Software: Practice and Experience , vol.35 , Issue.3 , pp. 237-254
    • Vivanco, R.A.1    Pizzi, N.J.2
  • 18
    • 84988226681 scopus 로고    scopus 로고
    • Benchmarking Java against c and fortran for scientific applications
    • J. M. Bull, L. A. Smith, L. Pottage, and R. Freeman, "Benchmarking java against c and fortran for scientific applications," in ACM ISCOPE, 2001.
    • (2001) ACM ISCOPE
    • Bull, J.M.1    Smith, L.A.2    Pottage, L.3    Freeman, R.4
  • 21
    • 84884869750 scopus 로고    scopus 로고
    • Lock-free and wait-free slot scheduling algorithms
    • P. Aggarwal and S. Sarangi, "Lock-free and wait-free slot scheduling algorithms," in IPDPS, 2013.
    • (2013) IPDPS
    • Aggarwal, P.1    Sarangi, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.