메뉴 건너뛰기




Volumn , Issue , 2010, Pages 351-360

Barra: A parallel functional simulator for GPGPU

Author keywords

[No Author keywords available]

Indexed keywords

EXECUTABLES; FUNCTIONAL LEVELS; FUNCTIONAL SIMULATORS; GENERAL PURPOSE; GRAPHICS PROCESSING UNIT; INSTRUCTION SET; MICRO-ARCHITECTURE DESIGN;

EID: 78049512154     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/MASCOTS.2010.43     Document Type: Conference Paper
Times cited : (67)

References (30)
  • 1
    • 85034047153 scopus 로고    scopus 로고
    • [Online], Available
    • AMD R600-Family Instruction Set Architecture, Advanced Micro Device, Inc., 2008. [Online]. Available: http://ati.amd.com/technology/streamcomputing/ R600ISA.pdf.
    • (2008) AMD R600-Family Instruction Set Architecture
  • 2
    • 36749086936 scopus 로고    scopus 로고
    • UNISIM: An open simulation environment and library for complex architecture design and collaborative development
    • [Online], Available
    • D. August, J. Chang, S. Girbal, D. Gracia-Perez, G. Mouchard, D. A. Penry, O. Temam, and N. Vachharajani, "UNISIM: an open simulation environment and library for complex architecture design and collaborative development," IEEE Computer Architecture Letters, vol. 6, no. 2, pp. 45-48, 2007. [Online]. Available: http://dx.doi.org/10.1109/L-CA.2007.12.
    • (2007) IEEE Computer Architecture Letters , vol.6 , Issue.2 , pp. 45-48
    • August, D.1    Chang, J.2    Girbal, S.3    Gracia-Perez, D.4    Mouchard, G.5    Penry, D.A.6    Temam, O.7    Vachharajani, N.8
  • 4
    • 0036469652 scopus 로고    scopus 로고
    • Simplescalar: An infrastructure for computer system modeling
    • [Online], Available
    • T. Austin, E. Larson, and D. Ernst, "Simplescalar: an infrastructure for computer system modeling," Computer, vol. 35, no. 2, pp. 59-67, 2002. [Online]. Available: http://dx.doi.org/10.1109/2.982917.
    • (2002) Computer , vol.35 , Issue.2 , pp. 59-67
    • Austin, T.1    Larson, E.2    Ernst, D.3
  • 6
    • 33846535493 scopus 로고    scopus 로고
    • The M5 simulator: Modeling networked systems
    • [Online], Available
    • N. L. Binkert, R. G. Dreslinski, L. R. Hsu, K. T. Lim, A. G. Saidi, and S. K. Reinhardt, "The M5 simulator: modeling networked systems," IEEE Micro, vol. 26, no. 4, pp. 52-60, 2006. [Online]. Available: http://dx.doi.org/10.1109/MM.2006.82.
    • (2006) IEEE Micro , vol.26 , Issue.4 , pp. 52-60
    • Binkert, N.L.1    Dreslinski, R.G.2    Hsu, L.R.3    Lim, K.T.4    Saidi, A.G.5    Reinhardt, S.K.6
  • 7
    • 78049487794 scopus 로고    scopus 로고
    • Comparaison d'algorithmes de branchements pour le simulateur de processeur graphique Barra
    • [Online], Available
    • S. Collange, M. Daumas, D. Defour, and D. Parello, "Comparaison d'algorithmes de branchements pour le simulateur de processeur graphique Barra," in 13ème Symposium sur les Architectures Nouvelles de Machines, 2009, pp. 1-12. [Online]. Available: http://hal.archives-ouvertes.fr/ hal-00397697.
    • (2009) 13ème Symposium Sur Les Architectures Nouvelles de Machines , pp. 1-12
    • Collange, S.1    Daumas, M.2    Defour, D.3    Parello, D.4
  • 9
    • 84856559490 scopus 로고    scopus 로고
    • Dynamic detection of uniform and affine vectors in GPGPU computations
    • [Online], Available
    • S. Collange, D. Defour, and Y. Zhang, "Dynamic detection of uniform and affine vectors in GPGPU computations," in Third workshop on Highly Parallel Processing on a Chip, 2009, pp. 1-10. [Online]. Available: http://hal.archives-ouvertes.fr/hal-00396719/.
    • (2009) Third Workshop on Highly Parallel Processing on A Chip , pp. 1-10
    • Collange, S.1    Defour, D.2    Zhang, Y.3
  • 11
    • 70649094184 scopus 로고    scopus 로고
    • Translating GPU binaries to tiered SIMD architectures with Ocelot
    • GIT-CERCS-09-01, [Online], Available
    • G. Diamos, A. Kerr, and M. Kesavan, "Translating GPU binaries to tiered SIMD architectures with Ocelot," Georgia Institute of Technology, CERCS technical report GIT-CERCS-09-01, 2009. [Online]. Available: http://hdl.handle.net/1853/27246.
    • (2009) Georgia Institute of Technology, CERCS Technical Report
    • Diamos, G.1    Kerr, A.2    Kesavan, M.3
  • 12
    • 84976676590 scopus 로고
    • Parallel discrete event simulation
    • [Online], Available
    • R. M. Fujimoto, "Parallel discrete event simulation," Communications of the ACM, vol. 33, no. 10, pp. 30-53, 1990. [Online]. Available: http://doi.acm.org/10.1145/84537.84545.
    • (1990) Communications of the ACM , vol.33 , Issue.10 , pp. 30-53
    • Fujimoto, R.M.1
  • 14
    • 44849137198 scopus 로고    scopus 로고
    • NVIDIA Tesla: A unified graphics and computing architecture
    • [Online], Available
    • J. E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym, "NVIDIA Tesla: a unified graphics and computing architecture," IEEE Micro, vol. 28, no. 2, pp. 39-55, 2008. [Online]. Available: http://dx.doi.org/10.1109/MM.2008. 31.
    • (2008) IEEE Micro , vol.28 , Issue.2 , pp. 39-55
    • Lindholm, J.E.1    Nickolls, J.2    Oberman, S.3    Montrym, J.4
  • 18
    • 70349100958 scopus 로고    scopus 로고
    • The OpenCL specification
    • [Online], Available
    • A. Munshi, "The OpenCL specification," Khronos OpenCL Working Group, Tech. Rep. 1.0 revision 48, 2009. [Online]. Available: http://www.khronos.org/registry/cl/specs/opencl-1.0.48.pdf.
    • (2009) Khronos OpenCL Working Group, Tech. Rep. 1.0 Revision , vol.48
    • Munshi, A.1
  • 19
    • 84873052000 scopus 로고    scopus 로고
    • version 2.3. [Online], Available
    • CUDA Compute Unified Device Architecture Programming Guide, NVIDIA, 2009, version 2.3. [Online]. Available: http://developer.download.nvidia.com/compute/ cuda/23/toolkit/docs/NVIDIACUDAProgrammingGuide2.3.pdf.
    • (2009) CUDA Compute Unified Device Architecture Programming Guide
  • 20
    • 78049523808 scopus 로고    scopus 로고
    • version 2.3. [Online], Available
    • The NVIDIA CUDA Debugger, NVIDIA, 2009, version 2.3. [Online]. Available: http://developer.download.nvidia.com/compute/cuda/23/toolkit/docs/ CUDAGDBUserManual2.3beta.pdf.
    • (2009) The NVIDIA CUDA Debugger
  • 22
    • 27944432620 scopus 로고    scopus 로고
    • A high-performance area-efficient multifunction interpolator
    • I. Koren and P. Kornerup, Eds., Cape Cod, Massachusetts, [Online], Available
    • S. F. Oberman and M. Siu, "A high-performance area-efficient multifunction interpolator," in Proceedings of the 17th IEEE Symposium on Computer Arithmetic, I. Koren and P. Kornerup, Eds., Cape Cod, Massachusetts, 2005, pp. 272-279. [Online]. Available: http://dx.doi.org/10.1109/ARITH.2005.7.
    • (2005) Proceedings of the 17th IEEE Symposium on Computer Arithmetic , pp. 272-279
    • Oberman, S.F.1    Siu, M.2
  • 23
    • 78049506293 scopus 로고    scopus 로고
    • Improving cyclelevel modular simulation by vectorization
    • Lille, France, [Online], Available
    • D. Parello, M. Bouache, and B. Goossens, "Improving cyclelevel modular simulation by vectorization," in Rapid Simulation and Performance Evaluation: Methods and Tools, Lille, France, 2009. [Online]. Available: http://www2.lifl.fr/rapido/Rapido%2709/Rapido09Proceed/parello.pdf.
    • (2009) Rapid Simulation and Performance Evaluation: Methods and Tools
    • Parello, D.1    Bouache, M.2    Goossens, B.3
  • 24
  • 25
    • 0030653560 scopus 로고    scopus 로고
    • Using the SimOS machine simulator to study complex computer systems
    • [Online], Available
    • M. Rosenblum, E. Bugnion, S. Devine, and S. A. Herrod, "Using the SimOS machine simulator to study complex computer systems," ACM Transactions on Modeling and Computer Simulation, vol. 7, no. 1, pp. 78-103, 1997. [Online]. Available: http://doi.acm.org/10.1145/244804.244807.
    • (1997) ACM Transactions on Modeling and Computer Simulation , vol.7 , Issue.1 , pp. 78-103
    • Rosenblum, M.1    Bugnion, E.2    Devine, S.3    Herrod, S.A.4
  • 27
    • 67549107026 scopus 로고    scopus 로고
    • Quantitative analysis of the speed/accuracy trade-off in transaction level modeling
    • [Online], Available
    • G. Schirner and R. Dömer, "Quantitative analysis of the speed/accuracy trade-off in transaction level modeling," ACM Transactions in Embedded Computing Systems, vol. 8, no. 1, pp. 1-29, 2008. [Online]. Available: http://doi.acm.org/10.1145/1457246.1457250.
    • (2008) ACM Transactions in Embedded Computing Systems , vol.8 , Issue.1 , pp. 1-29
    • Schirner, G.1    Dömer, R.2
  • 29
  • 30
    • 33748289310 scopus 로고    scopus 로고
    • SimFlex: Statistical sampling of computer system simulation
    • [Online], Available
    • T. F. Wenisch, R. E. Wunderlich, M. Ferdman, A. Ailamaki, B. Falsafi, and J. C. Hoe, "SimFlex: Statistical sampling of computer system simulation," IEEE Micro, vol. 26, no. 4, pp. 18-31, 2006. [Online]. Available: http://dx.doi.org/10.1109/MM.2006.79.
    • (2006) IEEE Micro , vol.26 , Issue.4 , pp. 18-31
    • Wenisch, T.F.1    Wunderlich, R.E.2    Ferdman, M.3    Ailamaki, A.4    Falsafi, B.5    Hoe, J.C.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.