-
1
-
-
85034047153
-
-
[Online], Available
-
AMD R600-Family Instruction Set Architecture, Advanced Micro Device, Inc., 2008. [Online]. Available: http://ati.amd.com/technology/streamcomputing/ R600ISA.pdf.
-
(2008)
AMD R600-Family Instruction Set Architecture
-
-
-
2
-
-
36749086936
-
UNISIM: An open simulation environment and library for complex architecture design and collaborative development
-
[Online], Available
-
D. August, J. Chang, S. Girbal, D. Gracia-Perez, G. Mouchard, D. A. Penry, O. Temam, and N. Vachharajani, "UNISIM: an open simulation environment and library for complex architecture design and collaborative development," IEEE Computer Architecture Letters, vol. 6, no. 2, pp. 45-48, 2007. [Online]. Available: http://dx.doi.org/10.1109/L-CA.2007.12.
-
(2007)
IEEE Computer Architecture Letters
, vol.6
, Issue.2
, pp. 45-48
-
-
August, D.1
Chang, J.2
Girbal, S.3
Gracia-Perez, D.4
Mouchard, G.5
Penry, D.A.6
Temam, O.7
Vachharajani, N.8
-
3
-
-
24044461043
-
Achieving structural and composable modeling of complex systems
-
[Online], Available
-
D. I. August, S. Malik, L.-S. Peh, V. Pai, M. Vachharajani, and P. Willmann, "Achieving structural and composable modeling of complex systems," International Journal of Parallel Programming, vol. 33, no. 2, pp. 81-101, 2005. [Online]. Available: http://dx.doi.org/10.1007/s10766-005- 3569-3.
-
(2005)
International Journal of Parallel Programming
, vol.33
, Issue.2
, pp. 81-101
-
-
August, D.I.1
Malik, S.2
Peh, L.-S.3
Pai, V.4
Vachharajani, M.5
Willmann, P.6
-
4
-
-
0036469652
-
Simplescalar: An infrastructure for computer system modeling
-
[Online], Available
-
T. Austin, E. Larson, and D. Ernst, "Simplescalar: an infrastructure for computer system modeling," Computer, vol. 35, no. 2, pp. 59-67, 2002. [Online]. Available: http://dx.doi.org/10.1109/2.982917.
-
(2002)
Computer
, vol.35
, Issue.2
, pp. 59-67
-
-
Austin, T.1
Larson, E.2
Ernst, D.3
-
5
-
-
70349169075
-
Analyzing CUDA workloads using a detailed GPU simulator
-
Boston, [Online], Available
-
A. Bakhoda, G. Yuan, W. W. L. Fung, H. Wong, and T. M. Aamodt, "Analyzing CUDA workloads using a detailed GPU simulator," in Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, Boston, 2009, pp. 163-174. [Online]. Available: http://dx.doi.org/10.1109/ISPASS.2009.4919648.
-
(2009)
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software
, pp. 163-174
-
-
Bakhoda, A.1
Yuan, G.2
Fung, W.W.L.3
Wong, H.4
Aamodt, T.M.5
-
6
-
-
33846535493
-
The M5 simulator: Modeling networked systems
-
[Online], Available
-
N. L. Binkert, R. G. Dreslinski, L. R. Hsu, K. T. Lim, A. G. Saidi, and S. K. Reinhardt, "The M5 simulator: modeling networked systems," IEEE Micro, vol. 26, no. 4, pp. 52-60, 2006. [Online]. Available: http://dx.doi.org/10.1109/MM.2006.82.
-
(2006)
IEEE Micro
, vol.26
, Issue.4
, pp. 52-60
-
-
Binkert, N.L.1
Dreslinski, R.G.2
Hsu, L.R.3
Lim, K.T.4
Saidi, A.G.5
Reinhardt, S.K.6
-
7
-
-
78049487794
-
Comparaison d'algorithmes de branchements pour le simulateur de processeur graphique Barra
-
[Online], Available
-
S. Collange, M. Daumas, D. Defour, and D. Parello, "Comparaison d'algorithmes de branchements pour le simulateur de processeur graphique Barra," in 13ème Symposium sur les Architectures Nouvelles de Machines, 2009, pp. 1-12. [Online]. Available: http://hal.archives-ouvertes.fr/ hal-00397697.
-
(2009)
13ème Symposium Sur Les Architectures Nouvelles de Machines
, pp. 1-12
-
-
Collange, S.1
Daumas, M.2
Defour, D.3
Parello, D.4
-
8
-
-
77951006223
-
Power consuption of GPUs from a software perspective
-
[Online], Available
-
S. Collange, D. Defour, and A. Tisserand, "Power consuption of GPUs from a software perspective," in 9th International Conference on Computational Science, 2009, pp. 922-931. [Online]. Available: http://hal.archives-ouvertes.fr/hal-00348672/.
-
(2009)
9th International Conference on Computational Science
, pp. 922-931
-
-
Collange, S.1
Defour, D.2
Tisserand, A.3
-
9
-
-
84856559490
-
Dynamic detection of uniform and affine vectors in GPGPU computations
-
[Online], Available
-
S. Collange, D. Defour, and Y. Zhang, "Dynamic detection of uniform and affine vectors in GPGPU computations," in Third workshop on Highly Parallel Processing on a Chip, 2009, pp. 1-10. [Online]. Available: http://hal.archives-ouvertes.fr/hal-00396719/.
-
(2009)
Third Workshop on Highly Parallel Processing on A Chip
, pp. 1-10
-
-
Collange, S.1
Defour, D.2
Zhang, Y.3
-
11
-
-
70649094184
-
Translating GPU binaries to tiered SIMD architectures with Ocelot
-
GIT-CERCS-09-01, [Online], Available
-
G. Diamos, A. Kerr, and M. Kesavan, "Translating GPU binaries to tiered SIMD architectures with Ocelot," Georgia Institute of Technology, CERCS technical report GIT-CERCS-09-01, 2009. [Online]. Available: http://hdl.handle.net/1853/27246.
-
(2009)
Georgia Institute of Technology, CERCS Technical Report
-
-
Diamos, G.1
Kerr, A.2
Kesavan, M.3
-
12
-
-
84976676590
-
Parallel discrete event simulation
-
[Online], Available
-
R. M. Fujimoto, "Parallel discrete event simulation," Communications of the ACM, vol. 33, no. 10, pp. 30-53, 1990. [Online]. Available: http://doi.acm.org/10.1145/84537.84545.
-
(1990)
Communications of the ACM
, vol.33
, Issue.10
, pp. 30-53
-
-
Fujimoto, R.M.1
-
13
-
-
77954596367
-
-
US Patent Office, US Patent 7339592 B2, [Online], Available
-
E. Lindholm, M. Y. Siu, S. S. Moy, S. Liu, and J. R. Nickolls, "Simulating multiported memories using lower port count memories," US Patent Office, US Patent 7339592 B2, 2008. [Online]. Available: http://www.google.com/patents?q=7339592.
-
(2008)
Simulating Multiported Memories Using Lower Port Count Memories
-
-
Lindholm, E.1
Siu, M.Y.2
Moy, S.S.3
Liu, S.4
Nickolls, J.R.5
-
14
-
-
44849137198
-
NVIDIA Tesla: A unified graphics and computing architecture
-
[Online], Available
-
J. E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym, "NVIDIA Tesla: a unified graphics and computing architecture," IEEE Micro, vol. 28, no. 2, pp. 39-55, 2008. [Online]. Available: http://dx.doi.org/10.1109/MM.2008. 31.
-
(2008)
IEEE Micro
, vol.28
, Issue.2
, pp. 39-55
-
-
Lindholm, J.E.1
Nickolls, J.2
Oberman, S.3
Montrym, J.4
-
15
-
-
0036469676
-
Simics: A full system simulation platform
-
[Online], Available
-
P. S. Magnusson, M. Christensson, J. Eskilson, D. Forsgren, G. Hållberg, J. Högberg, F. Larsson, A. Moestedt, and B. Werner, "Simics: A full system simulation platform," Computer, vol. 35, no. 2, pp. 50-58, 2002. [Online]. Available: http://dx.doi.org/10.1109/2.982916.
-
(2002)
Computer
, vol.35
, Issue.2
, pp. 50-58
-
-
Magnusson, P.S.1
Christensson, M.2
Eskilson, J.3
Forsgren, D.4
Hållberg, G.5
Högberg, J.6
Larsson, F.7
Moestedt, A.8
Werner, B.9
-
16
-
-
33748870886
-
Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset
-
[Online], Available
-
M. M. K. Martin, D. J. Sorin, B. M. Beckmann, M. R. Marty, M. Xu, A. R. Alameldeen, K. E. Moore, M. D. Hill, and D. A. Wood, "Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset," SIGARCH Computer Architecture News, vol. 33, no. 4, pp. 92-99, 2005. [Online]. Available: http://doi.acm.org/10.1145/1105734.1105747.
-
(2005)
SIGARCH Computer Architecture News
, vol.33
, Issue.4
, pp. 92-99
-
-
Martin, M.M.K.1
Sorin, D.J.2
Beckmann, B.M.3
Marty, M.R.4
Xu, M.5
Alameldeen, A.R.6
Moore, K.E.7
Hill, M.D.8
Wood, D.A.9
-
17
-
-
33749375698
-
Shader performance analysis on a modern GPU architecture
-
Barcelona, Spain, [Online], Available
-
V. Moya, C. Gonzalez, J. Roca, A. Fernandez, and R. Espasa, "Shader performance analysis on a modern GPU architecture," in Proceedings of the 38th annual IEEE/ACM International Symposium on Microarchitecture, Barcelona, Spain, 2005, pp. 355-364. [Online]. Available: http://dx.doi.org/10.1109/MICRO. 2005.30.
-
(2005)
Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 355-364
-
-
Moya, V.1
Gonzalez, C.2
Roca, J.3
Fernandez, A.4
Espasa, R.5
-
18
-
-
70349100958
-
The OpenCL specification
-
[Online], Available
-
A. Munshi, "The OpenCL specification," Khronos OpenCL Working Group, Tech. Rep. 1.0 revision 48, 2009. [Online]. Available: http://www.khronos.org/registry/cl/specs/opencl-1.0.48.pdf.
-
(2009)
Khronos OpenCL Working Group, Tech. Rep. 1.0 Revision
, vol.48
-
-
Munshi, A.1
-
19
-
-
84873052000
-
-
version 2.3. [Online], Available
-
CUDA Compute Unified Device Architecture Programming Guide, NVIDIA, 2009, version 2.3. [Online]. Available: http://developer.download.nvidia.com/compute/ cuda/23/toolkit/docs/NVIDIACUDAProgrammingGuide2.3.pdf.
-
(2009)
CUDA Compute Unified Device Architecture Programming Guide
-
-
-
20
-
-
78049523808
-
-
version 2.3. [Online], Available
-
The NVIDIA CUDA Debugger, NVIDIA, 2009, version 2.3. [Online]. Available: http://developer.download.nvidia.com/compute/cuda/23/toolkit/docs/ CUDAGDBUserManual2.3beta.pdf.
-
(2009)
The NVIDIA CUDA Debugger
-
-
-
22
-
-
27944432620
-
A high-performance area-efficient multifunction interpolator
-
I. Koren and P. Kornerup, Eds., Cape Cod, Massachusetts, [Online], Available
-
S. F. Oberman and M. Siu, "A high-performance area-efficient multifunction interpolator," in Proceedings of the 17th IEEE Symposium on Computer Arithmetic, I. Koren and P. Kornerup, Eds., Cape Cod, Massachusetts, 2005, pp. 272-279. [Online]. Available: http://dx.doi.org/10.1109/ARITH.2005.7.
-
(2005)
Proceedings of the 17th IEEE Symposium on Computer Arithmetic
, pp. 272-279
-
-
Oberman, S.F.1
Siu, M.2
-
23
-
-
78049506293
-
Improving cyclelevel modular simulation by vectorization
-
Lille, France, [Online], Available
-
D. Parello, M. Bouache, and B. Goossens, "Improving cyclelevel modular simulation by vectorization," in Rapid Simulation and Performance Evaluation: Methods and Tools, Lille, France, 2009. [Online]. Available: http://www2.lifl.fr/rapido/Rapido%2709/Rapido09Proceed/parello.pdf.
-
(2009)
Rapid Simulation and Performance Evaluation: Methods and Tools
-
-
Parello, D.1
Bouache, M.2
Goossens, B.3
-
24
-
-
17644388982
-
MicroLib: A case for the quantitative comparison of micro-architecture mechanisms
-
Portland, Oregon, [Online], Available
-
D. G. Perez, G. Mouchard, and O. Temam, "MicroLib: a case for the quantitative comparison of micro-architecture mechanisms," in Proceedings of the 37th annual IEEE/ACM International Symposium on Microarchitecture, Portland, Oregon, 2004, pp. 43-54. [Online]. Available: http://dx.doi.org/10. 1109/MICRO.2004.25.
-
(2004)
Proceedings of the 37th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 43-54
-
-
Perez, D.G.1
Mouchard, G.2
Temam, O.3
-
25
-
-
0030653560
-
Using the SimOS machine simulator to study complex computer systems
-
[Online], Available
-
M. Rosenblum, E. Bugnion, S. Devine, and S. A. Herrod, "Using the SimOS machine simulator to study complex computer systems," ACM Transactions on Modeling and Computer Simulation, vol. 7, no. 1, pp. 78-103, 1997. [Online]. Available: http://doi.acm.org/10.1145/244804.244807.
-
(1997)
ACM Transactions on Modeling and Computer Simulation
, vol.7
, Issue.1
, pp. 78-103
-
-
Rosenblum, M.1
Bugnion, E.2
Devine, S.3
Herrod, S.A.4
-
27
-
-
67549107026
-
Quantitative analysis of the speed/accuracy trade-off in transaction level modeling
-
[Online], Available
-
G. Schirner and R. Dömer, "Quantitative analysis of the speed/accuracy trade-off in transaction level modeling," ACM Transactions in Embedded Computing Systems, vol. 8, no. 1, pp. 1-29, 2008. [Online]. Available: http://doi.acm.org/10.1145/1457246.1457250.
-
(2008)
ACM Transactions in Embedded Computing Systems
, vol.8
, Issue.1
, pp. 1-29
-
-
Schirner, G.1
Dömer, R.2
-
28
-
-
78650725832
-
A flexible simulation framework for graphics architectures
-
Grenoble, France, [Online], Available
-
J. W. Sheaffer, D. Luebke, and K. Skadron, "A flexible simulation framework for graphics architectures," in Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware, Grenoble, France, 2004, pp. 85-94. [Online]. Available: http://doi.acm.org/10.1145/1058129.1058142.
-
(2004)
Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware
, pp. 85-94
-
-
Sheaffer, J.W.1
Luebke, D.2
Skadron, K.3
-
30
-
-
33748289310
-
SimFlex: Statistical sampling of computer system simulation
-
[Online], Available
-
T. F. Wenisch, R. E. Wunderlich, M. Ferdman, A. Ailamaki, B. Falsafi, and J. C. Hoe, "SimFlex: Statistical sampling of computer system simulation," IEEE Micro, vol. 26, no. 4, pp. 18-31, 2006. [Online]. Available: http://dx.doi.org/10.1109/MM.2006.79.
-
(2006)
IEEE Micro
, vol.26
, Issue.4
, pp. 18-31
-
-
Wenisch, T.F.1
Wunderlich, R.E.2
Ferdman, M.3
Ailamaki, A.4
Falsafi, B.5
Hoe, J.C.6
|