-
1
-
-
77949509509
-
-
home page, online
-
Gravit home page. [online]. http://gravit.slowchop.com.
-
Gravit
-
-
-
2
-
-
77949527681
-
-
Open64. http://www.open64.net.
-
Open64
-
-
-
3
-
-
84900342836
-
SPEComp: A new benchmark suite for measuring parallel computer performance
-
V. Aslot, M. Domeika, R. Eigenmann, G. Gaertner, W. Jones, and B. Parady. SPEComp: A new benchmark suite for measuring parallel computer performance. Lecture Notes in Computer Science, pages 1-10, 2001.
-
(2001)
Lecture Notes in Computer Science
, pp. 1-10
-
-
Aslot, V.1
Domeika, M.2
Eigenmann, R.3
Gaertner, G.4
Jones, W.5
Parady, B.6
-
4
-
-
33846349887
-
A hierarchical O (N log N) force-calculation algorithm
-
J. Barnes and P. Hut. A hierarchical O (N log N) force-calculation algorithm. Nature, 324(6096):446-449, 1986.
-
(1986)
Nature
, vol.324
, Issue.6096
, pp. 446-449
-
-
Barnes, J.1
Hut, P.2
-
5
-
-
0032630166
-
-
T. M. Chilimbi, B. Davidson, and J. R. Larus. Cache-conscious structure definition. In PLDI '99: Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation, pages 13-24, New York, NY, USA, 1999. ACM Press. Separate a class into hot class and coldclass.
-
T. M. Chilimbi, B. Davidson, and J. R. Larus. Cache-conscious structure definition. In PLDI '99: Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation, pages 13-24, New York, NY, USA, 1999. ACM Press. Separate a class into "hot" class and "cold"class.
-
-
-
-
6
-
-
17244375796
-
-
T. M. Chilimbi, M. D. Hill, and J. R. Larus. Cache-conscious structure layout. In PLDI '99: Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation, pages 1-12, New York, NY, USA, 1999. ACM Press. (1) Organize tree-like data structure together in cache. (2) Allocate contemporary elements ina cache block as much as possible.
-
T. M. Chilimbi, M. D. Hill, and J. R. Larus. Cache-conscious structure layout. In PLDI '99: Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation, pages 1-12, New York, NY, USA, 1999. ACM Press. (1) Organize tree-like data structure together in cache. (2) Allocate contemporary elements ina cache block as much as possible.
-
-
-
-
12
-
-
31844446709
-
Automatic pool allocation: Improving performance by controlling data structure layout in the heap
-
C. Lattner and V. Adve. Automatic pool allocation: improving performance by controlling data structure layout in the heap. SIGPLAN Not., 40(6):129-142, 2005.
-
(2005)
SIGPLAN Not
, vol.40
, Issue.6
, pp. 129-142
-
-
Lattner, C.1
Adve, V.2
-
13
-
-
77949526544
-
-
C. NVIDIA. NVIDIA CUDA Compute Unified Device Architecture Programming Guide, 1.1 edition, 11 2007.
-
C. NVIDIA. NVIDIA CUDA Compute Unified Device Architecture Programming Guide, 1.1 edition, 11 2007.
-
-
-
-
14
-
-
0033076195
-
Augmenting Loop Tiling with Data Alignment for Improved Cache Performance
-
February
-
P. Panda, H. Nakamura, N. Dutt, and A. Nicolau. Augmenting Loop Tiling with Data Alignment for Improved Cache Performance. IEEE Trans. on Computers, 48(2):142-149, February 1999.
-
(1999)
IEEE Trans. on Computers
, vol.48
, Issue.2
, pp. 142-149
-
-
Panda, P.1
Nakamura, H.2
Dutt, N.3
Nicolau, A.4
-
16
-
-
43449094719
-
Program optimization space pruning for a multithreaded gpu
-
New York, NY, USA, ACM
-
S. Ryoo, C. I. Rodrigues, S. S. Stone, S. S. Baghsorkhi, S.-Z. Ueng, J. A. Stratton, and W.-m. W. Hwu. Program optimization space pruning for a multithreaded gpu. In CGO '08: Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization, pages 195-204, New York, NY, USA, 2008. ACM.
-
(2008)
CGO '08: Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization
, pp. 195-204
-
-
Ryoo, S.1
Rodrigues, C.I.2
Stone, S.S.3
Baghsorkhi, S.S.4
Ueng, S.-Z.5
Stratton, J.A.6
Hwu, W.-M.W.7
-
17
-
-
0343462141
-
Automated Empirical Optimizations of Sofware and the ATLAS Project
-
R. Whaley, A. Petitet, and J. Dongarra. Automated Empirical Optimizations of Sofware and the ATLAS Project. Parallel Computing, 27(1-2):3-35, 2001.
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.1
Petitet, A.2
Dongarra, J.3
-
18
-
-
0003651470
-
-
Addison-Wesley Longman Publishing Co, Inc. Boston, MA, USA
-
M. Woo and M. Sheridan. OpenGL Programming Guide: The Official Guide to Learning OpenGL, Version 1.2. Addison-Wesley Longman Publishing Co., Inc. Boston, MA, USA, 1999.
-
(1999)
OpenGL Programming Guide: The Official Guide to Learning OpenGL, Version 1.2
-
-
Woo, M.1
Sheridan, M.2
-
19
-
-
0038378242
-
A Comparison of Empirical and Model-driven Optimization
-
June
-
K. Yotov, X. Li, G. Ren, M. Cibulskis, G. DeJong, M. Garzarán, D. Padua, K. Pingali, P. Stodghill, and P. Wu. A Comparison of Empirical and Model-driven Optimization. In Proc. of Programing Language Design and Implementation, pages 63-76, June 2003.
-
(2003)
Proc. of Programing Language Design and Implementation
, pp. 63-76
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Cibulskis, M.4
DeJong, G.5
Garzarán, M.6
Padua, D.7
Pingali, K.8
Stodghill, P.9
Wu, P.10
|