-
2
-
-
0034268943
-
A portable programming interface for performance evaluation on modern processors
-
S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci. A portable programming interface for performance evaluation on modern processors. International Journal of High Performance Computing Applications, 14(3): 189-204, 2000.
-
(2000)
International Journal of High Performance Computing Applications
, vol.14
, Issue.3
, pp. 189-204
-
-
Browne, S.1
Dongarra, J.2
Garner, N.3
Ho, G.4
Mucci, P.5
-
3
-
-
21244474546
-
Predicting inter-thread cache contention on a chip multi-processor architecture
-
D. Chandra, R Guo, S. Kim, and Y. Solihin. Predicting inter-thread cache contention on a chip multi-processor architecture. In Proc. of the 11th Int. Symposium on High-Performance Computer Architecture (HPCA), pages 340-351, 2005.
-
(2005)
Proc. of the 11th Int. Symposium on High-Performance Computer Architecture (HPCA)
, pp. 340-351
-
-
Chandra, D.1
Guo, R.2
Kim, S.3
Solihin, Y.4
-
4
-
-
35248852476
-
Scheduling threads for constructive cache sharing on CMPs
-
S. Chen, P. B. Gibbons, M. Kozuch, V. Liaskovitis, A. Ail-amaki, G. E. Blelloch, B. Falsafl, L. Fix, N. Hardavellas, T. C. Mowry, and C. Wilkerson. Scheduling threads for constructive cache sharing on CMPs. In Proc, of the 19th ACM Symposium on Parallel algorithms and architectures (SPAA), pages 105-115, 2007.
-
(2007)
Proc, of the 19th ACM Symposium on Parallel algorithms and architectures (SPAA)
, pp. 105-115
-
-
Chen, S.1
Gibbons, P.B.2
Kozuch, M.3
Liaskovitis, V.4
Ail-amaki, A.5
Blelloch, G.E.6
Falsafl, B.7
Fix, L.8
Hardavellas, N.9
Mowry, T.C.10
Wilkerson, C.11
-
5
-
-
0003197949
-
University of Florida Sparse Matrix Collection
-
June
-
T. Davis. University of Florida Sparse Matrix Collection. NA Digest, 97(23), June 1997. http://www.cise.ufl.edu/research/sparse/matrices.
-
(1997)
NA Digest
, vol.97
, Issue.23
-
-
Davis, T.1
-
7
-
-
0035370397
-
Modeling data locality for the sparse matrix-vector product using distance measures
-
D. B. Heras, J. C. Cabaleiro, and F. F. Rivera. Modeling data locality for the sparse matrix-vector product using distance measures. Parallel Computing, 27:897-912, 2001.
-
(2001)
Parallel Computing
, vol.27
, pp. 897-912
-
-
Heras, D.B.1
Cabaleiro, J.C.2
Rivera, F.F.3
-
9
-
-
1542501019
-
SPARSITY: Framework for optimizing sparse matrix-vector multiply
-
February
-
E. J. Im, K. A. Yelick, and R. Vuduc. SPARSITY: Framework for optimizing sparse matrix-vector multiply. International Journal of High Performance Computing Applications, 18(1): 135-158, February 2004.
-
(2004)
International Journal of High Performance Computing Applications
, vol.18
, Issue.1
, pp. 135-158
-
-
Im, E.J.1
Yelick, K.A.2
Vuduc, R.3
-
11
-
-
0031199614
-
Converting thread-level parallelism to instruction-level parallelism via simultaneous multithreading
-
J. L. Lo, J. S. Emer, H. M. Levy, R. L. Stamm, and D. M. Tullsen. Converting thread-level parallelism to instruction-level parallelism via simultaneous multithreading. ACM Transactions on Computer Systems, 15(3): 322-354, 1997.
-
(1997)
ACM Transactions on Computer Systems
, vol.15
, Issue.3
, pp. 322-354
-
-
Lo, J.L.1
Emer, J.S.2
Levy, H.M.3
Stamm, R.L.4
Tullsen, D.M.5
-
12
-
-
0001087280
-
Hyper-Threading technology architecture and microarchitecture
-
D. T. Marr, F. Binns, D. L. Hill, G. Hinton, D. A. Koufary, J. A. Miller, and M. Upton. Hyper-Threading technology architecture and microarchitecture. Intel Technology Journal Ql, 2002.
-
(2002)
Intel Technology Journal Ql
-
-
Marr, D.T.1
Binns, F.2
Hill, D.L.3
Hinton, G.4
Koufary, D.A.5
Miller, J.A.6
Upton, M.7
-
14
-
-
0036734103
-
Effects of ordering strategies and programming paradigms on sparse matrix computations
-
L. Oliker, X. Li, P. Husbands, and R. Biswas. Effects of ordering strategies and programming paradigms on sparse matrix computations. SIAM Review, 44(3): 373-393, 2002.
-
(2002)
SIAM Review
, vol.44
, Issue.3
, pp. 373-393
-
-
Oliker, L.1
Li, X.2
Husbands, P.3
Biswas, R.4
-
15
-
-
3042618790
-
Improving the locality of the sparse matrix-vector product on shared memory multiprocessors
-
J. C. Pichel, D. B. Heras, J. C. Cabaleiro, and F. F. Rivera. Improving the locality of the sparse matrix-vector product on shared memory multiprocessors. In Euromicro Conf. on Parallel, Distributed and Network-based Processing (PDP), pages 66-71, 2004.
-
(2004)
Euromicro Conf. on Parallel, Distributed and Network-based Processing (PDP)
, pp. 66-71
-
-
Pichel, J.C.1
Heras, D.B.2
Cabaleiro, J.C.3
Rivera, F.F.4
-
16
-
-
25644439819
-
Performance optimization of irregular codes based on the combination of reordering and blocking techniques
-
J. C. Pichel, D. B. Heras, J. C. Cabaleiro, and F. F. Rivera. Performance optimization of irregular codes based on the combination of reordering and blocking techniques. Parallel Computing, 31(8-9): 858-876, 2005.
-
(2005)
Parallel Computing
, vol.31
, Issue.8-9
, pp. 858-876
-
-
Pichel, J.C.1
Heras, D.B.2
Cabaleiro, J.C.3
Rivera, F.F.4
-
17
-
-
3042576437
-
Improving performance of sparse matrix-vector multiplication
-
A. Pinar and M. Heath. Improving performance of sparse matrix-vector multiplication. In Proc. of Supercomputing, 1999.
-
(1999)
Proc. of Supercomputing
-
-
Pinar, A.1
Heath, M.2
-
23
-
-
56749158843
-
Optimization of sparse matrix-vector multiply on emerging multicore platforms
-
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel. Optimization of sparse matrix-vector multiply on emerging multicore platforms. In Proc. of Supercomputing (SC), 2007.
-
(2007)
Proc. of Supercomputing (SC)
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
-
24
-
-
47249123399
-
Cachescouts: Fine-grain monitoring of shared caches in CMP platforms
-
L. Zhao, R. Iyer, R. Illikkal, J. Moses, S. Makineni, and D. Newell. Cachescouts: Fine-grain monitoring of shared caches in CMP platforms. In Proc. of the 16th Int. Conference on Parallel Architecture and Compilation Techniques (PACT), pages 339-352, 2007.
-
(2007)
Proc. of the 16th Int. Conference on Parallel Architecture and Compilation Techniques (PACT)
, pp. 339-352
-
-
Zhao, L.1
Iyer, R.2
Illikkal, R.3
Moses, J.4
Makineni, S.5
Newell, D.6
|