-
1
-
-
0029666641
-
Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor
-
Philadelphia, U.S.A
-
Tullsen DM, Eggers SJ, Emer JS, Levy HM, Lo JL, Stamm RL. Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor. ISCA, Philadelphia, U.S.A., 1996; 191-202.
-
(1996)
ISCA
, pp. 191-202
-
-
Tullsen, D.M.1
Eggers, S.J.2
Emer, J.S.3
Levy, H.M.4
Lo, J.L.5
Stamm, R.L.6
-
2
-
-
0031199614
-
Converting thread-level parallelism to instruction-level parallelism via simultaneous multithreading
-
Lo JL, Emer JS, Levy HM, Stamm RL, Tullsen DM. Converting thread-level parallelism to instruction-level parallelism via simultaneous multithreading. ACM Transactions on Computer Systems 1997; 15(3):322-354.
-
(1997)
ACM Transactions on Computer Systems
, vol.15
, Issue.3
, pp. 322-354
-
-
Lo, J.L.1
Emer, J.S.2
Levy, H.M.3
Stamm, R.L.4
Tullsen, D.M.5
-
3
-
-
0029200683
-
Simultaneous multithreading: Maximizing on-chip parallelism
-
Santa Margherita Ligure, Italy
-
Tullsen DM, Eggers S, Levy HM. Simultaneous multithreading: Maximizing on-chip parallelism. Proceedings of the 22th Annual International Symposium on Computer Architecture, Santa Margherita Ligure, Italy, 1995; 392-403.
-
(1995)
Proceedings of the 22th Annual International Symposium on Computer Architecture
, pp. 392-403
-
-
Tullsen, D.M.1
Eggers, S.2
Levy, H.M.3
-
6
-
-
5144222846
-
The block lanczos method for linear systems with multiple right-hand sides
-
Guennounia AE, Jbilou K, Sadok H. The block lanczos method for linear systems with multiple right-hand sides. Applied Numerical Mathematics 2004; 51(2-3):243-256.
-
(2004)
Applied Numerical Mathematics
, vol.51
, Issue.2-3
, pp. 243-256
-
-
Guennounia, A.E.1
Jbilou, K.2
Sadok, H.3
-
8
-
-
0001087280
-
Hyper-Threading technology architecture and microarchitecture
-
Marr DT, Binns F, Hill DL, Hinton G, Koufaty DA, Miller JA, Upton M. Hyper-Threading technology architecture and microarchitecture. Intel Technology Journal Q1 2002; 6(1):4-15.
-
(2002)
Intel Technology Journal Q1
, vol.6
, Issue.1
, pp. 4-15
-
-
Marr, D.T.1
Binns, F.2
Hill, D.L.3
Hinton, G.4
Koufaty, D.A.5
Miller, J.A.6
Upton, M.7
-
9
-
-
0001803542
-
Several strategies for reducing the bandwidth of matrices
-
Rose DJ, Willoughby RA (eds.). Plenum Press: New York
-
Cuthill E, McKee J. Several strategies for reducing the bandwidth of matrices. Sparse Matrices and their Applications, Rose DJ, Willoughby RA (eds.). Plenum Press: New York, 1972.
-
(1972)
Sparse Matrices and Their Applications
-
-
Cuthill, E.1
McKee, J.2
-
11
-
-
0036734103
-
Effects of ordering strategies and programming paradigms on sparse matrix computations
-
Oliker L, Li X, Husbands P, Biswas R. Effects of ordering strategies and programming paradigms on sparse matrix computations. SIAM Review 2002; 44(3):373-393.
-
(2002)
SIAM Review
, vol.44
, Issue.3
, pp. 373-393
-
-
Oliker, L.1
Li, X.2
Husbands, P.3
Biswas, R.4
-
12
-
-
0033189408
-
Memory hierarchy performance prediction for blocked sparse algorithms
-
Fraguela BB, Doallo R, Zapata EL. Memory hierarchy performance prediction for blocked sparse algorithms. Parallel Processing Letters 1999; 9(3):347-360.
-
(1999)
Parallel Processing Letters
, vol.9
, Issue.3
, pp. 347-360
-
-
Fraguela, B.B.1
Doallo, R.2
Zapata, E.L.3
-
13
-
-
0029713939
-
Block algorithms for sparse matrix computations on high performance workstations
-
Philadelphia, U.S.A
-
Navarro JJ, García E, Larriba-Pey JL, Juan T. Block algorithms for sparse matrix computations on high performance workstations. Proceedings of the IEEE International Conference on Supercomputing (ICS '96), Philadelphia, U.S.A., 1996; 301-309.
-
(1996)
Proceedings of the IEEE International Conference on Supercomputing (ICS '96)
, pp. 301-309
-
-
Navarro, J.J.1
García, E.2
Larriba-Pey, J.L.3
Juan, T.4
-
15
-
-
3042576437
-
Improving performance of sparse matrix-vector multiplication
-
Portland, OR
-
Pinar A, Heath M. Improving performance of sparse matrix-vector multiplication. Proceedings of Supercomputing, Portland, OR, 1999.
-
(1999)
Proceedings of Supercomputing
-
-
Pinar, A.1
Heath, M.2
-
17
-
-
25644439819
-
Performance optimization of irregular codes based on the combination of reordering and blocking techniques
-
Pichel JC, Heras DB, Cabaleiro JC, Rivera FF. Performance optimization of irregular codes based on the combination of reordering and blocking techniques. Parallel Computing 2005; 31(8-9):858-876.
-
(2005)
Parallel Computing
, vol.31
, Issue.8-9
, pp. 858-876
-
-
Pichel, J.C.1
Heras, D.B.2
Cabaleiro, J.C.3
Rivera, F.F.4
-
18
-
-
1542710739
-
Sparse tiling for stationary iterative methods
-
Strout MM, Carter L, Ferrante J, Kreaseck B. Sparse tiling for stationary iterative methods. International Journal of High Performance Computing Applications 2004; 18(1):95-114.
-
(2004)
International Journal of High Performance Computing Applications
, vol.18
, Issue.1
, pp. 95-114
-
-
Strout, M.M.1
Carter, L.2
Ferrante, J.3
Kreaseck, B.4
-
19
-
-
3042573689
-
Dynamic cache partitioning for simultaneous multithreading systems
-
Anaheim, CA, U.S.A
-
Suh G, Devadas S, Rudolph L. Dynamic cache partitioning for simultaneous multithreading systems. Proceeding ofthe 13th IASTED International Conference on Parallel and Distributed Computing System, Anaheim, CA, U.S.A., 2001.
-
(2001)
Proceeding Ofthe 13th IASTED International Conference on Parallel and Distributed Computing System
-
-
Suh, G.1
Devadas, S.2
Rudolph, L.3
-
20
-
-
0242370926
-
Code and data transformations for improving shared cache performance on SMT processors
-
Tokyo-Odaiba, Japan
-
Nikolopoulos DS. Code and data transformations for improving shared cache performance on SMT processors. International Symposium on High Performance Computing, Tokyo-Odaiba, Japan, 2003; 54-69.
-
(2003)
International Symposium on High Performance Computing
, pp. 54-69
-
-
Nikolopoulos, D.S.1
-
21
-
-
72649092106
-
Maximizing TLP with loop-parallelization on SMT
-
Austin, U.S.A
-
Puppin D, Tullsen DM. Maximizing TLP with loop-parallelization on SMT. Fifth Workshop on Multithreaded Execution, Architecture, and Compilation, Austin, U.S.A., 2001.
-
(2001)
Fifth Workshop on Multithreaded Execution, Architecture, and Compilation
-
-
Puppin, D.1
Tullsen, D.M.2
-
22
-
-
56749158843
-
Optimization of sparse matrix-vector multiply on emerging multicore platforms
-
Reno, U.S.A
-
Williams S, Oliker L, Vuduc R, Shalf J, Yelick K, Demmel J. Optimization of sparse matrix-vector multiply on emerging multicore platforms. Proceedings ofSupercomputing (SC), Reno, U.S.A., 2007.
-
(2007)
Proceedings Of Supercomputing (SC)
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
-
23
-
-
0035370397
-
Modeling data locality for the sparse matrix-vector product using distance measures
-
Heras DB, Cabaleiro JC, Rivera FF. Modeling data locality for the sparse matrix-vector product using distance measures. Parallel Computing 2001; 27:897-912.
-
(2001)
Parallel Computing
, vol.27
, pp. 897-912
-
-
Heras, D.B.1
Cabaleiro, J.C.2
Rivera, F.F.3
-
25
-
-
0031364101
-
Tuning compiler optimizations for simultaneous multithreading
-
Research Triangle Park, North Carolina, U.S.A
-
Lo JL, Eggers SJ, Levy HM, Parekh SS, Tullsen DM. Tuning compiler optimizations for simultaneous multithreading. International Symposium on Microarchitecture, Research Triangle Park, North Carolina, U.S.A., 1997; 114-124.
-
(1997)
International Symposium on Microarchitecture
, pp. 114-124
-
-
Lo, J.L.1
Eggers, S.J.2
Levy, H.M.3
Parekh, S.S.4
Tullsen, D.M.5
-
26
-
-
0042415671
-
An overview of the sparse basic linear algebra subprograms: The new standard from the BLAS technical forum
-
Duff I, Heroux M, Pozo R. An overview of the sparse basic linear algebra subprograms: The new standard from the BLAS technical forum. ACM Transactions on Mathematical Software 2002; 28(2):239-267.
-
(2002)
ACM Transactions on Mathematical Software
, vol.28
, Issue.2
, pp. 239-267
-
-
Duff, I.1
Heroux, M.2
Pozo, R.3
-
28
-
-
3042618790
-
Improving the locality of the sparse matrix-vector product on shared memory multiprocessors
-
PDP2004,A Coruna, Galicia, Spain
-
Pichel JC, Heras DB, Cabaleiro JC, Rivera FF. Improving the locality of the sparse matrix-vector product on shared memory multiprocessors. Euromicro Conference on Parallel, Distributed and Network-based Processing, PDP2004,A Coruna, Galicia, Spain, 2004; 66-71.
-
(2004)
Euromicro Conference on Parallel, Distributed and Network-based Processing
, pp. 66-71
-
-
Pichel, J.C.1
Heras, D.B.2
Cabaleiro, J.C.3
Rivera, F.F.4
-
29
-
-
0035450031
-
Modelling and improving locality for the sparse matrix-vector product on cache memories
-
Heras DB, Blanco V, Cabaleiro JC, Rivera FF. Modelling and improving locality for the sparse matrix-vector product on cache memories. Future Generation Computer Systems. Special Issue on High Performance Numerical Methods and Application 2001; 18(1):55-67.
-
(2001)
Future Generation Computer Systems. Special Issue on High Performance Numerical Methods and Application
, vol.18
, Issue.1
, pp. 55-67
-
-
Heras, D.B.1
Blanco, V.2
Cabaleiro, J.C.3
Rivera, F.F.4
-
30
-
-
84884063278
-
-
Princeton University Press: Princeton, NJ, U.S.A
-
Applegate D, Bixby R, Chvatal V, Cook W. The Traveling Salesman Problem: A Computational Study. Princeton University Press: Princeton, NJ, U.S.A., 2006.
-
(2006)
The Traveling Salesman Problem: A Computational Study
-
-
Applegate, D.1
Bixby, R.2
Chvatal, V.3
Cook, W.4
-
31
-
-
0003197949
-
University of Florida sparse matrix collection
-
15 October 2007
-
Davis T. University of Florida Sparse Matrix Collection. NA Digest 1997; 97(23). http://www.cise.ufl.edu/research/ sparse/matrices [15 October 2007].
-
(1997)
NA Digest
, vol.97
, pp. 23
-
-
Davis, T.1
-
32
-
-
0003734628
-
-
Department of Computer Science, University of Minnesota
-
Karypis G, Kumar V. METIS: A software package for partitioning unstructured graphs, partitioning meshes, and computing fill-reducing orderings of sparse matrices. Department of Computer Science, University of Minnesota, 1997.
-
(1997)
METIS: A Software Package for Partitioning Unstructured Graphs, Partitioning Meshes, and Computing Fill-reducing Orderings of Sparse Matrices
-
-
Karypis, G.1
Kumar, V.2
-
33
-
-
0003278283
-
The microarchitecture of the Pentium 4 processor
-
Hinton G, Sager D, Upton M, Boggs D, Carmean D, Kyker A, Roussel P. The microarchitecture of the Pentium 4 processor. Intel Technology Journal Q1 2001; 1-13.
-
(2001)
Intel Technology Journal Q1
, pp. 1-13
-
-
Hinton, G.1
Sager, D.2
Upton, M.3
Boggs, D.4
Carmean, D.5
Kyker, A.6
Roussel, P.7
-
35
-
-
0034268943
-
A portable programming interface for performance evaluation on modern processors
-
Browne S, Dongarra J, Garner N, Ho G, Mucci P. A portable programming interface for performance evaluation on modern processors. International Journal ofHigh Performance Computing Applications 2000; 14(3):189-204.
-
(2000)
International Journal OfHigh Performance Computing Applications
, vol.14
, Issue.3
, pp. 189-204
-
-
Browne, S.1
Dongarra, J.2
Garner, N.3
Ho, G.4
Mucci, P.5
|