-
1
-
-
51049093517
-
-
http://www.top500.org/lists/2007/06.
-
-
-
-
3
-
-
4143085072
-
Will microprocessor become simpler?
-
Nov
-
D. Alpert. Will microprocessor become simpler? Microprocessor Report, Nov 2003.
-
(2003)
Microprocessor Report
-
-
Alpert, D.1
-
4
-
-
35248845008
-
Is the schedule clause really necessary in openmp?
-
Jun
-
E. Ayguade, B. Blainey, A. Duran, J. Labarta, F. Martnez, X. Martorell, and R. Silvera. Is the schedule clause really necessary in openmp? In In Proceedings of the International Workshop of OpenMP Applications and Tools , Lecture Notes in Computer Science. Toronto, Canada., pages 147-159, Jun 2003.
-
(2003)
In Proceedings of the International Workshop of OpenMP Applications and Tools , Lecture Notes in Computer Science. Toronto, Canada
, pp. 147-159
-
-
Ayguade, E.1
Blainey, B.2
Duran, A.3
Labarta, J.4
Martnez, F.5
Martorell, X.6
Silvera, R.7
-
6
-
-
33744824945
-
Predictable performance in smt processors: Synergy between the os and smts
-
F.J. Cazorla, P.M. W. Knijnenburg, R. Sakellariou, E. Fernandez, A. Ramirez, and M. Valero. Predictable performance in smt processors: Synergy between the os and smts. IEEE Trans. Comput., 55(7):785-799, 2006.
-
(2006)
IEEE Trans. Comput
, vol.55
, Issue.7
, pp. 785-799
-
-
Cazorla, F.J.1
Knijnenburg, P.M.W.2
Sakellariou, R.3
Fernandez, E.4
Ramirez, A.5
Valero, M.6
-
7
-
-
12444293782
-
The performance impact of computational efficiency on HPC clusters with hyper-threading technology
-
O. Celebioglu, A. Saify, T. Leng, J. Hsieh, V. Mashayekhi, and R. Rooholamini. The performance impact of computational efficiency on HPC clusters with hyper-threading technology. In 18th International Parallel and Dis tributed Processing Symposium (IPDPS'04) - Workshop 14, 2004.
-
(2004)
18th International Parallel and Dis tributed Processing Symposium (IPDPS'04) - Workshop 14
-
-
Celebioglu, O.1
Saify, A.2
Leng, T.3
Hsieh, J.4
Mashayekhi, V.5
Rooholamini, R.6
-
8
-
-
33847261237
-
-
M. Curtis-Maury, T. Wang, C. Antonopoulos, and D. Nikolopoulos. Integrating multiple forms of multithreaded execution on multi-smt systems: A study with scientific applications. In Proceedings of the Second International Conference on the Quantitative Evaluation of Systems (QEST'05), pages 199-, 2005.
-
M. Curtis-Maury, T. Wang, C. Antonopoulos, and D. Nikolopoulos. Integrating multiple forms of multithreaded execution on multi-smt systems: A study with scientific applications. In Proceedings of the Second International Conference on the Quantitative Evaluation of Systems (QEST'05), pages 199-, 2005.
-
-
-
-
9
-
-
32844460093
-
Automatic thread distribution for nested parallelism in openmp
-
Cambridge, Massachusetts, USA, June
-
A. Duran, M. Gonzalez, J. Corbalan, X. Martorell, E.Ayguade, J. Labarta, and R. Silvera. Automatic thread distribution for nested parallelism in openmp. In .International Conference on Supercomputing (ICS05).In Proceedings of the 19th ACM International Conference on Supercomputing. Cambridge, Massachusetts, USA, pages 121-130, June 2005.
-
(2005)
International Conference on Supercomputing (ICS05).In Proceedings of the 19th ACM International Conference on Supercomputing
, pp. 121-130
-
-
Duran, A.1
Gonzalez, M.2
Corbalan, J.3
Martorell, X.4
Ayguade, E.5
Labarta, J.6
Silvera, R.7
-
10
-
-
46449121306
-
Advanced POWER Virtualization on IBM eServer p5 Servers: Architecture and Performance Considerations
-
B. Gibbs, B. Atyam, F. Berres, B. Blanchard, L. Castillo, P. Coelho, N. Guerin, L. Liu, C. Diniz Maciel, and C. Thirumalai. Advanced POWER Virtualization on IBM eServer p5 Servers: Architecture and Performance Considerations. IBM Redbook, 2005.
-
(2005)
IBM Redbook
-
-
Gibbs, B.1
Atyam, B.2
Berres, F.3
Blanchard, B.4
Castillo, L.5
Coelho, P.6
Guerin, N.7
Liu, L.8
Diniz Maciel, C.9
Thirumalai, C.10
-
11
-
-
21544447553
-
Analysis of System Overhead on Parallel Computers
-
Rome, Italy, December, Available from
-
R. Gioiosa, F. Petrini, K. Davis, and F. Lebaillif-Delamare. Analysis of System Overhead on Parallel Computers. In The 4th IEEE International Symposium on Signal Processing and Information Technolog y (ISSPIT 2004), Rome, Italy, December 2004. Available from http://bravo.ce.uniroma2.it/home/ gioiosa/pub/isspit04.pdf.
-
(2004)
The 4th IEEE International Symposium on Signal Processing and Information Technolog y (ISSPIT 2004)
-
-
Gioiosa, R.1
Petrini, F.2
Davis, K.3
Lebaillif-Delamare, F.4
-
13
-
-
51049100784
-
-
IBM. Cell broadband engine architecture
-
IBM. Cell broadband engine architecture.
-
-
-
-
14
-
-
51049101443
-
-
IBM. Cell broadband engine programming handbook
-
IBM. Cell broadband engine programming handbook.
-
-
-
-
15
-
-
51049122086
-
-
IBM. PowerPC Architecture book: Book I: User Instruction Set Architecture
-
IBM. PowerPC Architecture book: Book I: User Instruction Set Architecture.
-
-
-
-
16
-
-
51049101224
-
-
IBM. PowerPC Architecture book: Book II: PowerPC Virtual Environment Architecture
-
IBM. PowerPC Architecture book: Book II: PowerPC Virtual Environment Architecture.
-
-
-
-
17
-
-
51049124447
-
-
IBM. PowerPC Architecture book: Book III: PowerPC Operating Environment Architecture
-
IBM. PowerPC Architecture book: Book III: PowerPC Operating Environment Architecture.
-
-
-
-
18
-
-
33645501924
-
Performance characteristics of the multi-zone nas parallel benchmarks
-
H. Jin and R.F. Van der Wijngaart. Performance characteristics of the multi-zone nas parallel benchmarks. J. Parallel Distrib. Comput., 66(5):674-685, 2006.
-
(2006)
J. Parallel Distrib. Comput
, vol.66
, Issue.5
, pp. 674-685
-
-
Jin, H.1
Van der Wijngaart, R.F.2
-
19
-
-
3042669130
-
Ibm power5 chip: A dual-core multithreaded processor
-
R. Kalla, B. Sinharoy, and J.M. Tendler. Ibm power5 chip: A dual-core multithreaded processor. IEEE Micro, 24:40-47, 2004.
-
(2004)
IEEE Micro
, vol.24
, pp. 40-47
-
-
Kalla, R.1
Sinharoy, B.2
Tendler, J.M.3
-
20
-
-
0005317101
-
Dip: A parallel program development environment
-
J. Labarta, S. Girona, V. Pillet, T. Cortes, and L. Gregoris. Dip: A parallel program development environment. In Euro-Par, Vol. II, pages 665-674, 1996.
-
(1996)
Euro-Par
, vol.2
, pp. 665-674
-
-
Labarta, J.1
Girona, S.2
Pillet, V.3
Cortes, T.4
Gregoris, L.5
-
21
-
-
37549032725
-
IBM power6 microarchitecture
-
H.Q. Le, W.J. Starke, J.S. Fields, F.P. O'Connell, D.Q. Nguyen, B.J. Ronchetti, W.M. Sauer, E.M. Schwarz, and M. T. Vaden. IBM power6 microarchitecture. IBM Journal of Research and Development, 51:639-662, 2007.
-
(2007)
IBM Journal of Research and Development
, vol.51
, pp. 639-662
-
-
Le, H.Q.1
Starke, W.J.2
Fields, J.S.3
O'Connell, F.P.4
Nguyen, D.Q.5
Ronchetti, B.J.6
Sauer, W.M.7
Schwarz, E.M.8
Vaden, M.T.9
-
22
-
-
0001087280
-
Hyper-threading technology architecture and microarchitecture
-
Feb
-
D. T. Marr, F. Binns, D.L. Hill, G. Hinton, D.A. Koufaty, J. A. Miller, and M. Upton. Hyper-threading technology architecture and microarchitecture. Intel Technology Journal, 6(1), Feb 2002.
-
(2002)
Intel Technology Journal
, vol.6
, Issue.1
-
-
Marr, D.T.1
Binns, F.2
Hill, D.L.3
Hinton, G.4
Koufaty, D.A.5
Miller, J.A.6
Upton, M.7
-
24
-
-
2442670256
-
-
NASA. Nas parallel benchmarks. http://www.nas.nasa.gov/Resources/ Software/npb.html.
-
Nas parallel benchmarks
-
-
-
25
-
-
84877019178
-
The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8,192 Processors of ASCI Q
-
Phoenix, Arizona, November 10-16
-
F. Petrini, D. Kerbyson, and S. Pakin. The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8,192 Processors of ASCI Q. In ACM/IEEE SC2003, Phoenix, Arizona, November 10-16, 2003.
-
(2003)
ACM/IEEE SC2003
-
-
Petrini, F.1
Kerbyson, D.2
Pakin, S.3
-
26
-
-
34548042910
-
Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches
-
M.K. Qureshi and Y.N. Patt. Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches. In MICRO, 2006.
-
(2006)
MICRO
-
-
Qureshi, M.K.1
Patt, Y.N.2
-
27
-
-
51049115285
-
-
K. Schloegel, G. Karypis, and V. Kumar. Parallel multilevel algorithms for multi-constraint graph partitioning. Technical report
-
K. Schloegel, G. Karypis, and V. Kumar. Parallel multilevel algorithms for multi-constraint graph partitioning. Technical report.
-
-
-
-
28
-
-
51049103129
-
-
M.J. Serrano, R. Wood, and M. Nemirovsky. A study of multistreamed superscalar processors. Technical Report #93-05, University of California, Santa Barbara, 1993.
-
M.J. Serrano, R. Wood, and M. Nemirovsky. A study of multistreamed superscalar processors. Technical Report #93-05, University of California, Santa Barbara, 1993.
-
-
-
-
30
-
-
25844437046
-
POWER5 system microarchitecture
-
B. Sinharoy, R. N. Kalla, J. M. Tendler, R. J. Eickemeyer, and J. B. Joyner. POWER5 system microarchitecture. IBM Journal of Research and Development, 49(4/5):505-521, 2005.
-
(2005)
IBM Journal of Research and Development
, vol.49
, Issue.4-5
, pp. 505-521
-
-
Sinharoy, B.1
Kalla, R.N.2
Tendler, J.M.3
Eickemeyer, R.J.4
Joyner, J.B.5
-
31
-
-
0037171091
-
The siesta method for ab initio ordern materials simulation
-
J.M. Soler, E. Artacho, J.D. Gale, A. Garca, J. Junquera, P. Ordejn, and D. Snchez-Portal. The siesta method for ab initio ordern materials simulation. Journal of Physics: Condensed Matter, 14(11), 2002.
-
(2002)
Journal of Physics: Condensed Matter
, vol.14
, Issue.11
-
-
Soler, J.M.1
Artacho, E.2
Gale, J.D.3
Garca, A.4
Junquera, J.5
Ordejn, P.6
Snchez-Portal, D.7
-
32
-
-
32844469374
-
System noise, os clock ticks, and fine-grained parallel applications
-
New York, NY, USA, ACM Press
-
D. Tsafrir, Y. Etsion, D.G. Feitelson, and S. Kirkpatrick. System noise, os clock ticks, and fine-grained parallel applications. In ICS '05: Proceedings of the 19th annual international conference on Supercomputing, pages 303-312, New York, NY, USA, 2005. ACM Press.
-
(2005)
ICS '05: Proceedings of the 19th annual international conference on Supercomputing
, pp. 303-312
-
-
Tsafrir, D.1
Etsion, Y.2
Feitelson, D.G.3
Kirkpatrick, S.4
|