-
1
-
-
84873433370
-
Tilepro64 processor-product brief
-
Tilera
-
Tilera, "TILEPro64 Processor-Product Brief," Tilera, Tech. Rep., 2012.
-
(2012)
Tilera, Tech. Rep.
-
-
-
2
-
-
84872539869
-
-
[Online]. Available
-
NVIDIA, "NVIDIA Next Generation CUDA Compute Architecture:Fermi, " 2012. [Online]. Available: http://www.nvidia.com/content/PDF/ fermiwhitepapers/NVIDIAFermiComputeArchitectureWhitepaper.pdf
-
(2012)
NVIDIA Next Generation CUDA Compute Architecture:Fermi
-
-
-
3
-
-
51049083725
-
-
[Online]. Available
-
TOP500, "TOP500 Supercomputer Site." [Online]. Available: http://www.top500.org
-
TOP500 Supercomputer Site
-
-
-
4
-
-
78650178980
-
Feedback-directed page placement for ccnuma via hardware-generated memory traces
-
J. Marathe, V. Thakkar, and F. Mueller, "Feedback-Directed Page Placement for ccNUMA via Hardware-Generated Memory Traces," Journal of Parallel and Distributed Computing, vol. 70, no. 12, pp. 1204-1219, 2010.
-
(2010)
Journal of Parallel and Distributed Computing
, vol.70
, Issue.12
, pp. 1204-1219
-
-
Marathe, J.1
Thakkar, V.2
Mueller, F.3
-
5
-
-
84958986918
-
Upmlib: A runtime system for tuning the memory performance of openmp programs on scalable shared-memory multiprocessors
-
D. S. Nikolopoulos, T. S. Papatheodorou, C. D. Polychronopoulos, J. Labarta, and E. Ayguadé, "UPMLIB: A Runtime System for Tuning the Memory Performance of OpenMP Programs on Scalable Shared-Memory Multiprocessors," in The 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, 2000, pp. 85-99.
-
(2000)
The 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
, pp. 85-99
-
-
Nikolopoulos, D.S.1
Papatheodorou, T.S.2
Polychronopoulos, C.D.3
Labarta, J.4
Ayguadé, E.5
-
6
-
-
0006011065
-
User-level dynamic page migration for multiprogrammed shared-memory multiprocessors
-
D. S. Nikolopoulos, T. S. Papatheodorou, C. D. Polychronopoulos, J. Labarta, and E. Ayguade, "User-level dynamic page migration for multiprogrammed shared-memory multiprocessors," in Proc. of the 2000 International Conference on Parallel Processing, 2000.
-
(2000)
Proc. of the 2000 International Conference on Parallel Processing
-
-
Nikolopoulos, D.S.1
Papatheodorou, T.S.2
Polychronopoulos, C.D.3
Labarta, J.4
Ayguade, E.5
-
7
-
-
47249165359
-
Thread clustering: Sharing-aware scheduling on smp-cmp-smt multiprocessors
-
D. Tam, R. Azimi, and M. Stumm, "Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors," in Proc. of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007, ser. EuroSys '07, 2007.
-
(2007)
Proc. of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007, Ser. EuroSys '07
-
-
Tam, D.1
Azimi, R.2
Stumm, M.3
-
8
-
-
77952247820
-
Enhancing operating system support for multicore processors by using hardware performance monitoring
-
R. Azimi, D. K. Tam, L. Soares, and M. Stumm, "Enhancing operating system support for multicore processors by using hardware performance monitoring," SIGOPS Oper. Syst. Rev., 2009.
-
(2009)
SIGOPS Oper. Syst. Rev.
-
-
Azimi, R.1
Tam, D.K.2
Soares, L.3
Stumm, M.4
-
9
-
-
63549125482
-
Prediction models for multi-dimensional power-performance optimization on many cores
-
M. Curtis-Maury, A. Shah, F. Blagojevic, D. S. Nikolopoulos, B. R. de Supinski, and M. Schulz, "Prediction Models for Multi-Dimensional Power-Performance Optimization on Many Cores," in Proc. of the 17th international conference on Parallel architectures and compilation techniques, 2008.
-
(2008)
Proc. of the 17th International Conference on Parallel Architectures and Compilation Techniques
-
-
Curtis-Maury, M.1
Shah, A.2
Blagojevic, F.3
Nikolopoulos, D.S.4
De Supinski, B.R.5
Schulz, M.6
-
10
-
-
77953990600
-
Hybrid mpi/openmp power-aware computing
-
D. Li, B. de Supinski, M. Schulz, K. Cameron, and D. Nikolopoulos, "Hybrid MPI/OpenMP Power-Aware Computing," in IEEE International Symposium on Parallel Distributed Processing, 2010.
-
(2010)
IEEE International Symposium on Parallel Distributed Processing
-
-
Li, D.1
De Supinski, B.2
Schulz, M.3
Cameron, K.4
Nikolopoulos, D.5
-
11
-
-
84863014417
-
Critical path-based thread placement for numa systems
-
C. Su, D. Li, D. Nikolopoulos, M. Grove, K. W. Cameron, and B. R. de Supinski, "Critical path-based thread placement for numa systems," in The second international workshop on Performance modeling, benchmarking and simulation of high performance computing systems, 2011.
-
(2011)
The Second International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems
-
-
Su, C.1
Li, D.2
Nikolopoulos, D.3
Grove, M.4
Cameron, K.W.5
De Supinski, B.R.6
-
12
-
-
77953970995
-
Power-aware mpi task aggregation prediction for high-end computing systems
-
D. Li, D. Nikolopoulos, K. Cameron, B. de Supinski, and M. Schulz, "Power-Aware MPI Task Aggregation Prediction for High-End Computing Systems," in IEEE International Symposium on Parallel Distributed Processing, 2010.
-
(2010)
IEEE International Symposium on Parallel Distributed Processing
-
-
Li, D.1
Nikolopoulos, D.2
Cameron, K.3
De Supinski, B.4
Schulz, M.5
-
13
-
-
84873469116
-
Comparing scalability prediction strategies on an smp of cmps
-
K. Singh, M. Curtis-Maury, S. A. McKee, F. Blagojević, D. S. Nikolopoulos, B. R. de Supinski, and M. Schulz, "Comparing Scalability Prediction Strategies on an SMP of CMPs," in Proc. of the 16th international Euro-Par conference on Parallel processing, 2010.
-
(2010)
Proc. of the 16th International Euro-Par Conference on Parallel Processing
-
-
Singh, K.1
Curtis-Maury, M.2
McKee, S.A.3
Blagojević, F.4
Nikolopoulos, D.S.5
De Supinski, B.R.6
Schulz, M.7
-
14
-
-
53349118980
-
Identifying energy-efficient concurrency levels using machine learning
-
M. Curtis-Maury, K. Singh, S. A. McKee, F. Blagojevic, D. S. Nikolopoulos, B. R. de Supinski, and M. Schulz, "Identifying Energy-Efficient Concurrency Levels Using Machine Learning," in Proc. of the 2007 IEEE International Conference on Cluster Computing, 2007.
-
(2007)
Proc. of the 2007 IEEE International Conference on Cluster Computing
-
-
Curtis-Maury, M.1
Singh, K.2
McKee, S.A.3
Blagojevic, F.4
Nikolopoulos, D.S.5
De Supinski, B.R.6
Schulz, M.7
-
15
-
-
56849098650
-
Data and thread affinity in openmp programs
-
C. Terboven, D. an Mey, D. Schmidl, H. Jin, and T. Reichstein, "Data and Thread Affinity in OpenMP Programs," in Proc. of the 2008 Workshop on Memory Access on Future Processors: A Solved Problem?, 2008.
-
(2008)
Proc. of the 2008 Workshop on Memory Access on Future Processors: A Solved Problem?
-
-
Terboven, C.1
An Mey, D.2
Schmidl, D.3
Jin, H.4
Reichstein, T.5
-
16
-
-
73649124643
-
Memory affinity for hierarchical shared memory multiprocessors
-
C. Ribeiro, J.-F. Mehaut, A. Carissimi, M. Castro, and L. Fernandes, "Memory Affinity for Hierarchical Shared Memory Multiprocessors," in Computer Architecture and High Performance Computing, 2009. SBAC-PAD '09. 21st International Symposium on, 2009.
-
(2009)
Computer Architecture and High Performance Computing, 2009. SBAC-PAD '09. 21st International Symposium on
-
-
Ribeiro, C.1
Mehaut, J.-F.2
Carissimi, A.3
Castro, M.4
Fernandes, L.5
-
17
-
-
77954040030
-
Dynamic task and data placement over numa architectures: An openmp runtime perspective
-
F. Broquedis, N. Furmento, B. Goglin, R. Namyst, and P.-A. Wacrenier, "Dynamic Task and Data Placement over NUMA Architectures: An OpenMP Runtime Perspective," in Proc. of the 5th International Workshop on OpenMP: Evolving OpenMP in an Age of Extreme Parallelism, 2009.
-
(2009)
Proc. of the 5th International Workshop on OpenMP: Evolving OpenMP in An Age of Extreme Parallelism
-
-
Broquedis, F.1
Furmento, N.2
Goglin, B.3
Namyst, R.4
Wacrenier, P.-A.5
-
20
-
-
33646222013
-
Papi: A portable interface to hardware performance counters
-
P. J. Mucci, S. Browne, C. Deane, and G. Ho, "PAPI: A Portable Interface to Hardware Performance Counters," in In Proceedings of the Department of Defense HPCMP Users Group Conference, 1999, pp. 7-10.
-
(1999)
Proceedings of the Department of Defense HPCMP Users Group Conference
, pp. 7-10
-
-
Mucci, P.J.1
Browne, S.2
Deane, C.3
Ho, G.4
-
22
-
-
84873457282
-
-
[Online]. Available
-
"WattsUp Meter Tool." [Online]. Available: https://www. wattsupmeters.com
-
WattsUp Meter Tool
-
-
-
24
-
-
84873419352
-
Performance and the nas parallel benchmarks
-
D. H. Bailey, "Performance and the NAS Parallel Benchmarks," International Journal of High Performance Computing Applications, vol. 5, no. 3, pp. 63-73, 1994.
-
(1994)
International Journal of High Performance Computing Applications
, vol.5
, Issue.3
, pp. 63-73
-
-
Bailey, D.H.1
-
25
-
-
0027621747
-
The sequoia 2000 benchmark
-
M. Stonebraker, J. Frew, K. Gardels, and J. Meredith, "The Sequoia 2000 Benchmark," in Proc. of the 1993 ACM SIGMOD International Conference on Management of Data, 1993.
-
(1993)
Proc. of the 1993 ACM SIGMOD International Conference on Management of Data
-
-
Stonebraker, M.1
Frew, J.2
Gardels, K.3
Meredith, J.4
-
26
-
-
34248374123
-
Online power-performance adaptation of multithreaded programs using hardware event-based prediction
-
M. Curtis-Maury, J. Dzierwa, C. D. Antonopoulos, and D. S. Nikolopoulos, "Online Power-Performance Adaptation of Multithreaded Programs Using Hardware Event-Based Prediction," in Proc. of the 20th annual international conference on Supercomputing, 2006.
-
(2006)
Proc. of the 20th Annual International Conference on Supercomputing
-
-
Curtis-Maury, M.1
Dzierwa, J.2
Antonopoulos, C.D.3
Nikolopoulos, D.S.4
|