-
1
-
-
0034436574
-
Extending openMP for NUMA machines
-
J. Bircsak, P. Craig, R. Crowell, Z. Cvetanovic, J. Harris, C. A. Nelson, and C. D. Offner. Extending OpenMP for NUMA machines. Scientific Programming, 8:163-181, 2000.
-
(2000)
Scientific Programming
, vol.8
, pp. 163-181
-
-
Bircsak, J.1
Craig, P.2
Crowell, R.3
Cvetanovic, Z.4
Harris, J.5
Nelson, C.A.6
Offner, C.D.7
-
4
-
-
0346043334
-
Data distribution support on distributed shared memory multiprocessors
-
ACM Press
-
R. Chandra, D.-K. Chen, R. Cox, D. E. Maydan, N. Nedeljkovic, and J. M. Anderson. Data distribution support on distributed shared memory multiprocessors. In Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation, pages 334-345. ACM Press, 1997.
-
(1997)
Proceedings of the ACM SIGPLAN 1997 Conference on Programming Language Design and Implementation
, pp. 334-345
-
-
Chandra, R.1
Chen, D.-K.2
Cox, R.3
Maydan, D.E.4
Nedeljkovic, N.5
Anderson, J.M.6
-
5
-
-
84976707347
-
Scheduling and page migration for multiprocessor compute servers
-
ACM Press
-
R. Chandra, S. Devine, B. Verghese, A. Gupta, and M. Rosenblum. Scheduling and page migration for multiprocessor compute servers. In Proceedings of the sixth international conference on Architectural support for programming languages and operating systems, pages 12-24. ACM Press, 1994.
-
(1994)
Proceedings of the Sixth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 12-24
-
-
Chandra, R.1
Devine, S.2
Verghese, B.3
Gupta, A.4
Rosenblum, M.5
-
9
-
-
0042348170
-
-
Doctoral thesis, Mathematics and Computer Science, Department of Information Technology, University of Uppsala, may
-
F. Edelvik. Hybrid Solvers for the Maxwell Equations in Time-Domain. Doctoral thesis, Mathematics and Computer Science, Department of Information Technology, University of Uppsala, may 2002. http://urn.kb.se/resolve?urn= urn:nbn:se:uu:diva-2156.
-
(2002)
Hybrid Solvers for the Maxwell Equations in Time-domain
-
-
Edelvik, F.1
-
10
-
-
0016939622
-
An algorithm for reducing the bandwith and profile of a sparse matrix
-
April
-
N. E. Gibbs, J. William G. Poole, and P. K. Stockmeyer. An Algorithm for Reducing the Bandwith and Profile of a Sparse Matrix. SIAM Journal on Numerical Analysis, 13(2):236-250, April 1976.
-
(1976)
SIAM Journal on Numerical Analysis
, vol.13
, Issue.2
, pp. 236-250
-
-
Gibbs, N.E.1
William, J.2
Poole, G.3
Stockmeyer, P.K.4
-
11
-
-
23844446314
-
Performance of PDE solvers on a self-optimizing NUMA architecture
-
S. Holmgren, M. Nordén, J. Rantakokko, and D. Wallin. Performance of PDE Solvers on a Self-Optimizing NUMA Architecture. Parallel Algorithms and Applications, 17(4):285-299, 2002.
-
(2002)
Parallel Algorithms and Applications
, vol.17
, Issue.4
, pp. 285-299
-
-
Holmgren, S.1
Nordén, M.2
Rantakokko, J.3
Wallin, D.4
-
12
-
-
0003648799
-
The OpenMP implementation of NAS parallel benchmarks and Its performance
-
NASA Ames Research Center
-
H. Jin, M. Frumkin, and J. Yan. The OpenMP Implementation of NAS Parallel Benchmarks and Its Performance. NAS Technical Report NAS-99-011, NASA Ames Research Center, 1999.
-
(1999)
NAS Technical Report NAS-99-011
-
-
Jin, H.1
Frumkin, M.2
Yan, J.3
-
13
-
-
35048880638
-
Improving geographical locality of data for shared memory implementations of PDE solvers
-
Computational Science - ICCS 2004: 4th International Conference, Krakow, Poland, June 6-9, 2004, Proceedings, Part II
-
H. Löf, M. Nordén, and S. Holmgren. Improving Geographical Locality of Data for Shared Memory Implementations of PDE Solvers. In Computational Science - ICCS 2004: 4th International Conference, Krakow, Poland, June 6-9, 2004, Proceedings, Part II, volume 3037 of LNCS, pages 9-16, http://www.springerlink.com/openurl.asp?genre= article8.issn=0302- 9743&vo%lume=30378.spage=9, 2004.
-
(2004)
LNCS
, vol.3037
, pp. 9-16
-
-
Löf, H.1
Nordén, M.2
Holmgren, S.3
-
14
-
-
32844457970
-
Algorithmic optimizations of a parallel industrial CEM solver
-
Dept. of Information Technology, Uppsala University
-
H. Löf and J. Rantakokko. Algorithmic Optimizations of a Parallel Industrial CEM Solver. Technical report, Dept. of Information Technology, Uppsala University, 2004.
-
(2004)
Technical Report
-
-
Löf, H.1
Rantakokko, J.2
-
16
-
-
0034436544
-
A transparent runtime data distribution engine for OpenMP
-
D. S. Nikolopoulos, T. S. Papatheodorou, C. D. Polychronopoulos, J. Labarta, and E. Ayguade. A transparent runtime data distribution engine for OpenMP. Scientific Programming, 8:143-162, 2000.
-
(2000)
Scientific Programming
, vol.8
, pp. 143-162
-
-
Nikolopoulos, D.S.1
Papatheodorou, T.S.2
Polychronopoulos, C.D.3
Labarta, J.4
Ayguade, E.5
-
19
-
-
0025438154
-
Tranlation-lookaside buffer consistency
-
P. J. Teller. Tranlation-Lookaside Buffer Consistency. Computer, 23(6):26-36, 1990.
-
(1990)
Computer
, vol.23
, Issue.6
, pp. 26-36
-
-
Teller, P.J.1
-
20
-
-
23944431623
-
Using hardware counters to automatically improve memory performance
-
Washington, DC, USA, IEEE Computer Society
-
M. M. Tikir and J. K. Hollingsworth. Using Hardware Counters to Automatically Improve Memory Performance. In SC '04: Proceedings of the 2004 ACM/IEEE conference on Supercomputing, page 46, Washington, DC, USA, 2004. IEEE Computer Society.
-
(2004)
SC '04: Proceedings of the 2004 ACM/IEEE Conference on Supercomputing
, pp. 46
-
-
Tikir, M.M.1
Hollingsworth, J.K.2
-
21
-
-
0030263788
-
Operating system support for improving data locality on CC-NUMA compute servers
-
ACM Press
-
B. Verghese, S. Devine, A. Gupta, and M. Rosenblum. Operating system support for improving data locality on CC-NUMA compute servers. In Proceedings of the seventh international conference on Architectural support for programming languages and operating systems, pages 279-289. ACM Press, 1996.
-
(1996)
Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 279-289
-
-
Verghese, B.1
Devine, S.2
Gupta, A.3
Rosenblum, M.4
-
22
-
-
85129613053
-
Dynamic page placement to improve locality in CC-NUMA multiprocessors for TPC-C
-
New York, NY, USA, ACM Press
-
K. M. Wilson and B. B. Aglietti. Dynamic page placement to improve locality in CC-NUMA multiprocessors for TPC-C. In Supercomputing '01: Proceedings of the 2001 ACM/IEEE conference on Supercomputing, pages 33-33, New York, NY, USA, 2001. ACM Press.
-
(2001)
Supercomputing '01: Proceedings of the 2001 ACM/IEEE Conference on Supercomputing
, pp. 33-33
-
-
Wilson, K.M.1
Aglietti, B.B.2
|