-
1
-
-
34247373734
-
Evaluation of OpenMP for the Cyclops multithreaded architecture
-
OpenMP Shared Memory Parallel Programming: International Workshop on OpenMP Applications and Tools, WOMPAT 2003, of, Toronto, Canada, June 26-27
-
George S. Almási, Eduard Ayguadé, Cǎlin Caşcaval, José Castaños, Jesús Labarta, Francisco Martínez, Xavier Martorell, and José Moreira. Evaluation of OpenMP for the Cyclops multithreaded architecture. In OpenMP Shared Memory Parallel Programming: International Workshop on OpenMP Applications and Tools, WOMPAT 2003, volume 2716 of Lecture Notes in Computer Science., pages 69-83, Toronto, Canada, June 26-27, 2003.
-
(2003)
Lecture Notes in Computer Science
, vol.2716
, pp. 69-83
-
-
Almási, G.S.1
Ayguadé, E.2
Caşcaval, C.3
Castaños, J.4
Labarta, J.5
Martínez, F.6
Martorell, X.7
Moreira, J.8
-
2
-
-
11144357668
-
Demonstrating the scalability of a molecular dynamics application on a petaflops computer
-
August
-
George S. Almási, Cǎlin Caşcaval, José G. Castaños, Monty Denneau, Wilm Donath, Maria Eleftheriou, Mark Giampapa, Howard Ho, Derek Lieber, José E. Moreira, Dennis Newns, Marc Snir, and Henry S. Warren, Jr. Demonstrating the scalability of a molecular dynamics application on a petaflops computer. International Journal of Parallel Programming, 30(4):317-351, August 2002.
-
(2002)
International Journal of Parallel Programming
, vol.30
, Issue.4
, pp. 317-351
-
-
Almási, G.S.1
Caşcaval, C.2
Castaños, J.G.3
Denneau, M.4
Donath, W.5
Eleftheriou, M.6
Giampapa, M.7
Ho, H.8
Lieber, D.9
Moreira, J.E.10
Newns, D.11
Snir, M.12
Warren Jr., H.S.13
-
3
-
-
0025211006
-
The performance of spin lock alternatives for shared-memory multiprocessors
-
January
-
Thomas E. Anderson. The performance of spin lock alternatives for shared-memory multiprocessors. IEEE Transactions on Parallel and Distributed Systems, 1(1):6-16, January 1990.
-
(1990)
IEEE Transactions on Parallel and Distributed Systems
, vol.1
, Issue.1
, pp. 6-16
-
-
Anderson, T.E.1
-
4
-
-
0034290658
-
Performance characteristics for OpenMP constructs on different parallel computer architectures
-
Rudolf Berrendorf and Guido Nieken. Performance characteristics for OpenMP constructs on different parallel computer architectures. Concurrency - Practice and Experience, 12(12):1261-1273, 2000.
-
(2000)
Concurrency - Practice and Experience
, vol.12
, Issue.12
, pp. 1261-1273
-
-
Berrendorf, R.1
Nieken, G.2
-
5
-
-
34247381935
-
-
J. Mark Bull. Measuring synchronization and scheduling overheads in OpenMP. In Proceedings of the First European Workshop on OpenMP, Lund, Sweden, September 30 - October 1, 1999.
-
J. Mark Bull. Measuring synchronization and scheduling overheads in OpenMP. In Proceedings of the First European Workshop on OpenMP, Lund, Sweden, September 30 - October 1, 1999.
-
-
-
-
6
-
-
34247371741
-
-
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao. FAST: A functionally accurate simulation toolset for the Cyclops64 cellular architecture. In Proceedings of the Workshop on Modeling, Benchmarking and Simulation, pages 11-20, Madison, Wisconsin, June 4, 2005. Held in conjunction with the 32nd Annual International Symposium on Computer Architecture.
-
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao. FAST: A functionally accurate simulation toolset for the Cyclops64 cellular architecture. In Proceedings of the Workshop on Modeling, Benchmarking and Simulation, pages 11-20, Madison, Wisconsin, June 4, 2005. Held in conjunction with the 32nd Annual International Symposium on Computer Architecture.
-
-
-
-
7
-
-
33751064392
-
-
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao. Toward a, software infrastructure for the Cyclops-64 cellular architecture. In Proceedings of the 20th International Symposium on High Performance Computing Systems and Applications, St. John's, Newfoundland and Labrador, Canada, May 14-17, 2006.
-
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao. Toward a, software infrastructure for the Cyclops-64 cellular architecture. In Proceedings of the 20th International Symposium on High Performance Computing Systems and Applications, St. John's, Newfoundland and Labrador, Canada, May 14-17, 2006.
-
-
-
-
8
-
-
1142293069
-
Performance characteristics of OpenMP constructs, and application benchmarks on a large symmetric multiprocessor
-
New York, June 23-26
-
Nathan R. Fredrickson, Ahmad Afsahi, and Ying Qian. Performance characteristics of OpenMP constructs, and application benchmarks on a large symmetric multiprocessor. In Proceedings of the 2003 International Conference on Supercomputing, pages 140-149, New York, June 23-26 2003.
-
(2003)
Proceedings of the 2003 International Conference on Supercomputing
, pp. 140-149
-
-
Fredrickson, N.R.1
Afsahi, A.2
Qian, Y.3
-
9
-
-
0025438298
-
Synchronization algorithms for shared-memory multiprocessors
-
June
-
Gary Graunke and Shreekant Thakkar. Synchronization algorithms for shared-memory multiprocessors. Computer, 23:60-69, June 1990.
-
(1990)
Computer
, vol.23
, pp. 60-69
-
-
Graunke, G.1
Thakkar, S.2
-
11
-
-
84956970069
-
A pragmatic implementation of non-blocking linked-lists
-
Proceedings of the 15th International Conference on Distributed Computing, number in, Lisbon, Portugal, October 3-5
-
Timothy L. Harris. A pragmatic implementation of non-blocking linked-lists. In Proceedings of the 15th International Conference on Distributed Computing, number 2180 in Lecture Notes in Computer Science, pages 300-314, Lisbon, Portugal, October 3-5, 2001.
-
(2001)
Lecture Notes in Computer Science
, vol.2180
, pp. 300-314
-
-
Harris, T.L.1
-
12
-
-
8344241113
-
A scalable lock-free stack algorithm
-
Barcelona, Spain, June 27-30
-
Danny Hendler, Nir Shavit, and Lena Yerushalmi. A scalable lock-free stack algorithm. In Proceedings of the 16th Annual ACM Symposium on Parallel Algorithms and Architectures, pages 206-215, Barcelona, Spain, June 27-30, 2004.
-
(2004)
Proceedings of the 16th Annual ACM Symposium on Parallel Algorithms and Architectures
, pp. 206-215
-
-
Hendler, D.1
Shavit, N.2
Yerushalmi, L.3
-
13
-
-
27544489038
-
Nonblocking memory management support for dynamic-sized data structures
-
May
-
Maurice Herlihy, Victor Luchangco, Paul Martin, and Mark Moir. Nonblocking memory management support for dynamic-sized data structures. ACM Transactions on Computer Systems, 23(2):146-196, May 2005.
-
(2005)
ACM Transactions on Computer Systems
, vol.23
, Issue.2
, pp. 146-196
-
-
Herlihy, M.1
Luchangco, V.2
Martin, P.3
Moir, M.4
-
14
-
-
0027262011
-
-
Maurice Herlihy and J. Eliot B. Moss. Transactional memory: Architectural support for lock-free data structures. In Proceedings of the 20th Annual International Symposium on Computer Architecture, pages 289-300, San Diego, California, May 17-19, 1993.
-
Maurice Herlihy and J. Eliot B. Moss. Transactional memory: Architectural support for lock-free data structures. In Proceedings of the 20th Annual International Symposium on Computer Architecture, pages 289-300, San Diego, California, May 17-19, 1993.
-
-
-
-
15
-
-
34247346411
-
-
IBM system/370 extended architecture, publication no. SA22-7085
-
IBM system/370 extended architecture, Principle of operation, publication no. SA22-7085, 1983.
-
(1983)
Principle of operation
-
-
-
16
-
-
0032627704
-
Evaluating synchronization on shared address space multiprocessors: Methodology and performance
-
June
-
Sanjeev Kumar, Dongming Jiang, Rohit Chandra, and Jaswinder Pal Singh. Evaluating synchronization on shared address space multiprocessors: Methodology and performance. ACM SIGMETRICS Performance Evaluation Review, 27(1):23-34, June 1999.
-
(1999)
ACM SIGMETRICS Performance Evaluation Review
, vol.27
, Issue.1
, pp. 23-34
-
-
Kumar, S.1
Jiang, D.2
Chandra, R.3
Pal Singh, J.4
-
17
-
-
84944046879
-
Performance evaluation of the Omni OpenMP compiler
-
Proceedings of the 3rd International Symposium on High Performance Computing, of, Tokyo, Japan, October 16-18
-
Kazuhiro Kusano, Shigehisa Satoh, and Mitsuhisa Sato. Performance evaluation of the Omni OpenMP compiler. In Proceedings of the 3rd International Symposium on High Performance Computing, volume 1940 of Lecture Notes in Computer Science, pages 403-414, Tokyo, Japan, October 16-18, 2000.
-
(2000)
Lecture Notes in Computer Science
, vol.1940
, pp. 403-414
-
-
Kusano, K.1
Satoh, S.2
Sato, M.3
-
19
-
-
84976718540
-
Algorithms for scalable synchronization on shared-memory multiprocessors
-
February
-
John M. Mellor-Crummey and Michael L. Scott. Algorithms for scalable synchronization on shared-memory multiprocessors. ACM Transactions on Computer Systems, 9(1):21-65, February 1991.
-
(1991)
ACM Transactions on Computer Systems
, vol.9
, Issue.1
, pp. 21-65
-
-
Mellor-Crummey, J.M.1
Scott, M.L.2
-
22
-
-
3042671335
-
Hazard pointers: Safe memory reclamation for lock-free objects
-
Maged M. Michael. Hazard pointers: Safe memory reclamation for lock-free objects. IEEE Trans. Parallel Distrib. Syst, 15(6):491-504, 2004.
-
(2004)
IEEE Trans. Parallel Distrib. Syst
, vol.15
, Issue.6
, pp. 491-504
-
-
Michael, M.M.1
-
23
-
-
0029723606
-
Simple, fast, and practical non-blocking and blocking concurrent queue algorithms
-
New York, USA, May
-
Maged M. Michael and Michael L. Scott. Simple, fast, and practical non-blocking and blocking concurrent queue algorithms. In Proceedings of the 15th Annual ACM Symposium on Principles of Distributed Computing, pages 267-275, New York, USA, May 1996.
-
(1996)
Proceedings of the 15th Annual ACM Symposium on Principles of Distributed Computing
, pp. 267-275
-
-
Michael, M.M.1
Scott, M.L.2
-
24
-
-
34247326336
-
Architecture Review Board. OpenMP FORTRAN application program interface
-
Technical Report 2.0, November
-
OpenMP Architecture Review Board. OpenMP FORTRAN application program interface. Technical Report 2.0, November 2000.
-
(2000)
-
-
Open, M.P.1
-
25
-
-
0037660155
-
OpenMP C and C++ application program interface
-
OpenMP Architecture Review Board, Technical Report 2.0, March
-
OpenMP Architecture Review Board. OpenMP C and C++ application program interface. Technical Report 2.0, March 2002.
-
(2002)
-
-
-
26
-
-
68749102026
-
-
Achal Prabhakar, Vladimir Getov, and Barbara Chapman. Performance comparisons of basic OpenMP constructs. In Proceedings of the 4th International Symposium on High Performance Computing, number 2327 in Lecture Notes in Computer Science, pages 413-424, Kansai Science City, Japan, May 15-17, 2002.
-
Achal Prabhakar, Vladimir Getov, and Barbara Chapman. Performance comparisons of basic OpenMP constructs. In Proceedings of the 4th International Symposium on High Performance Computing, number 2327 in Lecture Notes in Computer Science, pages 413-424, Kansai Science City, Japan, May 15-17, 2002.
-
-
-
-
27
-
-
33746289072
-
Optimizing NANOS OpenMP for the IBM Cyclops multithreaded architecture
-
Denver, Colorado, April 4-8
-
David Ródenas, Xavier Martorell, Eduard Ayguadé, Jesús Labarta, George Almási, Cǎlin Caşcaval, José Castaños, and José Moreira. Optimizing NANOS OpenMP for the IBM Cyclops multithreaded architecture. In Proceedings of the 19th International Parallel and Distributed Processing Symposium, page 110, Denver, Colorado, April 4-8, 2005.
-
(2005)
Proceedings of the 19th International Parallel and Distributed Processing Symposium
, pp. 110
-
-
Ródenas, D.1
Martorell, X.2
Ayguadé, E.3
Labarta, J.4
Almási, G.5
Caşcaval, C.6
Castaños, J.7
Moreira, J.8
-
28
-
-
0021183678
-
Dynamic decentralized cache schemes for MIMD parallel processors
-
Ann Arbor, Michigan, June 5-7
-
Larry Rudolph and Zary Segall. Dynamic decentralized cache schemes for MIMD parallel processors. In Proceedings of the 11th Annual International Symposium on Computer Architecture, pages 340-347, Ann Arbor, Michigan, June 5-7, 1984.
-
(1984)
Proceedings of the 11th Annual International Symposium on Computer Architecture
, pp. 340-347
-
-
Rudolph, L.1
Segall, Z.2
-
29
-
-
0029181248
-
Lock-free linked lists using compare-and-swap
-
Ottawa, Ontario, Canada, August 2-23
-
John D. Valois. Lock-free linked lists using compare-and-swap. In Proceedings of the 14th Annual ACM Symposium of Distributed Computing, pages 214-222, Ottawa, Ontario, Canada, August 2-23, 1995.
-
(1995)
Proceedings of the 14th Annual ACM Symposium of Distributed Computing
, pp. 214-222
-
-
Valois, J.D.1
|