메뉴 건너뛰기




Volumn 2006, Issue , 2006, Pages 41-50

Landing OpenMP on cyclops-64: An efficient mapping of OpenMP to a many-core system-on-a-chip

Author keywords

Chip multiprocessor; OpenMP; Performance evaluation; Run time system; System on a chip

Indexed keywords

DATA STRUCTURES; EMBEDDED SYSTEMS; MICROPROCESSOR CHIPS; MULTIPROCESSING SYSTEMS; OPTIMIZATION;

EID: 34247379126     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1128022.1128030     Document Type: Conference Paper
Times cited : (20)

References (29)
  • 1
    • 34247373734 scopus 로고    scopus 로고
    • Evaluation of OpenMP for the Cyclops multithreaded architecture
    • OpenMP Shared Memory Parallel Programming: International Workshop on OpenMP Applications and Tools, WOMPAT 2003, of, Toronto, Canada, June 26-27
    • George S. Almási, Eduard Ayguadé, Cǎlin Caşcaval, José Castaños, Jesús Labarta, Francisco Martínez, Xavier Martorell, and José Moreira. Evaluation of OpenMP for the Cyclops multithreaded architecture. In OpenMP Shared Memory Parallel Programming: International Workshop on OpenMP Applications and Tools, WOMPAT 2003, volume 2716 of Lecture Notes in Computer Science., pages 69-83, Toronto, Canada, June 26-27, 2003.
    • (2003) Lecture Notes in Computer Science , vol.2716 , pp. 69-83
    • Almási, G.S.1    Ayguadé, E.2    Caşcaval, C.3    Castaños, J.4    Labarta, J.5    Martínez, F.6    Martorell, X.7    Moreira, J.8
  • 3
    • 0025211006 scopus 로고
    • The performance of spin lock alternatives for shared-memory multiprocessors
    • January
    • Thomas E. Anderson. The performance of spin lock alternatives for shared-memory multiprocessors. IEEE Transactions on Parallel and Distributed Systems, 1(1):6-16, January 1990.
    • (1990) IEEE Transactions on Parallel and Distributed Systems , vol.1 , Issue.1 , pp. 6-16
    • Anderson, T.E.1
  • 4
    • 0034290658 scopus 로고    scopus 로고
    • Performance characteristics for OpenMP constructs on different parallel computer architectures
    • Rudolf Berrendorf and Guido Nieken. Performance characteristics for OpenMP constructs on different parallel computer architectures. Concurrency - Practice and Experience, 12(12):1261-1273, 2000.
    • (2000) Concurrency - Practice and Experience , vol.12 , Issue.12 , pp. 1261-1273
    • Berrendorf, R.1    Nieken, G.2
  • 5
    • 34247381935 scopus 로고    scopus 로고
    • J. Mark Bull. Measuring synchronization and scheduling overheads in OpenMP. In Proceedings of the First European Workshop on OpenMP, Lund, Sweden, September 30 - October 1, 1999.
    • J. Mark Bull. Measuring synchronization and scheduling overheads in OpenMP. In Proceedings of the First European Workshop on OpenMP, Lund, Sweden, September 30 - October 1, 1999.
  • 6
    • 34247371741 scopus 로고    scopus 로고
    • Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao. FAST: A functionally accurate simulation toolset for the Cyclops64 cellular architecture. In Proceedings of the Workshop on Modeling, Benchmarking and Simulation, pages 11-20, Madison, Wisconsin, June 4, 2005. Held in conjunction with the 32nd Annual International Symposium on Computer Architecture.
    • Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao. FAST: A functionally accurate simulation toolset for the Cyclops64 cellular architecture. In Proceedings of the Workshop on Modeling, Benchmarking and Simulation, pages 11-20, Madison, Wisconsin, June 4, 2005. Held in conjunction with the 32nd Annual International Symposium on Computer Architecture.
  • 7
    • 33751064392 scopus 로고    scopus 로고
    • Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao. Toward a, software infrastructure for the Cyclops-64 cellular architecture. In Proceedings of the 20th International Symposium on High Performance Computing Systems and Applications, St. John's, Newfoundland and Labrador, Canada, May 14-17, 2006.
    • Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao. Toward a, software infrastructure for the Cyclops-64 cellular architecture. In Proceedings of the 20th International Symposium on High Performance Computing Systems and Applications, St. John's, Newfoundland and Labrador, Canada, May 14-17, 2006.
  • 8
    • 1142293069 scopus 로고    scopus 로고
    • Performance characteristics of OpenMP constructs, and application benchmarks on a large symmetric multiprocessor
    • New York, June 23-26
    • Nathan R. Fredrickson, Ahmad Afsahi, and Ying Qian. Performance characteristics of OpenMP constructs, and application benchmarks on a large symmetric multiprocessor. In Proceedings of the 2003 International Conference on Supercomputing, pages 140-149, New York, June 23-26 2003.
    • (2003) Proceedings of the 2003 International Conference on Supercomputing , pp. 140-149
    • Fredrickson, N.R.1    Afsahi, A.2    Qian, Y.3
  • 9
    • 0025438298 scopus 로고
    • Synchronization algorithms for shared-memory multiprocessors
    • June
    • Gary Graunke and Shreekant Thakkar. Synchronization algorithms for shared-memory multiprocessors. Computer, 23:60-69, June 1990.
    • (1990) Computer , vol.23 , pp. 60-69
    • Graunke, G.1    Thakkar, S.2
  • 11
    • 84956970069 scopus 로고    scopus 로고
    • A pragmatic implementation of non-blocking linked-lists
    • Proceedings of the 15th International Conference on Distributed Computing, number in, Lisbon, Portugal, October 3-5
    • Timothy L. Harris. A pragmatic implementation of non-blocking linked-lists. In Proceedings of the 15th International Conference on Distributed Computing, number 2180 in Lecture Notes in Computer Science, pages 300-314, Lisbon, Portugal, October 3-5, 2001.
    • (2001) Lecture Notes in Computer Science , vol.2180 , pp. 300-314
    • Harris, T.L.1
  • 13
    • 27544489038 scopus 로고    scopus 로고
    • Nonblocking memory management support for dynamic-sized data structures
    • May
    • Maurice Herlihy, Victor Luchangco, Paul Martin, and Mark Moir. Nonblocking memory management support for dynamic-sized data structures. ACM Transactions on Computer Systems, 23(2):146-196, May 2005.
    • (2005) ACM Transactions on Computer Systems , vol.23 , Issue.2 , pp. 146-196
    • Herlihy, M.1    Luchangco, V.2    Martin, P.3    Moir, M.4
  • 14
    • 0027262011 scopus 로고    scopus 로고
    • Maurice Herlihy and J. Eliot B. Moss. Transactional memory: Architectural support for lock-free data structures. In Proceedings of the 20th Annual International Symposium on Computer Architecture, pages 289-300, San Diego, California, May 17-19, 1993.
    • Maurice Herlihy and J. Eliot B. Moss. Transactional memory: Architectural support for lock-free data structures. In Proceedings of the 20th Annual International Symposium on Computer Architecture, pages 289-300, San Diego, California, May 17-19, 1993.
  • 15
    • 34247346411 scopus 로고
    • IBM system/370 extended architecture, publication no. SA22-7085
    • IBM system/370 extended architecture, Principle of operation, publication no. SA22-7085, 1983.
    • (1983) Principle of operation
  • 16
    • 0032627704 scopus 로고    scopus 로고
    • Evaluating synchronization on shared address space multiprocessors: Methodology and performance
    • June
    • Sanjeev Kumar, Dongming Jiang, Rohit Chandra, and Jaswinder Pal Singh. Evaluating synchronization on shared address space multiprocessors: Methodology and performance. ACM SIGMETRICS Performance Evaluation Review, 27(1):23-34, June 1999.
    • (1999) ACM SIGMETRICS Performance Evaluation Review , vol.27 , Issue.1 , pp. 23-34
    • Kumar, S.1    Jiang, D.2    Chandra, R.3    Pal Singh, J.4
  • 17
    • 84944046879 scopus 로고    scopus 로고
    • Performance evaluation of the Omni OpenMP compiler
    • Proceedings of the 3rd International Symposium on High Performance Computing, of, Tokyo, Japan, October 16-18
    • Kazuhiro Kusano, Shigehisa Satoh, and Mitsuhisa Sato. Performance evaluation of the Omni OpenMP compiler. In Proceedings of the 3rd International Symposium on High Performance Computing, volume 1940 of Lecture Notes in Computer Science, pages 403-414, Tokyo, Japan, October 16-18, 2000.
    • (2000) Lecture Notes in Computer Science , vol.1940 , pp. 403-414
    • Kusano, K.1    Satoh, S.2    Sato, M.3
  • 19
    • 84976718540 scopus 로고
    • Algorithms for scalable synchronization on shared-memory multiprocessors
    • February
    • John M. Mellor-Crummey and Michael L. Scott. Algorithms for scalable synchronization on shared-memory multiprocessors. ACM Transactions on Computer Systems, 9(1):21-65, February 1991.
    • (1991) ACM Transactions on Computer Systems , vol.9 , Issue.1 , pp. 21-65
    • Mellor-Crummey, J.M.1    Scott, M.L.2
  • 22
    • 3042671335 scopus 로고    scopus 로고
    • Hazard pointers: Safe memory reclamation for lock-free objects
    • Maged M. Michael. Hazard pointers: Safe memory reclamation for lock-free objects. IEEE Trans. Parallel Distrib. Syst, 15(6):491-504, 2004.
    • (2004) IEEE Trans. Parallel Distrib. Syst , vol.15 , Issue.6 , pp. 491-504
    • Michael, M.M.1
  • 24
    • 34247326336 scopus 로고    scopus 로고
    • Architecture Review Board. OpenMP FORTRAN application program interface
    • Technical Report 2.0, November
    • OpenMP Architecture Review Board. OpenMP FORTRAN application program interface. Technical Report 2.0, November 2000.
    • (2000)
    • Open, M.P.1
  • 25
    • 0037660155 scopus 로고    scopus 로고
    • OpenMP C and C++ application program interface
    • OpenMP Architecture Review Board, Technical Report 2.0, March
    • OpenMP Architecture Review Board. OpenMP C and C++ application program interface. Technical Report 2.0, March 2002.
    • (2002)
  • 26
    • 68749102026 scopus 로고    scopus 로고
    • Achal Prabhakar, Vladimir Getov, and Barbara Chapman. Performance comparisons of basic OpenMP constructs. In Proceedings of the 4th International Symposium on High Performance Computing, number 2327 in Lecture Notes in Computer Science, pages 413-424, Kansai Science City, Japan, May 15-17, 2002.
    • Achal Prabhakar, Vladimir Getov, and Barbara Chapman. Performance comparisons of basic OpenMP constructs. In Proceedings of the 4th International Symposium on High Performance Computing, number 2327 in Lecture Notes in Computer Science, pages 413-424, Kansai Science City, Japan, May 15-17, 2002.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.