메뉴 건너뛰기




Volumn 30, Issue 4, 2002, Pages 225-255

Runtime vs. Manual Data Distribution for Architecture-Agnostic Shared-Memory Programming Models

Author keywords

Data distribution; OpenMP; Operating systems; Performance evaluation; Runtime systems

Indexed keywords

DATA DISTRIBUTION; OPERATING SYSTEMS; PERFORMANCE EVALUATION; RUNTIME SYSTEMS;

EID: 1942475956     PISSN: 08857458     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1019899812171     Document Type: Conference Paper
Times cited : (3)

References (39)
  • 4
    • 1942448564 scopus 로고    scopus 로고
    • Intel OpenMP C++/Fortran Compiler for Hyper-Threading Technology: Implementation and Performance
    • January
    • X. Tian, A. Bik, M. Girkar, P. Grey, H. Saito, and E. Su, Intel OpenMP C++/Fortran Compiler for Hyper-Threading Technology: Implementation and Performance. Intel Technology Journal, 6(1) (January 2002).
    • (2002) Intel Technology Journal , vol.6 , Issue.1
    • Tian, X.1    Bik, A.2    Girkar, M.3    Grey, P.4    Saito, H.5    Su, E.6
  • 9
    • 0029666629 scopus 로고    scopus 로고
    • STiNG: A CC-NUMA Computer System for the Commercial Marketplace
    • Philadelphia, Pennsylvania, May
    • T. Lovett and R. Clapp, STiNG: A CC-NUMA Computer System for the Commercial Marketplace, Proc. 23rd Int'l. Symp. Computer Architecture (ISCA'96), Philadelphia, Pennsylvania, pp. 308-317 (May 1996).
    • (1996) Proc. 23rd Int'l. Symp. Computer Architecture (ISCA'96) , pp. 308-317
    • Lovett, T.1    Clapp, R.2
  • 10
    • 0029666642 scopus 로고    scopus 로고
    • Application and Architectural Bottlenecks in Large-Scale Distributed Shared Memory Machines
    • Philadelphia, Pennsylvania, June
    • C. Holt, J. P. Singh, and J. Hennessy, Application and Architectural Bottlenecks in Large-Scale Distributed Shared Memory Machines, Proc. 23rd Int'l. Symp. Computer Architecture (ISCA'96), Philadelphia, Pennsylvania, pp. 134-145 (June 1996).
    • (1996) Proc. 23rd Int'l. Symp. Computer Architecture (ISCA'96) , pp. 134-145
    • Holt, C.1    Singh, J.P.2    Hennessy, J.3
  • 11
    • 0032671417 scopus 로고    scopus 로고
    • Scaling Application Performance on a Cache-Coherent Multiprocessor
    • Atlanta, Georgia, May
    • D. Jiang and J. P. Singh, Scaling Application Performance on a Cache-Coherent Multiprocessor, Proc. 26th Int'l. Symp. Computer Architecture (ISCA'99), Atlanta, Georgia, pp. 305-316 (May 1999).
    • (1999) Proc. 26th Int'l. Symp. Computer Architecture (ISCA'99) , pp. 305-316
    • Jiang, D.1    Singh, J.P.2
  • 12
    • 84937401938 scopus 로고    scopus 로고
    • Exploiting Data Locality on Scalable Shared Memory Machines with Data Parallel Programs
    • Munich, Germany, August
    • S. Benkner and T. Brandes, Exploiting Data Locality on Scalable Shared Memory Machines with Data Parallel Programs, Proc. 6th Int'l. EuroPar Conf. (EuroPar'2000), Munich, Germany, pp. 647-657 (August 2000).
    • (2000) Proc. 6th Int'l. EuroPar Conf. (EuroPar'2000) , pp. 647-657
    • Benkner, S.1    Brandes, T.2
  • 14
    • 0003573830 scopus 로고    scopus 로고
    • A User's View of OpenMP: The Good, The Bad and the Ugly
    • San Diego, California (July)
    • W. Gropp, A User's View of OpenMP: The Good, The Bad and the Ugly, Workshop on OpenMP Applications and Tools (WOMPAT'2000), San Diego, California (July 2000).
    • (2000) Workshop on OpenMP Applications and Tools (WOMPAT'2000)
    • Gropp, W.1
  • 24
    • 0003565855 scopus 로고    scopus 로고
    • High Performance FORTRAN Forum, High Performance FORTRAN Language Specification, Version 2.0
    • Center for Research on Parallel Computation, Rice University (January)
    • High Performance FORTRAN Forum, High Performance FORTRAN Language Specification, Version 2.0. Technical Report CRPCTR-92225, Center for Research on Parallel Computation, Rice University (January 1997).
    • (1997) Technical Report , vol.CRPCTR-92225
  • 25
    • 0003605996 scopus 로고
    • The NAS Parallel Benchmarks 2.0
    • Numerical Aerodynamic Simulation Facility, NASA Ames Research Center (December)
    • D. Bailey, T. Harris, W. Saphir, R. V. der Wijngaart, A. Woo, and M. Yarrow, The NAS Parallel Benchmarks 2.0, Technical Report NAS-95-020, Numerical Aerodynamic Simulation Facility, NASA Ames Research Center (December 1995).
    • (1995) Technical Report , vol.NAS-95-020
    • Bailey, D.1    Harris, T.2    Saphir, W.3    Der Wijngaart, R.V.4    Woo, A.5    Yarrow, M.6
  • 26
    • 0003648799 scopus 로고    scopus 로고
    • The OpenMP Implementation of the NAS Parallel Benchmarks and its Performance
    • NASA Ames Research Center, October
    • H. Jin, M. Frumkin, and J. Yan. The OpenMP Implementation of the NAS Parallel Benchmarks and its Performance, Technical Report NAS-99-011, NASA Ames Research Center, (October 1999).
    • (1999) Technical Report , vol.NAS-99-011
    • Jin, H.1    Frumkin, M.2    Yan, J.3
  • 29
    • 29144465623 scopus 로고    scopus 로고
    • December
    • Standard Performance Evaluation Corporation, SPEC CPU2000 Benchmarks. http://www.spec.org (December 2000).
    • (2000) SPEC CPU2000 Benchmarks
  • 31
    • 84976735460 scopus 로고
    • The Privatizing DOALL Test: A Run-Time Technique for DOALL Loop Identification and Array Privatization
    • Manchester, United Kingdom, July
    • L. Rauchwerger and D. Padua, The Privatizing DOALL Test: A Run-Time Technique for DOALL Loop Identification and Array Privatization, Proc. Eigth ACM Int'l. Conf. Supercomputing (ICS'94), Manchester, United Kingdom, pp. 33-43 (July 1994).
    • (1994) Proc. Eigth ACM Int'l. Conf. Supercomputing (ICS'94) , pp. 33-43
    • Rauchwerger, L.1    Padua, D.2
  • 32
    • 0023535689 scopus 로고
    • Guided Self-Scheduling: A Practical Scheduling Scheme for Parallel Supercomputers
    • December
    • C. Polychronopoulos and D. Kuck. Guided Self-Scheduling: A Practical Scheduling Scheme for Parallel Supercomputers, IEEE Trans. Comput., C-36(12):1485-1495 (December 1987).
    • (1987) IEEE Trans. Comput. , vol.C-36 , Issue.12 , pp. 1485-1495
    • Polychronopoulos, C.1    Kuck, D.2
  • 34
    • 0029218614 scopus 로고
    • Using Simple Page Placement Schemes to Reduce the Cost of Cache Fills in Coherent Shared-Memory Systems
    • Santa Barbara, California, April
    • M. Marchetti, L. Kontothanassis, R. Bianchini, and M. Scott. Using Simple Page Placement Schemes to Reduce the Cost of Cache Fills in Coherent Shared-Memory Systems, Proc. 9th IEEE Int'l. Parallel Proc. Symp. (IPPS'95), Santa Barbara, California, pp. 380-385 (April 1995).
    • (1995) Proc. 9th IEEE Int'l. Parallel Proc. Symp. (IPPS'95) , pp. 380-385
    • Marchetti, M.1    Kontothanassis, L.2    Bianchini, R.3    Scott, M.4
  • 35
    • 0028419803 scopus 로고
    • Using Processor Affinity in Loop Scheduling on Shared-Memory Multiprocessors
    • April
    • E. Markatos and T. LeBlanc. Using Processor Affinity in Loop Scheduling on Shared-Memory Multiprocessors. IEEE Trans. Parallel and Distributed Systems, 5(4):379-400 (April 1994).
    • (1994) IEEE Trans. Parallel and Distributed Systems , vol.5 , Issue.4 , pp. 379-400
    • Markatos, E.1    LeBlanc, T.2
  • 36
    • 0003750394 scopus 로고    scopus 로고
    • Implementation of NAS Parallel Benchmarks in High Performance FORTRAN
    • NASA Ames Research Center (September)
    • M. Frumkin, H. Jin, and J. Yan, Implementation of NAS Parallel Benchmarks in High Performance FORTRAN. Technical Report NAS-98-009, NASA Ames Research Center (September 1998).
    • (1998) Technical Report , vol.NAS-98-009
    • Frumkin, M.1    Jin, H.2    Yan, J.3
  • 38
    • 84966587922 scopus 로고    scopus 로고
    • Quantifying and Resolving Remote Memory Access Contention on Hardware DSM Multiprocessors
    • Fort Lauderdale, Florida (April)
    • D. Nikolopoulos, Quantifying and Resolving Remote Memory Access Contention on Hardware DSM Multiprocessors, Proc. 16th IEEE/ACM Int'l. Parallel and Distributed Proc. Symp. (IPDPS'02), Fort Lauderdale, Florida (April 2002).
    • (2002) Proc. 16th IEEE/ACM Int'l. Parallel and Distributed Proc. Symp. (IPDPS'02)
    • Nikolopoulos, D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.