SCOPUS 정보 검색 플랫폼

International Journal of Parallel Programming

Volumn 29, Issue 3, 2001, Pages 249-282

The architectural and operating system implications on the performance of synchronization on ccNUMA multiprocessors

(2) Nikolopoulos, Dimitrios S a Papatheodorou, Theodore S b

a United States of America (United States)

b UNIVERSITY OF PATRAS (Greece)

Author keywords

ccNUMA; Operating systems; Performance evaluation; Shared memory multiprocessors; Synchronization

Indexed keywords

CCNUMA; PERFORMANCE EVALUATION; SHARED-MEMORY MULTIPROCESSORS; SYNCHRONIZATION ALGORITHMS;

ALGORITHMS; COMPUTER ARCHITECTURE; COMPUTER OPERATING SYSTEMS; COMPUTER SIMULATION; SCHEDULING; SOFTWARE ENGINEERING; SUPERVISORY AND EXECUTIVE PROGRAMS; SYNCHRONIZATION;

MULTIPROCESSING SYSTEMS;

EID: 2942721725 PISSN: 08857458 EISSN: None Source Type: Journal
DOI: None Document Type: Article

Times cited : (11)

References (44)

1
- 0032671417
- Scaling Application Performance on a Cache-Coherent Multiprocessor
- Atlanta, Georgia, May
- D. Jiang and J. P. Singh, Scaling Application Performance on a Cache-Coherent Multiprocessor, Proc. 26th Int'l. Symp. Computer Architecture (ISCA'99), Atlanta, Georgia, pp. 305-316 (May 1999).
- (1999) Proc. 26th Int'l. Symp. Computer Architecture (ISCA'99) , pp. 305-316
- Jiang, D.¹ Singh, J.P.²

2
- 84963769243
- Scal-Tool: Pinpointing and Quantifying Scalability Bottlenecks in DSM Multiprocessors
- Portland, Oregon November
- Y. Solihin, V. Lahm, and J. Torrellas, Scal-Tool: Pinpointing and Quantifying Scalability Bottlenecks in DSM Multiprocessors, Proc. ACM/IEEE Super computing: High Performance Networking and Computing Conf. (SC'99), Portland, Oregon (November 1999).
- (1999) Proc. ACM/IEEE Super Computing: High Performance Networking and Computing Conf. (SC'99)
- Solihin, Y.¹ Lahm, V.² Torrellas, J.³

3
- 0025211006
- The Performance of Spin Lock Alternatives for Shared-Memory Multiprocessors
- January
- T. Anderson, The Performance of Spin Lock Alternatives for Shared-Memory Multiprocessors, IEEE Trans. Parallel Distributed Syst., 1(1):6-16 (January 1990).
- (1990) IEEE Trans. Parallel Distributed Syst. , vol.1 , Issue.1 , pp. 6-16
- Anderson, T.¹

4
- 84976722900
- The Impact of Operating System Scheduling Policies and Synchronization Methods on the Performance of Parallel Applications
- San Diego, California, June
- A. Gupta, A. Tucker, and S. Urushibara, The Impact of Operating System Scheduling Policies and Synchronization Methods on the Performance of Parallel Applications, Proc. ACM SIGMETRICS Conf. Measurement and Modeling of Computer Systems (SIGMETRICS'91), San Diego, California, pp. 120-132 (June 1991).
- (1991) Proc. ACM SIGMETRICS Conf. Measurement and Modeling of Computer Systems (SIGMETRICS'91) , pp. 120-132
- Gupta, A.¹ Tucker, A.² Urushibara, S.³

5
- 84976718540
- Algorithms for Scalable Synchronization on Shared-Memory Multiprocessors
- February
- J. Mellor-Crummey and M. Scott, Algorithms for Scalable Synchronization on Shared-Memory Multiprocessors, ACM Trans. Computer Syst., 9(1):21-65 (February 1991).
- (1991) ACM Trans. Computer Syst. , vol.9 , Issue.1 , pp. 21-65
- Mellor-Crummey, J.¹ Scott, M.²

6
- 0032627704
- Evaluating Synchronization on Shared Address Space Multiprocessors: Methodology and Performance
- Atlanta, Georgia, May
- S. Kumar, D. Jiang, R. Chandra, and J. P. Singh, Evaluating Synchronization on Shared Address Space Multiprocessors: Methodology and Performance, Proc. ACM SIGMETRICS Conf. on Measurement and Modeling of Computer Systems (SIGMETRICS'99), Atlanta, Georgia, pp. 23-24 (May 1999).
- (1999) Proc. ACM SIGMETRICS Conf. on Measurement and Modeling of Computer Systems (SIGMETRICS'99) , pp. 23-24
- Kumar, S.¹ Jiang, D.² Chandra, R.³ Singh, J.P.⁴

7
- 0032644668
- A Quantitative Architectural Evaluation of Synchronization Algorithms and Disciplines on ccNUMA Systems: The Case of the SGI Origin2000
- Rhodes, Greece, June
- D. Nikolopoulos and T. Papatheodorou, A Quantitative Architectural Evaluation of Synchronization Algorithms and Disciplines on ccNUMA Systems: The Case of the SGI Origin2000, Proc. 13th ACM Int'l. Conf. Supercomputing (ICS'99), Rhodes, Greece, pp. 319-328 (June 1999).
- (1999) Proc. 13th ACM Int'l. Conf. Supercomputing (ICS'99) , pp. 319-328
- Nikolopoulos, D.¹ Papatheodorou, T.²

8
- 0030685587
- Efficient Synchronization: Let Them Eat QOLB
- Denver, Colorado, June
- A. Kägi and J. Goodman, Efficient Synchronization: Let Them Eat QOLB, Proc. 24th Int'l. Symp. Computer Architecture (ISCA'97), Denver, Colorado, pp. 171-181 (June 1997).
- (1997) Proc. 24th Int'l. Symp. Computer Architecture (ISCA'97) , pp. 171-181
- Kägi, A.¹ Goodman, J.²

9
- 0032762010
- MP-LOCKs: Replacing H/W Synchronization Primitives with Message Passing
- Orlando, Florida, January
- C. Kuo, J. Carter, and R. Kuramkote, MP-LOCKs: Replacing H/W Synchronization Primitives with Message Passing, Proc. Fifth Int'l. Symp. High Performance Computer Architecture (HPCA-5), Orlando, Florida, pp. 284-288 (January 1999).
- (1999) Proc. Fifth Int'l. Symp. High Performance Computer Architecture (HPCA-5) , pp. 284-288
- Kuo, C.¹ Carter, J.² Kuramkote, R.³

10
- 0034581396
- Improving the Throughput of Synchronization by Insertion of Delays
- Toulouse, France, January
- R. Rajwar, A. Kägi, and J. Goodman, Improving the Throughput of Synchronization by Insertion of Delays, Proc. Sixth Int'l. Symp. High Performance Computer Architecture (HPCA-6), Toulouse, France, pp. 156-165 (January 2000).
- (2000) Proc. Sixth Int'l. Symp. High Performance Computer Architecture (HPCA-6) , pp. 156-165
- Rajwar, R.¹ Kägi, A.² Goodman, J.³

11
- 84957872108
- The Impact of Speeding Up Critical Sections with Data Prefetching and Forwarding
- Bloomingdale, Illinois, August
- P. Trancoso and J. Torrellas, The Impact of Speeding Up Critical Sections with Data Prefetching and Forwarding, Proc. 1996 Int'l. Conf. Parallel Processing (ICPP'96), Bloomingdale, Illinois, pp. 79-86 (August 1996).
- (1996) Proc. 1996 Int'l. Conf. Parallel Processing (ICPP'96) , pp. 79-86
- Trancoso, P.¹ Torrellas, J.²

12
- 0012711061
- Eliminating Synchronization Overhead in Automatically Parallelized Programs Using Dynamic Feedback
- February
- P. Diniz and M. Rinard, Eliminating Synchronization Overhead in Automatically Parallelized Programs Using Dynamic Feedback, ACM Trans. Comp. Syst., 17(2):89-132 (February 1999).
- (1999) ACM Trans. Comp. Syst. , vol.17 , Issue.2 , pp. 89-132
- Diniz, P.¹ Rinard, M.²

13
- 0031162140
- A Circular List-Based Mutual Exclusion Scheme for Large Shared-Memory Multiprocessors
- June
- S. Fu and N. Tzeng, A Circular List-Based Mutual Exclusion Scheme for Large Shared-Memory Multiprocessors, IEEE Trans. Parallel Distributed Syst., 8(6):628-639 (June 1997).
- (1997) IEEE Trans. Parallel Distributed Syst. , vol.8 , Issue.6 , pp. 628-639
- Fu, S.¹ Tzeng, N.²

14
- 0032678895
- Fast and Fair Mutual Exclusion for Shared Memory Systems
- Austin, Texas
- T. Huang, Fast and Fair Mutual Exclusion for Shared Memory Systems, Proc. 19th IEEE Int'l. Conf. Distributed Computing Systems (ICDCS'99), Austin, Texas, pp. 224-231 (1999).
- (1999) Proc. 19th IEEE Int'l. Conf. Distributed Computing Systems (ICDCS'99) , pp. 224-231
- Huang, T.¹

15
- 80053227555
- Reactive Synchronization Algorithms for Multiprocessors
- San Jose, California, October
- B. Lim and A. Agarwal, Reactive Synchronization Algorithms for Multiprocessors, Proc. Sixth Int'l. Conf. Architectural Support for Progr. Lang. Oper. Syst. (ASPLOS-VI), San Jose, California, pp. 25-35 (October 1994).
- (1994) Proc. Sixth Int'l. Conf. Architectural Support for Progr. Lang. Oper. Syst. (ASPLOS-VI) , pp. 25-35
- Lim, B.¹ Agarwal, A.²

16
- 0031123529
- Isotach Networks
- April
- P. Reynolds, C. Williams, and R. Wagner, Isotach Networks, IEEE Trans. Parallel Distributed Syst., 8(4):337-348 (April 1997).
- (1997) IEEE Trans. Parallel Distributed Syst. , vol.8 , Issue.4 , pp. 337-348
- Reynolds, P.¹ Williams, C.² Wagner, R.³

17
- 70450049370
- Empirical Studies of Competitive Spinning for a Shared-Memory Multiprocessor
- Pacific Grove, California, October
- A. Karlin, K. Li, M. Manasse, and S. Owicki, Empirical Studies of Competitive Spinning for a Shared-Memory Multiprocessor, Proc. 13th ACM Symp. Oper. System Principles (SOSP'91), Pacific Grove, California, pp. 41-55 (October 1991).
- (1991) Proc. 13th ACM Symp. Oper. System Principles (SOSP'91) , pp. 41-55
- Karlin, A.¹ Li, K.² Manasse, M.³ Owicki, S.⁴

18
- 0031073624
- Scheduler-Conscious Synchronization
- February
- L. Kontothanassis, R. Wisniewski, and M. Scott, Scheduler-Conscious Synchronization, ACM Trans. Computer Syst., 15(1):3-40 (February 1997).
- (1997) ACM Trans. Computer Syst. , vol.15 , Issue.1 , pp. 3-40
- Kontothanassis, L.¹ Wisniewski, R.² Scott, M.³

19
- 0027646857
- Waiting Algorithms for Synchronization in Large-Scale Multiprocessors
- August
- B. H. Lim and A. Agarwal, Waiting Algorithms for Synchronization in Large-Scale Multiprocessors, ACM Trans. Computer Syst., 11(3):253-294 (August 1993).
- (1993) ACM Trans. Computer Syst. , vol.11 , Issue.3 , pp. 253-294
- Lim, B.H.¹ Agarwal, A.²

20
- 0001439212
- Gang Scheduling Performance Benefits for Fine-grain Synchronization
- December
- D. Feitelson and L. Rudolph, Gang Scheduling Performance Benefits for Fine-grain Synchronization, J. Parallel Distributed Computing, 16(4):306-318 (December 1992).
- (1992) J. Parallel Distributed Computing , vol.16 , Issue.4 , pp. 306-318
- Feitelson, D.¹ Rudolph, L.²

21
- 0020252515
- Scheduling Techniques for Concurrent Systems
- Miami, Florida, October
- J. Ousterhout, Scheduling Techniques for Concurrent Systems, Proc. Third Int'l. Conf. Distributed Computing Systems (ICDCS'82), Miami, Florida, pp. 22-30 (October 1982).
- (1982) Proc. Third Int'l. Conf. Distributed Computing Systems (ICDCS'82) , pp. 22-30
- Ousterhout, J.¹

22
- 0031628001
- Thread Scheduling for Multiprogrammed Multiprocessors
- Puerto Vallarta, Mexico, June
- N. Arora, R. Blumofe, and C. Greg-Plaxton, Thread Scheduling for Multiprogrammed Multiprocessors, Proc. Tenth ACM Symp. Parallel Algorithms and Architectures (SPAA'98), Puerto Vallarta, Mexico, pp. 119-129 (June 1998).
- (1998) Proc. Tenth ACM Symp. Parallel Algorithms and Architectures (SPAA'98) , pp. 119-129
- Arora, N.¹ Blumofe, R.² Greg-Plaxton, C.³

23
- 0025917643
- Wait-free Synchronization
- January
- M. Herlihy, Wait-free Synchronization, ACM Trans. Progr. Lang. Syst., 13(1): 124-149 (January 1991).
- (1991) ACM Trans. Progr. Lang. Syst. , vol.13 , Issue.1 , pp. 124-149
- Herlihy, M.¹

24
- 0031675416
- Managing Concurrent Access for Shared-Memory Active Messages
- Orlando, Florida, April
- S. Lumetta and D. Culler, Managing Concurrent Access for Shared-Memory Active Messages, Proc. of First IEEE/ACM Joint Int'l. Parallel Processing Symp. Symp. Parallel and Distributed Processing (IPPS/SPDP'98), Orlando, Florida, pp. 272-276 (April 1998).
- (1998) Proc. of First IEEE/ACM Joint Int'l. Parallel Processing Symp. Symp. Parallel and Distributed Processing (IPPS/SPDP'98) , pp. 272-276
- Lumetta, S.¹ Culler, D.²

25
- 0002477257
- Nonblocking Algorithms and Preemption-Safe Locking on Multiprogrammed Shared-Memory Multiprocessors
- May
- M. Michael and M. Scott, Nonblocking Algorithms and Preemption-Safe Locking on Multiprogrammed Shared-Memory Multiprocessors, J. Parallel Distributed Computing, 51(1):1-26 (May 1998).
- (1998) J. Parallel Distributed Computing , vol.51 , Issue.1 , pp. 1-26
- Michael, M.¹ Scott, M.²

26
- 0028436588
- A Nonblocking Algorithm for Shared Queues Using Compare-and-Swap
- May
- S. Prakash, D. Lee, and T. Johnson, A Nonblocking Algorithm for Shared Queues Using Compare-and-Swap, IEEE Trans. Computers, 43(5):548-559 (May 1994).
- (1994) IEEE Trans. Computers , vol.43 , Issue.5 , pp. 548-559
- Prakash, S.¹ Lee, D.² Johnson, T.³

27
- 0003433568
- Ph.D. thesis, Rensselaer Polytechnic Institute, Department of Computer Science
- J. Valois, Lock-Free Data Structures, Ph.D. thesis, Rensselaer Polytechnic Institute, Department of Computer Science (1995).
- (1995) Lock-Free Data Structures
- Valois, J.¹

28
- 0030685588
- The SGI Origin: A ccNUMA Highly Scalable Server
- Denver, Colorado, June
- J. Laudon and D. Lenoski, The SGI Origin: A ccNUMA Highly Scalable Server, Proc. 24th Int'l. Symp. Computer Architecture (ISCA'97), Denver, Colorado, pp. 241-251 (June 1997).
- (1997) Proc. 24th Int'l. Symp. Computer Architecture (ISCA'97) , pp. 241-251
- Laudon, J.¹ Lenoski, D.²

29
- 0033905330
- Fast Synchronization on Scalable Cache-Coherent Multiprocessors using Hybrid Primitives
- Cancun, Mexico, May
- D. Nikolopoulos and T. Papatheodorou, Fast Synchronization on Scalable Cache-Coherent Multiprocessors using Hybrid Primitives, Proc. 14th IEEE/ACM Int'l. Parallel and Distributed Processing Symp. (IPDPS'2000), Cancun, Mexico, pp. 711-719 (May 2000).
- (2000) Proc. 14th IEEE/ACM Int'l. Parallel and Distributed Processing Symp. (IPDPS'2000) , pp. 711-719
- Nikolopoulos, D.¹ Papatheodorou, T.²

30
- 0003662159
- Morgan Kaufman
- D. Culler, J. P. Singh, and A. Gupta, Parallel Computer Architecture: A Hardware/Software Approach, Morgan Kaufman (1998).
- (1998) Parallel Computer Architecture: A Hardware/Software Approach
- Culler, D.¹ Singh, J.P.² Gupta, A.³

31
- 0021183678
- Dynamic Decentralized Cache Schemes for MIMD Parallel Processors
- Ann Arbor, Michigan, June
- L. Rudolph and Z. Segall, Dynamic Decentralized Cache Schemes for MIMD Parallel Processors, Proc. 11th Int'l. Symp. Computer Architecture (ISCA'84), Ann Arbor, Michigan, pp. 340-347 (June 1984).
- (1984) Proc. 11th Int'l. Symp. Computer Architecture (ISCA'84) , pp. 340-347
- Rudolph, L.¹ Segall, Z.²

32
- 85034779732
- Axioms for Concurrent Objects
- Munich, Germany, January
- M. Herlihy and J. Wing, Axioms for Concurrent Objects, Proc. 14th ACM Symp. Principles Progr. Lang. (POPL'87), Munich, Germany, pp. 13-26 (January 1987).
- (1987) Proc. 14th ACM Symp. Principles Progr. Lang. (POPL'87) , pp. 13-26
- Herlihy, M.¹ Wing, J.²

33
- 0029723606
- Fast and Practical Nonblocking and Blocking Concurrent Queue Algorithms
- Philadelphia, Pennsylvania
- M. Michael and M. Scott, Simple, Fast and Practical Nonblocking and Blocking Concurrent Queue Algorithms, Proc. 15th ACM Symp. Principles of Distributed Computing (PODC'96), Philadelphia, Pennsylvania, pp. 267-276 (1996).
- (1996) Proc. 15th ACM Symp. Principles of Distributed Computing (PODC'96) , pp. 267-276
- Michael, M.¹ Scott Simple, M.²

34
- 0011611820
- First-Class User-Level Threads
- Pacific Grove, California, October
- B. Marsh, M. Scott, T. LeBlanc, and E. Markatos, First-Class User-Level Threads, Proc. 13th ACM Symp. Oper. Syst. Principles (SOSP'91), Pacific Grove, California, pp. 110-121 (October 1991).
- (1991) Proc. 13th ACM Symp. Oper. Syst. Principles (SOSP'91) , pp. 110-121
- Marsh, B.¹ Scott, M.² LeBlanc, T.³ Markatos, E.⁴

35
- 38549118932
- A Tool to Schedule Parallel Applications on Multiprocessors: The NANOS CPU Manager
- in conjuction with IEEE IPDPS'2000, Lecture Notes in Computer Science, Cancun, Mexico, May
- X. Martorell, J. Corbalan, D. Nikolopoulos, N. Navarro, E. Polyhchronopoulos, T. Papatheodorou, and J. Labarta, A Tool to Schedule Parallel Applications on Multiprocessors: The NANOS CPU Manager, Proc. Sixth Workshop on Job Scheduling Strategies for Parallel Processing, in conjuction with IEEE IPDPS'2000, Lecture Notes in Computer Science, Vol. 1911, Cancun, Mexico, pp. 87-112 (May 2000).
- (2000) Proc. Sixth Workshop on Job Scheduling Strategies for Parallel Processing , vol.1911 , pp. 87-112
- Martorell, X.¹ Corbalan, J.² Nikolopoulos, D.³ Navarro, N.⁴ Polyhchronopoulos, E.⁵ Papatheodorou, T.⁶ Labarta, J.⁷

36
- 57649203666
- IRIX 6.5 Man Pages, November
- Silicon Graphics Inc. IRIX 6.5 Man Pages, http://techpubs.sgi.com (November 1999).
- (1999)

37
- 0029179077
- The SPLASH-2 Programs: Characterization and Methodological Considerations
- Santa Margherita Ligure, Italy, June
- S. C. Woo, M. Ohara, E. Torrie, J. P. Singh, and A. Gupta, The SPLASH-2 Programs: Characterization and Methodological Considerations, Proc. 22nd Int'l. Symp. Computer Architecture (ISCA'95), Santa Margherita Ligure, Italy, pp. 24-37 (June 1995).
- (1995) Proc. 22nd Int'l. Symp. Computer Architecture (ISCA'95) , pp. 24-37
- Woo, S.C.¹ Ohara, M.² Torrie, E.³ Singh, J.P.⁴ Gupta, A.⁵

38
- 10844272942
- December
- Standard Performance Evaluation Corporation, SPEC CPU95 Documentation. http://www.spec.org (December 1999).
- (1999) SPEC CPU95 Documentation

39
- 0032669174
- Thread Fork/Join Techniques for Multi-Level Parallelism Exploitation in NUMA Multiprocessors
- Rhodes, Greece, June
- X. Martorell, E. Ayguadé, N. Navarro, J. Corbalan, M. Gonzàlez, and J. Labarta, Thread Fork/Join Techniques for Multi-Level Parallelism Exploitation in NUMA Multiprocessors, Proc. 13th ACM Int'l. Conf. Super computing (ICS'99), Rhodes, Greece, pp. 294-301 (June 1999).
- (1999) Proc. 13th ACM Int'l. Conf. Super Computing (ICS'99) , pp. 294-301
- Martorell, X.¹ Ayguadé, E.² Navarro, N.³ Corbalan, J.⁴ Gonzàlez, M.⁵ Labarta, J.⁶

40
- 0003554135
- November
- OpenMP Architecture Review Board, OpenMP Fortran Application Programming Interface, Version 1.1 (November 1999).
- (1999) OpenMP Fortran Application Programming Interface, Version 1.1

41
- 10844236981
- An Integrated Kernel and User-Level Paradigm for Efficient Multiprogramming
- University of Illinois at Urbana-Champaign June
- D. Craig, An Integrated Kernel and User-Level Paradigm for Efficient Multiprogramming. Technical Report CSRD No. 1533, University of Illinois at Urbana-Champaign (June 1999).
- (1999) Technical Report CSRD No. 1533 , vol.1533
- Craig, D.¹

42
- 0000444590
- Evaluating the Performance of Cache-Affinity Scheduling in Shared-Memory Multiprocessors
- February
- J. Torrellas, A. Tucker, and A. Gupta, Evaluating the Performance of Cache-Affinity Scheduling in Shared-Memory Multiprocessors, J. Parallel Distributed Computing, 24(2): 139-151 (February 1995).
- (1995) J. Parallel Distributed Computing , vol.24 , Issue.2 , pp. 139-151
- Torrellas, J.¹ Tucker, A.² Gupta, A.³

43
- 0029199162
- Empirical Evaluation of the Cray T3D: A Compiler Perspective
- St. Margherita Ligure, Italy, June
- R. Arpaci, D. Culler, A. Krishnamurthy, S. Steinberg, and K. Yelick, Empirical Evaluation of the Cray T3D: A Compiler Perspective, Proc. 22nd Int'l. Symp. Computer Architecture (ISCA'95), St. Margherita Ligure, Italy, pp. 320-331 (June 1995).
- (1995) Proc. 22nd Int'l. Symp. Computer Architecture (ISCA'95) , pp. 320-331
- Arpaci, R.¹ Culler, D.² Krishnamurthy, A.³ Steinberg, S.⁴ Yelick, K.⁵

44
- 0030259457
- Synchronization and Communication in the T3E Multiprocessor
- Cambridge, Massachusetts, October
- S. Scott, Synchronization and Communication in the T3E Multiprocessor, Proc. Seventh Int'l. Conf. Architectural Support for Progr. Lang. Oper. Syst. (ASPLOS-VII), Cambridge, Massachusetts, pp. 26-36 (October 1996).
- (1996) Proc. Seventh Int'l. Conf. Architectural Support for Progr. Lang. Oper. Syst. (ASPLOS-VII) , pp. 26-36
- Scott, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.