SCOPUS 정보 검색 플랫폼

Annual ACM Symposium on Parallel Algorithms and Architectures

Volumn 16, Issue , 2004, Pages 235-244

Effectively sharing a cache among threads

(2) Blelloch, Guy E a Gibbons, Phillip B b

a CARNEGIE MELLON UNIVERSITY (United States)

b INTEL RESEARCH (United States)

Author keywords

Chip multiprocessors; Multithreaded architectures; Scheduling algorithms; Shared cache

Indexed keywords

COMPUTATION THEORY; LOGIC DESIGN; MICROPROCESSOR CHIPS; MULTIPROCESSING SYSTEMS; PROGRAM PROCESSORS; VIRTUAL STORAGE;

CHIP MULTIPROCESSORS; MULTITHREAD ARCHITECTURE; SCHEDULING ALGORITHMS; SHARED CACHE;

CACHE MEMORY;

EID: 8344240379 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1007912.1007948 Document Type: Conference Paper

Times cited : (76)

References (31)

1
- 0036590708
- The data locality of work stealing
- U. A. Acar, G. E. Blelloch, and R. D. Blumofe. The data locality of work stealing. Theory of Computing Systems, 35(3):321-347, 2002.
- (2002) Theory of Computing Systems , vol.35 , Issue.3 , pp. 321-347
- Acar, U.A.¹ Blelloch, G.E.² Blumofe, R.D.³

2
- 0024656760
- An analytical cache model
- A. Agarwal, M. Horowitz, and J. L. Hennessy. An analytical cache model. A CM Trans. on Computer Systems, 7(2): 184~215, 1989.
- (1989) A CM Trans. on Computer Systems , vol.7 , Issue.2 , pp. 184-215
- Agarwal, A.¹ Horowitz, M.² Hennessy, J.L.³

3
- 0036038481
- Cache-oblivious priority queue and graph algorithm applications
- May
- L. Arge, M. A. Bender, E. D. Demaine, B. Holland-Minkley, and J. I. Munro. Cache-oblivious priority queue and graph algorithm applications. In Proc. 34th ACM Symp. on Theory of Computing (STOC), pages 268-276, May 2002.
- (2002) Proc. 34th ACM Symp. on Theory of Computing (STOC) , pp. 268-276
- Arge, L.¹ Bender, M.A.² Demaine, E.D.³ Holland-Minkley, B.⁴ Munro, J.I.⁵

4
- 0033722744
- Piranha: A scalable architecture based on single-chip multiprocessing
- June
- L. A. Barroso, K. Gharachorloo, R. McNamara, A. Nowatzyk, S. Qadeer, B. Sano, S. Smith, R. Stets, and B. Verghese. Piranha: A scalable architecture based on single-chip multiprocessing. In Proc. 27th ACM International Symp. on Computer Architecture (ISCA), pages 282-293, June 2000.
- (2000) Proc. 27th ACM International Symp. on Computer Architecture (ISCA) , pp. 282-293
- Barroso, L.A.¹ Gharachorloo, K.² McNamara, R.³ Nowatzyk, A.⁴ Qadeer, S.⁵ Sano, B.⁶ Smith, S.⁷ Stets, R.⁸ Verghese, B.⁹

5
- 0033713203
- Application-controlled paging for a shared cache
- R. D. Barve, E. F. Grove, and J. S. Vitter. Application-controlled paging for a shared cache. SIAM Journal on Computing, 29(4):1290-1303, 2000.
- (2000) SIAM Journal on Computing , vol.29 , Issue.4 , pp. 1290-1303
- Barve, R.D.¹ Grove, E.F.² Vitter, J.S.³

6
- 0003003638
- A study of replacment algorithms for virtual storage computers
- L. A. Belady. A study of replacment algorithms for virtual storage computers. IBM Systems Journal, 5(2):78-101, 1966.
- (1966) IBM Systems Journal , vol.5 , Issue.2 , pp. 78-101
- Belady, L.A.¹

7
- 0003575841
- Provably efficient scheduling for languages with fine-grained parallelism
- G. E. Blelloch, P. B. Gibbons, and Y. Matias. Provably efficient scheduling for languages with fine-grained parallelism. Journal of the ACM, 46(2):281-321, 1999.
- (1999) Journal of the ACM , vol.46 , Issue.2 , pp. 281-321
- Blelloch, G.E.¹ Gibbons, P.B.² Matias, Y.³

8
- 0030707347
- Space-efficient scheduling of parallelism with synchronization variables
- June
- G. E. Blelloch, P. B. Gibbons, Y. Matias, and G. J. Narlikar. Space-efficient scheduling of parallelism with synchronization variables. In Proc. 9th ACM Symp. on Parallel Algorithms and Architectures (SPAA), pages 12-23, June 1997.
- (1997) Proc. 9th ACM Symp. on Parallel Algorithms and Architectures (SPAA) , pp. 12-23
- Blelloch, G.E.¹ Gibbons, P.B.² Matias, Y.³ Narlikar, G.J.⁴

9
- 0030387154
- An analysis of dag-consistent distributed shared-memory algorithms
- June
- R. D. Blumofe, M. Frigo, C. F. Joerg, C. E. Leiserson, and K. H. Randall. An analysis of dag-consistent distributed shared-memory algorithms. In Proc. 8th ACM Symp. on Parallel Algorithms and Architectures (SPAA), pages 297-308, June 1996.
- (1996) Proc. 8th ACM Symp. on Parallel Algorithms and Architectures (SPAA) , pp. 297-308
- Blumofe, R.D.¹ Frigo, M.² Joerg, C.F.³ Leiserson, C.E.⁴ Randall, K.H.⁵

10
- 0000269759
- Scheduling multithreaded computations by work stealing
- R. D. Blumofe and C. E. Leiserson. Scheduling multithreaded computations by work stealing. Journal of the ACM, 46(5):720-748, 1999.
- (1999) Journal of the ACM , vol.46 , Issue.5 , pp. 720-748
- Blumofe, R.D.¹ Leiserson, C.E.²

11
- 0032095557
- Performance of shared caches on multithreaded architectures
- Y.-Y. Chen, J.-K. Peir, and C.-T. King. Performance of shared caches on multithreaded architectures. Journal of Information Science and Engineering, 14(2):499-514, 1998.
- (1998) Journal of Information Science and Engineering , vol.14 , Issue.2 , pp. 499-514
- Chen, Y.-Y.¹ Peir, J.-K.² King, C.-T.³

12
- 0034832577
- Low-contention depth-first scheduling of parallel computations with write-once synchronization variables
- July
- P. Fatourou. Low-contention depth-first scheduling of parallel computations with write-once synchronization variables. In Proc. 13th ACM Symp. on Parallel Algorithms and Architectures (SPAA), pages 189-198, July 2001.
- (2001) Proc. 13th ACM Symp. on Parallel Algorithms and Architectures (SPAA) , pp. 189-198
- Fatourou, P.¹

13
- 0033350255
- Cache-oblivious algorithms
- Oct.
- M. Frigo, C. E. Leiserson, H. Prokop, and S. Ramachandran. Cache-oblivious algorithms. In Proc. 40th IEEE Symp. on Foundations of Computer Science (FOCS), pages 285-298, Oct. 1999.
- (1999) Proc. 40th IEEE Symp. on Foundations of Computer Science (FOCS) , pp. 285-298
- Frigo, M.¹ Leiserson, C.E.² Prokop, H.³ Ramachandran, S.⁴

14
- 0033880036
- The stanford hydra CMP
- L. Hammond, B. A. Hubbert, M. Siu, M. K. Prabhu, M. Chen, and K. Olukotun. The Stanford Hydra CMP. IEEE Micro, 20(2):71-84, 2000.
- (2000) IEEE Micro , vol.20 , Issue.2 , pp. 71-84
- Hammond, L.¹ Hubbert, B.A.² Siu, M.³ Prabhu, M.K.⁴ Chen, M.⁵ Olukotun, K.⁶

15
- 0031235242
- A single-chip multiprocessor
- L. Hammond, B. Nayfeh, and K. Olukotun. A single-chip multiprocessor. IEEE Computer, 30(9):79-85, 1997.
- (1997) IEEE Computer , vol.30 , Issue.9 , pp. 79-85
- Hammond, L.¹ Nayfeh, B.² Olukotun, K.³

16
- 84958948520
- Competitive analysis of paging
- Springer. LNCS
- S. Irani. Competitive analysis of paging. In Online Algorithms. Springer, 1998. LNCS, 1442:52-73.
- (1998) Online Algorithms , vol.1442 , pp. 52-73
- Irani, S.¹

17
- 0025429331
- Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
- May
- N. P. Jouppi. Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers. In Proc. 17th ACM International Symp. on Computer Architecture (ISCA), pages 364-373, May 1990.
- (1990) Proc. 17th ACM International Symp. on Computer Architecture (ISCA) , pp. 364-373
- Jouppi, N.P.¹

18
- 4644226743
- Simultaneous multi-threading implementation in POWER5
- Aug.
- R. Kalla, B. Sinharoy, and J. Tendler. Simultaneous multi-threading implementation in POWER5. In 15th IEEE Hot Chips, Aug. 2003.
- (2003) 15th IEEE Hot Chips
- Kalla, R.¹ Sinharoy, B.² Tendler, J.³

19
- 0001087280
- Hyper-threading technology architecture and microarchitecture, white paper
- Feb.
- D. T. Marr, F. Binns, D. L. Hill, G. Hinton, D. A. Koufaty, J. A. Miller, and M. Upton. Hyper-threading technology architecture and microarchitecture, white paper. Intel Technical Journal, 6(1), Feb. 2002.
- (2002) Intel Technical Journal , vol.6 , Issue.1
- Marr, D.T.¹ Binns, F.² Hill, D.L.³ Hinton, G.⁴ Koufaty, D.A.⁵ Miller, J.A.⁶ Upton, M.⁷

20
- 0034823696
- Towards a first vertical prototyping of an extremely fine-grained parallel programming approach
- July
- D. Naishlos, J. Nuzman, C.-W. Tseng, and U. Vishkin. Towards a first vertical prototyping of an extremely fine-grained parallel programming approach. In Proc. 13th ACM Symp. on Parallel Algorithms and Architectures (SPAA), pages 93-102, July 2001.
- (2001) Proc. 13th ACM Symp. on Parallel Algorithms and Architectures (SPAA) , pp. 93-102
- Naishlos, D.¹ Nuzman, J.² Tseng, C.-W.³ Vishkin, U.⁴

21
- 0036489340
- Scheduling threads for low space requirement and good locality
- G. J. Narlikar. Scheduling threads for low space requirement and good locality. Theory of Computing Systems, 35(2):151-187, 2002.
- (2002) Theory of Computing Systems , vol.35 , Issue.2 , pp. 151-187
- Narlikar, G.J.¹

22
- 0040362680
- Space-efficient scheduling of nested parallelism
- G. J. Narlikar and G. E. Blelloch. Space-efficient scheduling of nested parallelism. ACM Trans. on Programming Languages and Systems, 21(1):138-173, 1999.
- (1999) ACM Trans. on Programming Languages and Systems , vol.21 , Issue.1 , pp. 138-173
- Narlikar, G.J.¹ Blelloch, G.E.²

23
- 0025629433
- Analysis of multithreaded architectures for parallel computing
- July
- R. H. Saavedra-Barrera, D. E. Culler, and T. von Eicken. Analysis of multithreaded architectures for parallel computing. In Proc. 2nd ACM Symp. on Parallel Algorithms and Architectures (SPAA), pages 169-178, July 1990.
- (1990) Proc. 2nd ACM Symp. on Parallel Algorithms and Architectures (SPAA) , pp. 169-178
- Saavedra-Barrera, R.H.¹ Culler, D.E.² Von Eicken, T.³

24
- 0022012946
- Amortized efficiency of list update and paging rules
- D. D. Sleator and R. E. Tarjan. Amortized efficiency of list update and paging rules. Communications of the ACM, 28(2):202-208, 1985.
- (1985) Communications of the ACM , vol.28 , Issue.2 , pp. 202-208
- Sleator, D.D.¹ Tarjan, R.E.²

25
- 0034826142
- Analytical cache models with application to cache partitioning
- June
- G. E. Suh, S. Devadas, and L. Rudolph. Analytical cache models with application to cache partitioning. In Proc. 2001 ACM International Conference on Supercomputing, pages 1-12, June 2001.
- (2001) Proc. 2001 ACM International Conference on Supercomputing , pp. 1-12
- Suh, G.E.¹ Devadas, S.² Rudolph, L.³

26
- 1642371317
- Dynamic partitioning of shared cache memory
- G. E. Suh, L. Rudolph, and S. Devadas. Dynamic partitioning of shared cache memory. Journal of Supercomputing, 28(1):7-26, 2004.
- (2004) Journal of Supercomputing , vol.28 , Issue.1 , pp. 7-26
- Suh, G.E.¹ Rudolph, L.² Devadas, S.³

27
- 8344224107
- Power4 system microarchitecture, technical white paper
- IBM Server Group, Oct.
- J. M. Tendler, S. Dodson, S. Fields, H. Le, and B. Sinharoy. Power4 system microarchitecture, technical white paper. Technical Report 20, IBM Server Group, Oct. 2001.
- (2001) Technical Report , vol.20
- Tendler, J.M.¹ Dodson, S.² Fields, S.³ Le, H.⁴ Sinharoy, B.⁵

28
- 0023456387
- Footprints in the cache
- D. Thibaut and H. S. Stone. Footprints in the cache. ACM Trans. on Computer Systems, 5(4):305-329, 1987.
- (1987) ACM Trans. on Computer Systems , vol.5 , Issue.4 , pp. 305-329
- Thibaut, D.¹ Stone, H.S.²

29
- 0026881152
- Improving disk cache hit-ratios through cache partitioning
- D. Thibaut and H. S. Stone. Improving disk cache hit-ratios through cache partitioning. IEEE Transactions on Computers, 41(6):665-676, 1992.
- (1992) IEEE Transactions on Computers , vol.41 , Issue.6 , pp. 665-676
- Thibaut, D.¹ Stone, H.S.²

30
- 0034316177
- The MAJC architecture: A synthesis of parallelism and scalability
- M. Tremblay, J. Chan, S. Chaudhry, A. W. Conigliaro, and S. S. Tse. The MAJC architecture: A synthesis of parallelism and scalability. IEEE Micro, 20(6): 12-25, 2000.
- (2000) IEEE Micro , vol.20 , Issue.6 , pp. 12-25
- Tremblay, M.¹ Chan, J.² Chaudhry, S.³ Conigliaro, A.W.⁴ Tse, S.S.⁵

31
- 0029183524
- Simultaneous multithreading: Maximizing on-chip parallelism
- June
- D. M. Tullsen, S. J. Eggers, and H. M. Levy. Simultaneous multithreading: Maximizing on-chip parallelism. In Proc. 22nd ACM International Symp. on Computer Architecture (ISCA), pages 392-403, June 1995.
- (1995) Proc. 22nd ACM International Symp. on Computer Architecture (ISCA) , pp. 392-403
- Tullsen, D.M.¹ Eggers, S.J.² Levy, H.M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.