-
1
-
-
85075101839
-
-
W. J. Bolosky and M. L. Scott. False sharing and its effect on shared memory performance. In Proceedings of the USENIX SEDMS IV Conference, 1993.
-
W. J. Bolosky and M. L. Scott. False sharing and its effect on shared memory performance. In Proceedings of the USENIX SEDMS IV Conference, 1993.
-
-
-
-
2
-
-
0031600410
-
Cache-conscious data placement
-
New York, NY, USA, ACM Press
-
B. Calder, C. Krintz, S. John, and T. Austin. Cache-conscious data placement. In ASPLOS-VIII: Proceedings of the eighth international conference on Architectural support for programming languages and operating systems, pages 139-149, New York, NY, USA, 1998. ACM Press.
-
(1998)
ASPLOS-VIII: Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
, pp. 139-149
-
-
Calder, B.1
Krintz, C.2
John, S.3
Austin, T.4
-
3
-
-
17244376579
-
Cacheconscious structure definition
-
New York, NY, USA, ACM Press
-
T. M. Chilimbi, B. Davidson, and J. R. Larus. Cacheconscious structure definition. In PLDI '99: Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation, pages 13-24, New York, NY, USA, 1999. ACM Press.
-
(1999)
PLDI '99: Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
, pp. 13-24
-
-
Chilimbi, T.M.1
Davidson, B.2
Larus, J.R.3
-
4
-
-
0032667164
-
Cache-conscious structure layout
-
New York, NY, USA, ACM Press
-
T. M. Chilimbi, M. D. Hill, and J. R. Larus. Cache-conscious structure layout. In PLDI '99: Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation, pages 1-12, New York, NY, USA, 1999. ACM Press.
-
(1999)
PLDI '99: Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
, pp. 1-12
-
-
Chilimbi, T.M.1
Hill, M.D.2
Larus, J.R.3
-
5
-
-
33745218336
-
Cache-conscious coallocation of hot data streams
-
T. M. Chilimbi and R. Shaham. Cache-conscious coallocation of hot data streams. SIGPLAN Not., 41(6):252-262, 2006.
-
(2006)
SIGPLAN Not
, vol.41
, Issue.6
, pp. 252-262
-
-
Chilimbi, T.M.1
Shaham, R.2
-
6
-
-
0027309861
-
The detection and elimination of useless misses in multiprocessors
-
New York, NY, USA, ACM Press
-
M. Dubois, J. Skeppstedt, L. Ricciulli, K. Ramamurthy, and P. Stenstroemm. The detection and elimination of useless misses in multiprocessors. In WCA '93: Proceedings of the 20th annual international symposium on Computer architecture, pages 88-97, New York, NY, USA, 1993. ACM Press.
-
(1993)
WCA '93: Proceedings of the 20th annual international symposium on Computer architecture
, pp. 88-97
-
-
Dubois, M.1
Skeppstedt, J.2
Ricciulli, L.3
Ramamurthy, K.4
Stenstroemm, P.5
-
7
-
-
0034290539
-
A framework for performance analysis tools
-
R. Hundt. HP Caliper: A framework for performance analysis tools. IEEE Concurrency, 8(4):64-71, 2000.
-
(2000)
IEEE Concurrency
, vol.8
, Issue.4
, pp. 64-71
-
-
Hundt, R.1
Caliper, H.P.2
-
8
-
-
84885965517
-
Practical structure layout optimization and advice
-
Washington, DC, USA, IEEE Computer Society
-
R. Hundt, S. Mannarswamy, and D. Cnakrabarti. Practical structure layout optimization and advice. In CGO '06: Proceedings of the International Symposium on Code Generation and Optimization, pages 233-244, Washington, DC, USA, 2006. IEEE Computer Society.
-
(2006)
CGO '06: Proceedings of the International Symposium on Code Generation and Optimization
, pp. 233-244
-
-
Hundt, R.1
Mannarswamy, S.2
Cnakrabarti, D.3
-
9
-
-
0029192199
-
Reducing false sharing on shared memory multiprocessors through compile time data transformations
-
New York, NY, USA, ACM Press
-
T. E. Jeremiassen and S. J. Eggers. Reducing false sharing on shared memory multiprocessors through compile time data transformations. In PPOPP '95: Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 179-188, New York, NY, USA, 1995. ACM Press.
-
(1995)
PPOPP '95: Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
, pp. 179-188
-
-
Jeremiassen, T.E.1
Eggers, S.J.2
-
10
-
-
0029537212
-
A dynamic cache sub-block design to reduce false sharing
-
00:313
-
M. Kadiyala and L. Bhuyan. A dynamic cache sub-block design to reduce false sharing, iced, 00:313, 1995.
-
(1995)
iced
-
-
Kadiyala, M.1
Bhuyan, L.2
-
11
-
-
0001495548
-
Automated data-member layout of heap objects to improve memory-hierarchy performance
-
T. Kistler and M. Franz. Automated data-member layout of heap objects to improve memory-hierarchy performance. ACMTrans. Program. Lang. Syst., 22(3):490-505, 2000.
-
(2000)
ACMTrans. Program. Lang. Syst
, vol.22
, Issue.3
, pp. 490-505
-
-
Kistler, T.1
Franz, M.2
-
12
-
-
34247172677
-
Whole program optimization of global variable layout
-
Seattle, WA, USA, IEEE Computer Society
-
N. McIntosh, S. Mannarswamy, and R. Hundt. Whole program optimization of global variable layout. In PACT '06: Proceedings of the 15th International Conference on Parallel Architectures and Compilation Techniques, Seattle, WA, USA, 2006. IEEE Computer Society.
-
(2006)
PACT '06: Proceedings of the 15th International Conference on Parallel Architectures and Compilation Techniques
-
-
McIntosh, N.1
Mannarswamy, S.2
Hundt, R.3
-
13
-
-
3042558367
-
Syzygy - a framework for scalable cross-module IPO
-
Washington, DC, USA, IEEE Computer Society
-
S. Moon, X. D. Li, R. Hundt, D. R. Chakrabarti, L. A. Lozano, U. Srinivasan, and S.-M. Liu. Syzygy - a framework for scalable cross-module IPO. In CGO '04: Proceedings of the international symposium on Code generation and optimization, page 65, Washington, DC, USA, 2004. IEEE Computer Society.
-
(2004)
CGO '04: Proceedings of the international symposium on Code generation and optimization
, pp. 65
-
-
Moon, S.1
Li, X.D.2
Hundt, R.3
Chakrabarti, D.R.4
Lozano, L.A.5
Srinivasan, U.6
Liu, S.-M.7
-
14
-
-
0036040711
-
An efficient profile-analysis framework for data-layout optimizations
-
New York, NY, USA, ACM Press
-
S. Rubin, R. Bodik, and T. Chilimbi. An efficient profile-analysis framework for data-layout optimizations. In POPL '02: Proceedings of the 29th ACM SIGPLAN-SIGACT symposium on Principles of programming languages, pages 140-153, New York, NY, USA, 2002. ACM Press.
-
(2002)
POPL '02: Proceedings of the 29th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
, pp. 140-153
-
-
Rubin, S.1
Bodik, R.2
Chilimbi, T.3
-
15
-
-
0025440459
-
A survey of cache coherence schemes for multiprocessors
-
P. Stenstrom. A survey of cache coherence schemes for multiprocessors. Computer, 23(6):12-24, 1990.
-
(1990)
Computer
, vol.23
, Issue.6
, pp. 12-24
-
-
Stenstrom, P.1
-
16
-
-
84955558691
-
-
S. A. M. Talbot, A. J. Bennett, and P. H. J. Kelly. Cautious, machine-independent performance tuning for shared-memory multiprocessors. In Euro-Par '96: Proceedings of the Second International Euro-Par Conference on Parallel Processing, pages 106-113, London, UK, 1996. Springer-Verlag.
-
S. A. M. Talbot, A. J. Bennett, and P. H. J. Kelly. Cautious, machine-independent performance tuning for shared-memory multiprocessors. In Euro-Par '96: Proceedings of the Second International Euro-Par Conference on Parallel Processing, pages 106-113, London, UK, 1996. Springer-Verlag.
-
-
-
-
17
-
-
0028446907
-
False sharing and spatial locality in multiprocessor caches
-
J. Torrellas, H. S. Lam, and J. L. Hennessy. False sharing and spatial locality in multiprocessor caches. IEEE Trans. Comput., 43(6):651-663, 1994.
-
(1994)
IEEE Trans. Comput
, vol.43
, Issue.6
, pp. 651-663
-
-
Torrellas, J.1
Lam, H.S.2
Hennessy, J.L.3
-
18
-
-
85006879958
-
Improving cache behavior of dynamically allocated data structures
-
Washington, DC, USA, IEEE Computer Society
-
D. N. Truong, F. Bodin, and A. Seznec. Improving cache behavior of dynamically allocated data structures. In PACT '98: Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques, page 322, Washington, DC, USA, 1998. IEEE Computer Society.
-
(1998)
PACT '98: Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques
, pp. 322
-
-
Truong, D.N.1
Bodin, F.2
Seznec, A.3
-
19
-
-
8344272049
-
Array regrouping and structure splitting using whole-program reference affinity
-
New York, NY, USA, ACM Press
-
Y. Zhong, M. Orlovich, X. Shen, and C. Ding. Array regrouping and structure splitting using whole-program reference affinity. In PLDI '04: Proceedings of the ACM SIGPLAN 2004 conference on Programming language design and implementation, pages 255-266, New York, NY, USA, 2004. ACM Press.
-
(2004)
PLDI '04: Proceedings of the ACM SIGPLAN 2004 conference on Programming language design and implementation
, pp. 255-266
-
-
Zhong, Y.1
Orlovich, M.2
Shen, X.3
Ding, C.4
|