-
1
-
-
0023795996
-
An evaluation of directory schemes for cache coherence
-
(Honolulu, May). IEEE Computer Society Press, Los Alamitos, CA
-
th Annual International Symposium on Computer Architecture (Honolulu, May). IEEE Computer Society Press, Los Alamitos, CA, 1988, 280-298.
-
(1988)
th Annual International Symposium on Computer Architecture
, pp. 280-298
-
-
Agarwal, A.1
Simoni, R.2
Horowitz, M.3
Hennessy, J.4
-
2
-
-
85076910730
-
An analysis of Linux scalability to many cores
-
(Vancouver, Oct. 4-6). USENIX Association, Berkeley, CA
-
Boyd-Wickizer, S. Clements, A.T., Mao, Y., Pesterev, A., Kaashoek, M.F., Morris, R., and Zeldovich, N. An analysis of Linux scalability to many cores. In Proceedings of the Ninth USENIX Symposium on Operating Systems Design and Implementation (Vancouver, Oct. 4-6). USENIX Association, Berkeley, CA, 2010, 1-8.
-
(2010)
Proceedings of the Ninth USENIX Symposium on Operating Systems Design and Implementation
, pp. 1-8
-
-
Boyd-Wickizer, S.1
Clements, A.T.2
Mao, Y.3
Pesterev, A.4
Kaashoek, M.F.5
Morris, R.6
Zeldovich, N.7
-
3
-
-
33646388562
-
Scaling Linux to the extreme
-
Boston, June 27-July 2
-
Bryant, R. Scaling Linux to the extreme. In Proceedings of the Linux Symposium (Boston, June 27-July 2, 2004), 133-148.
-
(2004)
Proceedings of the Linux Symposium
, pp. 133-148
-
-
Bryant, R.1
-
4
-
-
79955397043
-
Bulldozer: An approach to multithreaded compute performance
-
Mar./Apr.
-
Butler, M., Barnes, L., Sarma, D.D., and Gelinas, B. Bulldozer: An approach to multithreaded compute performance. IEEE Micro 31, 2 (Mar./Apr. 2011), 6-15.
-
(2011)
IEEE Micro
, vol.31
, Issue.2
, pp. 6-15
-
-
Butler, M.1
Barnes, L.2
Sarma, D.D.3
Gelinas, B.4
-
5
-
-
84856527825
-
DeNovo: Rethinking the memory hierarchy for disciplined parallelism
-
(Galveston Island, TX, Oct. 10-14). IEEE Computer Society, Washington, D.C.
-
th International Conference on Parallel Architectures and Compilation Techniques (Galveston Island, TX, Oct. 10-14). IEEE Computer Society, Washington, D.C., 2011, 155-166.
-
(2011)
th International Conference on Parallel Architectures and Compilation Techniques
, pp. 155-166
-
-
Choi, B.1
Komuravelli, R.2
Sung, H.3
Smolinski, R.4
Honarmand, N.5
Adve, S.V.6
Adve, V.S.7
Carter, N.P.8
Chou, C.-T.9
-
6
-
-
77951200277
-
Cache hierarchy and memory subsystem of the AMD Opteron processor
-
Mar./Apr.
-
Conway, P., Kalyanasundharam, N., Donley, G., Lepak, K., and Hughes, B. Cache hierarchy and memory subsystem of the AMD Opteron processor. IEEE Micro 30, 2 (Mar./Apr. 2010), 16-29.
-
(2010)
IEEE Micro
, vol.30
, Issue.2
, pp. 16-29
-
-
Conway, P.1
Kalyanasundharam, N.2
Donley, G.3
Lepak, K.4
Hughes, B.5
-
7
-
-
79955887509
-
Cuckoo directory: Efficient and scalable CMP coherence
-
(San Antonio, TX, Feb. 12-16). IEEE Computer Society, Washington, D.C.
-
th Symposium on High-Performance Computer Architecture (San Antonio, TX, Feb. 12-16). IEEE Computer Society, Washington, D.C., 2011, 169-180.
-
(2011)
th Symposium on High-Performance Computer Architecture
, pp. 169-180
-
-
Ferdman, M.1
Lotfi-Kamran, P.2
Balet, K.3
Falsafi, B.4
-
8
-
-
0027699767
-
Cooperative shared memory: Software and hardware for scalable multiprocessors
-
Nov.
-
Hill, M.D., Larus, J.R., Reinhardt, S.K., and Wood, D.A. Cooperative shared memory: Software and hardware for scalable multiprocessors. ACM Transactions on Computer Systems 11, 4 (Nov. 1993), 300-318.
-
(1993)
ACM Transactions on Computer Systems
, vol.11
, Issue.4
, pp. 300-318
-
-
Hill, M.D.1
Larus, J.R.2
Reinhardt, S.K.3
Wood, D.A.4
-
9
-
-
0024903997
-
Evaluating associativity in CPU caches
-
Dec.
-
Hill, M.D. and Smith, A.J. Evaluating associativity in CPU caches. IEEE Transactions on Computers 38, 12 (Dec. 1989), 1612-1630.
-
(1989)
IEEE Transactions on Computers
, vol.38
, Issue.12
, pp. 1612-1630
-
-
Hill, M.D.1
Smith, A.J.2
-
10
-
-
77952123736
-
A 48-core IA-32 message-passing processor with DV FS in 45nm CMOS
-
San Francisco, Feb. 7-11
-
Howard, J. et al. A 48-core IA-32 message-passing processor with DV FS in 45nm CMOS. In Proceedings of the International Solid-State Circuits Conference (San Francisco, Feb. 7-11, 2010), 108-109.
-
(2010)
Proceedings of the International Solid-State Circuits Conference
, pp. 108-109
-
-
Howard, J.1
-
11
-
-
79951719036
-
Achieving noninclusive cache performance with inclusive caches: Temporal locality-aware cache management policies
-
(Atlanta, Dec. 4-8). IEEE Computer Society, Washington, D.C.
-
rd Annual IEEE/ACM International Symposium on Microarchitecture (Atlanta, Dec. 4-8). IEEE Computer Society, Washington, D.C., 2010, 151-162.
-
(2010)
rd Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 151-162
-
-
Jaleel, A.1
Borch, E.2
Bhandaru, M.3
Steely Jr., S.C.4
Emer, J.5
-
12
-
-
70450237431
-
Rigel: An architecture and scalable programming interface for a 1,000-core accelerator
-
(Austin, TX, June 20-24). ACM Press, New York
-
th Annual International Symposium on Computer Architecture (Austin, TX, June 20-24). ACM Press, New York, 2009, 140-151.
-
(2009)
th Annual International Symposium on Computer Architecture
, pp. 140-151
-
-
Kelm, J.H.1
Johnson, D.R.2
Johnson, M.R.3
Crago, N.C.4
Tuohy, W.5
Mahesri, A.6
Lumetta, S.S.7
Frank, M.I.8
Patel, S.J.9
-
13
-
-
79951831804
-
Cohesion: An adaptive hybrid memory model for accelerators
-
Jan./Feb.
-
Kelm, J.H., Johnson, D.R., Tuohy, W., Lumetta, S.S., and Patel, S.J. Cohesion: An adaptive hybrid memory model for accelerators. IEEE Micro 31, 1 (Jan./Feb. 2011), 42-55.
-
(2011)
IEEE Micro
, vol.31
, Issue.1
, pp. 42-55
-
-
Kelm, J.H.1
Johnson, D.R.2
Tuohy, W.3
Lumetta, S.S.4
Patel, S.J.5
-
15
-
-
77951154340
-
The GPU computing era
-
Mar./Apr.
-
Nickolls, J. and Dally, W.J. The GPU computing era. IEEE Micro 30, 2 (Mar./Apr. 2010), 56-69.
-
(2010)
IEEE Micro
, vol.30
, Issue.2
, pp. 56-69
-
-
Nickolls, J.1
Dally, W.J.2
-
16
-
-
51349168284
-
UltraSPARC T2: A highly treaded, powerefficient SPARC SOC
-
Jeju, Korea, Nov. 12-14
-
Shah, M., Barren, J., Brooks, J., Golla, R., Grohoski, G., Gura, N., Hetherington, R., Jordan, P., Luttrell, M., Olson, C., Sana, B., Sheahan, D., Spracklen, L., and Wynn, W. UltraSPARC T2: A highly treaded, powerefficient SPARC SOC. In Proceedings of the IEEE Asian Solid-State Circuits Conference (Jeju, Korea, Nov. 12-14, 2007), 22-25.
-
(2007)
Proceedings of the IEEE Asian Solid-State Circuits Conference
, pp. 22-25
-
-
Shah, M.1
Barren, J.2
Brooks, J.3
Golla, R.4
Grohoski, G.5
Gura, N.6
Hetherington, R.7
Jordan, P.8
Luttrell, M.9
Olson, C.10
Sana, B.11
Sheahan, D.12
Spracklen, L.13
Wynn, W.14
-
17
-
-
77957932159
-
Inside Intel next-generation Nehalem microarchitecture
-
Stanford, CA, Aug. 24-26
-
Singhal, R. Inside Intel next-generation Nehalem microarchitecture. Hot Chips 20 (Stanford, CA, Aug. 24-26, 2008).
-
(2008)
Hot Chips
, vol.20
-
-
Singhal, R.1
-
19
-
-
79951700287
-
Fractal coherence: Scalably verifiable cache coherence
-
(Atlanta, Dec. 4-8). IEEE Computer Society, Washington, D.C.
-
rd Annual IEEE/ACM International Symposium on Microarchitecture (Atlanta, Dec. 4-8). IEEE Computer Society, Washington, D.C., 2010, 471-482.
-
(2010)
rd Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 471-482
-
-
Zhang, M.1
Lebeck, A.R.2
Sorin, D.J.3
|