-
3
-
-
0003158656
-
Hitting the memory wall: Implications of the obvious
-
March
-
Wm A Wulf, Sally A McKee. Hitting the memory wall: Implications of the obvious. ACM SIGARPH Computer Architecture News, March 1995, 23(1): 20-24.
-
(1995)
ACM SIGARPH Computer Architecture News
, vol.23
, Issue.1
, pp. 20-24
-
-
Wulf, W.A.1
McKee, S.A.2
-
4
-
-
0029308368
-
Effective hardware-based data prefetching for high performance processors
-
Chen T F, Baer J L. Effective hardware-based data prefetching for high performance processors. IEEE Transactions on Computers, 1995, 44(5): 609-623.
-
(1995)
IEEE Transactions on Computers
, vol.44
, Issue.5
, pp. 609-623
-
-
Chen, T.F.1
Baer, J.L.2
-
5
-
-
84965078406
-
Fixed and adaptive sequential prefetching in shared-memory multiprocessors
-
Los Alamitos, CA, USA, CRC Press
-
Dahlgren F, Dubois M, Stenström P. Fixed and adaptive sequential prefetching in shared-memory multiprocessors. In Proc. International Conference on Parallel Processing (ICPP), Los Alamitos, CA, USA, CRC Press, 1993, Vol.1, pp.56-63.
-
(1993)
Proc. International Conference on Parallel Processing (ICPP)
, vol.1
, pp. 56-63
-
-
Dahlgren, F.1
Dubois, M.2
Stenström, P.3
-
10
-
-
0035691709
-
Dynamic speculative precomputation
-
Austin, Texas
-
Collins J, Tullsen D, Wang H, Shen J. Dynamic speculative precomputation. In Proc. the 84th International Symposium on Microarchitecture, Austin, Texas, 2001, pp.306-317.
-
(2001)
Proc. the 84th International Symposium on Microarchitecture
, pp. 306-317
-
-
Collins, J.1
Tullsen, D.2
Wang, H.3
Shen, J.4
-
12
-
-
68749089060
-
Prefetching linked data structures in systems with merged DRAM-logic [Thesis]. University of Illinois at Urbana-Champaign
-
Technical Report UIUCDCS-R-20012221, May
-
Hughes C J. Prefetching linked data structures in systems with merged DRAM-logic [Thesis]. University of Illinois at Urbana-Champaign, Technical Report UIUCDCS-R-20012221, May 2000.
-
(2000)
-
-
Hughes, C.J.1
-
13
-
-
0036036248
-
Post-pass binary adaptation tool for software-based speculative precomputation
-
Berlin, Germany
-
Liao S, Wang P, Wang H, Hoflehner G, Lavery D, Shen J. Post-pass binary adaptation tool for software-based speculative precomputation. In Proc. the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI'02), Berlin, Germany, 2002, pp.117-128.
-
(2002)
Proc. the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI'02)
, pp. 117-128
-
-
Liao, S.1
Wang, P.2
Wang, H.3
Hoflehner, G.4
Lavery, D.5
Shen, J.6
-
14
-
-
0034839064
-
Tolerating memory latency through software-controlled pre-execution in simultaneous multithreading processors
-
Göeborg, Sweden
-
Chi-Keung Luk. Tolerating memory latency through software-controlled pre-execution in simultaneous multithreading processors. In Proc. the 28th Annual International Symposium on Computer Architecture, Göeborg, Sweden, 2001, pp.40-51.
-
(2001)
Proc. the 28th Annual International Symposium on Computer Architecture
, pp. 40-51
-
-
Luk, C.1
-
17
-
-
0033705677
-
Data movement for linked data structures
-
Santa Fe, New Mexcio
-
Yang C L, Lebeck A R. Push vs. pull: Data movement for linked data structures. In Proc. the International Conference on Supercomputing (ICS), Santa Fe, New Mexcio, 2000, pp.76-186, pp.176-186.
-
(2000)
Proc. the International Conference on Supercomputing (ICS)
-
-
Yang, C.L.1
Lebeck, A.R.2
vs, P.3
pull4
-
20
-
-
34548714520
-
-
Xian-He Sun, Surendra Byna. Data-access memory servers for multi-processor environments. IIT CS TR-2005-001, November 2005, http://www.cs.iit.edu/~suren/research.html.
-
Xian-He Sun, Surendra Byna. Data-access memory servers for multi-processor environments. IIT CS TR-2005-001, November 2005, http://www.cs.iit.edu/~suren/research.html.
-
-
-
-
21
-
-
0003510233
-
Evaluating future microprocessors: The SimpleScalar tool set
-
Technical Report 1308, University of Wisconsin-Madison Computer Sciences
-
Burger D C, Austin T M, Bennett S. Evaluating future microprocessors: The SimpleScalar tool set. Technical Report 1308, University of Wisconsin-Madison Computer Sciences, 1996.
-
(1996)
-
-
Burger, D.C.1
Austin, T.M.2
Bennett, S.3
-
22
-
-
20444490760
-
Predicting the memory-access cost based on data access patterns
-
San Diego, September
-
Surendra Byna, Xian-He Sun, William Gropp, Rajeev Thakur. Predicting the memory-access cost based on data access patterns. In Proc. the IEEE International Conference on Cluster Computing, San Diego, September 2004, pp.327-336.
-
(2004)
Proc. the IEEE International Conference on Cluster Computing
, pp. 327-336
-
-
Byna, S.1
Sun, X.2
Gropp, W.3
Thakur, R.4
-
23
-
-
0034844454
-
Data prefetching by dependence graph pre-computation
-
Göeborg, Sweden
-
Annavaram M, Patel J M, Davidson E S. Data prefetching by dependence graph pre-computation. In Proc. the 28th International Symposium on Computer Architecture (ISCA), Göeborg, Sweden, 2001, pp.52-61.
-
(2001)
Proc. the 28th International Symposium on Computer Architecture (ISCA)
, pp. 52-61
-
-
Annavaram, M.1
Patel, J.M.2
Davidson, E.S.3
-
24
-
-
0035176199
-
Multi-chain prefetching: Effective exploitation of inter-chain memory parallelism for pointer-chasing codes
-
Barcelona, Spain
-
Kohout N, Choi S, Kim D, Yeung D. Multi-chain prefetching: Effective exploitation of inter-chain memory parallelism for pointer-chasing codes. In Proc. the 10th International Conference on Parallel Architectures and Compilation Techniques, Barcelona, Spain, 2001, pp.268-279.
-
(2001)
Proc. the 10th International Conference on Parallel Architectures and Compilation Techniques
, pp. 268-279
-
-
Kohout, N.1
Choi, S.2
Kim, D.3
Yeung, D.4
-
25
-
-
0031600692
-
Dependence based prefetching for linked data structures
-
San Jose, CA
-
Roth A, Moshovos A, Sohi G S. Dependence based prefetching for linked data structures. In Proc. the 8th International Conference on Architectural Support for Programming Languages and Operating Systems, San Jose, CA, 1998, pp.115-126.
-
(1998)
Proc. the 8th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 115-126
-
-
Roth, A.1
Moshovos, A.2
Sohi, G.S.3
-
29
-
-
34548725961
-
-
access. White paper, Intel Research website, Available online at
-
Jack Doweck. Inside Intel core microarchitecture and smart memory access. White paper, Intel Research website, Available online at http://download.intel. com/technology/architecture/sma.pdf, 2006.
-
(2006)
Inside Intel core microarchitecture and smart memory
-
-
Doweck, J.1
-
32
-
-
33644654831
-
Srinivasan. When prefetching improves/degrades performance
-
Ischia, Italy, May 04-06
-
Thomas R Puzak, A Hartstein, P G Emma, V Srinivasan. When prefetching improves/degrades performance. In Proc. the 2nd Conference on Computing Frontiers, Ischia, Italy, May 04-06, 2005, pp.342-352.
-
(2005)
Proc. the 2nd Conference on Computing Frontiers
, pp. 342-352
-
-
Thomas, R.1
Puzak, A.2
Hartstein, P.G.3
Emma, V.4
-
33
-
-
24644502365
-
-
Standard Performance Evaluation Corporation
-
Standard Performance Evaluation Corporation. SPEC Benchmarks, http://www.spec.org/.
-
SPEC Benchmarks
-
-
-
34
-
-
0025402476
-
A set of level 3 basic linear algebra subprograms
-
Jack J Dongarra, Jeremy Du Croz, Sven Hammarling, Iain Duff. A set of level 3 basic linear algebra subprograms. ACM Transactions on Mathematical Software, 1990, 16(1): 1-17.
-
(1990)
ACM Transactions on Mathematical Software
, vol.16
, Issue.1
, pp. 1-17
-
-
Dongarra, J.J.1
Croz, J.D.2
Hammarling, S.3
Duff, I.4
-
35
-
-
34548801156
-
Memory bandwidth and machine balance in current high performance computers
-
John D McCalpin. Memory bandwidth and machine balance in current high performance computers. IEEE Technical Committee on Computer Architecture, 1995, http://www.cs.virginia.edu/stream.
-
(1995)
IEEE Technical Committee on Computer Architecture
-
-
McCalpin, J.D.1
-
36
-
-
0035182089
-
Basic block distribution analysis to find periodic behavior and simulation points in applications
-
Barcelona, Spain
-
Sherwood T, Perelman E, Calder B. Basic block distribution analysis to find periodic behavior and simulation points in applications. In Proc. the International Conference on Parallel Architectures and Compilation Techniques, Barcelona, Spain, 2001, pp.3-14.
-
(2001)
Proc. the International Conference on Parallel Architectures and Compilation Techniques
, pp. 3-14
-
-
Sherwood, T.1
Perelman, E.2
Calder, B.3
-
37
-
-
0029341212
-
Sequential hardware prefetching in shared-memory multiprocessors
-
Dahlgren F, Dubois M, Stenström P. Sequential hardware prefetching in shared-memory multiprocessors. IEEE Transactions on Parallel and Distributed Systems, 1995, 6(7): 733-746.
-
(1995)
IEEE Transactions on Parallel and Distributed Systems
, vol.6
, Issue.7
, pp. 733-746
-
-
Dahlgren, F.1
Dubois, M.2
Stenström, P.3
-
39
-
-
0002031606
-
Tolerating latency through softwarecontrolled prefetching in shared-memory multiprocessors
-
June
-
Mowry T, Gupta A. Tolerating latency through softwarecontrolled prefetching in shared-memory multiprocessors. Journal of Parallel and Distributed Computing, June 1991, 12(2): 87-106.
-
(1991)
Journal of Parallel and Distributed Computing
, vol.12
, Issue.2
, pp. 87-106
-
-
Mowry, T.1
Gupta, A.2
-
40
-
-
0033075416
-
The impact of exploiting instruction-level parallelism on sharedmemory multiprocessors
-
February
-
Pai V S, Ranganathan P, Abdel-Shafi H, Adve S. The impact of exploiting instruction-level parallelism on sharedmemory multiprocessors. IEEE Transactions on Computers, February 1999, 48(2): 218-226.
-
(1999)
IEEE Transactions on Computers
, vol.48
, Issue.2
, pp. 218-226
-
-
Pai, V.S.1
Ranganathan, P.2
Abdel-Shafi, H.3
Adve, S.4
-
42
-
-
0036296856
-
Using a user-level memory thread for correlation prefetching
-
Anchorage, Alaska, May
-
Solihin Y, Lee J, Torrellas J. Using a user-level memory thread for correlation prefetching. In Proc. International Symposium on Computer Architecture, Anchorage, Alaska, May 2002, pp.171-182.
-
(2002)
Proc. International Symposium on Computer Architecture
, pp. 171-182
-
-
Solihin, Y.1
Lee, J.2
Torrellas, J.3
|