-
1
-
-
29144456000
-
A post-compiler approach to scratchpad mapping of code
-
Sep
-
F. Angiolini, F. Menichelli, A. Ferrero, L. Benini, and M. Olivieri. A post-compiler approach to scratchpad mapping of code. In CASES '04: Proceedings of the 2004 international conference on Compilers, architecture, and synthesis for embedded systems, pages 259-267, Sep 2004.
-
(2004)
CASES '04: Proceedings of the 2004 international conference on Compilers, architecture, and synthesis for embedded systems
, pp. 259-267
-
-
Angiolini, F.1
Menichelli, F.2
Ferrero, A.3
Benini, L.4
Olivieri, M.5
-
3
-
-
0036045884
-
Scratchpad memory: Design alternative for cache on-chip memory in embedded systems
-
R. Banakar, S. Steinke, B.-S. Lee, M. Balakrishnan, and P. Marwedel. Scratchpad memory: design alternative for cache on-chip memory in embedded systems. In CODES '02: Proceedings of the tenth international symposium on Hardware/software codesign, pages 73-78, 2002.
-
(2002)
CODES '02: Proceedings of the tenth international symposium on Hardware/software codesign
, pp. 73-78
-
-
Banakar, R.1
Steinke, S.2
Lee, B.-S.3
Balakrishnan, M.4
Marwedel, P.5
-
8
-
-
0003978993
-
Shade: A fast instruction-set simulator for execution profiling
-
93-12, UWCSE 93-06-06, Sun Microsystems Laboratories, Inc. and the University of Washington
-
R. F. Cmelik and D. Keppel. Shade: A fast instruction-set simulator for execution profiling. Technical Report SMLI 93-12, UWCSE 93-06-06, Sun Microsystems Laboratories, Inc. and the University of Washington, 1993.
-
(1993)
Technical Report SMLI
-
-
Cmelik, R.F.1
Keppel, D.2
-
9
-
-
84976746765
-
Virtual memory
-
P. J. Denning. Virtual memory. ACM Computing Surveys, 2(3):153-189, 1970.
-
(1970)
ACM Computing Surveys
, vol.2
, Issue.3
, pp. 153-189
-
-
Denning, P.J.1
-
10
-
-
84948988003
-
DELI: A new run-time control point
-
Nov
-
G. Desoli, N. Mateev, E. Duesterwald, P. Faraboschi, and J. A. Fisher. DELI: a new run-time control point. In MICRO 35: Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture, pages 257-268, Nov 2002.
-
(2002)
MICRO 35: Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture
, pp. 257-268
-
-
Desoli, G.1
Mateev, N.2
Duesterwald, E.3
Faraboschi, P.4
Fisher, J.A.5
-
13
-
-
0035365369
-
Dynamic binary translation and optimization
-
K. Ebcioglu, E. R. Altman, M. Gschwind, and S. W. Sathaye. Dynamic binary translation and optimization. IEEE Transactions on Computers, 50(6):529-548, 2001.
-
(2001)
IEEE Transactions on Computers
, vol.50
, Issue.6
, pp. 529-548
-
-
Ebcioglu, K.1
Altman, E.R.2
Gschwind, M.3
Sathaye, S.W.4
-
14
-
-
33646558229
-
Using advanced compiler technology to exploit the performance of the Cell Broadband Engine architecture
-
January
-
A. E. Eichenberger, J. K. OBrien, K. M. OBrien, P. Wu, T. Chen, P. H. Oden, D. A. Prener, J. C. Shepherd, B. So, Z. Sura, A. Wang, T. Zhang, P. Zhao, M. K. Gschwind, R. Archambault, Y. Gao, and R. Koo. Using advanced compiler technology to exploit the performance of the Cell Broadband Engine architecture. IBM Systems Journal, 45(1):59-84, January 2006.
-
(2006)
IBM Systems Journal
, vol.45
, Issue.1
, pp. 59-84
-
-
Eichenberger, A.E.1
OBrien, J.K.2
OBrien, K.M.3
Wu, P.4
Chen, T.5
Oden, P.H.6
Prener, D.A.7
Shepherd, J.C.8
So, B.9
Sura, Z.10
Wang, A.11
Zhang, T.12
Zhao, P.13
Gschwind, M.K.14
Archambault, R.15
Gao, Y.16
Koo, R.17
-
16
-
-
33646015987
-
Synergistic processing in Cell's multicore architecture
-
March-April
-
M. Gschwind, H. P. Hofstee, B. Flachs, M. Hopkins, Y. Watanabe, and T. Yamazaki. Synergistic processing in Cell's multicore architecture. IEEE Micro, 26(2):10-24, March-April 2006.
-
(2006)
IEEE Micro
, vol.26
, Issue.2
, pp. 10-24
-
-
Gschwind, M.1
Hofstee, H.P.2
Flachs, B.3
Hopkins, M.4
Watanabe, Y.5
Yamazaki, T.6
-
17
-
-
77954694130
-
Using complete machine simulation for software power estimation: The SoftWatt approach
-
S. Gurumurthi, A. Sivasubramaniam, M. J. Irwin, N. Vijaykrishnan, M. Kandemir, T. Li, and L. K. John. Using complete machine simulation for software power estimation: The SoftWatt approach. In HPCA '02: Proceedings of the Eighth International Symposium on High-Performance Computer Architecture, page 141, 2002.
-
(2002)
HPCA '02: Proceedings of the Eighth International Symposium on High-Performance Computer Architecture
, pp. 141
-
-
Gurumurthi, S.1
Sivasubramaniam, A.2
Irwin, M.J.3
Vijaykrishnan, N.4
Kandemir, M.5
Li, T.6
John, L.K.7
-
20
-
-
0027595384
-
The superblock: An effective technique for VLIW and superscalar compilation
-
W.-M. W. Hwu, S. A. Mahlke, W. Y. Chen, P. P. Chang, N. J. Waiter, R. A. Bringmann, R. G. Ouellette, R. E. Hank, T. Kiyohara, G. E. Haab, J. G. Holm, and D. M. Lavery. The superblock: an effective technique for VLIW and superscalar compilation. Journal of Supercomputing, 7(1-2):229-248, 1993.
-
(1993)
Journal of Supercomputing
, vol.7
, Issue.1-2
, pp. 229-248
-
-
Hwu, W.-M.W.1
Mahlke, S.A.2
Chen, W.Y.3
Chang, P.P.4
Waiter, N.J.5
Bringmann, R.A.6
Ouellette, R.G.7
Hank, R.E.8
Kiyohara, T.9
Haab, G.E.10
Holm, J.G.11
Lavery, D.M.12
-
24
-
-
0031599861
-
Hardware-software tradeoffs in a direct Rambus implementation of the RAMpage memory hierarchy
-
P. Machanick, P. Salverda, and L. Pompe. Hardware-software tradeoffs in a direct Rambus implementation of the RAMpage memory hierarchy. ACM SIGPLAN Notices, 33(11):105-114, 1998.
-
(1998)
ACM SIGPLAN Notices
, vol.33
, Issue.11
, pp. 105-114
-
-
Machanick, P.1
Salverda, P.2
Pompe, L.3
-
25
-
-
84976808079
-
Mimic: A fast System/370 simulator
-
New York, NY, USA, ACM Press
-
C. May. Mimic: A fast System/370 simulator. In SIGPLAN '87: Papers of the Symposium on Interpreters and interpretive techniques, pages 1-13, New York, NY, USA, 1987. ACM Press.
-
(1987)
SIGPLAN '87: Papers of the Symposium on Interpreters and interpretive techniques
, pp. 1-13
-
-
May, C.1
-
26
-
-
0030285348
-
A 160-MHz, 32-b, 0.5-W CMOS RISC microprocessor
-
November
-
J. Montanaro, R. T. Witek, K. Anne, A. J. Black, E. M. Cooper, D. W. Dobberpuhl, P. M. Donahue, J. Eno, G. W. Hoeppner, D. Kruckemyer, T. H. Lee, P. C. M. Lin, L. Madden, D. Murray, M. H. Pearce, S. Santhanam, K. J. Snyder, R. Stephany, and S. C. Thierauf. A 160-MHz, 32-b, 0.5-W CMOS RISC microprocessor. IEEE JSSC, 31(11):1703-1714, November 1996.
-
(1996)
IEEE JSSC
, vol.31
, Issue.11
, pp. 1703-1714
-
-
Montanaro, J.1
Witek, R.T.2
Anne, K.3
Black, A.J.4
Cooper, E.M.5
Dobberpuhl, D.W.6
Donahue, P.M.7
Eno, J.8
Hoeppner, G.W.9
Kruckemyer, D.10
Lee, T.H.11
Lin, P.C.M.12
Madden, L.13
Murray, D.14
Pearce, M.H.15
Santhanam, S.16
Snyder, K.J.17
Stephany, R.18
Thierauf, S.C.19
-
27
-
-
0004100483
-
Hot pages: Software caching for Raw microprocessors
-
Technical Report LCS-TM-599, Massachusetts Institute of Technology Lab for Computer Science
-
C. Moritz, M. Frank, W. Lee, and S. Amarasinghe. Hot pages: Software caching for Raw microprocessors. Technical Report LCS-TM-599, Massachusetts Institute of Technology Lab for Computer Science, 1999.
-
(1999)
-
-
Moritz, C.1
Frank, M.2
Lee, W.3
Amarasinghe, S.4
-
28
-
-
33846508742
-
Novel caches for predictable computing
-
Technical Report CSTR-98-011, Department of Computer Science, University of Bristol, Oct
-
H. Muller, D. May, J. Irwin, and D. Page. Novel caches for predictable computing. Technical Report CSTR-98-011, Department of Computer Science, University of Bristol, Oct 1998.
-
(1998)
-
-
Muller, H.1
May, D.2
Irwin, J.3
Page, D.4
-
29
-
-
33845979119
-
The performance of a system for automatic segmentation of programs within an ALGOL compiler (GIER ALGOL)
-
P. Naur. The performance of a system for automatic segmentation of programs within an ALGOL compiler (GIER ALGOL). Communications of the ACM, 8(11):671-676, 1965.
-
(1965)
Communications of the ACM
, vol.8
, Issue.11
, pp. 671-676
-
-
Naur, P.1
-
30
-
-
33746730501
-
Operating systems: Program overlay techniques
-
R. J. Pankhurst. Operating systems: Program overlay techniques. Communications of the ACM, 11(2):119-125, 1968.
-
(1968)
Communications of the ACM
, vol.11
, Issue.2
, pp. 119-125
-
-
Pankhurst, R.J.1
-
31
-
-
33646824701
-
-
R. A. Ravindran, P. D. Nagarkar, G. S. Dasika, E. D. Marsman, R. M. Senger, S. A. Mahlke, and R. B. Brown. Compiler managed dynamic instruction placement in a low-power code cache. In CGO '05: Proceedings of the international symposium on Code generation and optimization, pages 179-190, March 2005.
-
R. A. Ravindran, P. D. Nagarkar, G. S. Dasika, E. D. Marsman, R. M. Senger, S. A. Mahlke, and R. B. Brown. Compiler managed dynamic instruction placement in a low-power code cache. In CGO '05: Proceedings of the international symposium on Code generation and optimization, pages 179-190, March 2005.
-
-
-
-
32
-
-
0003450887
-
CACTI 3.0: An integrated cache timing, power and area model
-
Technical Report 2001/2, Compaq Western Research Laboratory, Dec
-
P. Shivakumar and N. P. Jouppi. CACTI 3.0: An integrated cache timing, power and area model. Technical Report 2001/2, Compaq Western Research Laboratory, Dec 2001.
-
(2001)
-
-
Shivakumar, P.1
Jouppi, N.P.2
-
33
-
-
84976708141
-
A proposal to establish a pseudo virtual memory via writable overlays
-
T. R. Spacek. A proposal to establish a pseudo virtual memory via writable overlays. Communications of the ACM, 15(6):421-426, 1972.
-
(1972)
Communications of the ACM
, vol.15
, Issue.6
, pp. 421-426
-
-
Spacek, T.R.1
-
34
-
-
84893786147
-
Assigning program and data objects to scratchpad for energy reduction
-
Mar
-
S. Steinke, L. Wehmeyer, B. Lee, and P. Marwedel. Assigning program and data objects to scratchpad for energy reduction. In DATE '02: Proceedings of the conference on Design, automation and test in Europe, pages 409-417, Mar 2002.
-
(2002)
DATE '02: Proceedings of the conference on Design, automation and test in Europe
, pp. 409-417
-
-
Steinke, S.1
Wehmeyer, L.2
Lee, B.3
Marwedel, P.4
-
35
-
-
0036505033
-
The Raw microprocessor: A computational fabric for software circuits and general-purpose programs
-
Mar
-
M. B. Taylor, J. Kim, J. E. Miller, D. Wentzlaff, F. Ghodrat, B. Greenwald, H. Hoffman, P. Johnson, J.-W. Lee, W. Lee, A. Ma, A. Saraf, M. Seneski, N. Shnidman, V. Strumpen, M. Frank, S. Amarasinghe, and A. Agarwal. The Raw microprocessor: A computational fabric for software circuits and general-purpose programs. IEEE Micro, 22(2):25-35, Mar 2002.
-
(2002)
IEEE Micro
, vol.22
, Issue.2
, pp. 25-35
-
-
Taylor, M.B.1
Kim, J.2
Miller, J.E.3
Wentzlaff, D.4
Ghodrat, F.5
Greenwald, B.6
Hoffman, H.7
Johnson, P.8
Lee, J.-W.9
Lee, W.10
Ma, A.11
Saraf, A.12
Seneski, M.13
Shnidman, N.14
Strumpen, V.15
Frank, M.16
Amarasinghe, S.17
Agarwal, A.18
-
36
-
-
4644353790
-
Evaluation of the Raw microprocessor: An exposed-wire-delay architecture for ILP and streams
-
Jun
-
M. B. Taylor, W. Lee, J. E. Miller, D. Wentzlaff, I. Bratt, B. Greenwald, H. Hoffmann, P. Johnson, J. Kim, J. Psota, A. Saraf, N. Shnidman, V. Strumpen, M. Frank, S. Amarasinghe, and A. Agarwal. Evaluation of the Raw microprocessor: An exposed-wire-delay architecture for ILP and streams. In ISCA '04: Proceedings of the 31st annual international symposium on Computer architecture, pages 2-13, Jun 2004.
-
(2004)
ISCA '04: Proceedings of the 31st annual international symposium on Computer architecture
, pp. 2-13
-
-
Taylor, M.B.1
Lee, W.2
Miller, J.E.3
Wentzlaff, D.4
Bratt, I.5
Greenwald, B.6
Hoffmann, H.7
Johnson, P.8
Kim, J.9
Psota, J.10
Saraf, A.11
Shnidman, N.12
Strumpen, V.13
Frank, M.14
Amarasinghe, S.15
Agarwal, A.16
-
38
-
-
0030149507
-
CACTI: An enhanced cache access and cycle time model
-
May
-
S. J. E. Wilton and N. P. Jouppi. CACTI: An enhanced cache access and cycle time model. IEEE JSSC, 31(5):677-688, May 1996.
-
(1996)
IEEE JSSC
, vol.31
, Issue.5
, pp. 677-688
-
-
Wilton, S.J.E.1
Jouppi, N.P.2
-
40
-
-
84949817426
-
Exploiting choice in resizable cache design to optimize deep-submicron processor energy-delay
-
Feb
-
S.-H. Yang, B. Falsafi, M. D. Powell, and T. N. Vijaykumar. Exploiting choice in resizable cache design to optimize deep-submicron processor energy-delay. In HPCA '02: Proceedings of the Eighth International Symposium on High-Performance Computer Architecture, pages 151-161, Feb 2002.
-
(2002)
HPCA '02: Proceedings of the Eighth International Symposium on High-Performance Computer Architecture
, pp. 151-161
-
-
Yang, S.-H.1
Falsafi, B.2
Powell, M.D.3
Vijaykumar, T.N.4
|