-
3
-
-
33746728297
-
Code size minimization and retargetable assembly for custom EPIC and VLIW instruction formats
-
S. Aditya, S. Mahlke, and B. Rau. Code size minimization and retargetable assembly for custom EPIC and VLIW instruction formats. ACM TODAES, 5(4):752-773, 2000.
-
(2000)
ACM TODAES
, vol.5
, Issue.4
, pp. 752-773
-
-
Aditya, S.1
Mahlke, S.2
Rau, B.3
-
4
-
-
0033717865
-
Clock rate versus IPC: The end of the road for conventional microarchitectures
-
June
-
V. Agarwal et al. Clock rate versus IPC: the end of the road for conventional microarchitectures. In Proc. 27th Intl. Symposium on Computer Architecture, pages 248-259, June 2000.
-
(2000)
Proc. 27th Intl. Symposium on Computer Architecture
, pp. 248-259
-
-
Agarwal, V.1
-
5
-
-
84944394490
-
Instruction replication for clustered microarchitectures
-
A. Aletá, J. M. Codina, A. Gonzalez, and D. Kaeli. Instruction replication for clustered microarchitectures. In MICRO 36: Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture, page 326, 2003.
-
(2003)
MICRO 36: Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 326
-
-
Aletá, A.1
Codina, J.M.2
Gonzalez, A.3
Kaeli, D.4
-
7
-
-
0025537017
-
Architecture and implementation of a VLIW supercomputer
-
IEEE Computer Society Press
-
R. Colwell et al. Architecture and implementation of a VLIW supercomputer. In Proc. 1990 Conference on Supercomputing, pages 910-919. IEEE Computer Society Press, 1990.
-
(1990)
Proc. 1990 Conference on Supercomputing
, pp. 910-919
-
-
Colwell, R.1
-
8
-
-
0030379247
-
Instruction fetch mechanisms for VLIW architectures with compressed encodings
-
Dec.
-
T. Conte et al. Instruction fetch mechanisms for VLIW architectures with compressed encodings. In Proc. 29th Intl. Symposium on Microarchitecture, pages 201-211, Dec. 1996.
-
(1996)
Proc. 29th Intl. Symposium on Microarchitecture
, pp. 201-211
-
-
Conte, T.1
-
9
-
-
0029701432
-
The design of a high-performance low-power microprocessor
-
D. Dobberpuhl. The design of a high-performance low-power microprocessor. In Proc. of ISLPED, pages 11-16, 1996.
-
(1996)
Proc. of ISLPED
, pp. 11-16
-
-
Dobberpuhl, D.1
-
15
-
-
84862452827
-
HPL-PD architecture specification: Version 1.1
-
Hewlett-Packard Laboratories, Feb
-
V. Kathail, M. Schlansker, and B. Rau. HPL-PD architecture specification: Version 1.1. Technical Report HPL-93-80, Hewlett-Packard Laboratories, Feb 2000.
-
(2000)
Technical Report
, vol.HPL-93-80
-
-
Kathail, V.1
Schlansker, M.2
Rau, B.3
-
16
-
-
0036292594
-
An instruction set and microarchitecture for instruction level distributed processing
-
June
-
H. Kim and J. Smith. An instruction set and microarchitecture for instruction level distributed processing. In Proc. 29th Intl. Symposium on Computer Architecture, pages 71-81, June 2002.
-
(2002)
Proc. 29th Intl. Symposium on Computer Architecture
, pp. 71-81
-
-
Kim, H.1
Smith, J.2
-
18
-
-
0033357301
-
Compiler-driven cached code compression schemes for embedded ILP processors
-
Dec.
-
S. Y. Larin and T. M. Conte. Compiler-driven cached code compression schemes for embedded ILP processors. In Proc. 32nd Intl. Symposium on Microarchitecture, pages 82-92, Dec. 1999.
-
(1999)
Proc. 32nd Intl. Symposium on Microarchitecture
, pp. 82-92
-
-
Larin, S.Y.1
Conte, T.M.2
-
19
-
-
0003272089
-
MediaBench: A tool for evaluating and synthesizing multimedia and communications systems
-
Dec.
-
C. Lee et al. MediaBench: A tool for evaluating and synthesizing multimedia and communications systems. In Proc. 30th Intl. Symposium on Microarchitecture, Dec. 1997.
-
(1997)
Proc. 30th Intl. Symposium on Microarchitecture
-
-
Lee, C.1
-
20
-
-
0003589265
-
Low-Cost Embedded Program Loop Caching Revisited
-
Univ. of Michigan
-
L. Lee et al. Low-Cost Embedded Program Loop Caching Revisited. Technical Report CSE-TR-411-99, Univ. of Michigan, 1999.
-
(1999)
Technical Report
, vol.CSE-TR-411-99
-
-
Lee, L.1
-
21
-
-
0029488328
-
Instruction selection using binate covering for code size optimization
-
S. Liao, S. Devadas, K. Keutzer, and S. Tjiang. Instruction selection using binate covering for code size optimization. In International Conference on Computer-Aided Design, pages 393-399, 1995.
-
(1995)
International Conference on Computer-aided Design
, pp. 393-399
-
-
Liao, S.1
Devadas, S.2
Keutzer, K.3
Tjiang, S.4
-
22
-
-
0031232922
-
Will physical scalability sabotage performance gains?
-
D. Matzke. Will physical scalability sabotage performance gains? IEEE Computer, 30(9):37-39, 1997.
-
(1997)
IEEE Computer
, vol.30
, Issue.9
, pp. 37-39
-
-
Matzke, D.1
-
23
-
-
0027709629
-
Balancing fine- and medium-grained parallelism in scheduling loops for the XIMD architecture
-
C. Newburn, A. Huang, and J. Shen. Balancing fine- and medium-grained parallelism in scheduling loops for the XIMD architecture. In Proc. PACT-1993, pages 39-52, 1993.
-
(1993)
Proc. PACT-1993
, pp. 39-52
-
-
Newburn, C.1
Huang, A.2
Shen, J.3
-
24
-
-
0024480706
-
The cydra 5 departmental supercomputer
-
B. R. Rau, W. Y. D. W. L. Yen, and R. A. Towle. The cydra 5 departmental supercomputer. In IEEE Computer, volume 22, pages 12-35, 1989.
-
(1989)
IEEE Computer
, vol.22
, pp. 12-35
-
-
Rau, B.R.1
Yen, W.Y.D.W.L.2
Towle, R.A.3
-
26
-
-
0037669851
-
Exploiting ILP, TLP, and DLP with the polymorphous TRIPS architecture
-
June
-
K. Sankaralingam et al. Exploiting ILP, TLP, and DLP with the polymorphous TRIPS architecture. In Proc. 30th Intl. Symposium on Computer Architecture, pages 422-433, June 2003.
-
(2003)
Proc. 30th Intl. Symposium on Computer Architecture
, pp. 422-433
-
-
Sankaralingam, K.1
-
27
-
-
0003450887
-
CACTI 3.0: An integrated cache timing, power, and area model
-
Western Research Laboratory, Feb.
-
P. Shivakumar and N. P. Jouppi. CACTI 3.0: An integrated cache timing, power, and area model. Technical Report WRL-2001-2, Western Research Laboratory, Feb. 2001.
-
(2001)
Technical Report
, vol.WRL-2001-2
-
-
Shivakumar, P.1
Jouppi, N.P.2
-
29
-
-
0034854189
-
Modeling and minimization of interconnect energy dissipation in nanometer technologies
-
ACM Press
-
C. N. Taylor, S. Dey, and Y. Zhao. Modeling and minimization of interconnect energy dissipation in nanometer technologies. In Proceedings of the 38th conference on Design automation, pages 754-757. ACM Press, 2001.
-
(2001)
Proceedings of the 38th Conference on Design Automation
, pp. 754-757
-
-
Taylor, C.N.1
Dey, S.2
Zhao, Y.3
-
30
-
-
84955456130
-
Sclar operand networks: On-chip interconnect for ILP in partitioned architectures
-
Feb.
-
M. Taylor et al. Sclar operand networks: On-chip interconnect for ILP in partitioned architectures. In Proc. 9th Intl. Symposium on High-Performance Computer Architecture, pages 341-343, Feb. 2003.
-
(2003)
Proc. 9th Intl. Symposium on High-performance Computer Architecture
, pp. 341-343
-
-
Taylor, M.1
-
31
-
-
4644353790
-
Evaluation of the Raw microprocessor: An exposed-wire-delay architecture for ILP and streams
-
June
-
M. Taylor et al. Evaluation of the Raw microprocessor: An exposed-wire-delay architecture for ILP and streams. In Proc. 31st Intl. Symposium on Computer architecture, pages 2-13, June 2004.
-
(2004)
Proc. 31st Intl. Symposium on Computer Architecture
, pp. 2-13
-
-
Taylor, M.1
-
33
-
-
0026137273
-
A variable instruction stream extension to the VLIW architecture
-
A. Wolfe and J. P. Shen. A variable instruction stream extension to the VLIW architecture. In Proc. ASPLOS-IV, pages 2-14, 1991.
-
(1991)
Proc. ASPLOS-IV
, pp. 2-14
-
-
Wolfe, A.1
Shen, J.P.2
|