-
2
-
-
3242744876
-
ICTINEO: A tool for research on ilp
-
E. Ayguadé, C. Barrado, J. Labarta, J. Llosa, D. López, S. Moreno, D. Padua, E. Riera, and M. Valero. ICTINEO: A tool for research on ilp. In Proc. of the Supercomputing'96 (SC'96), Research Exhibit "Polaris at Work", 1996.
-
(1996)
Proc. of the Supercomputing'96 (SC'96), Research Exhibit "Polaris at Work"
-
-
Ayguadé, E.1
Barrado, C.2
Labarta, J.3
Llosa, J.4
López, D.5
Moreno, S.6
Padua, D.7
Riera, E.8
Valero, M.9
-
3
-
-
0003477925
-
The perfect club benchmarks: Effective performance evaluation of supercomputers
-
November
-
M. Berry, D. Chen, P. Koss, and D. Kuck. The Perfect Club benchmarks: Effective performance evaluation of supercomputers. Technical Report 827, Center for Supercomputing Research and Development, November 1988.
-
(1988)
Technical Report 827, Center for Supercomputing Research and Development
-
-
Berry, M.1
Chen, D.2
Koss, P.3
Kuck, D.4
-
6
-
-
0026157612
-
IMPACT: An architectural framework for multiple-instruction-issue processors
-
P. Chang, S. Mahlke, W. Chen, N. Warter, and W. Hwu. IMPACT: An architectural framework for multiple-instruction-issue processors. In Proc. of the 18th Int. Symp. on Computer Architecture, pages 266-275, 1991.
-
(1991)
Proc. of the 18th Int. Symp. on Computer Architecture
, pp. 266-275
-
-
Chang, P.1
Mahlke, S.2
Chen, W.3
Warter, N.4
Hwu, W.5
-
7
-
-
0019610938
-
An approach to scientific array processing: The architectural design of the AP120B/FPS-164 family
-
A. Charlesworth. An approach to scientific array processing: The architectural design of the AP120B/FPS-164 family. Computer, 14(9):18-27, 1981.
-
(1981)
Computer
, vol.14
, Issue.9
, pp. 18-27
-
-
Charlesworth, A.1
-
9
-
-
0033716803
-
Multiple-banked register file architectures
-
June
-
J. Cruz, A. Gonzalez, M. Valero, and N. Topham. Multiple-banked register file architectures. In Proc. , 27th Annual Internat. Symp. on Computer Architecture, June 2000.
-
(2000)
Proc. , 27th Annual Internat. Symp. on Computer Architecture
-
-
Cruz, J.1
Gonzalez, A.2
Valero, M.3
Topham, N.4
-
12
-
-
0033703885
-
Lx: A technology platform for customizable VLIW embedded porcessing
-
June
-
P. Faraboschi, G. Brown, G. Desoli, and F. Homewood. Lx: A technology platform for customizable VLIW embedded porcessing. In Proc. of the 27th Int. Symp. on Computer Architecture, pages 203-213, June 2000.
-
(2000)
Proc. of the 27th Int. Symp. on Computer Architecture
, pp. 203-213
-
-
Faraboschi, P.1
Brown, G.2
Desoli, G.3
Homewood, F.4
-
13
-
-
0031650008
-
Partitioned schedules for clustered vliw architectures
-
March
-
M. Fernandes, J. Llosa, and N. Topham. Partitioned schedules for clustered vliw architectures. In Proc. , 12th International Parallel Processing Symposium and 9th Symposium on Parallel and Distributed Processing (IPPS/SPDP'1998), pages 386-391, March 1998.
-
(1998)
Proc. , 12th International Parallel Processing Symposium and 9th Symposium on Parallel and Distributed Processing (IPPS/SPDP'1998)
, pp. 386-391
-
-
Fernandes, M.1
Llosa, J.2
Topham, N.3
-
15
-
-
0033888003
-
The tigersharc DSP architecture
-
January-February
-
J. Fridman and Z. Greefield. The tigersharc DSP architecture. IEEE Micro, pages 66-76, January-February 2000.
-
(2000)
IEEE Micro
, pp. 66-76
-
-
Fridman, J.1
Greefield, Z.2
-
16
-
-
0003318618
-
MAP1000 unfolds at Equator
-
December
-
P. N. Glaskowsky. MAP1000 unfolds at Equator. Microporcessor Report. , 12(16), December 1998.
-
(1998)
Microporcessor Report
, vol.12
, Issue.16
-
-
Glaskowsky, P.N.1
-
17
-
-
0036287089
-
The optimal useful logic depth per pipeline stage is 6-8 FO4
-
May
-
M. Hrishikesh, N. P. Jouppi, K. I. Farkas, D. Burger, S. W. Keckler, and P. Shivakumar. The optimal useful logic depth per pipeline stage is 6-8 FO4. In Proc. , 29th Annual Internat. Symp. on Computer Architecture, pages 14-24, May 2002.
-
(2002)
Proc. , 29th Annual Internat. Symp. on Computer Architecture
, pp. 14-24
-
-
Hrishikesh, M.1
Jouppi, N.P.2
Farkas, K.I.3
Burger, D.4
Keckler, S.W.5
Shivakumar, P.6
-
19
-
-
0027595384
-
The superblock: An effective technique for VLIW and superscalar compilation
-
W. Hwu, S. Mahlke, W. Chen, P. Chang, N. Warter, R. Bringmann, R. Ouellette, R. Hank, T. Kiyohara, G. Haab, J. Holm, and D. Lavery. The superblock: An effective technique for VLIW and superscalar compilation. Journal of Supercomputing, 7(1/2):229-248, 1993.
-
(1993)
Journal of Supercomputing
, vol.7
, Issue.1-2
, pp. 229-248
-
-
Hwu, W.1
Mahlke, S.2
Chen, W.3
Chang, P.4
Warter, N.5
Bringmann, R.6
Ouellette, R.7
Hank, R.8
Kiyohara, T.9
Haab, G.10
Holm, J.11
Lavery, D.12
-
20
-
-
0032639289
-
The alpha 21264 microprocessor
-
March
-
R. Kessler. The Alpha 21264 microprocessor. IEEE Micro, 19(2):24-36, March 1999.
-
(1999)
IEEE Micro
, vol.19
, Issue.2
, pp. 24-36
-
-
Kessler, R.1
-
23
-
-
0029488251
-
Hypernode reduction modulo scheduling
-
November
-
J. Llosa, M. Valero, E. Ayguadé, and A. González. Hypernode reduction modulo scheduling. In Proc. of the 28th Int. Symp. on Microarchitecture (MICRO-28), pages 350-360, November 1995.
-
(1995)
Proc. of the 28th Int. Symp. on Microarchitecture (MICRO-28)
, pp. 350-360
-
-
Llosa, J.1
Valero, M.2
Ayguadé, E.3
González, A.4
-
24
-
-
2342562830
-
Using Sacks to organize register files in VLIW machines
-
September
-
J. Llosa, M. Valero, J. Fortes, and E. Ayguadé. Using Sacks to organize register files in VLIW machines. In CONPAR 94-VAPP VI, September 1994.
-
(1994)
CONPAR 94-VAPP
, vol.6
-
-
Llosa, J.1
Valero, M.2
Fortes, J.3
Ayguadé, E.4
-
26
-
-
0002017307
-
Instruction-level parallel processing: History, overview and perspective
-
July
-
B. Rau and J. A. Fisher. Instruction-level parallel processing: History, overview and perspective. Journal of Supercomputing, 7(1/2):9-50, July 1993.
-
(1993)
Journal of Supercomputing
, vol.7
, Issue.1-2
, pp. 9-50
-
-
Rau, B.1
Fisher, J.A.2
-
27
-
-
0028768013
-
Iterative modulo scheduling: An algorithm for software pipelining loops
-
November
-
B. R. Rau. Iterative modulo scheduling: An algorithm for software pipelining loops. In Proc. of the 27th Int. Symp. on Microarchitecture (MICRO-27), pages 63-74, November 1994.
-
(1994)
Proc. of the 27th Int. Symp. on Microarchitecture (MICRO-27)
, pp. 63-74
-
-
Rau, B.R.1
-
28
-
-
0034581535
-
Register organization for media processing
-
January
-
S. Rixner, W. Dally, B. Khailany, P. Mattson, U. Kapasi, and J. Owens. Register organization for media processing. In Proc. , 6th High-Performance Computer Architecture (HPCA-6), pages 375-386, January 2000.
-
(2000)
Proc. , 6th High-Performance Computer Architecture (HPCA-6)
, pp. 375-386
-
-
Rixner, S.1
Dally, W.2
Khailany, B.3
Mattson, P.4
Kapasi, U.5
Owens, J.6
-
29
-
-
0017922490
-
CRAY-1 computer system
-
January
-
R. Rusell. CRAY-1 computer system. In Communications of the ACM, vol 21, pages 63-72, January 1978.
-
(1978)
Communications of the ACM
, vol.21
, pp. 63-72
-
-
Rusell, R.1
-
36
-
-
0034462834
-
Two-level hierarchical register file organization for vliw processors
-
December
-
J. Zalamea, J. Llosa, E. Ayguadé, and M. Valero. Two-level hierarchical register file organization for vliw processors. In Proc. of the 33rd Int. Symp. on Microarchitecture (MICRO-33), pages 137-146, December 2000.
-
(2000)
Proc. of the 33rd Int. Symp. on Microarchitecture (MICRO-33)
, pp. 137-146
-
-
Zalamea, J.1
Llosa, J.2
Ayguadé, E.3
Valero, M.4
-
37
-
-
0035691538
-
Modulo scheduling with integrated register spilling for clustered VLIW architectures
-
December
-
J. Zalamea, J. Llosa, E. Ayguadé, and M. Valero. Modulo scheduling with integrated register spilling for clustered VLIW architectures. In Proc. of the 34th Int. Symp. on Microarchitecture (MICRO-34), pages 160-169, December 2001.
-
(2001)
Proc. of the 34th Int. Symp. on Microarchitecture (MICRO-34)
, pp. 160-169
-
-
Zalamea, J.1
Llosa, J.2
Ayguadé, E.3
Valero, M.4
|