-
3
-
-
0037619265
-
Web search for a planet: the google cluster architecture
-
Barroso, L.A. Dean, J. and Holzle, U. (2003) ‘Web search for a planet: the google cluster architecture’, IEEE Micro, March–April, pp.22–28.
-
(2003)
IEEE Micro
, vol.March–April
, pp. 22-28
-
-
Barroso, L.A.1
Dean, J.2
Holzle, U.3
-
4
-
-
0033722744
-
Piranha: a scalable architecture based on single-chip multiprocessing
-
Barroso, L.A., Gharachorloo, K., McNamara, R., Nowatzyk, A., Qadeer, S., Sano, B., Smith, S., Stets, R. and Verghese, B. (2000) ‘Piranha: a scalable architecture based on single-chip multiprocessing’, Proceedings of the 27th Annual International Symposium on Computer Architecture, June, pp.282–293.
-
(2000)
Proceedings of the 27th Annual International Symposium on Computer Architecture
, vol.June
, pp. 282-293
-
-
Barroso, L.A.1
Gharachorloo, K.2
McNamara, R.3
Nowatzyk, A.4
Qadeer, S.5
Sano, B.6
Smith, S.7
Stets, R.8
Verghese, B.9
-
5
-
-
0036149420
-
Networks on chip: a new SoC paradigm
-
January
-
Benini, L. and Micheli, G.D. (2002) ‘Networks on chip: a new SoC paradigm’, IEEE Computer, Vol. 35, No. 1, January, pp.70–80.
-
(2002)
IEEE Computer
, vol.35
, Issue.1
, pp. 70-80
-
-
Benini, L.1
Micheli, G.D.2
-
7
-
-
0034848112
-
Route packets, not wires: on-chip interconnection networks
-
ACM
-
Dally, W.J. and Towles, B. (2001) ‘Route packets, not wires: on-chip interconnection networks’, Proceedings of the Design Automation Conference (DAC), ACM, June, pp.684–689.
-
(2001)
Proceedings of the Design Automation Conference (DAC)
, vol.June
, pp. 684-689
-
-
Dally, W.J.1
Towles, B.2
-
8
-
-
0002529086
-
Compaq chooses SMT for alpha
-
December
-
Diefendorff, K. (1999) ‘Compaq chooses SMT for alpha’, Microprocessor Report, December, Vol. 13, No. 16, pp.5–11.
-
(1999)
Microprocessor Report
, vol.13
, Issue.16
, pp. 5-11
-
-
Diefendorff, K.1
-
9
-
-
84950303625
-
A theory for deadlock-free dynamic network reconfiguration
-
Duato, J., Lysne, O., Pang, R. and Pinkston, T.M. (2003) ‘A theory for deadlock-free dynamic network reconfiguration’, USC Technical Report CENG-2003-06, Vol. (available at www.usc.edu/dept/ceng/pinkston/SMART.html), pp.1–30.
-
(2003)
USC Technical Report CENG-2003-06
, pp. 1-30
-
-
Duato, J.1
Lysne, O.2
Pang, R.3
Pinkston, T.M.4
-
10
-
-
0031374601
-
The multi-cluster architecture: reducing cycle time through partitioning
-
Farkas, K.I., Chow, P., Jouppi, N.P. and Vranesic, Z.G. (1997) ‘The multi-cluster architecture: reducing cycle time through partitioning’, Proceedings of International Symposium on Microarchitecture, December, pp.149–159.
-
(1997)
Proceedings of International Symposium on Microarchitecture
, vol.December
, pp. 149-159
-
-
Farkas, K.I.1
Chow, P.2
Jouppi, N.P.3
Vranesic, Z.G.4
-
11
-
-
0029547346
-
The M-machine multicomputer
-
Fillo, M., Keckler, S.W., Dally, W.J., Carter, N.P., Chang, A., Gurevich, Y. and Lee, W.S. (1995) ‘The M-machine multicomputer’, Proceedings of the 28th International Symposium Microarchitecture, December, pp.146–156.
-
(1995)
Proceedings of the 28th International Symposium Microarchitecture
, vol.December
, pp. 146-156
-
-
Fillo, M.1
Keckler, S.W.2
Dally, W.J.3
Carter, N.P.4
Chang, A.5
Gurevich, Y.6
Lee, W.S.7
-
12
-
-
12344257052
-
IBM raises curtain on Power5
-
October
-
Glaskowsky, P.N. (2003) ‘IBM raises curtain on Power5’, Microprocessor Report, October, Vol. 17, No. 10, pp.13, 14.
-
(2003)
Microprocessor Report
, vol.17
, Issue.10
, pp. 13-14
-
-
Glaskowsky, P.N.1
-
14
-
-
0033880036
-
The stanford hydra CMP
-
February
-
Hammond, L., Hubbert, B.A., Siu, M., Prabhu, M.K., Chen, M. and Olukotun, K. (2000) ‘The stanford hydra CMP’, IEEE Micro, February, Vol. 20, No. 2, pp.71–84.
-
(2000)
IEEE Micro
, vol.20
, Issue.2
, pp. 71-84
-
-
Hammond, L.1
Hubbert, B.A.2
Siu, M.3
Prabhu, M.K.4
Chen, M.5
Olukotun, K.6
-
15
-
-
33646922057
-
The future of wires
-
April
-
Ho, R., Mai, K.W. and Horowitz, M.A. (2001) ‘The future of wires’, Proceedings of the IEEE, Vol. 89, No. 4, April, pp.490–504.
-
(2001)
Proceedings of the IEEE
, vol.89
, Issue.4
, pp. 490-504
-
-
Ho, R.1
Mai, K.W.2
Horowitz, M.A.3
-
16
-
-
84955516546
-
A methodology for designing efficient on-chip interconnects on well-behaved communication patterns
-
IEEE Computer Society Press
-
Ho, W.H. and Pinkston, T.M. (2003) ‘A methodology for designing efficient on-chip interconnects on well-behaved communication patterns’, Proceedings of the 9th International Symposium on High-Performance Computer Architecture, IEEE Computer Society Press, February, pp.377–388.
-
(2003)
Proceedings of the 9th International Symposium on High-Performance Computer Architecture
, vol.February
, pp. 377-388
-
-
Ho, W.H.1
Pinkston, T.M.2
-
17
-
-
84893760422
-
Exploiting the routing flexibility for energy/performance aware mapping of regular NoC architectures
-
Munich, Germany, March
-
Hu, J. and Marculescu, R. (2003) ‘Exploiting the routing flexibility for energy/performance aware mapping of regular NoC architectures’, Proceedings of Design, Automation and Test in Europe Conference, Munich, Germany, March.
-
(2003)
Proceedings of Design, Automation and Test in Europe Conference
-
-
Hu, J.1
Marculescu, R.2
-
19
-
-
0032639289
-
The alpha 21264 microprocessor
-
March–April
-
Kessler, R.E. (1999) ‘The alpha 21264 microprocessor’, IEEE Micro, March–April, Vol. 19, No. 2, pp.24–36.
-
(1999)
IEEE Micro
, vol.19
, Issue.2
, pp. 24-36
-
-
Kessler, R.E.1
-
20
-
-
0036949388
-
An adaptive, non-uniform cache structure for wire-dominated on-chip caches
-
Kim, C., Burger, D. and Keckler, S.W. (2002) ‘An adaptive, non-uniform cache structure for wire-dominated on-chip caches’, Proceedings of International Conference on Architectural Support for Programming Languages and Operating Systems, October, pp.211–222.
-
(2002)
Proceedings of International Conference on Architectural Support for Programming Languages and Operating Systems
, vol.October
, pp. 211-222
-
-
Kim, C.1
Burger, D.2
Keckler, S.W.3
-
21
-
-
0034818863
-
Multiplex: unifying conventional and speculative thread-level parallelism on a chip multiprocessor
-
Kim, S.W., Ooi, C-L., Park, I., Eigenmann, R., Falsafi, B. and Vijaykumar, T.N. (2001) ‘Multiplex: unifying conventional and speculative thread-level parallelism on a chip multiprocessor’, Proceedings of International Conference on Supercomputing, June, pp.368–380.
-
(2001)
Proceedings of International Conference on Supercomputing
, vol.June
, pp. 368-380
-
-
Kim, S.W.1
Ooi, C.-L.2
Park, I.3
Eigenmann, R.4
Falsafi, B.5
Vijaykumar, T.N.6
-
22
-
-
0038633602
-
Hyperthreading technology in the netburst microarchitecture
-
March–April
-
Koufaty, D. and Marr, D.T. (2003) ‘Hyperthreading technology in the netburst microarchitecture’, IEEE Micro, March–April, Vol. 23, No. 2, pp.56–65.
-
(2003)
IEEE Micro
, vol.23
, Issue.2
, pp. 56-65
-
-
Koufaty, D.1
Marr, D.T.2
-
23
-
-
27544475709
-
Sun’s niagara pours on the cores
-
Krewell, K. (2004) ‘Sun’s niagara pours on the cores’, Microprocessor Report, September, pp.1–3.
-
(2004)
Microprocessor Report
, vol.September
, pp. 1-3
-
-
Krewell, K.1
-
25
-
-
0034512994
-
aSoC: a scalable, single chip communications architecture
-
Liang, J., Swaminathan, S. and Tessier, R. (2000) ‘aSoC: a scalable, single chip communications architecture’, International Conference on Parallel Architectures and Compilation Techniques, October, pp.37–46.
-
(2000)
International Conference on Parallel Architectures and Compilation Techniques
, vol.October
, pp. 37-46
-
-
Liang, J.1
Swaminathan, S.2
Tessier, R.3
-
26
-
-
84949497612
-
Fast dynamic reconfiguration in irregular networks
-
IEEE Computer Society, August 2000
-
Lysne, O. and Duato, J. (2005) ‘Fast dynamic reconfiguration in irregular networks’, The 2000 International Conference on Parallel Processing, IEEE Computer Society, August 2000, pp.449–458.
-
(2005)
The 2000 International Conference on Parallel Processing
, pp. 449-458
-
-
Lysne, O.1
Duato, J.2
-
27
-
-
77951115346
-
A methodology for developing dynamic network reconfiguration processes
-
IEEE Press
-
Lysne, O., Pinkston, T.M. and Duato, J.(2003) ‘A methodology for developing dynamic network reconfiguration processes’, Proceedings of the International Conference on Parallel Processing (ICPP’03), IEEE Press, October, pp.77–86.
-
(2003)
Proceedings of the International Conference on Parallel Processing (ICPP’03)
, vol.October
, pp. 77-86
-
-
Lysne, O.1
Pinkston, T.M.2
Duato, J.3
-
28
-
-
0035311079
-
Power: a first-class architectural design constraint
-
April
-
Mudge, T. (2001) ‘Power: a first-class architectural design constraint’, IEEE Computer, Vol. 34, No. 4, April, pp.52–58.
-
(2001)
IEEE Computer
, vol.34
, Issue.4
, pp. 52-58
-
-
Mudge, T.1
-
29
-
-
84950155377
-
The alpha 21364 network architecture
-
IEEE Computer Society Press
-
Mukherjee, S.S., Bannon, P., Lang, S., Spink, A. and Webb, D. (2001) ‘The alpha 21364 network architecture’, Symposium on High Performance Interconnects (HOT Interconnects 9), IEEE Computer Society Press, August, pp.113–117.
-
(2001)
Symposium on High Performance Interconnects (HOT Interconnects 9)
, vol.August
, pp. 113-117
-
-
Mukherjee, S.S.1
Bannon, P.2
Lang, S.3
Spink, A.4
Webb, D.5
-
30
-
-
0035693945
-
A design space evaluation of grid processor architectures
-
Nagarajan, R., Sankaralingam, K., Burger, D. and Keckler, S.W. (2001) ‘A design space evaluation of grid processor architectures’, Proceedings of the 34th Annual International Symposium on Microarchitecture, December, pp.40–51.
-
(2001)
Proceedings of the 34th Annual International Symposium on Microarchitecture
, vol.December
, pp. 40-51
-
-
Nagarajan, R.1
Sankaralingam, K.2
Burger, D.3
Keckler, S.W.4
-
31
-
-
0003926726
-
-
University of Wisconsin, Madison, Tech. Rep. CS-TR-1996-1328, November
-
Palacharla, S., Jouppi, N.P. and Smith, J.E. (1996) ‘Quantifying the complexity of superscalar processors’, University of Wisconsin, Madison, Tech. Rep. CS-TR-1996-1328, November.
-
(1996)
Quantifying the complexity of superscalar processors
-
-
Palacharla, S.1
Jouppi, N.P.2
Smith, J.E.3
-
33
-
-
84948777317
-
Efficient interconnects for clustered micro architectures
-
September
-
Parcerisa, J-M., Sahuquillo, J., Gonzalez, A. and Duato, J. (2002) ‘Efficient interconnects for clustered micro architectures’, Proceedings of 2002 International Conference on Parallel Architectures and Compilation Techniques, September.
-
(2002)
Proceedings of 2002 International Conference on Parallel Architectures and Compilation Techniques
-
-
Parcerisa, J.-M.1
Sahuquillo, J.2
Gonzalez, A.3
Duato, J.4
-
34
-
-
14844360899
-
Special issue on on-chip networks
-
February
-
Peh, L-S. and Pinkston, T.M. (2005) ‘Special issue on on-chip networks’, IEEE Transactions on Parallel and Distributed Systems, Vol. 16, No. 2, February.
-
(2005)
IEEE Transactions on Parallel and Distributed Systems
, vol.16
, Issue.2
-
-
Peh, L.-S.1
Pinkston, T.M.2
-
35
-
-
84950306169
-
What will have the greatest impact in 2010 processor, memory or interconnect architecture?
-
(Panel Moderator) February
-
Pinkston T.M. (2002) (Panel Moderator) ‘What will have the greatest impact in 2010 processor, memory or interconnect architecture?’, Proceedings of the 8th Int’l Symp. on High Performance Computer Architecture www.usc.edu/dept/ceng/pinkston/presentations/statistic.html, February.
-
(2002)
Proceedings of the 8th Int’l Symp. on High Performance Computer Architecture
-
-
Pinkston, T.M.1
-
36
-
-
0141724933
-
Deadlock-free dynamic reconfiguration schemes for increased network dependability
-
August
-
Pinkston, T.M., Pang, R. and Duato, J. (2003) ‘Deadlock-free dynamic reconfiguration schemes for increased network dependability’, IEEE Transactions on Parallel and Distributed Systems, August, Vol. 14, No. 8, pp.780–794.
-
(2003)
IEEE Transactions on Parallel and Distributed Systems
, vol.14
, Issue.8
, pp. 780-794
-
-
Pinkston, T.M.1
Pang, R.2
Duato, J.3
-
38
-
-
0037669851
-
Exploiting ILP, TLP, and DLP with the polymorphous TRIPS architecture
-
Sankaralingam, K., Nagarajan, R., Liu, H., Kim, C., Huh, J., Burger, D., Keckler, S.W. and Moore, C.R. (2003) ‘Exploiting ILP, TLP, and DLP with the polymorphous TRIPS architecture’, Proceedings of the International Symposium on Computer Architecture, June, pp.422–433.
-
(2003)
Proceedings of the International Symposium on Computer Architecture
, vol.June
, pp. 422-433
-
-
Sankaralingam, K.1
Nagarajan, R.2
Liu, H.3
Kim, C.4
Huh, J.5
Burger, D.6
Keckler, S.W.7
Moore, C.R.8
-
39
-
-
0034846659
-
Addressing the system-on-a-chip interconnect woes through communication-based design
-
Sgroi, M., Sheets, M., Mihal, A., Keutzer, K., Malik, S., Rabaey, J.M. and Sangiovanni-Vincentelli, A.L. (2001) ‘Addressing the system-on-a-chip interconnect woes through communication-based design’, Proceedings of Design Automation Conference, June, pp.667–672.
-
(2001)
Proceedings of Design Automation Conference
, vol.June
, pp. 667-672
-
-
Sgroi, M.1
Sheets, M.2
Mihal, A.3
Keutzer, K.4
Malik, S.5
Rabaey, J.M.6
Sangiovanni-Vincentelli, A.L.7
-
40
-
-
85008055290
-
Power-efficient interconnection networks: dynamic voltage scaling with links
-
May
-
Shang, L., Peh, L-S. and Jha, N.K. (2002) ‘Power-efficient interconnection networks: dynamic voltage scaling with links’, Computer Architecture Letters, May, Vol. 1, No. 2, pp.1–4.
-
(2002)
Computer Architecture Letters
, vol.1
, Issue.2
, pp. 1-4
-
-
Shang, L.1
Peh, L.-S.2
Jha, N.K.3
-
41
-
-
0031234685
-
Trace processors: moving to fourth-generation micro architectures
-
September
-
Smith, J.E. and Vajapeyam, S. (1997) ‘Trace processors: moving to fourth-generation micro architectures’, IEEE Computer, September, Vol. 30, No. 9, pp.68–74.
-
(1997)
IEEE Computer
, vol.30
, Issue.9
, pp. 68-74
-
-
Smith, J.E.1
Vajapeyam, S.2
-
42
-
-
0029178210
-
Multiscalar processors
-
Sohi, G.S., Breach, S.E. and Vijaykumar, T.N. (1995) ‘Multiscalar processors’, Proceedings of the 22nd Annual International Symposium on Computer Architecture, June, pp.414–425.
-
(1995)
Proceedings of the 22nd Annual International Symposium on Computer Architecture
, vol.June
, pp. 414-425
-
-
Sohi, G.S.1
Breach, S.E.2
Vijaykumar, T.N.3
-
43
-
-
0003588510
-
MAJC architecture tutorial
-
September
-
Sun microsystems (1999) ‘MAJC architecture tutorial’, White Paper, September.
-
(1999)
White Paper
-
-
Sun microsystems1
-
45
-
-
0036505033
-
The raw microprocessor: a computational fabric for software circuits and general-purpose programs
-
March–April
-
Taylor, M., Kim, J., Miller, J., Wentzlaff, D., Ghodrat, F., Greenwald, B., Hoffman, H., Johnson, P., Lee, J-W., Lee, W., Ma, A., Saraf, A., Seneski, M., Shnidman, N., Strumpen, V., Frank, M., Amarasinghe, S., and Agarwal, A. (2002) ‘The raw microprocessor: a computational fabric for software circuits and general-purpose programs’, IEEE Micro, March–April, Vol. 22, No. 2, pp.25–35.
-
(2002)
IEEE Micro
, vol.22
, Issue.2
, pp. 25-35
-
-
Taylor, M.1
Kim, J.2
Miller, J.3
Wentzlaff, D.4
Ghodrat, F.5
Greenwald, B.6
Hoffman, H.7
Johnson, P.8
Lee, J.-W.9
Lee, W.10
Ma, A.11
Saraf, A.12
Seneski, M.13
Shnidman, N.14
Strumpen, V.15
Frank, M.16
Amarasinghe, S.17
Agarwal, A.18
-
46
-
-
84955456130
-
Scalar operand networks: on-chip interconnect for ILP in partitioned architectures
-
IEEE Computer Society Press
-
Taylor, M.B., Lee, W., Amarasinghe, S. and Agarwal, S. (2003) ‘Scalar operand networks: on-chip interconnect for ILP in partitioned architectures’, Proceedings of the 9th International Symposium on High-Performance Computer Architecture, IEEE Computer Society Press, February, pp.341–353.
-
(2003)
Proceedings of the 9th International Symposium on High-Performance Computer Architecture
, vol.February
, pp. 341-353
-
-
Taylor, M.B.1
Lee, W.2
Amarasinghe, S.3
Agarwal, S.4
-
47
-
-
0036298603
-
POWER4 system microarchitecture
-
January
-
Tendler, J.M., Dodson, S., Fields, S., Le, H. and Sinharoy, B. (2002) ‘POWER4 system microarchitecture’, IBM Journal of Research and Development, Vol. 46, No. 1, January, pp.5–26.
-
(2002)
IBM Journal of Research and Development
, vol.46
, Issue.1
, pp. 5-26
-
-
Tendler, J.M.1
Dodson, S.2
Fields, S.3
Le, H.4
Sinharoy, B.5
-
48
-
-
0033344478
-
The Super-threaded processor architecture
-
September
-
Tsai, J-Y., Huang, J., Amlo, C., Lilja, D.J. and Yew, P-C. (1999) ‘The Super-threaded processor architecture’, IEEE Transactions on Computers, September, Vol. 48, No. 9, pp.881–902.
-
(1999)
IEEE Transactions on Computers
, vol.48
, Issue.9
, pp. 881-902
-
-
Tsai, J.-Y.1
Huang, J.2
Amlo, C.3
Lilja, D.J.4
Yew, P.-C.5
-
49
-
-
0029200683
-
Simultaneous multithreading: maximizing on-chip parallelism
-
Tullsen, D.M., Eggers, S. and Levy, H.M. (1995) ‘Simultaneous multithreading: maximizing on-chip parallelism’, Proceedings of the 22rd International Symposium on Computer Architecture, June, pp.392–403.
-
(1995)
Proceedings of the 22rd International Symposium on Computer Architecture
, vol.June
, pp. 392-403
-
-
Tullsen, D.M.1
Eggers, S.2
Levy, H.M.3
-
50
-
-
84862144932
-
Power-driven design of router micro-architectures in on-chip networks
-
IEEE Computer Society
-
Wang, H., Peh, L-H. and Malik, S. (2003) ‘Power-driven design of router micro-architectures in on-chip networks’, Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture, IEEE Computer Society, December, pp.105–116.
-
(2003)
Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture
, vol.December
, pp. 105-116
-
-
Wang, H.1
Peh, L.-H.2
Malik, S.3
-
52
-
-
84950303681
-
-
In this paper, we restrict ourselves to microsystems built from CMOS technology as opposed to other more exotic nanotechnologies, e.g., MEMs, molecular and quantum computing technologies, etc
-
In this paper, we restrict ourselves to microsystems built from CMOS technology as opposed to other more exotic nanotechnologies, e.g., MEMs, molecular and quantum computing technologies, etc.
-
-
-
-
53
-
-
84950304225
-
-
Power consumption issues are not considered in this work. For more discussion on the impact of technology scaling and architectural techniques on power consumption, the interested reader is directed to Zyuban and Kogge (2000), Mudge (2001) and Wang et al. (2003)
-
Power consumption issues are not considered in this work. For more discussion on the impact of technology scaling and architectural techniques on power consumption, the interested reader is directed to Zyuban and Kogge (2000), Mudge (2001) and Wang et al. (2003).
-
-
-
-
54
-
-
84950305046
-
-
The designation ‘FO4’ (or fan-out of four) is the delay (e.g., in picoseconds) of an inverter loaded by four identical inverters for a given technology
-
The designation ‘FO4’ (or fan-out of four) is the delay (e.g., in picoseconds) of an inverter loaded by four identical inverters for a given technology.
-
-
-
-
55
-
-
84950306412
-
-
Worst case environmental conditions of high temperature and low supply voltage were assumed
-
Worst case environmental conditions of high temperature and low supply voltage were assumed.
-
-
-
-
56
-
-
84950306515
-
-
The tile size in Table 1 is equivalent to that used in the Raw architecture (Taylor et al., 2002), which is 4 mm × 4 µm in 0.15 μm technology for 16 tiles per chip
-
The tile size in Table 1 is equivalent to that used in the Raw architecture (Taylor et al., 2002), which is 4 mm × 4 µm in 0.15 μm technology for 16 tiles per chip.
-
-
-
-
57
-
-
84950304469
-
-
This processor architecture project was canceled before any chips were ever built
-
This processor architecture project was canceled before any chips were ever built.
-
-
-
-
58
-
-
84950304006
-
-
We know of no hybrid designs that allow this as of yet, however these may emerge in the near future if the required logic complexity remains below the critical wire-limited bound
-
We know of no hybrid designs that allow this as of yet, however these may emerge in the near future if the required logic complexity remains below the critical wire-limited bound.
-
-
-
-
59
-
-
84950304914
-
-
For completion, also shown are some resources for communicating instructions for completion, but we focus on data communication
-
For completion, also shown are some resources for communicating instructions for completion, but we focus on data communication.
-
-
-
|