-
2
-
-
0025404493
-
Executing a program on the MIT tagged-token dataflow architecture
-
(Mar.)
-
Arvind and Nikhil, R. S. 1990. Executing a program on the MIT tagged-token dataflow architecture. IEEE Transactions on Computing 39, 3 (Mar.), 300-318.
-
(1990)
IEEE Transactions on Computing
, vol.39
, Issue.3
, pp. 300-318
-
-
Arvind1
Nikhil, R.S.2
-
3
-
-
0033722744
-
Piranha: A scalable architecture based on single-chip multiprocessing
-
Barroso, L. A., Gharachorloo, K., McNamara, R., Nowatzyk, A., Qadeer, S., Sano, B., Smith, S., Stets, R., and Verghese, B. 2000. Piranha: A scalable architecture based on single-chip multiprocessing. In Proceedings of the 27th Annual International Symposium on Computer Architecture, 282-293.
-
(2000)
Proceedings of the 27th Annual International Symposium on Computer Architecture
, pp. 282-293
-
-
Barroso, L.A.1
Gharachorloo, K.2
McNamara, R.3
Nowatzyk, A.4
Qadeer, S.5
Sano, B.6
Smith, S.7
Stets, R.8
Verghese, B.9
-
4
-
-
0011891821
-
PACT XPP—A self-reconfigurable data processing architecture
-
Baumgarte, V., May, F., Nückel, A., Vorbach, M., and Weinhardt, M. 2001. PACT XPP—A self-reconfigurable data processing architecture. In 1st International Conference on Engineering of Reconfigurable Systems and Algorithms.
-
(2001)
1st International Conference on Engineering of Reconfigurable Systems and Algorithms
-
-
Baumgarte, V.1
May, F.2
Nückel, A.3
Vorbach, M.4
Weinhardt, M.5
-
5
-
-
11844296497
-
Evaluation of multithreaded architecture for cellular computing
-
Casçaval, C., Castaos, J., Ceze, L., Denneau, M., Gupta, M., Lieber, D., Moreira, J. E., Strauss, K. Jr., 2002. Evaluation of multithreaded architecture for cellular computing. In Proceedings of the 8th International Symposium on High Performance Computer Architecture, 311-322.
-
(2002)
Proceedings of the 8th International Symposium on High Performance Computer Architecture
, pp. 311-322
-
-
Casçaval, C.1
Castaos, J.2
Ceze, L.3
Denneau, M.4
Gupta, M.5
Lieber, D.6
Moreira, J.E.7
Strauss, K.8
-
6
-
-
0026157612
-
IMPACT: An architectural framework for multiple-instruction-issue processors
-
Chang, P. P., Mahlke, S. A., Chen, W. Y., Warter, N. J., and Mei W., Hwu, W. 1991. IMPACT: An architectural framework for multiple-instruction-issue processors. In Proceedings of the 18th Annual International Symposium on Computer Architecture, 266-275.
-
(1991)
Proceedings of the 18th Annual International Symposium on Computer Architecture
, pp. 266-275
-
-
Chang, P.P.1
Mahlke, S.A.2
Chen, W.Y.3
Warter, N.J.4
Mei, W.5
Hwu, W.6
-
7
-
-
0033689702
-
Architectural support for scalable speculative parallelization in shared-memory multiprocessors
-
Cintra, M., Martínez, J. F., and Torrellas, J. 2000. Architectural support for scalable speculative parallelization in shared-memory multiprocessors. In Proceedings of the 27th Annual International Symposium on Computer Architecture, 13-24.
-
(2000)
Proceedings of the 27th Annual International Symposium on Computer Architecture
, pp. 13-24
-
-
Cintra, M.1
Martínez, J.F.2
Torrellas, J.3
-
8
-
-
0030684340
-
Configurable computing: The catalyst for high-performance architectures
-
Ebeling, C., Cronquist, D. C., and Franklin, P. 1997. Configurable computing: The catalyst for high-performance architectures. In International Conference on Application-Specific Systems, Architectures, and Processors, 364-372.
-
(1997)
International Conference on Application-Specific Systems, Architectures, and Processors
, pp. 364-372
-
-
Ebeling, C.1
Cronquist, D.C.2
Franklin, P.3
-
9
-
-
0036292604
-
Tarantula: A vector extension to the alpha architecture
-
Espasa, R., Ardanaz, F., Emer, J., Felix, S., Gago, J., Gramunt, R., Hernandez, I., Juan, T., Lowney, G., Mattina, M., and Seznec, A. 2002. Tarantula: A vector extension to the alpha architecture. In Proceedings ofISCA-29, 281-292.
-
(2002)
Proceedings ofISCA-29
, pp. 281-292
-
-
Espasa, R.1
Ardanaz, F.2
Emer, J.3
Felix, S.4
Gago, J.5
Gramunt, R.6
Hernandez, I.7
Juan, T.8
Lowney, G.9
Mattina, M.10
Seznec, A.11
-
11
-
-
0034174187
-
Piperench: A reconfigurable architecture and compiler
-
(April)
-
Goldstein, S. C., Schmit, H., Budiu, M., Cadambi, S., Moe, M., and Taylor, R. 2000. Piperench: A reconfigurable architecture and compiler. IEEE Computer 33, 4 (April), 70-77.
-
(2000)
IEEE Computer
, vol.33
, Issue.4
, pp. 70-77
-
-
Goldstein, S.C.1
Schmit, H.2
Budiu, M.3
Cadambi, S.4
Moe, M.5
Taylor, R.6
-
12
-
-
0030379515
-
Increasing the instruction fetch rate via block-structured instruction set architectures
-
Hao, E., Chang, P., Evers, M., and Patt, Y. 1996. Increasing the instruction fetch rate via block-structured instruction set architectures. In Proceedings of MICRO-29, 191-200.
-
(1996)
Proceedings of MICRO-29
, pp. 191-200
-
-
Hao, E.1
Chang, P.2
Evers, M.3
Patt, Y.4
-
14
-
-
0030837256
-
Control flow speculation in multiscalar processors
-
Jacobson, Q., Bennett, S., Sharma, N., and Smith, J. E. 1997. Control flow speculation in multiscalar processors. in Proceedings of the 3rd International Symposium on High Performance Computer Architecture, 218-229.
-
(1997)
Proceedings of the 3rd International Symposium on High Performance Computer Architecture
, pp. 218-229
-
-
Jacobson, Q.1
Bennett, S.2
Sharma, N.3
Smith, J.E.4
-
15
-
-
0033299230
-
FlexRAM: Toward an advanced intelligent memory system
-
Kang, Y., Huang, W., Yoo, S.-M., Keen, D., Ge, Z., Lam, V., Pattnaik, P., and Torrellas, J. 1999. FlexRAM: Toward an advanced intelligent memory system. in International Conference on Computer Design, 192-201.
-
(1999)
International Conference on Computer Design
, pp. 192-201
-
-
Kang, Y.1
Huang, W.2
Yoo, S.-M.3
Keen, D.4
Ge, Z.5
Lam, V.6
Pattnaik, P.7
Torrellas, J.8
-
17
-
-
0032639289
-
The alpha 21264 microprocessor
-
(March/April)
-
Kessler, R. 1999. The alpha 21264 microprocessor. IEEE Micro 19, 2 (March/April), 24-36.
-
(1999)
IEEE Micro
, vol.19
, Issue.2
, pp. 24-36
-
-
Kessler, R.1
-
18
-
-
0035271572
-
Imagine: Media processing with streams
-
(March/April)
-
Khailany, B., Dally, W. J., Rixner, S., Kapasi, Ü. J., Mattson, P., Namkoong, J., Owens, J. D., Towles, B., and Chang, A. 2001. Imagine: Media processing with streams. IEEE Micro 21, 2 (March/April), 35-46.
-
(2001)
IEEE Micro
, vol.21
, Issue.2
, pp. 35-46
-
-
Khailany, B.1
Dally, W.J.2
Rixner, S.3
Kapasi, Ü.J.4
Mattson, P.5
Namkoong, J.6
Owens, J.D.7
Towles, B.8
Chang, A.9
-
19
-
-
33845423872
-
An Adaptive, Non-uniform cache structure for wire-delay dominated on-chip caches
-
Kim, C., Burger, D., and Keckler, S. W. 2002. An Adaptive, Non-uniform cache structure for wire-delay dominated on-chip caches. In Proceedings of ASPLOS-10, 211-222.
-
(2002)
Proceedings of ASPLOS-10
, pp. 211-222
-
-
Kim, C.1
Burger, D.2
Keckler, S.W.3
-
21
-
-
0031339427
-
Mediabench: A tool for evaluating and synthesizing multimedia and communications systems
-
Lee, C., Potkonjak, M., and Mangione-Smith, W. H. 1997. Mediabench: A tool for evaluating and synthesizing multimedia and communications systems. in International Symposium on Microarchitecture, 330-335.
-
(1997)
International Symposium on Microarchitecture
, pp. 330-335
-
-
Lee, C.1
Potkonjak, M.2
Mangione-Smith, W.H.3
-
22
-
-
84886709991
-
Vsv: L2-miss-driven variable supply-voltage scaling for low power
-
Li, H., Cher, C.-Y., Vijaykumar, T., and Roy, K. 2003. Vsv: L2-miss-driven variable supply-voltage scaling for low power. In 36th Annual International Symposium on Microarchitecture, 19-28.
-
(2003)
36th Annual International Symposium on Microarchitecture
, pp. 19-28
-
-
Li, H.1
Cher, C.-Y.2
Vijaykumar, T.3
Roy, K.4
-
23
-
-
0002745357
-
Effective Compiler Support for Predicated Execution Using the Hyperblock
-
Mahlke, S. A., Lin, D. C., Chen, W. Y., Hank, R. E., and Bringmann, R. A. 1992. Effective Compiler Support for Predicated Execution Using the Hyperblock. In Proceedings of MICRO-25, 4554.
-
(1992)
Proceedings of MICRO-25
, pp. 4554
-
-
Mahlke, S.A.1
Lin, D.C.2
Chen, W.Y.3
Hank, R.E.4
Bringmann, R.A.5
-
24
-
-
0033688597
-
Smart memories: A modular reconfigurable architecture
-
Mai, K., Paaske, T., Jayasena, N., Ho, R., Dally, W. J., and Horowitz, M. 2000. Smart memories: A modular reconfigurable architecture. In Proceedings of ISCA-27, 161-171.
-
(2000)
Proceedings of ISCA-27
, pp. 161-171
-
-
Mai, K.1
Paaske, T.2
Jayasena, N.3
Ho, R.4
Dally, W.J.5
Horowitz, M.6
-
25
-
-
0006639334
-
Performance benefits of large execution atomic units in dynamically scheduled machines
-
Melvin, S. and Patt, Y. 1989. Performance benefits of large execution atomic units in dynamically scheduled machines. in 3rd International Conference on Supercomputing, 427-432.
-
(1989)
3rd International Conference on Supercomputing
, pp. 427-432
-
-
Melvin, S.1
Patt, Y.2
-
26
-
-
0035693945
-
A design space evaluation of grid processor architectures
-
Nagarajan, R., Sankaralingam, K., Burger, D., and Keckler, S. W. 2001. A design space evaluation of grid processor architectures. In Proceedings of MICRO-34, 40-51.
-
(2001)
Proceedings of MICRO-34
, pp. 40-51
-
-
Nagarajan, R.1
Sankaralingam, K.2
Burger, D.3
Keckler, S.W.4
-
27
-
-
0031594009
-
Active pages: A computation model for intelligent memory
-
Oskin, M., Chong, F. T., and Sherwood, T. 1998. Active pages: A computation model for intelligent memory. In Proceedings of the 25th International Symposium on Computer Architecture, 192-203.
-
(1998)
Proceedings of the 25th International Symposium on Computer Architecture
, pp. 192-203
-
-
Oskin, M.1
Chong, F.T.2
Sherwood, T.3
-
28
-
-
0032207001
-
The pews microarchitecture: Reducing complexity through data dependence based decentralization
-
(November)
-
Ranganathan, N. and Franklin, M. 1998. The pews microarchitecture: Reducing complexity through data dependence based decentralization. Microprocessors and Microsystems 22, 6 (November), 333-343.
-
(1998)
Microprocessors and Microsystems
, vol.22
, Issue.6
, pp. 333-343
-
-
Ranganathan, N.1
Franklin, M.2
-
29
-
-
4243514480
-
Combining hyperblocks and exit prediction to increase front-end bandwidth and performance
-
Department of Computer Sciences, The University of Texas at Austin
-
Ranganathan, N., Nagarajan, R., Burger, D., and Keckler, S. W. 2002. Combining hyperblocks and exit prediction to increase front-end bandwidth and performance. Tech. Rep. TR-02-41, Department of Computer Sciences, The University of Texas at Austin.
-
(2002)
Tech. Rep. TR-02-41
-
-
Ranganathan, N.1
Nagarajan, R.2
Burger, D.3
Keckler, S.W.4
-
30
-
-
0002017307
-
Instruction-level parallel processing: History, overview, and perspective
-
Rau, B. R. and Fisher, J. A. 1993. Instruction-level parallel processing: History, overview, and perspective. Journal of Supercomputing 7, 9-50.
-
(1993)
Journal of Supercomputing
, vol.7
, pp. 9-50
-
-
Rau, B.R.1
Fisher, J.A.2
-
31
-
-
0032312385
-
A bandwidth-efficient architecture for media processing
-
Rixner, S., Dally, W. J., Kapasi, U. J., Khailany, B., Lopez-Lagunas, A., Mattson, P. R., and Owens, J. D. 1998. A bandwidth-efficient architecture for media processing. In Proceedings on the 31st International Symposium on Microarchitecture, 3-13.
-
(1998)
Proceedings on the 31st International Symposium on Microarchitecture
, pp. 3-13
-
-
Rixner, S.1
Dally, W.J.2
Kapasi, U.J.3
Khailany, B.4
Lopez-Lagunas, A.5
Mattson, P.R.6
Owens, J.D.7
-
32
-
-
84944402628
-
Universal mechanisms for data-parallel architectures
-
Sankaralingam, K., Keckler, S. W., Mark, W. R., and Burger, D. 2003. Universal mechanisms for data-parallel architectures. In Proceedings of MICRO-36, 303-314.
-
(2003)
Proceedings of MICRO-36
, pp. 303-314
-
-
Sankaralingam, K.1
Keckler, S.W.2
Mark, W.R.3
Burger, D.4
-
33
-
-
33746585048
-
Dynamic frequency and voltage control for a multiple clock domain microarchitecture
-
Semeraro, G., Albonesi, D., Dropsho, S., Magklis, G., Dwarkadas, S., and Scott, M. 2002. Dynamic frequency and voltage control for a multiple clock domain microarchitecture. In 35th International Symposium on Microarchitecture, 356-367.
-
(2002)
35th International Symposium on Microarchitecture
, pp. 356-367
-
-
Semeraro, G.1
Albonesi, D.2
Dropsho, S.3
Magklis, G.4
Dwarkadas, S.5
Scott, M.6
-
34
-
-
84944387421
-
Scalable memory disambiguation for high ilp processors
-
Sethumadhavan, S., Desikan, R., Burger, D., Moore, C. R., and Keckler, S. W. 2003. Scalable memory disambiguation for high ilp processors. In 36th International Symposium on Microarchitecture, 399-410.
-
(2003)
36th International Symposium on Microarchitecture
, pp. 399-410
-
-
Sethumadhavan, S.1
Desikan, R.2
Burger, D.3
Moore, C.R.4
Keckler, S.W.5
-
35
-
-
33845437061
-
Automatically characterizing large scale program behavior
-
Sherwood, T., Perelman, E., Hamerly, G., and Calder, B. 2002. Automatically characterizing large scale program behavior. In International Conference on Architectural Support for Programming Languages and Operating Systems, 45-57.
-
(2002)
International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 45-57
-
-
Sherwood, T.1
Perelman, E.2
Hamerly, G.3
Calder, B.4
-
36
-
-
0029178210
-
Multiscalar processors
-
Sohi, G. S., Breach, S. E., and Vijaykumar, T. N. 1995. Multiscalar processors. In Proceedings of the 22nd International Symposium on Computer Architecture, 414-425.
-
(1995)
Proceedings of the 22nd International Symposium on Computer Architecture
, pp. 414-425
-
-
Sohi, G.S.1
Breach, S.E.2
Vijaykumar, T.N.3
-
37
-
-
0033703889
-
A scalable approach to thread-level speculation
-
Steffan, J. G., Colohan, C. B., Zhai, A., and Mowry, T. C. 2000. A scalable approach to thread-level speculation. In Proceedings of the 27th Annual International Symposium on Computer Architecture, 1-12.
-
(2000)
Proceedings of the 27th Annual International Symposium on Computer Architecture
, pp. 1-12
-
-
Steffan, J.G.1
Colohan, C.B.2
Zhai, A.3
Mowry, T.C.4
-
39
-
-
84944392428
-
Wavescalar
-
Swanson, S., Michelson, K., Schwerin, A., and Oskin, M. 2003. Wavescalar. In 36th Annual International Symposium on Microarchitecture, 291-302.
-
(2003)
36th Annual International Symposium on Microarchitecture
, pp. 291-302
-
-
Swanson, S.1
Michelson, K.2
Schwerin, A.3
Oskin, M.4
-
40
-
-
0038289667
-
Bottlenecks in multimedia processing with SIMD style extensions and architectural enhancements
-
Talla, D., John, L., and Burger, D. 2003. Bottlenecks in multimedia processing with SIMD style extensions and architectural enhancements. IEEE Transactions on Computers 52, 8, 35-46.
-
(2003)
IEEE Transactions on Computers
, vol.52
, Issue.8
, pp. 35-46
-
-
Talla, D.1
John, L.2
Burger, D.3
-
41
-
-
0036298603
-
POWER4 system microarchitecture
-
(Jan)
-
Tendler, J. M., Dodson, J. S., Fields, J., Le, H., and Sinharoy, B. 2001. POWER4 system microarchitecture. IBM Journal of Research and Development 26, 1 (Jan), 5-26.
-
(2001)
IBM Journal of Research and Development
, vol.26
, Issue.1
, pp. 5-26
-
-
Tendler, J.M.1
Dodson, J.S.2
Fields, J.3
Le, H.4
Sinharoy, B.5
-
42
-
-
85024284553
-
Mnemonic Instruction Set
-
March 2001.
-
TMS320C54X. DSP Reference Set, Volume 2: Mnemonic Instruction Set, Literature Number: SPRU172C, March 2001.
-
Literature Number: SPRU172C
, vol.2
-
-
-
43
-
-
0029200683
-
Simultaneous multithreading: Maximizing on-chip parallelism
-
Tullsen, D. M., Eggers, S. J., and Levy, H. M. 1995. Simultaneous multithreading: Maximizing on-chip parallelism. In Proceedings of ISCA-22, 392-403.
-
(1995)
Proceedings of ISCA-22
, pp. 392-403
-
-
Tullsen, D.M.1
Eggers, S.J.2
Levy, H.M.3
-
44
-
-
0031236158
-
Baring it all to software: RAW machines
-
(Sept)
-
Waingold, E., Taylor, M., Srikrishna, D., Sarkar, V., Lee, W., Lee, V., Kim, J., Frank, M., Finch, P., Barua, R., Babb, J., Amarasinghe, S., and Agarwal, A. 1997. Baring it all to software: RAW machines. IEEE Computer 30, 9 (Sept), 86-93.
-
(1997)
IEEE Computer
, vol.30
, Issue.9
, pp. 86-93
-
-
Waingold, E.1
Taylor, M.2
Srikrishna, D.3
Sarkar, V.4
Lee, W.5
Lee, V.6
Kim, J.7
Frank, M.8
Finch, P.9
Barua, R.10
Babb, J.11
Amarasinghe, S.12
Agarwal, A.13
|