SCOPUS 정보 검색 플랫폼

ACM Transactions on Architecture and Code Optimization

Volumn 1, Issue 1, 2004, Pages 62-93

TRIPS: A Polymorphous Architecture for Exploiting ILP, TLP, and DLP

(10) Sankaralingam, Karthikeyan a Nagarajan, Ramadass a Liu, Haiming a Kim, Changkyu a Huh, Jaehyuk a Ranganathan, Nitya a Burger, Doug a Keckler, Stephen W a Mc donald, Robert G a Moore, Charles R a

a University of Texas at Austin (United States)

Author keywords

Computer Architecture; Computer Systems; Configurable Computing; High Performance Computing; ScAlable

Indexed keywords

EID: 84905483003 PISSN: 15443566 EISSN: 15443973 Source Type: Journal
DOI: 10.1145/980152.980156 Document Type: Article

Times cited : (67)

References (45)

1
- 0031594003
- Dynamic ipc/clock rate optimization
- Albonesi, D. 1998. Dynamic ipc/clock rate optimization. In 25th International Symposium on Computer Architecture, 282-292.
- (1998) 25th International Symposium on Computer Architecture , pp. 282-292
- Albonesi, D.¹

2
- 0025404493
- Executing a program on the MIT tagged-token dataflow architecture
- (Mar.)
- Arvind and Nikhil, R. S. 1990. Executing a program on the MIT tagged-token dataflow architecture. IEEE Transactions on Computing 39, 3 (Mar.), 300-318.
- (1990) IEEE Transactions on Computing , vol.39 , Issue.3 , pp. 300-318
- Arvind¹ Nikhil, R.S.²

3
- 0033722744
- Piranha: A scalable architecture based on single-chip multiprocessing
- Barroso, L. A., Gharachorloo, K., McNamara, R., Nowatzyk, A., Qadeer, S., Sano, B., Smith, S., Stets, R., and Verghese, B. 2000. Piranha: A scalable architecture based on single-chip multiprocessing. In Proceedings of the 27th Annual International Symposium on Computer Architecture, 282-293.
- (2000) Proceedings of the 27th Annual International Symposium on Computer Architecture , pp. 282-293
- Barroso, L.A.¹ Gharachorloo, K.² McNamara, R.³ Nowatzyk, A.⁴ Qadeer, S.⁵ Sano, B.⁶ Smith, S.⁷ Stets, R.⁸ Verghese, B.⁹

4
- 0011891821
- PACT XPP—A self-reconfigurable data processing architecture
- Baumgarte, V., May, F., Nückel, A., Vorbach, M., and Weinhardt, M. 2001. PACT XPP—A self-reconfigurable data processing architecture. In 1st International Conference on Engineering of Reconfigurable Systems and Algorithms.
- (2001) 1st International Conference on Engineering of Reconfigurable Systems and Algorithms
- Baumgarte, V.¹ May, F.² Nückel, A.³ Vorbach, M.⁴ Weinhardt, M.⁵

5
- 11844296497
- Evaluation of multithreaded architecture for cellular computing
- Casçaval, C., Castaos, J., Ceze, L., Denneau, M., Gupta, M., Lieber, D., Moreira, J. E., Strauss, K. Jr., 2002. Evaluation of multithreaded architecture for cellular computing. In Proceedings of the 8th International Symposium on High Performance Computer Architecture, 311-322.
- (2002) Proceedings of the 8th International Symposium on High Performance Computer Architecture , pp. 311-322
- Casçaval, C.¹ Castaos, J.² Ceze, L.³ Denneau, M.⁴ Gupta, M.⁵ Lieber, D.⁶ Moreira, J.E.⁷ Strauss, K.⁸

6
- 0026157612
- IMPACT: An architectural framework for multiple-instruction-issue processors
- Chang, P. P., Mahlke, S. A., Chen, W. Y., Warter, N. J., and Mei W., Hwu, W. 1991. IMPACT: An architectural framework for multiple-instruction-issue processors. In Proceedings of the 18th Annual International Symposium on Computer Architecture, 266-275.
- (1991) Proceedings of the 18th Annual International Symposium on Computer Architecture , pp. 266-275
- Chang, P.P.¹ Mahlke, S.A.² Chen, W.Y.³ Warter, N.J.⁴ Mei, W.⁵ Hwu, W.⁶

7
- 0033689702
- Architectural support for scalable speculative parallelization in shared-memory multiprocessors
- Cintra, M., Martínez, J. F., and Torrellas, J. 2000. Architectural support for scalable speculative parallelization in shared-memory multiprocessors. In Proceedings of the 27th Annual International Symposium on Computer Architecture, 13-24.
- (2000) Proceedings of the 27th Annual International Symposium on Computer Architecture , pp. 13-24
- Cintra, M.¹ Martínez, J.F.² Torrellas, J.³

8
- 0030684340
- Configurable computing: The catalyst for high-performance architectures
- Ebeling, C., Cronquist, D. C., and Franklin, P. 1997. Configurable computing: The catalyst for high-performance architectures. In International Conference on Application-Specific Systems, Architectures, and Processors, 364-372.
- (1997) International Conference on Application-Specific Systems, Architectures, and Processors , pp. 364-372
- Ebeling, C.¹ Cronquist, D.C.² Franklin, P.³

9
- 0036292604
- Tarantula: A vector extension to the alpha architecture
- Espasa, R., Ardanaz, F., Emer, J., Felix, S., Gago, J., Gramunt, R., Hernandez, I., Juan, T., Lowney, G., Mattina, M., and Seznec, A. 2002. Tarantula: A vector extension to the alpha architecture. In Proceedings ofISCA-29, 281-292.
- (2002) Proceedings ofISCA-29 , pp. 281-292
- Espasa, R.¹ Ardanaz, F.² Emer, J.³ Felix, S.⁴ Gago, J.⁵ Gramunt, R.⁶ Hernandez, I.⁷ Juan, T.⁸ Lowney, G.⁹ Mattina, M.¹⁰ Seznec, A.¹¹

10
- 4544221243
- Addison-Wesley, Reading, MA.
- Fernando, R. and Kilgard, M. J. 2003. The Cg Tutorial. Addison-Wesley, Reading, MA.
- (2003) The Cg Tutorial.
- Fernando, R.¹ Kilgard, M.J.²

11
- 0034174187
- Piperench: A reconfigurable architecture and compiler
- (April)
- Goldstein, S. C., Schmit, H., Budiu, M., Cadambi, S., Moe, M., and Taylor, R. 2000. Piperench: A reconfigurable architecture and compiler. IEEE Computer 33, 4 (April), 70-77.
- (2000) IEEE Computer , vol.33 , Issue.4 , pp. 70-77
- Goldstein, S.C.¹ Schmit, H.² Budiu, M.³ Cadambi, S.⁴ Moe, M.⁵ Taylor, R.⁶

12
- 0030379515
- Increasing the instruction fetch rate via block-structured instruction set architectures
- Hao, E., Chang, P., Evers, M., and Patt, Y. 1996. Increasing the instruction fetch rate via block-structured instruction set architectures. In Proceedings of MICRO-29, 191-200.
- (1996) Proceedings of MICRO-29 , pp. 191-200
- Hao, E.¹ Chang, P.² Evers, M.³ Patt, Y.⁴

13
- 0022584031
- Hpsm, a high performance restricted data flow architecture having minimal functionally
- Hwu, W. and Patt, Y. 1986. Hpsm, a high performance restricted data flow architecture having minimal functionally. in Proceedings of the International Symposium on Computer Architecture, 297-306.
- (1986) Proceedings of the International Symposium on Computer Architecture , pp. 297-306
- Hwu, W.¹ Patt, Y.²

14
- 0030837256
- Control flow speculation in multiscalar processors
- Jacobson, Q., Bennett, S., Sharma, N., and Smith, J. E. 1997. Control flow speculation in multiscalar processors. in Proceedings of the 3rd International Symposium on High Performance Computer Architecture, 218-229.
- (1997) Proceedings of the 3rd International Symposium on High Performance Computer Architecture , pp. 218-229
- Jacobson, Q.¹ Bennett, S.² Sharma, N.³ Smith, J.E.⁴

15
- 0033299230
- FlexRAM: Toward an advanced intelligent memory system
- Kang, Y., Huang, W., Yoo, S.-M., Keen, D., Ge, Z., Lam, V., Pattnaik, P., and Torrellas, J. 1999. FlexRAM: Toward an advanced intelligent memory system. in International Conference on Computer Design, 192-201.
- (1999) International Conference on Computer Design , pp. 192-201
- Kang, Y.¹ Huang, W.² Yoo, S.-M.³ Keen, D.⁴ Ge, Z.⁵ Lam, V.⁶ Pattnaik, P.⁷ Torrellas, J.⁸

16
- 84862452827
- Hewlett-Packard Laboratories.
- Kathail, V., Schlansker, M., and Rau., B. R. 2000. HPL-PD architecture specification: Version 1.1. Tech. Rep. HPL-93-80(R.1), Hewlett-Packard Laboratories.
- (2000) HPL-PD architecture specification: Version 1.1. Tech. Rep. HPL-93-80(R.1)
- Kathail, V.¹ Schlansker, M.² Rau, B.R.³

17
- 0032639289
- The alpha 21264 microprocessor
- (March/April)
- Kessler, R. 1999. The alpha 21264 microprocessor. IEEE Micro 19, 2 (March/April), 24-36.
- (1999) IEEE Micro , vol.19 , Issue.2 , pp. 24-36
- Kessler, R.¹

18
- 0035271572
- Imagine: Media processing with streams
- (March/April)
- Khailany, B., Dally, W. J., Rixner, S., Kapasi, Ü. J., Mattson, P., Namkoong, J., Owens, J. D., Towles, B., and Chang, A. 2001. Imagine: Media processing with streams. IEEE Micro 21, 2 (March/April), 35-46.
- (2001) IEEE Micro , vol.21 , Issue.2 , pp. 35-46
- Khailany, B.¹ Dally, W.J.² Rixner, S.³ Kapasi, Ü.J.⁴ Mattson, P.⁵ Namkoong, J.⁶ Owens, J.D.⁷ Towles, B.⁸ Chang, A.⁹

19
- 33845423872
- An Adaptive, Non-uniform cache structure for wire-delay dominated on-chip caches
- Kim, C., Burger, D., and Keckler, S. W. 2002. An Adaptive, Non-uniform cache structure for wire-delay dominated on-chip caches. In Proceedings of ASPLOS-10, 211-222.
- (2002) Proceedings of ASPLOS-10 , pp. 211-222
- Kim, C.¹ Burger, D.² Keckler, S.W.³

20
- 0036292594
- An instruction set and microarchitecture for instruction level distributed processing
- Kim, H.-S. and Smith, J. E. 2002. An instruction set and microarchitecture for instruction level distributed processing. in Proceedings of the 29th International Symposium on Computer Architecture, 71-82.
- (2002) Proceedings of the 29th International Symposium on Computer Architecture , pp. 71-82
- Kim, H.-S.¹ Smith, J.E.²

21
- 0031339427
- Mediabench: A tool for evaluating and synthesizing multimedia and communications systems
- Lee, C., Potkonjak, M., and Mangione-Smith, W. H. 1997. Mediabench: A tool for evaluating and synthesizing multimedia and communications systems. in International Symposium on Microarchitecture, 330-335.
- (1997) International Symposium on Microarchitecture , pp. 330-335
- Lee, C.¹ Potkonjak, M.² Mangione-Smith, W.H.³

22
- 84886709991
- Vsv: L2-miss-driven variable supply-voltage scaling for low power
- Li, H., Cher, C.-Y., Vijaykumar, T., and Roy, K. 2003. Vsv: L2-miss-driven variable supply-voltage scaling for low power. In 36th Annual International Symposium on Microarchitecture, 19-28.
- (2003) 36th Annual International Symposium on Microarchitecture , pp. 19-28
- Li, H.¹ Cher, C.-Y.² Vijaykumar, T.³ Roy, K.⁴

23
- 0002745357
- Effective Compiler Support for Predicated Execution Using the Hyperblock
- Mahlke, S. A., Lin, D. C., Chen, W. Y., Hank, R. E., and Bringmann, R. A. 1992. Effective Compiler Support for Predicated Execution Using the Hyperblock. In Proceedings of MICRO-25, 4554.
- (1992) Proceedings of MICRO-25 , pp. 4554
- Mahlke, S.A.¹ Lin, D.C.² Chen, W.Y.³ Hank, R.E.⁴ Bringmann, R.A.⁵

24
- 0033688597
- Smart memories: A modular reconfigurable architecture
- Mai, K., Paaske, T., Jayasena, N., Ho, R., Dally, W. J., and Horowitz, M. 2000. Smart memories: A modular reconfigurable architecture. In Proceedings of ISCA-27, 161-171.
- (2000) Proceedings of ISCA-27 , pp. 161-171
- Mai, K.¹ Paaske, T.² Jayasena, N.³ Ho, R.⁴ Dally, W.J.⁵ Horowitz, M.⁶

25
- 0006639334
- Performance benefits of large execution atomic units in dynamically scheduled machines
- Melvin, S. and Patt, Y. 1989. Performance benefits of large execution atomic units in dynamically scheduled machines. in 3rd International Conference on Supercomputing, 427-432.
- (1989) 3rd International Conference on Supercomputing , pp. 427-432
- Melvin, S.¹ Patt, Y.²

26
- 0035693945
- A design space evaluation of grid processor architectures
- Nagarajan, R., Sankaralingam, K., Burger, D., and Keckler, S. W. 2001. A design space evaluation of grid processor architectures. In Proceedings of MICRO-34, 40-51.
- (2001) Proceedings of MICRO-34 , pp. 40-51
- Nagarajan, R.¹ Sankaralingam, K.² Burger, D.³ Keckler, S.W.⁴

27
- 0031594009
- Active pages: A computation model for intelligent memory
- Oskin, M., Chong, F. T., and Sherwood, T. 1998. Active pages: A computation model for intelligent memory. In Proceedings of the 25th International Symposium on Computer Architecture, 192-203.
- (1998) Proceedings of the 25th International Symposium on Computer Architecture , pp. 192-203
- Oskin, M.¹ Chong, F.T.² Sherwood, T.³

28
- 0032207001
- The pews microarchitecture: Reducing complexity through data dependence based decentralization
- (November)
- Ranganathan, N. and Franklin, M. 1998. The pews microarchitecture: Reducing complexity through data dependence based decentralization. Microprocessors and Microsystems 22, 6 (November), 333-343.
- (1998) Microprocessors and Microsystems , vol.22 , Issue.6 , pp. 333-343
- Ranganathan, N.¹ Franklin, M.²

29
- 4243514480
- Combining hyperblocks and exit prediction to increase front-end bandwidth and performance
- Department of Computer Sciences, The University of Texas at Austin
- Ranganathan, N., Nagarajan, R., Burger, D., and Keckler, S. W. 2002. Combining hyperblocks and exit prediction to increase front-end bandwidth and performance. Tech. Rep. TR-02-41, Department of Computer Sciences, The University of Texas at Austin.
- (2002) Tech. Rep. TR-02-41
- Ranganathan, N.¹ Nagarajan, R.² Burger, D.³ Keckler, S.W.⁴

30
- 0002017307
- Instruction-level parallel processing: History, overview, and perspective
- Rau, B. R. and Fisher, J. A. 1993. Instruction-level parallel processing: History, overview, and perspective. Journal of Supercomputing 7, 9-50.
- (1993) Journal of Supercomputing , vol.7 , pp. 9-50
- Rau, B.R.¹ Fisher, J.A.²

31
- 0032312385
- A bandwidth-efficient architecture for media processing
- Rixner, S., Dally, W. J., Kapasi, U. J., Khailany, B., Lopez-Lagunas, A., Mattson, P. R., and Owens, J. D. 1998. A bandwidth-efficient architecture for media processing. In Proceedings on the 31st International Symposium on Microarchitecture, 3-13.
- (1998) Proceedings on the 31st International Symposium on Microarchitecture , pp. 3-13
- Rixner, S.¹ Dally, W.J.² Kapasi, U.J.³ Khailany, B.⁴ Lopez-Lagunas, A.⁵ Mattson, P.R.⁶ Owens, J.D.⁷

32
- 84944402628
- Universal mechanisms for data-parallel architectures
- Sankaralingam, K., Keckler, S. W., Mark, W. R., and Burger, D. 2003. Universal mechanisms for data-parallel architectures. In Proceedings of MICRO-36, 303-314.
- (2003) Proceedings of MICRO-36 , pp. 303-314
- Sankaralingam, K.¹ Keckler, S.W.² Mark, W.R.³ Burger, D.⁴

33
- 33746585048
- Dynamic frequency and voltage control for a multiple clock domain microarchitecture
- Semeraro, G., Albonesi, D., Dropsho, S., Magklis, G., Dwarkadas, S., and Scott, M. 2002. Dynamic frequency and voltage control for a multiple clock domain microarchitecture. In 35th International Symposium on Microarchitecture, 356-367.
- (2002) 35th International Symposium on Microarchitecture , pp. 356-367
- Semeraro, G.¹ Albonesi, D.² Dropsho, S.³ Magklis, G.⁴ Dwarkadas, S.⁵ Scott, M.⁶

34
- 84944387421
- Scalable memory disambiguation for high ilp processors
- Sethumadhavan, S., Desikan, R., Burger, D., Moore, C. R., and Keckler, S. W. 2003. Scalable memory disambiguation for high ilp processors. In 36th International Symposium on Microarchitecture, 399-410.
- (2003) 36th International Symposium on Microarchitecture , pp. 399-410
- Sethumadhavan, S.¹ Desikan, R.² Burger, D.³ Moore, C.R.⁴ Keckler, S.W.⁵

35
- 33845437061
- Automatically characterizing large scale program behavior
- Sherwood, T., Perelman, E., Hamerly, G., and Calder, B. 2002. Automatically characterizing large scale program behavior. In International Conference on Architectural Support for Programming Languages and Operating Systems, 45-57.
- (2002) International Conference on Architectural Support for Programming Languages and Operating Systems , pp. 45-57
- Sherwood, T.¹ Perelman, E.² Hamerly, G.³ Calder, B.⁴

36
- 0029178210
- Multiscalar processors
- Sohi, G. S., Breach, S. E., and Vijaykumar, T. N. 1995. Multiscalar processors. In Proceedings of the 22nd International Symposium on Computer Architecture, 414-425.
- (1995) Proceedings of the 22nd International Symposium on Computer Architecture , pp. 414-425
- Sohi, G.S.¹ Breach, S.E.² Vijaykumar, T.N.³

37
- 0033703889
- A scalable approach to thread-level speculation
- Steffan, J. G., Colohan, C. B., Zhai, A., and Mowry, T. C. 2000. A scalable approach to thread-level speculation. In Proceedings of the 27th Annual International Symposium on Computer Architecture, 1-12.
- (2000) Proceedings of the 27th Annual International Symposium on Computer Architecture , pp. 1-12
- Steffan, J.G.¹ Colohan, C.B.² Zhai, A.³ Mowry, T.C.⁴

38
- 68749093830
- The gilgamesh MIND processor-in-memory architecture for petaflops-scale computing
- Sterling, T. L. and Zima, H. P. 2002. The gilgamesh MIND processor-in-memory architecture for petaflops-scale computing. In International Symposium on High Performance Computing, 1-5.
- (2002) International Symposium on High Performance Computing , pp. 1-5
- Sterling, T.L.¹ Zima, H.P.²

39
- 84944392428
- Wavescalar
- Swanson, S., Michelson, K., Schwerin, A., and Oskin, M. 2003. Wavescalar. In 36th Annual International Symposium on Microarchitecture, 291-302.
- (2003) 36th Annual International Symposium on Microarchitecture , pp. 291-302
- Swanson, S.¹ Michelson, K.² Schwerin, A.³ Oskin, M.⁴

40
- 0038289667
- Bottlenecks in multimedia processing with SIMD style extensions and architectural enhancements
- Talla, D., John, L., and Burger, D. 2003. Bottlenecks in multimedia processing with SIMD style extensions and architectural enhancements. IEEE Transactions on Computers 52, 8, 35-46.
- (2003) IEEE Transactions on Computers , vol.52 , Issue.8 , pp. 35-46
- Talla, D.¹ John, L.² Burger, D.³

41
- 0036298603
- POWER4 system microarchitecture
- (Jan)
- Tendler, J. M., Dodson, J. S., Fields, J., Le, H., and Sinharoy, B. 2001. POWER4 system microarchitecture. IBM Journal of Research and Development 26, 1 (Jan), 5-26.
- (2001) IBM Journal of Research and Development , vol.26 , Issue.1 , pp. 5-26
- Tendler, J.M.¹ Dodson, J.S.² Fields, J.³ Le, H.⁴ Sinharoy, B.⁵

42
- 85024284553
- Mnemonic Instruction Set
- March 2001.
- TMS320C54X. DSP Reference Set, Volume 2: Mnemonic Instruction Set, Literature Number: SPRU172C, March 2001.
- Literature Number: SPRU172C , vol.2

43
- 0029200683
- Simultaneous multithreading: Maximizing on-chip parallelism
- Tullsen, D. M., Eggers, S. J., and Levy, H. M. 1995. Simultaneous multithreading: Maximizing on-chip parallelism. In Proceedings of ISCA-22, 392-403.
- (1995) Proceedings of ISCA-22 , pp. 392-403
- Tullsen, D.M.¹ Eggers, S.J.² Levy, H.M.³

44
- 0031236158
- Baring it all to software: RAW machines
- (Sept)
- Waingold, E., Taylor, M., Srikrishna, D., Sarkar, V., Lee, W., Lee, V., Kim, J., Frank, M., Finch, P., Barua, R., Babb, J., Amarasinghe, S., and Agarwal, A. 1997. Baring it all to software: RAW machines. IEEE Computer 30, 9 (Sept), 86-93.
- (1997) IEEE Computer , vol.30 , Issue.9 , pp. 86-93
- Waingold, E.¹ Taylor, M.² Srikrishna, D.³ Sarkar, V.⁴ Lee, W.⁵ Lee, V.⁶ Kim, J.⁷ Frank, M.⁸ Finch, P.⁹ Barua, R.¹⁰ Babb, J.¹¹ Amarasinghe, S.¹² Agarwal, A.¹³

45
- 7744223550
- XILINX. 2003 Virtex-II Pro X Platform FPGAs: Introduction and Overview.
- (2003) Virtex-II Pro X Platform FPGAs: Introduction and Overview

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.