-
1
-
-
84863344334
-
-
ARM. Arm11. http://www.arm.com/products/CPUs/families/ARM11Family.html.
-
Arm11
-
-
-
2
-
-
12844273425
-
Spatial computation
-
M. Budiu, G. Venkataramani, T. Chelcea, and S. C. Goldstein. Spatial computation. In 12th International Conference on Architectural Support for Programming Languages and Operating Systems, pages 14-26, 2004.
-
(2004)
12th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 14-26
-
-
Budiu, M.1
Venkataramani, G.2
Chelcea, T.3
Goldstein, S.C.4
-
3
-
-
21644435314
-
Application-specific processing on a general-purpose core via transparent instruction set customization
-
Dec.
-
N. Clark et al. Application-specific processing on a general-purpose core via transparent instruction set customization. In Proc. of the 37th Annual International Symposium on Microarchitecture, pages 30-40, Dec. 2004.
-
(2004)
Proc. of the 37th Annual International Symposium on Microarchitecture
, pp. 30-40
-
-
Clark, N.1
-
5
-
-
48249092127
-
Efficient embedded computing
-
July
-
W. J. Dally, J. Balfour, D. Black-Shaffer, J. Chen, R. Harting, V. Parikh, J. Park, and D. Sheffield. Efficient embedded computing. IEEE Computer, 41(7):27-32, July 2008.
-
(2008)
IEEE Computer
, vol.41
, Issue.7
, pp. 27-32
-
-
Dally, W.J.1
Balfour, J.2
Black-Shaffer, D.3
Chen, J.4
Harting, R.5
Parikh, V.6
Park, J.7
Sheffield, D.8
-
6
-
-
64849117951
-
Bridging the computation gap between programmable processors and hardwired accelerators
-
Feb.
-
K. Fan, M. Kudlur, G. Dasika, and S. Mahlke. Bridging the computation gap between programmable processors and hardwired accelerators. In Proc. of the 15th International Symposium on High-Performance Computer Architecture, pages 313-322, Feb. 2009.
-
(2009)
Proc. of the 15th International Symposium on High-Performance Computer Architecture
, pp. 313-322
-
-
Fan, K.1
Kudlur, M.2
Dasika, G.3
Mahlke, S.4
-
7
-
-
34548705938
-
Compiler-directed synthesis of multifunction loop accelerators
-
Sept.
-
K. Fan, M. Kudlur, H. Park, and S. Mahlke. Compiler-directed synthesis of multifunction loop accelerators. In Proc. of the 2005 Workshop on Application Specific Processors, pages 91-98, Sept. 2005.
-
(2005)
Proc. of the 2005 Workshop on Application Specific Processors
, pp. 91-98
-
-
Fan, K.1
Kudlur, M.2
Park, H.3
Mahlke, S.4
-
10
-
-
77954995378
-
Understanding sources of inefficiency in general-purpose chips
-
R. Hameed, W. Qadeer, M. Wachs, O. Azizi, A. Solomatnikov, B. C. Lee, S. Richardson, C. Kozyrakis, and M. Horowitz. Understanding sources of inefficiency in general-purpose chips. In Proc. of the 37th Annual International Symposium on Computer Architecture, pages 37-47, 2010.
-
(2010)
Proc. of the 37th Annual International Symposium on Computer Architecture
, pp. 37-47
-
-
Hameed, R.1
Qadeer, W.2
Wachs, M.3
Azizi, O.4
Solomatnikov, A.5
Lee, B.C.6
Richardson, S.7
Kozyrakis, C.8
Horowitz, M.9
-
12
-
-
0345521552
-
-
T. Instruments. Jan.
-
T. Instruments. Tms320c2x user's guide, Jan. 1993.
-
(1993)
Tms320c2x User's Guide
-
-
-
14
-
-
33750401079
-
The H.264 video coding standard
-
H. Kalva. The H.264 video coding standard. IEEE MultiMedia, 13(4):86-90, 2006.
-
(2006)
IEEE MultiMedia
, vol.13
, Issue.4
, pp. 86-90
-
-
Kalva, H.1
-
15
-
-
0027595384
-
The superblock: An effective technique for vliw and superscalar compilation
-
May
-
W. mei W. Hwu, S. A. Mahlke, W. Y. Chen, P. P. Chang, N. J. Warter, R. A. Bringmann, R. G. Ouellette, R. E. Hank, T. Kiyohara, G. E. Haab, J. G. Holm, and D. M. Lavery. The superblock: An effective technique for vliw and superscalar compilation. Journal of Supercomputing, 7(1):229-248, May 1993.
-
(1993)
Journal of Supercomputing
, vol.7
, Issue.1
, pp. 229-248
-
-
Mei, W.1
Hwu, W.2
Mahlke, S.A.3
Chen, W.Y.4
Chang, P.P.5
Warter, N.J.6
Bringmann, R.A.7
Ouellette, R.G.8
Hank, R.E.9
Kiyohara, T.10
Haab, G.E.11
Holm, J.G.12
Lavery, D.M.13
-
16
-
-
47349084021
-
Optimizing nuca organizations and wiring alternatives for large caches with cacti 6.0
-
N. Muralimanohar, R. Balasubramonian, and N. P. Jouppi. Optimizing nuca organizations and wiring alternatives for large caches with cacti 6.0. In IEEE Micro, pages 3-14, 2007.
-
(2007)
IEEE Micro
, pp. 3-14
-
-
Muralimanohar, N.1
Balasubramonian, R.2
Jouppi, N.P.3
-
18
-
-
0035363244
-
rePLay: A hardware framework for dynamic optimization
-
DOI 10.1109/12.931895
-
S. J. Patel and S. S. Lumetta. rePLay: A hardware framework for dynamic optimization. IEEE Transactions on Computers, 50(6):590-608, June 2001. (Pubitemid 32609869)
-
(2001)
IEEE Transactions on Computers
, vol.50
, Issue.6
, pp. 590-608
-
-
Patel, S.J.1
Lumetta, S.S.2
-
22
-
-
0036603298
-
PICO-NPA: High-level synthesis of nonprogrammable hardware accelerators
-
DOI 10.1023/A:1015341305426
-
R. Schreiber et al. PICO-NPA: High-level synthesis of nonprogrammable hardware accelerators. Journal of VLSI Signal Processing, 31(2):127-142, 2002. (Pubitemid 34669474)
-
(2002)
Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology
, vol.31
, Issue.2
, pp. 127-142
-
-
Schreiber, R.1
Aditya, S.2
Mahlke, S.3
Kathail, V.4
Rau, B.R.5
Cronquist, D.6
Sivaraman, M.7
-
24
-
-
84944392428
-
Wavescalar
-
IEEE Computer Society
-
S. Swanson, K. Michelson, A. Schwerin, and M. Oskin. Wavescalar. In Proc. of the 36th Annual International Symposium on Microarchitecture, page 291. IEEE Computer Society, 2003.
-
(2003)
Proc. of the 36th Annual International Symposium on Microarchitecture
, pp. 291
-
-
Swanson, S.1
Michelson, K.2
Schwerin, A.3
Oskin, M.4
-
28
-
-
77952256041
-
Conservation cores: Reducing the energy of mature computations
-
G. Venkatesh, J. Sampson, N. Goulding, S. Garcia, V. Bryksin, J. Lugo-Martinez, S. Swanson, and M. B. Taylor. Conservation cores: reducing the energy of mature computations. In 18th International Conference on Architectural Support for Programming Languages and Operating Systems, pages 205-218, 2010.
-
(2010)
18th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 205-218
-
-
Venkatesh, G.1
Sampson, J.2
Goulding, N.3
Garcia, S.4
Bryksin, V.5
Lugo-Martinez, J.6
Swanson, S.7
Taylor, M.B.8
-
30
-
-
0033703884
-
CHIMAERA: A high-performance architecture with a tightly-coupled reconfigurable functional unit
-
Z. A. Ye et al. CHIMAERA: a high-performance architecture with a tightly-coupled reconfigurable functional unit. In Proc. of the 27th Annual International Symposium on Computer Architecture, pages 225-235, 2000.
-
(2000)
Proc. of the 27th Annual International Symposium on Computer Architecture
, pp. 225-235
-
-
Ye, Z.A.1
-
32
-
-
64949084227
-
Reconciling specialization and flexibility through compound circuits
-
S. Yehia, S. Girbal, H. Berry, and O. Temam. Reconciling specialization and flexibility through compound circuits. In Proc. of the 15th International Symposium on High-Performance Computer Architecture, pages 277-288, 2009.
-
(2009)
Proc. of the 15th International Symposium on High-Performance Computer Architecture
, pp. 277-288
-
-
Yehia, S.1
Girbal, S.2
Berry, H.3
Temam, O.4
|