-
1
-
-
84871649933
-
Level-3 BLAS on the TI C6678 multi-core DSP
-
Computer Architecture and High Performance Computing
-
M. Ali et al., "Level-3 BLAS on the TI C6678 Multi-Core DSP," Proc. IEEE 24th Int'l Symp. Computer Architecture and High Performance Computing, 2012, pp. 179-186.
-
(2012)
Proc. IEEE 24th Int'l Symp
, pp. 179-186
-
-
Ali, M.1
-
2
-
-
34247349114
-
The potential of the cell processor for scientific computing
-
S. Williams et al., "The Potential of the Cell Processor for Scientific Computing," Proc. 3rd Conf. Computing Frontiers, 2006, pp. 9-20.
-
(2006)
Proc. 3rd Conf. Computing Frontiers
, pp. 9-20
-
-
Williams, S.1
-
3
-
-
84907901421
-
1TOPS/WSoftware programmable media processor
-
D. Moloney et al., "1TOPS/WSoftware Programmable Media Processor," Hot Chips 23, 2011; www.hotchips.org/wp-content /uploads/hc archives/hc23/HC23.19.8-Video /HC23.19.811-1TOPS-Media-Moloney-Movidius.pdf.
-
(2011)
Hot Chips 23
-
-
Moloney, D.1
-
4
-
-
0025402476
-
A set of level 3 basic linear algebra subprograms
-
J.J. Dongarra et al., "A Set of Level 3 Basic Linear Algebra Subprograms," ACM Trans. Mathematical Software, vol. 16, no. 1, 1990, pp. 1-17.
-
(1990)
ACM Trans. Mathematical Software
, vol.16
, Issue.1
, pp. 1-17
-
-
Dongarra, J.J.1
-
6
-
-
0036374185
-
Critical power slope: Understanding the runtime effects of frequency scaling
-
A. Miyoshi et al., "Critical Power Slope: Understanding the Runtime Effects of Frequency Scaling," Proc. 16th Int'l Conf. Supercomputing (ICS 02), 2002, pp. 35-44.
-
(2002)
Proc. 16th Int'l Conf. Supercomputing (ICS 02)
, pp. 35-44
-
-
Miyoshi, A.1
-
7
-
-
0003015894
-
Some scheduling techniques and an easily schedulable horizontal architecture for high performance scientific computing
-
B.R. Rau and C.D. Glaeser, "Some Scheduling Techniques and an Easily Schedulable Horizontal Architecture for High Performance Scientific Computing," Proc. 14th Ann. Workshop Microprogramming (MICRO 14), 1981, pp. 183-198.
-
(1981)
Proc. 14th Ann. Workshop Microprogramming (MICRO 14)
, pp. 183-198
-
-
Rau, B.R.1
Glaeser, C.D.2
-
8
-
-
0032662838
-
An experimental evaluation of tiling and shackling for memory hierarchy management
-
I. Kodukula et al., "An Experimental Evaluation of Tiling and Shackling for Memory Hierarchy Management," Proc. 13th Int'l Conf. Supercomputing (ICS 99), 1999, pp. 482-491.
-
(1999)
Proc. 13th Int'l Conf. Supercomputing (ICS 99)
, pp. 482-491
-
-
Kodukula, I.1
-
9
-
-
84877723101
-
Unleashing the high-performance and low-power of multi-core dsps for general-purpose HPC
-
F.D. Igual et al., "Unleashing the High-Performance and Low-Power of Multi-core DSPs for General-Purpose HPC," Proc. Int'l Conf. High Performance Computing, Networking, Storage and Analysis (SC 12), 2012, pp. 26:1-26:11.
-
(2012)
Proc. Int'l Conf. High Performance Computing, Networking, Storage and Analysis (SC 12)
, pp. 261-2611
-
-
Igual, F.D.1
-
10
-
-
84869388261
-
Codesign tradeoffs for high-performance, low power linear algebra architectures
-
A. Pedram, R. van de Geijn, and A. Gerstlauer, "Codesign Tradeoffs for High-Performance, Low Power Linear Algebra Architectures," IEEE Trans. Computers, vol. 61, no. 12, 2012, pp. 1724-1736.
-
(2012)
IEEE Trans. Computers
, vol.61
, Issue.12
, pp. 1724-1736
-
-
Pedram, A.1
Van De Geijn, R.2
Gerstlauer, A.3
-
12
-
-
85066538250
-
Accelerating exascale: How the end of moore's law scaling is changing the machines you use
-
and the Algorithms You Use, Nvidia, Mar
-
S. Oberlin, Accelerating Exascale: How the End of Moore's Law Scaling is Changing the Machines You Use, the Way You Code, and the Algorithms You Use, Nvidia, Mar. 2014; www.siam.org/meetings/ex14/02-oberlin-slides.pdf.
-
(2014)
The Way You Code
-
-
Oberlin, S.1
|