-
1
-
-
85092762484
-
Realizing a Power Efficient, Easy to Program Many Core: The Tile Processor
-
Presented at
-
A. AGARWAL. Realizing a Power Efficient, Easy to Program Many Core: The Tile Processor. Presented at the Stanford Computer Systems EE380 Colloquium. Available online: http://www.stanford.edu/class/ee380/Abstracts/100203-slides.pdf.
-
the Stanford Computer Systems EE380 Colloquium
-
-
AGARWAL, A.1
-
2
-
-
0034844454
-
Data Prefetching by Dependence Graph Precomputation
-
(Goteborg, Sweden, June), ACM
-
ANNAVARAM, M., PATEL, J. M., AND DAVIDSON, E. S. Data Prefetching by Dependence Graph Precomputation. In Proceedings of the 28th Annual International Symposium on Computer Architecture (Goteborg, Sweden, June 2001), ACM.
-
(2001)
Proceedings of the 28th Annual International Symposium on Computer Architecture
-
-
ANNAVARAM, M.1
PATEL, J. M.2
DAVIDSON, E. S.3
-
3
-
-
85092790265
-
-
Low-Key Intel 80-Core Intro: The Tip of the Iceberg, (April)
-
BARON, M. Low-Key Intel 80-Core Intro: The Tip of the Iceberg. Microprocessor Report (April 2007).
-
(2007)
Microprocessor Report
-
-
BARON, M.1
-
4
-
-
4544256288
-
A transregional CMOS SRAM with single, logic V and dynamic power rails
-
BHAVNAGARWALA, A., KOSONOCKY, S., KOWALCZYK, S., JOSHI, R., CHAN, Y., SRINIVASAN, U., AND WADHWA, J. A transregional CMOS SRAM with single, logic V and dynamic power rails. In Symp. VLSI Circuits Dig. Tech. Papers (2004), pp. 292-293.
-
(2004)
Symp. VLSI Circuits Dig. Tech. Papers
, pp. 292-293
-
-
BHAVNAGARWALA, A.1
KOSONOCKY, S.2
KOWALCZYK, S.3
JOSHI, R.4
CHAN, Y.5
SRINIVASAN, U.6
WADHWA, J.7
-
6
-
-
70349145946
-
Inside the CORE microarchitecture
-
presented at the (Palo Alto, CA, August)
-
DOWECK, J. Inside the CORE microarchitecture. In presented at the 18th IEEE Hot Chips Symp. (Palo Alto, CA, August 2006).
-
(2006)
18th IEEE Hot Chips Symp
-
-
DOWECK, J.1
-
7
-
-
35448978700
-
Energy per instruction trends in. Intel microprocessors
-
GROCHOWSKI, E., AND ANNAVARAM, M. Energy per instruction trends in. Intel microprocessors. Technology Intel Magazine 4, 3 (2006), 1-8.
-
(2006)
Technology Intel Magazine
, vol.4
, Issue.3
, pp. 1-8
-
-
GROCHOWSKI, E.1
ANNAVARAM, M.2
-
8
-
-
77954728886
-
Application heartbeats: a generic interface for specifying program performance and goals in autonomous computing environments
-
HOFFMANN, H., EASTEP, J., SANTAMBROGIO, M. D., MILLER, J. E., AND AGARWAL, A. Application heartbeats: a generic interface for specifying program performance and goals in autonomous computing environments. In ICAC (2010).
-
(2010)
ICAC
-
-
HOFFMANN, H.1
EASTEP, J.2
SANTAMBROGIO, M. D.3
MILLER, J. E.4
AGARWAL, A.5
-
9
-
-
79953120063
-
Dynamic knobs for responsive power-aware computing
-
HOFFMANN, H., SIDIROGLOU, S., CARBIN, M., MISAILOVIC, S., AGARWAL, A., AND RINARD, M. C. Dynamic knobs for responsive power-aware computing. In ASPLOS (2011), pp. 199-212.
-
(2011)
ASPLOS
, pp. 199-212
-
-
HOFFMANN, H.1
SIDIROGLOU, S.2
CARBIN, M.3
MISAILOVIC, S.4
AGARWAL, A.5
RINARD, M. C.6
-
10
-
-
3042569221
-
Physical Experimentation with Prefetching Helper Threads on Intel's Hyper-Threaded Processors
-
(San Jose, CA, March)
-
KIM, D., WEI LIAO, S. S., WANG, P., DEL CUVILLO, J., TIAN, X., ZOU, X., WANG, H., YEUNG, D., GIRKAR, M., AND SHEN, J. Physical Experimentation with Prefetching Helper Threads on Intel's Hyper-Threaded Processors. In Proceedings of the 2004 International Symposium on Code Generation and Optimization with Special Emphasis on Feedback-Directed and Runtime Optimization (San Jose, CA, March 2004).
-
(2004)
Proceedings of the 2004 International Symposium on Code Generation and Optimization with Special Emphasis on Feedback-Directed and Runtime Optimization
-
-
KIM, D.1
WEI LIAO, S. S.2
WANG, P.3
DEL CUVILLO, J.4
TIAN, X.5
ZOU, X.6
WANG, H.7
YEUNG, D.8
GIRKAR, M.9
SHEN, J.10
-
11
-
-
33845414882
-
Design and Evaluation of Compiler Algorithms for Pre-Execution
-
(San Jose, CA, October), ACM
-
KIM, D., AND YEUNG, D. Design and Evaluation of Compiler Algorithms for Pre-Execution. In Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems (San Jose, CA, October 2002), ACM, pp. 159-170.
-
(2002)
Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 159-170
-
-
KIM, D.1
YEUNG, D.2
-
12
-
-
4444263176
-
A Study of Source-Level Compiler Algorithms for Automatic Construction of Pre-Execution Code
-
(August)
-
KIM, D., AND YEUNG, D. A Study of Source-Level Compiler Algorithms for Automatic Construction of Pre-Execution Code. ACM Transactions on Computer Systems 22, 3 (August 2004).
-
(2004)
ACM Transactions on Computer Systems
, vol.22
, pp. 3
-
-
KIM, D.1
YEUNG, D.2
-
13
-
-
21644440721
-
Conjoined-Core Chip Multiprocessing
-
IEEE CS Press
-
KUMAR, R., JOUPPI, N., AND TULLSEN, D. Conjoined-Core Chip Multiprocessing. In Proc. Int'l Symp. Microarchitecture (2004), IEEE CS Press, pp. 195-206.
-
(2004)
Proc. Int'l Symp. Microarchitecture
, pp. 195-206
-
-
KUMAR, R.1
JOUPPI, N.2
TULLSEN, D.3
-
14
-
-
58149234982
-
A 65nm Sub-Vt Microcontroller with Integrated SRAM and Switched Capacitor DC-DC Convertor
-
(January)
-
KWONG, J., RAMADASS, Y., VERMA, N., AND CHANDRAKASAN, A. A 65nm Sub-Vt Microcontroller with Integrated SRAM and Switched Capacitor DC-DC Convertor. IEEE Journal of Solid-State Circuits 44, 1 (January 2009), 115-125.
-
(2009)
IEEE Journal of Solid-State Circuits
, vol.44
, Issue.1
, pp. 115-125
-
-
KWONG, J.1
RAMADASS, Y.2
VERMA, N.3
CHANDRAKASAN, A.4
-
15
-
-
68849120952
-
Prefetching with Helper Threads for Loosely Coupled Multiprocessor Systems
-
(July)
-
LEE, J., JUNG, C., LIM, D., AND SOLIHIN, Y. Prefetching with Helper Threads for Loosely Coupled Multiprocessor Systems. IEEE Transactions on Parallel and Distributed Systems 20, 9 (July 2009).
-
(2009)
IEEE Transactions on Parallel and Distributed Systems
, vol.20
, pp. 9
-
-
LEE, J.1
JUNG, C.2
LIM, D.3
SOLIHIN, Y.4
-
16
-
-
33749382556
-
Dynamic Helper Threaded Prefetching on the Sun Ultra-SPARC CMP Processor
-
(November)
-
LU, J., DAS, A., HSU, W.-C., NGUYEN, K., AND ABRAHAM, S. G. Dynamic Helper Threaded Prefetching on the Sun Ultra-SPARC CMP Processor. In Proceedings of the 38th International Symposium on Microarchitecture (November 2005).
-
(2005)
Proceedings of the 38th International Symposium on Microarchitecture
-
-
LU, J.1
DAS, A.2
HSU, W.-C.3
NGUYEN, K.4
ABRAHAM, S. G.5
-
17
-
-
84982318971
-
GPGPU: general purpose computation on graphics hardware
-
(Los Angeles, CA), ACM
-
LUEBKE, D., HARRIS, M., KRUGER, J., PURCELL, T., GOVINDARAJU, N., BUCK, I., WOOLLEY, C., AND LEFOHN, A. GPGPU: general purpose computation on graphics hardware. In ACM SIGGRAPH 2004 Course Notes SIGGRAPH '04 (Los Angeles, CA, 2004), ACM.
-
(2004)
ACM SIGGRAPH 2004 Course Notes SIGGRAPH '04
-
-
LUEBKE, D.1
HARRIS, M.2
KRUGER, J.3
PURCELL, T.4
GOVINDARAJU, N.5
BUCK, I.6
WOOLLEY, C.7
LEFOHN, A.8
-
18
-
-
77952563226
-
Graphite: A Distributed Parallel Simulator for Multicores
-
(January)
-
MILLER, J., KASTURE, H., KURIAN, G., III, C. G., BECKMANN, N., CELIO, C., EASTEP, J., AND AGARWAL, A. Graphite: A Distributed Parallel Simulator for Multicores. In 16th IEEE International Symposium on High-Performance Computer Architecture (HPCA) (January 2010).
-
(2010)
16th IEEE International Symposium on High-Performance Computer Architecture (HPCA)
-
-
MILLER, J.1
KASTURE, H.2
KURIAN, C. G.3
BECKMANN, N.4
CELIO, C.5
EASTEP, J.6
AGARWAL, A.7
-
20
-
-
0029273301
-
Supporting Dynamic Data Structures on Distributed Memory Machines
-
(March)
-
ROGERS, A., CARLISLE, M., REPPY, J., AND HENDREN, L. Supporting Dynamic Data Structures on Distributed Memory Machines. ACM Transactions on Programming Languages and Systems 17, 2 (March 1995).
-
(1995)
ACM Transactions on Programming Languages and Systems
, vol.17
, pp. 2
-
-
ROGERS, A.1
CARLISLE, M.2
REPPY, J.3
HENDREN, L.4
-
21
-
-
4644353790
-
Evaluation of the Raw microprocessor: An exposed-wire-delay architecture for ILP and streams
-
(June)
-
TAYLOR, M. B., LEE, W., MILLER, J. E., WENTZLAFF, D., BRATT, I., GREENWALD, B., HOFFMANN, H., JOHNSON, P., KIM, J., PSOTA, J., SARAF, A., SHNIDMAN, N., STRUMPEN, V., FRANK, M., AMARASINGHE, S., AND AGARWAL, A. Evaluation of the Raw microprocessor: An exposed-wire-delay architecture for ILP and streams. In ISCA '04: Proc of the 31st annual International Symposium on Computer Architecture (June 2004), pp. 2-13.
-
(2004)
ISCA '04: Proc of the 31st annual International Symposium on Computer Architecture
, pp. 2-13
-
-
TAYLOR, M. B.1
LEE, W.2
MILLER, J. E.3
WENTZLAFF, D.4
BRATT, I.5
GREENWALD, B.6
HOFFMANN, H.7
JOHNSON, P.8
KIM, J.9
PSOTA, J.10
SARAF, A.11
SHNIDMAN, N.12
STRUMPEN, V.13
FRANK, M.14
AMARASINGHE, S.15
AGARWAL, A.16
-
22
-
-
36849030305
-
On-chip interconnection architecture of the Tile processor
-
(Sept-Oct)
-
WENTZLAFF, D., GRIFFIN, P., HOFFMANN, H., BAO, L., EDWARDS, B., RAMEY, C., MATTINA, M., MIAO, C.-C., BROWN, J. F., AND AGARWAL, A. On-chip interconnection architecture of the Tile processor. IEEE Micro 27, 5 (Sept-Oct 2007), 15-31.
-
(2007)
IEEE Micro
, vol.27
, Issue.5
, pp. 15-31
-
-
WENTZLAFF, D.1
GRIFFIN, P.2
HOFFMANN, H.3
BAO, L.4
EDWARDS, B.5
RAMEY, C.6
MATTINA, M.7
MIAO, C.-C.8
BROWN, J. F.9
AGARWAL, A.10
-
23
-
-
34547375943
-
A 2.60 pJ/Inst subthreshold sensor processor for optimal energy efficiency
-
(June)
-
ZHAI, B., NAZHANDALI, L., OLSON, J., REEVES, A., MINUTH, M., HELFAND, R., PANT, S., BLAAUW, D., AND AUSTIN, T. A 2.60 pJ/Inst subthreshold sensor processor for optimal energy efficiency. In Symp. VLSI Circuits Dig. Tech. Papers (June 2006), pp. 154-155.
-
(2006)
Symp. VLSI Circuits Dig. Tech. Papers
, pp. 154-155
-
-
ZHAI, B.1
NAZHANDALI, L.2
OLSON, J.3
REEVES, A.4
MINUTH, M.5
HELFAND, R.6
PANT, S.7
BLAAUW, D.8
AUSTIN, T.9
|