-
1
-
-
0030382365
-
Shared memory consistency models: A tutorial
-
Dec
-
ADVE, S. V. AND GHARACHORLOO, K. 1996. Shared memory consistency models: A tutorial. IEEE Computer 29, 12 (Dec), 66-76.
-
(1996)
IEEE Computer
, vol.29
, Issue.12
, pp. 66-76
-
-
ADVE, S.V.1
GHARACHORLOO, K.2
-
2
-
-
0033717865
-
Clock rate versus IPC: The end of the road for conventional microarchitectures
-
AGARWAL, V., HRISHIKESH, M. S., KECKLER, S. W., AND BURGER, D. 2000. Clock rate versus IPC: the end of the road for conventional microarchitectures. In Proceedings of the 27th International Symposium Computer Architecture.
-
(2000)
Proceedings of the 27th International Symposium Computer Architecture
-
-
AGARWAL, V.1
HRISHIKESH, M.S.2
KECKLER, S.W.3
BURGER, D.4
-
6
-
-
33644662621
-
Implementation of h.264 encoder and decoder on personal computers
-
CHEN, Y.-K., LI, E. Q., ZHOU, X., AND GE, S. 2006. Implementation of h.264 encoder and decoder on personal computers. J. Visual Communication and. Image Representation 17, 2, 509-532.
-
(2006)
J. Visual Communication and. Image Representation
, vol.17
, Issue.2
, pp. 509-532
-
-
CHEN, Y.-K.1
LI, E.Q.2
ZHOU, X.3
GE, S.4
-
11
-
-
33847152637
-
-
DRAKE, M., HOFFMANN, H., RABBAH, R., AND AMARASINGHE, S. 2006. Mpeg-2 decoding in a stream programming language. In Proceedings of the 20th IEEE International Parallel & Distributed Processing Symposium, Rhodes Island (IPDPS).
-
DRAKE, M., HOFFMANN, H., RABBAH, R., AND AMARASINGHE, S. 2006. Mpeg-2 decoding in a stream programming language. In Proceedings of the 20th IEEE International Parallel & Distributed Processing Symposium, Rhodes Island (IPDPS).
-
-
-
-
12
-
-
59449105466
-
-
EATHERTON, W. 2005. The push of network processing to the top of the pyramid. Keynote presentation at the Symposium on Architectures for Networking and Communication Systems, Princeton, NJ.
-
EATHERTON, W. 2005. The push of network processing to the top of the pyramid. Keynote presentation at the Symposium on Architectures for Networking and Communication Systems, Princeton, NJ.
-
-
-
-
13
-
-
34548052234
-
-
EREZ, M., AHN, J. H., GUMMAHAJU, J., ROSENBLUM, M., AND DALLY, W. J. 2007. Executing irregular scientific applications on stream architectures. In Proceedings of the 21st Annual International Conference on Supercomputing. 93-104.
-
EREZ, M., AHN, J. H., GUMMAHAJU, J., ROSENBLUM, M., AND DALLY, W. J. 2007. Executing irregular scientific applications on stream architectures. In Proceedings of the 21st Annual International Conference on Supercomputing. 93-104.
-
-
-
-
14
-
-
34548207355
-
-
FATAHALLAN, K., KNIGHT, T. J., HOUSTON, M. et al. 2006. Sequoia: Programming the memory hierarchy. In Proceedings of the 2006 ACM / IEEE Conference on Supercomputing.
-
FATAHALLAN, K., KNIGHT, T. J., HOUSTON, M. et al. 2006. Sequoia: Programming the memory hierarchy. In Proceedings of the 2006 ACM / IEEE Conference on Supercomputing.
-
-
-
-
17
-
-
35348919265
-
A novel SIMD architecture for the cell heterogeneous chipmulti-processor
-
GSCHWIND, M. et al. 2005. A novel SIMD architecture for the cell heterogeneous chipmulti-processor. In Conference Record of Hot Chips 17.
-
(2005)
Conference Record of Hot Chips 17
-
-
GSCHWIND, M.1
-
18
-
-
77957817744
-
-
GUMMARAJU, J., COBURN, J., TURNER, Y., AND ROSENBLUM, M. 2008. Streamware: programming general-purpose multicore processors using streams. In Proceedings of the 13th International Conference on Architectural Support for Programming Languages and Operating Systems. 297-307.
-
GUMMARAJU, J., COBURN, J., TURNER, Y., AND ROSENBLUM, M. 2008. Streamware: programming general-purpose multicore processors using streams. In Proceedings of the 13th International Conference on Architectural Support for Programming Languages and Operating Systems. 297-307.
-
-
-
-
19
-
-
47849087164
-
-
GUMMARAJU, J., EREZ, M., COBURN, J., ROSENBLUM, M., AND DALLY, W. J. 2007. Architectural support for the stream execution model on general-purpose processors. In Proceedings of the 16th International Conference on Parallel Architecture and Compilation Techniques. 3-12.
-
GUMMARAJU, J., EREZ, M., COBURN, J., ROSENBLUM, M., AND DALLY, W. J. 2007. Architectural support for the stream execution model on general-purpose processors. In Proceedings of the 16th International Conference on Parallel Architecture and Compilation Techniques. 3-12.
-
-
-
-
20
-
-
33749377408
-
-
GUMMARAJU, J. AND ROSENBLUM, M. 2005. Stream programming on general-purpose processors. In Proceedings of the 38th International Symposium on Microarchitecture.
-
GUMMARAJU, J. AND ROSENBLUM, M. 2005. Stream programming on general-purpose processors. In Proceedings of the 38th International Symposium on Microarchitecture.
-
-
-
-
22
-
-
84976721461
-
-
HEINLEIN, J., GHARACHORLOO, K., DRESSER, S., AND GUPTA, A. 1994. Integration of message passing and shared memory in the Stanford flash multiprocessor. SIGOPS Oper. Syst. Rev. 28, 5, 38-50.
-
HEINLEIN, J., GHARACHORLOO, K., DRESSER, S., AND GUPTA, A. 1994. Integration of message passing and shared memory in the Stanford flash multiprocessor. SIGOPS Oper. Syst. Rev. 28, 5, 38-50.
-
-
-
-
23
-
-
59449101408
-
-
HO, R., MAI, K., AND HOROWITZ, M. 2001. The Future of wires. Proceedings of the IEEE 89, 4 (Apr.).
-
HO, R., MAI, K., AND HOROWITZ, M. 2001. The Future of wires. Proceedings of the IEEE 89, 4 (Apr.).
-
-
-
-
24
-
-
0141538149
-
-
HO, R., MAI, K., AND HOROWITZ, M. 2003. Efficient on-chip global interconnects. In Symposium on VLSI Circuits. 271-274.
-
HO, R., MAI, K., AND HOROWITZ, M. 2003. Efficient on-chip global interconnects. In Symposium on VLSI Circuits. 271-274.
-
-
-
-
25
-
-
2442653861
-
-
HOROWITZ, M. AND DALLY, W. 2004. How scaling will change processor architecture. In Proceedings of the International Solid-State Circuits Conference. 132-133.
-
HOROWITZ, M. AND DALLY, W. 2004. How scaling will change processor architecture. In Proceedings of the International Solid-State Circuits Conference. 132-133.
-
-
-
-
27
-
-
59449084767
-
-
ITU-T REC. H.264. 2002. ISO/IEC 144496-10 AVC. 2002.
-
ITU-T REC. H.264. 2002. ISO/IEC 144496-10 AVC. 2002.
-
-
-
-
28
-
-
59449105947
-
Long words and wide ports: Reinventing the Configurable Processor
-
Stanford, CA
-
JAM, D., EZER, G., AND KIM, J. 2004. Long words and wide ports: Reinventing the Configurable Processor. In Proceedings of the Conference Record of Hot Chips 16. Stanford, CA.
-
(2004)
Proceedings of the Conference Record of Hot Chips 16
-
-
JAM, D.1
EZER, G.2
KIM, J.3
-
30
-
-
85008020071
-
A programmable 512 gops stream processor for signal, image, and video processing
-
KHAILANY, B., WILLIAMS, T., LIN, J., LONG, E., RYGH, M., TOVEY, D., AND DALLY, W. 2008. A programmable 512 gops stream processor for signal, image, and video processing. IEEE Journal of Solid-State Circuits 43, 1, 202-213.
-
(2008)
IEEE Journal of Solid-State Circuits
, vol.43
, Issue.1
, pp. 202-213
-
-
KHAILANY, B.1
WILLIAMS, T.2
LIN, J.3
LONG, E.4
RYGH, M.5
TOVEY, D.6
DALLY, W.7
-
33
-
-
84872894119
-
-
KRANZ, D., JOHNSON, K., AGARWAL, A., KUBIATOWICZ, J., AND LIM, B.-H. 1993. Integrating messagepassing and shared-memory: early experience. In Proceedings of the 4th ACM SIGPLAN symposium on Principles and Practice of Parallel. Programming. 54-63.
-
KRANZ, D., JOHNSON, K., AGARWAL, A., KUBIATOWICZ, J., AND LIM, B.-H. 1993. Integrating messagepassing and shared-memory: early experience. In Proceedings of the 4th ACM SIGPLAN symposium on Principles and Practice of Parallel. Programming. 54-63.
-
-
-
-
34
-
-
27544456315
-
Interconnections in multi-core architectures: Understanding mechanisms, overheads and scaling
-
KUMAR, R., ZYUBAN, V., AND TULLSEN, D. M. 2005. Interconnections in multi-core architectures: Understanding mechanisms, overheads and scaling. In Proceedings of the 32nd International Symposium, on Computer Architecture.
-
(2005)
Proceedings of the 32nd International Symposium, on Computer Architecture
-
-
KUMAR, R.1
ZYUBAN, V.2
TULLSEN, D.M.3
-
35
-
-
35348861326
-
-
LEVERICH, J., ARAKIDA, H., SOLOMATNIKOV, A., FIROOZSHAHIAN, A., HOROWITZ, M., AND KOZYRAKIS, C. 2007. Comparing memory systems for chip multiprocessors. In Proceedings of the 34th Annual, International Symposium on Computer Architecture. 358-368.
-
LEVERICH, J., ARAKIDA, H., SOLOMATNIKOV, A., FIROOZSHAHIAN, A., HOROWITZ, M., AND KOZYRAKIS, C. 2007. Comparing memory systems for chip multiprocessors. In Proceedings of the 34th Annual, International Symposium on Computer Architecture. 358-368.
-
-
-
-
36
-
-
59449105948
-
-
LEWIS, B. AND BERG, D. J. 1998. Multithreaded. Programming with Pthreads. Upper saddle River. NJ: Prentice Hall.
-
LEWIS, B. AND BERG, D. J. 1998. Multithreaded. Programming with Pthreads. Upper saddle River. NJ: Prentice Hall.
-
-
-
-
37
-
-
59449091948
-
-
LI, M. et al. 2005. ALP: efficient support for all levels of parallelism for complex MEDIA applications. TECH. REP. UIUCDCS-R-2005-2605, UIUC CS. JULY.
-
LI, M. et al. 2005. ALP: efficient support for all levels of parallelism for complex MEDIA applications. TECH. REP. UIUCDCS-R-2005-2605, UIUC CS. JULY.
-
-
-
-
38
-
-
17644395320
-
Blocking and array contraction across arbitrarily nested loops using affine partitioning
-
LIM, A. W., LIAO, S.-W., AND LAM, M. S. 2001. Blocking and array contraction across arbitrarily nested loops using affine partitioning. ACM SIGPLAN Notices 36, 7, 103-112.
-
(2001)
ACM SIGPLAN Notices
, vol.36
, Issue.7
, pp. 103-112
-
-
LIM, A.W.1
LIAO, S.-W.2
LAM, M.S.3
-
39
-
-
59449098153
-
-
LIN, Y. 2004. A programmable Vector coprocessor architecture for wireless applications. In Proceedings of the 3rd Workshop on Application Specific Processors.
-
LIN, Y. 2004. A programmable Vector coprocessor architecture for wireless applications. In Proceedings of the 3rd Workshop on Application Specific Processors.
-
-
-
-
40
-
-
33646932154
-
-
LOGHI, M. AND PNCINO, M. 2005. Exploring energy/performance tradeoffs in shared memory MPSoCs: Snoop-based cache coherence vs. software solutions. In Proceedings of the Design Automation and Test in Europe Conference
-
LOGHI, M. AND PNCINO, M. 2005. Exploring energy/performance tradeoffs in shared memory MPSoCs: Snoop-based cache coherence vs. software solutions. In Proceedings of the Design Automation and Test in Europe Conference
-
-
-
-
41
-
-
59449091947
-
-
MACHNICKI, E. 2005. Ultra high performance scalable DSP family for multimedia. In Proceedings of the Conference Record of Hot Chips 17.
-
MACHNICKI, E. 2005. Ultra high performance scalable DSP family for multimedia. In Proceedings of the Conference Record of Hot Chips 17.
-
-
-
-
42
-
-
59449086506
-
-
MAI, K. et al. 2000. Smart memories: A modular reconfigurable architecture. In Proceedings of the 27th International Symposium on Computer architecture.
-
MAI, K. et al. 2000. Smart memories: A modular reconfigurable architecture. In Proceedings of the 27th International Symposium on Computer architecture.
-
-
-
-
43
-
-
59449088280
-
-
MIPS32 2001. MIPS32 Architecture For Programmers II: The MIPS32 Instruction Set. MIPS Technologies, Inc.
-
MIPS32 2001. MIPS32 Architecture For Programmers Volume II: The MIPS32 Instruction Set. MIPS Technologies, Inc.
-
-
-
-
45
-
-
33749048325
-
-
MPEG SOFTWARE SIMULATION GROUP, Available at
-
MPEG SOFTWARE SIMULATION GROUP. Mssg mpeg2 encoder and decoder. Available at: http://www.mpeg.org/MPEG/MSSG/.
-
Mssg mpeg2 encoder and decoder
-
-
-
46
-
-
84905483003
-
TRIPS: A polymorphous architecture for exploiting ILP, TLP, and DLP
-
SANKARALINGAM, K. 2004. TRIPS: A polymorphous architecture for exploiting ILP, TLP, and DLP. ACM Trans. Archit. Code Optirn. 1, 1, 62-93.
-
(2004)
ACM Trans. Archit. Code Optirn. 1
, vol.1
, pp. 62-93
-
-
SANKARALINGAM, K.1
-
47
-
-
38349050777
-
A performance analysis of PIM, stream processing, and tiled processing on memory-intensive signal processing kernels
-
SUH, J. et al. 2003. A performance analysis of PIM, stream processing, and tiled processing on memory-intensive signal processing kernels. In Proceedings of the 30th International Symposium on Computer Architecture.
-
(2003)
Proceedings of the 30th International Symposium on Computer Architecture
-
-
SUH, J.1
-
48
-
-
59449087566
-
-
TARJAN, D., THOZIYOOR, S., AND JOUPPI, N. P. 2006. CACTI 4.0. Tech. Rep. HPL-2006-86, HP Labs.
-
TARJAN, D., THOZIYOOR, S., AND JOUPPI, N. P. 2006. CACTI 4.0. Tech. Rep. HPL-2006-86, HP Labs.
-
-
-
-
50
-
-
84868867970
-
-
TENSILICA 2007. Tensilica Software Tools, http://www.tensilica. com/products/software.htm.
-
(2007)
Tensilica Software Tools
-
-
-
52
-
-
35348861182
-
DRAMsim: A memory-system simulator
-
WANG, D. et al. 2005. DRAMsim: A memory-system simulator. SIGARCH Computer Architecture News 33, 4.
-
(2005)
SIGARCH Computer Architecture News
, vol.33
, pp. 4
-
-
WANG, D.1
-
55
-
-
59449110123
-
The low-power high-performance architecture of the PWRficient processor family
-
YEH, T.-Y. 2005. The low-power high-performance architecture of the PWRficient processor family. In Proceedings of the Conference Record of Hot Chips 17.
-
(2005)
Proceedings of the Conference Record of Hot Chips 17
-
-
YEH, T.-Y.1
|