-
1
-
-
0012561327
-
Coming challenges in microarchitecture and architecture
-
mar
-
R. Ronen, A. Mendelson, K. Lai, S.-L. Lu, F. Pollack, and J. Shen, "Coming challenges in microarchitecture and architecture," Proceedings of the IEEE, vol. 89, no. 3, pp. 325-340, mar 2001.
-
(2001)
Proceedings of the IEEE
, vol.89
, Issue.3
, pp. 325-340
-
-
Ronen, R.1
Mendelson, A.2
Lai, K.3
Lu, S.-L.4
Pollack, F.5
Shen, J.6
-
2
-
-
28444468099
-
Chip multithreading: Opportunities and challenges
-
Proc 11th International Symposium on High-Performance Computer Architecture, ser. IEEE, February
-
L. Spracklen and S. G. Abraham, "Chip multithreading: opportunities and challenges," in Proc 11th International Symposium on High-Performance Computer Architecture, ser. HPCA'05. IEEE, February 2005, pp. 248-252.
-
(2005)
HPCA'05
, pp. 248-252
-
-
Spracklen, L.1
Abraham, S.G.2
-
3
-
-
13444302326
-
The free lunch is over: A fundamental turn toward concurrency in software
-
H. Sutter, "The free lunch is over: A fundamental turn toward concurrency in software," Dr. Dobb's Journal, vol. 30, no. 3, 2005.
-
(2005)
Dr. Dobb's Journal
, vol.30
, Issue.3
-
-
Sutter, H.1
-
4
-
-
84859717032
-
SPARC T4: A dynamically threaded server-on-a-chip
-
M. Shah, R. Golla, P. Jordan, G. Grohoski, J. Barreh, J. Brooks, M. Greenberg, G. Levinsky, M. Luttrell, C. Olson, Z. Samoail, M. Smittle, and T. Ziaja, "SPARC T4: A dynamically threaded server-on-a-chip," IEEE Micro, vol. PP, no. 99, p. 1, 2012.
-
(2012)
IEEE Micro
, vol.PP
, Issue.99
, pp. 1
-
-
Shah, M.1
Golla, R.2
Jordan, P.3
Grohoski, G.4
Barreh, J.5
Brooks, J.6
Greenberg, M.7
Levinsky, G.8
Luttrell, M.9
Olson, C.10
Samoail, Z.11
Smittle, M.12
Ziaja, T.13
-
5
-
-
84872921861
-
-
[Online]. Available
-
Advanced Micro Devices, Inc., "AMD Fusion APU era begins." [Online]. Available: http://www.amd.com/us/press-releases/Pages/amd-fusion-apu- era-2011jan04.aspx
-
AMD Fusion APU Era Begins
-
-
-
6
-
-
84948696213
-
A network on chip architecture and design methodology
-
S. Kumar, A. Jantsch, J.-P. Soininen, M. Forsell, M. Millberg, J. Oberg, K. Tiensyrja, and A. Hemani, "A network on chip architecture and design methodology," in Proc. IEEE Computer Society Annual Symposium on VLSI, 2002, pp. 105-112.
-
Proc. IEEE Computer Society Annual Symposium on VLSI, 2002
, pp. 105-112
-
-
Kumar, S.1
Jantsch, A.2
Soininen, J.-P.3
Forsell, M.4
Millberg, M.5
Oberg, J.6
Tiensyrja, K.7
Hemani, A.8
-
7
-
-
2342622625
-
On-chip networks: A scalable, communication-centric embedded system design paradigm
-
J. Henkel, W. Wolf, and S. Chakradhar, "On-chip networks: a scalable, communication-centric embedded system design paradigm," in Proc. 17th International Conference on VLSI Design, 2004, pp. 845-851.
-
Proc. 17th International Conference on VLSI Design, 2004
, pp. 845-851
-
-
Henkel, J.1
Wolf, W.2
Chakradhar, S.3
-
8
-
-
35248881549
-
Online resource management in a multiprocessor with a network-on-chip
-
Proc. 2007 ACM symposium on Applied computing, ser. ACM
-
O. Moreira, J. J.-D. Mol, and M. Bekooij, "Online resource management in a multiprocessor with a network-on-chip," in Proc. 2007 ACM symposium on Applied computing, ser. SAC '07. ACM, 2007, pp. 1557-1564.
-
(2007)
SAC '07
, pp. 1557-1564
-
-
Moreira, O.1
Mol, J.J.-D.2
Bekooij, M.3
-
9
-
-
0003158656
-
Hitting the memory wall: Implications of the obvious
-
March
-
W. A. Wulf and S. A. McKee, "Hitting the memory wall: implications of the obvious," SIGARCH Comput. Archit. News, vol. 23, pp. 20-24, March 1995.
-
(1995)
SIGARCH Comput. Archit. News
, vol.23
, pp. 20-24
-
-
Wulf, W.A.1
McKee, S.A.2
-
10
-
-
33747513231
-
Supporting microthread scheduling and synchronisation in CMPs
-
I. Bell, N. Hasasneh, and C. Jesshope, "Supporting microthread scheduling and synchronisation in CMPs," International Journal of Parallel Programming, vol. 34, pp. 343-381, 2006.
-
(2006)
International Journal of Parallel Programming
, vol.34
, pp. 343-381
-
-
Bell, I.1
Hasasneh, N.2
Jesshope, C.3
-
11
-
-
85042605140
-
Parallel operation in the Control Data 6600
-
Proceedings of the October 27-29, 1964, fall joint computer conference, part II: very high speed computer systems, ser. New York, NY, USA: ACM
-
J. E. Thornton, "Parallel operation in the Control Data 6600," in Proceedings of the October 27-29, 1964, fall joint computer conference, part II: very high speed computer systems, ser. AFIPS '64 (Fall, part II). New York, NY, USA: ACM, 1965, pp. 33-40.
-
(1965)
AFIPS '64 (Fall, Part II)
, pp. 33-40
-
-
Thornton, J.E.1
-
12
-
-
0020289466
-
Architecture and applications of the HEP multiprocessor computer system
-
B. Smith, "Architecture and applications of the HEP multiprocessor computer system," Proc. SPIE Int. Soc. Opt. Eng.; (United States), vol. 298, pp. 241-248, 1981.
-
(1981)
Proc. SPIE Int. Soc. Opt. Eng.; (United States)
, vol.298
, pp. 241-248
-
-
Smith, B.1
-
13
-
-
33749935057
-
CRAY T90 vs. Tera MTA: The old champ faces a new challenger
-
411 First Avenue South, Seattle, WA 9810, USA: Cray Inc., June
-
J. Boisseau, L. Carter, A. Snavely, D. Callahan, J. Feo, S. Kahan, and Z. Wu, "CRAY T90 vs. Tera MTA: The old champ faces a new challenger," in Proc. Cray User's Group Conference. 411 First Avenue South, Seattle, WA 9810, USA: Cray Inc., June 1998.
-
(1998)
Proc. Cray User's Group Conference
-
-
Boisseau, J.1
Carter, L.2
Snavely, A.3
Callahan, D.4
Feo, J.5
Kahan, S.6
Wu, Z.7
-
14
-
-
84877021547
-
Multi-processor performance on the Tera MTA
-
Washington, DC, USA: IEEE Computer Society
-
A. Snavely, L. Carter, J. Boisseau, A. Majumdar, K. S. Gatlin, N. Mitchell, J. Feo, and B. Koblenz, "Multi-processor performance on the Tera MTA," in Supercomputing '98: Proceedings of the 1998 ACM/IEEE conference on Supercomputing. Washington, DC, USA: IEEE Computer Society, 1998, pp. 1-8.
-
(1998)
Supercomputing '98: Proceedings of the 1998 ACM/IEEE Conference on Supercomputing
, pp. 1-8
-
-
Snavely, A.1
Carter, L.2
Boisseau, J.3
Majumdar, A.4
Gatlin, K.S.5
Mitchell, N.6
Feo, J.7
Koblenz, B.8
-
15
-
-
0029200683
-
Simultaneous multithreading: Maximizing on-chip parallelism
-
May
-
D. M. Tullsen, S. J. Eggers, and H. M. Levy, "Simultaneous multithreading: maximizing on-chip parallelism," SIGARCH Comput. Archit. News, vol. 23, pp. 392-403, May 1995.
-
(1995)
SIGARCH Comput. Archit. News
, vol.23
, pp. 392-403
-
-
Tullsen, D.M.1
Eggers, S.J.2
Levy, H.M.3
-
16
-
-
0001087280
-
Hyper-Threading technology architecture and microarchitecture
-
[Online]. Available
-
D. T. Marr, F. Binns, D. L. Hill, G. Hinton, D. A. Koufaty, J. A. Miller, and M. Upton, "Hyper-Threading technology architecture and microarchitecture," Intel Technology Journal, vol. 6, no. 1, pp. 1-12, 2002. [Online]. Available: http://www.mendeley.com/research/hyperthreading- technology-architecture-and-microarchitecture/
-
(2002)
Intel Technology Journal
, vol.6
, Issue.1
, pp. 1-12
-
-
Marr, D.T.1
Binns, F.2
Hill, D.L.3
Hinton, G.4
Koufaty, D.A.5
Miller, J.A.6
Upton, M.7
-
17
-
-
0023704057
-
MASA: A multithreaded processor architecture for parallel symbolic computing
-
May [Online]. Available: 10.1145/633625.52449
-
R. H. Halstead, Jr. and T. Fujita, "MASA: a multithreaded processor architecture for parallel symbolic computing," SIGARCH Comput. Archit. News, vol. 16, pp. 443-451, May 1988. [Online]. Available: 10.1145/633625.52449
-
(1988)
SIGARCH Comput. Archit. News
, vol.16
, pp. 443-451
-
-
Halstead Jr., R.H.1
Fujita, T.2
-
18
-
-
0024667829
-
Can dataflow subsume von Neumann computing?
-
R. S. Nikhil and Arvind, "Can dataflow subsume von Neumann computing?" SIGARCH Comput. Archit. News, vol. 17, no. 3, pp. 262-272, 1989.
-
(1989)
SIGARCH Comput. Archit. News
, vol.17
, Issue.3
, pp. 262-272
-
-
Nikhil, R.S.1
Arvind2
-
19
-
-
84872894540
-
Occam and the transputer
-
Advances in Petri Nets 1989, ser. Springer Berlin / Heidelberg
-
D. May and R. Shepherd, "Occam and the transputer," in Advances in Petri Nets 1989, ser. Lecture Notes in Computer Science. Springer Berlin / Heidelberg, 1990, vol. 424, pp. 329-353.
-
(1990)
Lecture Notes in Computer Science
, vol.424
, pp. 329-353
-
-
May, D.1
Shepherd, R.2
-
20
-
-
0026142829
-
Fine-grain parallelism with minimal hardware support: A compiler-controlled threaded abstract machine
-
New York, NY, USA: ACM
-
D. E. Culler, A. Sah, K. E. Schauser, T. von Eicken, and J. Wawrzynek, "Fine-grain parallelism with minimal hardware support: a compiler-controlled threaded abstract machine," in ASPLOS-IV: Proc. 4th international conference on Architectural support for programming languages and operating systems. New York, NY, USA: ACM, 1991, pp. 164-175.
-
(1991)
ASPLOS-IV: Proc. 4th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 164-175
-
-
Culler, D.E.1
Sah, A.2
Schauser, K.E.3
Von Eicken, T.4
Wawrzynek, J.5
-
21
-
-
0024928610
-
The performance implications of thread management alternatives for shared-memory multiprocessors
-
dec
-
T. Anderson, E. Lazowska, and H. Levy, "The performance implications of thread management alternatives for shared-memory multiprocessors," IEEE Trans. Comput., vol. 38, no. 12, pp. 1631-1644, dec 1989.
-
(1989)
IEEE Trans. Comput.
, vol.38
, Issue.12
, pp. 1631-1644
-
-
Anderson, T.1
Lazowska, E.2
Levy, H.3
-
22
-
-
41549112893
-
On-Chip COMA Cache-Coherence Protocol for Microgrids of Microthreaded Cores
-
Euro-Par Workshops, ser. Bouge and et al., Eds., Springer
-
L. Zhang and C. R. Jesshope, "On-Chip COMA Cache-Coherence Protocol for Microgrids of Microthreaded Cores," in Euro-Par Workshops, ser. LNCS, Bouge and et al., Eds., vol. 4854. Springer, 2007, pp. 38-48.
-
(2007)
LNCS
, vol.4854
, pp. 38-48
-
-
Zhang, L.1
Jesshope, C.R.2
-
23
-
-
51049116878
-
The verification of the on-chip COMA cache coherence protocol
-
T. D.Vu, L. Zhang, and C. R. Jesshope, "The verification of the on-chip COMA cache coherence protocol," in International Conference on Algebraic Methodology and Software Technology, 2008, pp. 413-429.
-
International Conference on Algebraic Methodology and Software Technology, 2008
, pp. 413-429
-
-
Vu, T.D.1
Zhang, L.2
Jesshope, C.R.3
-
24
-
-
0025467711
-
A bridging model for parallel computation
-
aug.
-
L. G. Valiant, "A bridging model for parallel computation," Commun. ACM, vol. 33, pp. 103-111, aug. 1990.
-
(1990)
Commun. ACM
, vol.33
, pp. 103-111
-
-
Valiant, L.G.1
-
25
-
-
33745612838
-
-
version 3.0. [Online]. Available
-
OpenMP Architecture Review Board. (2008) OpenMP application program interface, version 3.0. [Online]. Available: http://www.openmp.org/mp-documents/ spec30.pdf
-
(2008)
OpenMP Application Program Interface
-
-
-
26
-
-
70349100958
-
-
Khronos OpenCL Working Group. version 1.0.43
-
Khronos OpenCL Working Group. (2009) The OpenCL specification, version 1.0.43.
-
(2009)
The OpenCL Specification
-
-
-
27
-
-
70350733812
-
The Cilk++ concurrency platform
-
New York, NY, USA: ACM
-
C. E. Leiserson, "The Cilk++ concurrency platform," in DAC '09: Proceedings of the 46th Annual Design Automation Conference. New York, NY, USA: ACM, 2009, pp. 522-527.
-
(2009)
DAC '09: Proceedings of the 46th Annual Design Automation Conference
, pp. 522-527
-
-
Leiserson, C.E.1
-
28
-
-
84976772007
-
Parallel prefix computation
-
October
-
R. E. Ladner and M. J. Fischer, "Parallel prefix computation," J. ACM, vol. 27, no. 4, pp. 831-838, October 1980.
-
(1980)
J. ACM
, vol.27
, Issue.4
, pp. 831-838
-
-
Ladner, R.E.1
Fischer, M.J.2
-
29
-
-
0015195766
-
Hierarchical ordering of sequential processes
-
June
-
E. W. Dijkstra, "Hierarchical ordering of sequential processes," Acta Informatica, vol. 1, no. 2, pp. 115-138, June 1971.
-
(1971)
Acta Informatica
, vol.1
, Issue.2
, pp. 115-138
-
-
Dijkstra, E.W.1
-
30
-
-
84860004847
-
Heterogeneous integration to simplify many-core architecture simulations
-
Proc. 2012 Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools, ser. ACM
-
R. Poss, M. Lankamp, M. I. Uddin, J. Sýkora, and L. Kafka, "Heterogeneous integration to simplify many-core architecture simulations," in Proc. 2012 Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools, ser. RAPIDO '12. ACM, 2012, pp. 17-24.
-
(2012)
RAPIDO '12
, pp. 17-24
-
-
Poss, R.1
Lankamp, M.2
Uddin, M.I.3
Sýkora, J.4
Kafka, L.5
-
31
-
-
78650940223
-
Towards Scalable I/O on a Many-core Architecture
-
IEEE, July
-
M. A. Hicks, M. W. van Tol, and C. R. Jesshope, "Towards Scalable I/O on a Many-core Architecture," in International Conference on Embedded Computer Systems: Architectures, MOdeling and Simulation (SAMOS). IEEE, July 2010, pp. 341-348.
-
(2010)
International Conference on Embedded Computer Systems: Architectures, MOdeling and Simulation (SAMOS)
, pp. 341-348
-
-
Hicks, M.A.1
Van Tol, M.W.2
Jesshope, C.R.3
-
32
-
-
0030149507
-
Cacti: An enhanced cache access and cycle time model
-
may
-
S. Wilton and N. Jouppi, "Cacti: an enhanced cache access and cycle time model," Solid-State Circuits, IEEE Journal of, vol. 31, no. 5, pp. 677-688, may 1996.
-
(1996)
Solid-State Circuits, IEEE Journal of
, vol.31
, Issue.5
, pp. 677-688
-
-
Wilton, S.1
Jouppi, N.2
-
34
-
-
84872970661
-
An operating system strategy for general-purpose parallel computing on many-core architectures
-
vol. High Performance Computing: From Grids and Clouds to Exascale
-
M. W. van Tol and C. R. Jesshope, "An operating system strategy for general-purpose parallel computing on many-core architectures," Advances in Parallel Computing, vol. High Performance Computing: From Grids and Clouds to Exascale, no. 20, pp. 157-181, 2011.
-
(2011)
Advances in Parallel Computing
, Issue.20
, pp. 157-181
-
-
Van Tol, M.W.1
Jesshope, C.R.2
-
35
-
-
84976797486
-
Sharing and protection in a single-address-space operating system
-
November
-
J. S. Chase, H. M. Levy, M. J. Feeley, and E. D. Lazowska, "Sharing and protection in a single-address-space operating system," ACM Trans. Comput. Syst., vol. 12, pp. 271-307, November 1994.
-
(1994)
ACM Trans. Comput. Syst.
, vol.12
, pp. 271-307
-
-
Chase, J.S.1
Levy, H.M.2
Feeley, M.J.3
Lazowska, E.D.4
-
36
-
-
84872913927
-
Specifying loop transformations for C2μTC source-ro-source compiler
-
IBM Research Center, January
-
D. Saougkos, D. Evgenidou, and G. Manis, "Specifying loop transformations for C2μTC source-ro-source compiler," in Proc. of 14th Workshop on Compilers for Parallel Computing (CPC'09), Zürich, Switzerland. IBM Research Center, January 2009.
-
(2009)
Proc. of 14th Workshop on Compilers for Parallel Computing (CPC'09), Zürich, Switzerland
-
-
Saougkos, D.1
Evgenidou, D.2
Manis, G.3
-
37
-
-
84872897389
-
Run-time scheduling with the C2uTC parallelizing compiler
-
2nd Workshop on Parallel Programming and Run-Time Management Techniques for Many-Core Architectures, in Workshop Proceedings of the 24th Conference on Computing Systems (ARCS 2011), ser. Springer
-
D. Saougkos and G. Manis, "Run-time scheduling with the C2uTC parallelizing compiler," in 2nd Workshop on Parallel Programming and Run-Time Management Techniques for Many-Core Architectures, in Workshop Proceedings of the 24th Conference on Computing Systems (ARCS 2011), ser. Lecture Notes in Computer Science. Springer, 2011, pp. 151-157.
-
(2011)
Lecture Notes in Computer Science
, pp. 151-157
-
-
Saougkos, D.1
Manis, G.2
-
38
-
-
33747508171
-
SAC: A functional array language for efficient multi-threaded execution
-
Aug
-
C. Grelck and S.-B. Scholz, "SAC: a functional array language for efficient multi-threaded execution," International Journal of Parallel Programming, vol. 34, no. 4, pp. 383-427, Aug 2006.
-
(2006)
International Journal of Parallel Programming
, vol.34
, Issue.4
, pp. 383-427
-
-
Grelck, C.1
Scholz, S.-B.2
-
39
-
-
70350453696
-
Compiling the Functional Data-Parallel Language SaC for Microgrids of Self-Adaptive Virtual Processors
-
C. Grelck, S. Herhut, C. Jesshope, C. Joslin, M. Lankamp, S.-B. Scholz, and A. Shafarenko, "Compiling the Functional Data-Parallel Language SaC for Microgrids of Self-Adaptive Virtual Processors," in 14th Workshop on Compilers for Parallel Computing (CPC'09), IBM Research Center, Zurich, Switzerland, 2009.
-
14th Workshop on Compilers for Parallel Computing (CPC'09), IBM Research Center, Zurich, Switzerland, 2009
-
-
Grelck, C.1
Herhut, S.2
Jesshope, C.3
Joslin, C.4
Lankamp, M.5
Scholz, S.-B.6
Shafarenko, A.7
-
40
-
-
8844226734
-
Optimization and benchmark of cryptographic algorithms on network processors
-
September/October
-
Z. Tan, C. Lin, H. Yin, and B. Li, "Optimization and benchmark of cryptographic algorithms on network processors," IEEE Micro, vol. 24, no. 5, pp. 55-69, September/October 2004.
-
(2004)
IEEE Micro
, vol.24
, Issue.5
, pp. 55-69
-
-
Tan, Z.1
Lin, C.2
Yin, H.3
Li, B.4
-
41
-
-
85015520396
-
NPCryptBench: A cryptographic benchmark suite for network processors
-
September
-
Y. Yue, C. Lin, and Z. Tan, "NPCryptBench: a cryptographic benchmark suite for network processors," SIGARCH Comput. Archit. News, vol. 34, no. 1, pp. 49-56, September 2005.
-
(2005)
SIGARCH Comput. Archit. News
, vol.34
, Issue.1
, pp. 49-56
-
-
Yue, Y.1
Lin, C.2
Tan, Z.3
-
42
-
-
84858791438
-
Clearing the clouds: A study of emerging scale-out workloads on modern hardware
-
Proc. 17th international conference on Architectural Support for Programming Languages and Operating Systems, ser. ACM
-
M. Ferdman, A. Adileh, O. Kocberber, S. Volos, M. Alisafaee, D. Jevdjic, C. Kaynak, A. D. Popescu, A. Ailamaki, and B. Falsafi, "Clearing the clouds: a study of emerging scale-out workloads on modern hardware," in Proc. 17th international conference on Architectural Support for Programming Languages and Operating Systems, ser. ASPLOS '12. ACM, 2012, pp. 37-48.
-
(2012)
ASPLOS '12
, pp. 37-48
-
-
Ferdman, M.1
Adileh, A.2
Kocberber, O.3
Volos, S.4
Alisafaee, M.5
Jevdjic, D.6
Kaynak, C.7
Popescu, A.D.8
Ailamaki, A.9
Falsafi, B.10
|