-
1
-
-
70349169075
-
Analyzing cuda workloads using a detailed gpu simulator
-
april
-
A. Bakhoda, G. Yuan, W. Fung, H. Wong, and T. Aamodt. Analyzing cuda workloads using a detailed gpu simulator. In Performance Analysis of Systems and Software, 2009. ISPASS 2009. IEEE International Symposium on, pages 163-174, april 2009.
-
(2009)
Performance Analysis of Systems and Software, 2009. ISPASS 2009. IEEE International Symposium on
, pp. 163-174
-
-
Bakhoda, A.1
Yuan, G.2
Fung, W.3
Wong, H.4
Aamodt, T.5
-
2
-
-
33846535493
-
The m5 simulator: Modeling networked systems
-
N. L. Binkert, R. G. Dreslinski, L. R. Hsu, K. T. Lim, A. G. Saidi, and S. K. Reinhardt. The m5 simulator: Modeling networked systems. IEEE Micro, 26:52-60, 2006.
-
(2006)
IEEE Micro
, vol.26
, pp. 52-60
-
-
Binkert, N.L.1
Dreslinski, R.G.2
Hsu, L.R.3
Lim, K.T.4
Saidi, A.G.5
Reinhardt, S.K.6
-
4
-
-
77953096885
-
Phoenixsim: A simulator for physical-layer analysis of chip-scale photonic interconnection networks
-
J. Chan, G. Hendry, A. Biberman, K. Bergman, and L. P. Carloni. Phoenixsim: a simulator for physical-layer analysis of chip-scale photonic interconnection networks. In Proc. of the Conference on Design, Automation and Test in Europe, DATE '10, pages 691-696, 2010.
-
(2010)
Proc. of the Conference on Design, Automation and Test in Europe, DATE '10
, pp. 691-696
-
-
Chan, J.1
Hendry, G.2
Biberman, A.3
Bergman, K.4
Carloni, L.P.5
-
5
-
-
78149233155
-
Ocelot: A dynamic compiler for bulk-synchronous applications in heterogeneo us systems
-
New York, NY, USA, ACM
-
G. Diamos, A. Kerr, S. Yalamanchili, and N. Clark. Ocelot: A dynamic compiler for bulk-synchronous applications in heterogeneo us systems. In PACT-19, pages 353-364, New York, NY, USA, 2010. ACM.
-
(2010)
PACT-19
, pp. 353-364
-
-
Diamos, G.1
Kerr, A.2
Yalamanchili, S.3
Clark, N.4
-
6
-
-
0012441737
-
-
Morgan Kaufmann Publishers Inc., San Franciso, CA, USA
-
J. Duato, S. Yalamanchili, and N. Lionel. Interconnection Networks: An Engineering Approach. Morgan Kaufmann Publishers Inc., San Franciso, CA, USA, 2002.
-
(2002)
Interconnection Networks: An Engineering Approach
-
-
Duato, J.1
Yalamanchili, S.2
Lionel, N.3
-
8
-
-
68949189534
-
Improvement potential and equalization example for multidrop dram memory buses
-
H. Fredriksson and C. Svensson. Improvement potential and equalization example for multidrop dram memory buses. IEEE Transaction On Advanced Packaging, 32(3):675-682, 2009.
-
(2009)
IEEE Transaction on Advanced Packaging
, vol.32
, Issue.3
, pp. 675-682
-
-
Fredriksson, H.1
Svensson, C.2
-
9
-
-
0024932245
-
Parallel discrete event simulation
-
New York, NY, USA, ACM
-
R. M. Fujimoto. Parallel discrete event simulation. In Proceedings of the 21st conference on Winter simulation, WSC '89, pages 19-28, New York, NY, USA, 1989. ACM.
-
(1989)
Proceedings of the 21st Conference on Winter Simulation, WSC '89
, pp. 19-28
-
-
Fujimoto, R.M.1
-
11
-
-
0031175430
-
Compact location problems
-
S. Krumke, M. Marathe, H. Noltemeier, V. Radhakrishnan, S. Ravi, and D. Rosenkrantz. Compact location problems. Theoretical Computer Science, 181(2):379-404, 1997.
-
(1997)
Theoretical Computer Science
, vol.181
, Issue.2
, pp. 379-404
-
-
Krumke, S.1
Marathe, M.2
Noltemeier, H.3
Radhakrishnan, V.4
Ravi, S.5
Rosenkrantz, D.6
-
12
-
-
84948970946
-
Processor allocation on Cplant: Achieving general processor locality using one-dimensional allocation strategies
-
V. Leung, E. Arkin, M. Bender, D. Bunde, J. Johnston, A. Lal, J. Mitchell, C. Phillips, and S. Seiden. Processor allocation on Cplant: Achieving general processor locality using one-dimensional allocation strategies. In Proc. 4th IEEE Intern. Conf. on Cluster Computing, pages 296-304, 2002.
-
(2002)
Proc. 4th IEEE Intern. Conf. on Cluster Computing
, pp. 296-304
-
-
Leung, V.1
Arkin, E.2
Bender, M.3
Bunde, D.4
Johnston, J.5
Lal, A.6
Mitchell, J.7
Phillips, C.8
Seiden, S.9
-
14
-
-
80052390195
-
Backfilling with guarantees granted upon job submission
-
number 6852 in LNCS
-
A. Lindsay, M. Galloway-Carson, C. Johnson, D. Bunde, and V. Leung. Backfilling with guarantees granted upon job submission. In Proc. 17th Intern. Euro-Par Conf. Parallel Processing, number 6852 in LNCS, pages 142-153, 2011.
-
(2011)
Proc. 17th Intern. Euro-Par Conf. Parallel Processing
, pp. 142-153
-
-
Lindsay, A.1
Galloway-Carson, M.2
Johnson, C.3
Bunde, D.4
Leung, V.5
-
15
-
-
0031175459
-
Non-contiguous processor allocation algorithms for mesh-connected multicomputers
-
V. Lo, K. Windisch, W. Liu, and B. Nitzberg. Non-contiguous processor allocation algorithms for mesh-connected multicomputers. IEEE Trans. Parallel and Distributed Systems, 8(7):712-726, 1997.
-
(1997)
IEEE Trans. Parallel and Distributed Systems
, vol.8
, Issue.7
, pp. 712-726
-
-
Lo, V.1
Windisch, K.2
Liu, W.3
Nitzberg, B.4
-
16
-
-
31944440969
-
Pin: Building customized program analysis tools with dynamic instrumentation
-
C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. J. Reddi, and K. Hazelwood. Pin: Building customized program analysis tools with dynamic instrumentation. In PLDI, 2005.
-
(2005)
PLDI
-
-
Luk, C.-K.1
Cohn, R.2
Muth, R.3
Patil, H.4
Klauser, A.5
Lowney, G.6
Wallace, S.7
Reddi, V.J.8
Hazelwood, K.9
-
18
-
-
0035363047
-
Utilization, predictability, workloads, and user runtime estimates in scheduling the IBM SP2 with backfilling
-
A. W. Mu'alem and D. G. Feitelson. Utilization, predictability, workloads, and user runtime estimates in scheduling the IBM SP2 with backfilling. IEEE Trans. Parallel and Distributed Syst., 12(6):529-543, 2001.
-
(2001)
IEEE Trans. Parallel and Distributed Syst
, vol.12
, Issue.6
, pp. 529-543
-
-
Mu'alem, A.W.1
Feitelson, D.G.2
-
20
-
-
78149258297
-
The Portals 4.0 message passing interface
-
April
-
R. E. Riesen, K. T. Pedretti, R. Brightwell, B. W. Barrett, K. D. Underwood, T. B. Hudson, and A. B. Maccabe. The Portals 4.0 message passing interface. Technical Report SAND2008-2639, Sandia National Laboratories, April 2008.
-
(2008)
Technical Report SAND2008-2639, Sandia National Laboratories
-
-
Riesen, R.E.1
Pedretti, K.T.2
Brightwell, R.3
Barrett, B.W.4
Underwood, K.D.5
Hudson, T.B.6
Maccabe, A.B.7
-
21
-
-
80053001343
-
The structural simulation toolkit
-
March
-
A. F. Rodrigues, K. S. Hemmert, B. W. Barrett, C. Kersey, R. Oldfield, M. Weston, R. Risen, J. Cook, P. Rosenfeld, E. CooperBalls, and B. Jacob. The structural simulation toolkit. SIGMETRICS Perform. Eval. Rev., 38:37-42, March 2011.
-
(2011)
SIGMETRICS Perform. Eval. Rev
, vol.38
, pp. 37-42
-
-
Rodrigues, A.F.1
Hemmert, K.S.2
Barrett, B.W.3
Kersey, C.4
Oldfield, R.5
Weston, M.6
Risen, R.7
Cook, J.8
Rosenfeld, P.9
Cooperballs, E.10
Jacob, B.11
-
26
-
-
34548742805
-
Simulating red storm: Challenges and successes in building a system simulation
-
Long Beach, CA, IEEE
-
K. Underwood, M. Levenhagen, and A. Rodrigues. Simulating red storm: Challenges and successes in building a system simulation. In IEEE International Parallel and Distributed Processing Symposium, Long Beach, CA, 2007. IEEE.
-
(2007)
IEEE International Parallel and Distributed Processing Symposium
-
-
Underwood, K.1
Levenhagen, M.2
Rodrigues, A.3
-
27
-
-
80555144336
-
Enabling flexible collective communication offload with triggered operations
-
August
-
K. D. Underwood, J. Coffman, R. Larsen, K. S. Hemmert, B. W. Barrett, R. Brightwell, and M. Levenhagen. Enabling flexible collective communication offload with triggered operations. In Proceedings of 19th Annual Symposium on High-Performance Interconnects (HotI), August 2011.
-
(2011)
Proceedings of 19th Annual Symposium on High-Performance Interconnects (HotI)
-
-
Underwood, K.D.1
Coffman, J.2
Larsen, R.3
Hemmert, K.S.4
Barrett, B.W.5
Brightwell, R.6
Levenhagen, M.7
|