-
2
-
-
84973836157
-
The NAS Parallel Benchmarks
-
Fall
-
D. H. Bailey, E. Barszcz, J. T. Barton, D. S. Browning, R. L. Carter, D. Dagum, R. A. Fatoohi, P. O. Frederickson, T. A. Lasinski, R. S. Schreiber, H. D. Simon, V. Venkatakrishnan, and S. K. Weeratunga. The NAS Parallel Benchmarks. International Journal of Supercomputer Applications, 5(3):63-73, Fall 1991.
-
(1991)
International Journal of Supercomputer Applications
, vol.5
, Issue.3
, pp. 63-73
-
-
Bailey, D.H.1
Barszcz, E.2
Barton, J.T.3
Browning, D.S.4
Carter, R.L.5
Dagum, D.6
Fatoohi, R.A.7
Frederickson, P.O.8
Lasinski, T.A.9
Schreiber, R.S.10
Simon, H.D.11
Venkatakrishnan, V.12
Weeratunga, S.K.13
-
3
-
-
77949706996
-
-
PhD thesis, Department of Computer Science, Princeton University, Princeton, New Jersey, United States, November
-
M. J. Bridges. The VELOCITY Compiler: Extracting Efficient Multicore Execution from Legacy Sequential Codes. PhD thesis, Department of Computer Science, Princeton University, Princeton, New Jersey, United States, November 2008.
-
(2008)
The VELOCITY Compiler: Extracting Efficient Multicore Execution from Legacy Sequential Codes
-
-
Bridges, M.J.1
-
6
-
-
26444605254
-
-
Master's thesis, Department of Computer Science, University of Illinois, Urbana, IL, May
-
J. R. B. Davies. Parallel loop constructs for multiprocessors. Master's thesis, Department of Computer Science, University of Illinois, Urbana, IL, May 1981.
-
(1981)
Parallel Loop Constructs for Multiprocessors
-
-
Davies, J.R.B.1
-
7
-
-
0023385308
-
PROGRAM DEPENDENCE GRAPH and ITS USE in OPTIMIZATION
-
DOI 10.1145/24039.24041
-
J. Ferrante, K. J. Ottenstein, and J. D. Warren. The program dependence graph and its use in optimization. ACM Transactions on Programming Languages and Systems, 9:319-349, July 1987. (Pubitemid 17641083)
-
(1987)
ACM Transactions on Programming Languages and Systems
, vol.9
, Issue.3
, pp. 319-349
-
-
Ferrante, J.1
Ottenstein Karl, J.2
Warren Joe, D.3
-
8
-
-
79959411439
-
FastForward for efficient pipeline parallelism: A cache-optimized concurrent lock-free queue
-
New York, NY, USA, February
-
J. Giacomoni, T. Moseley, and M. Vachharajani. FastForward for efficient pipeline parallelism: a cache-optimized concurrent lock-free queue. In PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 43-52, New York, NY, USA, February 2008.
-
(2008)
PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 43-52
-
-
Giacomoni, J.1
Moseley, T.2
Vachharajani, M.3
-
9
-
-
35048876693
-
Improving parallel irregular reductions using partial array expansion
-
(CDROM), New York, NY, USA, ACM
-
E. Gutiérrez, O. Plata, and E. L. Zapata. Improving parallel irregular reductions using partial array expansion. In Supercomputing '01: Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM), pages 38-38, New York, NY, USA, 2001. ACM.
-
(2001)
Supercomputing '01: Proceedings of the 2001 ACM/IEEE Conference on Supercomputing
, pp. 38-38
-
-
Gutiérrez, E.1
Plata, O.2
Zapata, E.L.3
-
11
-
-
0025550566
-
Loop distribution with arbitrary control flow
-
November
-
K. Kennedy and K. S. McKinley. Loop distribution with arbitrary control flow. In Proceedings of Supercomputing, pages 407-416, November 1990.
-
(1990)
Proceedings of Supercomputing
, pp. 407-416
-
-
Kennedy, K.1
McKinley, K.S.2
-
12
-
-
35448941890
-
Optimistic parallelism requires abstractions
-
New York, NY, USA, ACM
-
M. Kulkarni, K. Pingali, B. Walter, G. Ramanarayanan, K. Bala, and L. P. Chew. Optimistic parallelism requires abstractions. In PLDI '07: Proceedings of the 2007 ACMSIGPLAN Conference on Programming Language Design and Implementation, pages 211-222, New York, NY, USA, 2007. ACM.
-
(2007)
PLDI '07: Proceedings of the 2007 ACMSIGPLAN Conference on Programming Language Design and Implementation
, pp. 211-222
-
-
Kulkarni, M.1
Pingali, K.2
Walter, B.3
Ramanarayanan, G.4
Bala, K.5
Chew, L.P.6
-
13
-
-
47349098275
-
Minebench: A benchmark suite for data mining workloads
-
0
-
R. Narayanan, B. Ozisikyilmaz, J. Zambreno, G. Memik, and A. Choudhary. Minebench: A benchmark suite for data mining workloads. IEEEWorkload Characterization Symposium, 0:182-188, 2006.
-
(2006)
IEEEWorkload Characterization Symposium
, pp. 182-188
-
-
Narayanan, R.1
Ozisikyilmaz, B.2
Zambreno, J.3
Memik, G.4
Choudhary, A.5
-
14
-
-
79959468284
-
Software thread-level speculation: An optimistic library implementation
-
New York, NY, USA, ACM
-
C. E. Oancea and A. Mycroft. Software thread-level speculation: an optimistic library implementation. In IWMSE '08: Proceedings of the 1st International Workshop onMulticore Software Engineering, pages 23-32, New York, NY, USA, 2008. ACM.
-
(2008)
IWMSE '08: Proceedings of the 1st International Workshop OnMulticore Software Engineering
, pp. 23-32
-
-
Oancea, C.E.1
Mycroft, A.2
-
15
-
-
33749375700
-
Automatic thread extraction with decoupled software pipelining
-
November
-
G. Ottoni, R. Rangan, A. Stoler, and D. I. August. Automatic thread extraction with decoupled software pipelining. In Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture, pages 105-116, November 2005.
-
(2005)
Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 105-116
-
-
Ottoni, G.1
Rangan, R.2
Stoler, A.3
August, D.I.4
-
16
-
-
43449113286
-
Parallel-stage decoupled software pipelining
-
E. Raman, G. Ottoni, A. Raman, M. Bridges, and D. I. August. Parallel-stage decoupled software pipelining. In Proceedings of the 2008 International Symposium on Code Generation and Optimization, April 2008.
-
Proceedings of the 2008 International Symposium on Code Generation and Optimization, April 2008
-
-
Raman, E.1
Ottoni, G.2
Raman, A.3
Bridges, M.4
August, D.I.5
-
17
-
-
0033076827
-
The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization
-
L. Rauchwerger and D. A. Padua. The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization. IEEE Transactions on Parallel and Distributed Systems, 10(2):160-180, 1999.
-
(1999)
IEEE Transactions on Parallel and Distributed Systems
, vol.10
, Issue.2
, pp. 160-180
-
-
Rauchwerger, L.1
Padua, D.A.2
-
18
-
-
77954011474
-
Runtime characterisation of irregular accesses applied to parallelisation of irregular reductions
-
D. E. Singh, M. J. Martin, and F. F. Rivera. Runtime characterisation of irregular accesses applied to parallelisation of irregular reductions. Int. J. Comput. Sci. Eng., 1(1):1-14, 2005.
-
(2005)
Int. J. Comput. Sci. Eng.
, vol.1
, Issue.1
, pp. 1-14
-
-
Singh, D.E.1
Martin, M.J.2
Rivera, F.F.3
-
19
-
-
77954008735
-
-
Standard Performance Evaluation Corporation (SPEC). http://www.spec.org.
-
-
-
-
20
-
-
33745198176
-
The STAMPede approach to thread-level speculation
-
February
-
J. G. Steffan, C. Colohan, A. Zhai, and T. C. Mowry. The STAMPede approach to thread-level speculation. ACMTransactions on Computer Systems, 23(3):253-300, February 2005.
-
(2005)
ACMTransactions on Computer Systems
, vol.23
, Issue.3
, pp. 253-300
-
-
Steffan, J.G.1
Colohan, C.2
Zhai, A.3
Mowry, T.C.4
-
22
-
-
41349089872
-
Speculative decoupled software pipelining
-
N. Vachharajani, R. Rangan, E. Raman, M. J. Bridges, G. Ottoni, and D. I. August. Speculative decoupled software pipelining. In Proceedings of the 16th International Conference on Parallel Architectures and Compilation Techniques, September 2007.
-
Proceedings of the 16th International Conference on Parallel Architectures and Compilation Techniques, September 2007
-
-
Vachharajani, N.1
Rangan, R.2
Raman, E.3
Bridges, M.J.4
Ottoni, G.5
August, D.I.6
|