-
1
-
-
34748817436
-
Adaptive work stealing with parallelism feedback
-
DOI 10.1145/1229428.1229448, Proceedings of the 2007 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'07
-
Agrawal, K., He, Y., Leiserson, C. E.: Adaptive work stealing with parallelism feedback. In: Yelick, K. A., Mellor-Crummey, J. M. (eds.) Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (22th PPOPP'2007), pp. 112-120. ACM, New york (2007) (Pubitemid 47479086)
-
(2007)
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP
, pp. 112-120
-
-
Agrawal, K.1
He, Y.2
Leiserson, C.E.3
-
2
-
-
33646421297
-
The fortress language specification, version 1.0beta
-
Mar
-
Allen, E., Chase, D., Hallett, J., Luchangco, V., Maessen, J.-W., Ryu, S., Steele, G. L. Jr., Tobin-Hochstadt, S.: The Fortress Language Specification, version 1.0beta. Technical report, SUN, Mar. (2007)
-
(2007)
Technical Report, SUN
-
-
Allen, E.1
Chase, D.2
Hallett, J.3
Luchangco, V.4
Maessen, J.-W.5
Ryu, S.6
Steele Jr., G.L.7
Tobin-Hochstadt, S.8
-
3
-
-
0029429935
-
Balancing processor loads and exploiting data locality in n-body simulations
-
ACM, New York, NY, USA
-
Banicescu, I., Hummel, S. F.: Balancing processor loads and exploiting data locality in n-body simulations. In: Supercomputing'95: Proceedings of the 1995 ACM/IEEE Conference on Supercomputing (CDROM), p. 43. ACM, New York, NY, USA (1995)
-
(1995)
Supercomputing'95: Proceedings of the 1995 ACM/IEEE Conference on Supercomputing (CDROM)
, pp. 43
-
-
Banicescu, I.1
Hummel, S.F.2
-
4
-
-
19644399842
-
On the scalability of dynamic scheduling scientific applications with adaptive weighted factoring
-
Banicescu, I., Velusamy, V., Devaprasad, J.: On the scalability of dynamic scheduling scientific applications with adaptive weighted factoring. Clust. Comput. J. Netw. Softw. Tools Appl. 6, 215-226 (2003)
-
(2003)
Clust. Comput. J. Netw. Softw. Tools Appl
, vol.6
, pp. 215-226
-
-
Banicescu, I.1
Velusamy, V.2
Devaprasad, J.3
-
5
-
-
34548265764
-
CellSs: A programming model for the cell BE architecture
-
Bellens, P., Perez, J. M., Badia, R. M., Labarta, J.: CellSs: A programming model for the cell BE architecture. In: Proceedings of the 2006 ACM/IEEE SC'06 Conference. IEEE (2006)
-
(2006)
Proceedings of the 2006 ACM/IEEE SC'06 Conference. IEEE
-
-
Bellens, P.1
Perez, J.M.2
Badia, R.M.3
Labarta, J.4
-
6
-
-
0030601279
-
Cilk: An efficient multithreaded runtime system
-
DOI 10.1006/jpdc.1996.0107
-
Blumofe, R., Joerg, C., Kuszmaul, B., Leiserson, C., Randall, K., Zhou, Y.: Cilk: An efficient multithreaded runtime system. In: Proceedings of the 5th Symposium on Principles and Practice of Parallel Programming (PPOPP'1995), pp. 55-69. ACM (1995) (Pubitemid 126167766)
-
(1996)
Journal of Parallel and Distributed Computing
, vol.37
, Issue.1
, pp. 55-69
-
-
Blumofe, R.D.1
Joerg, C.F.2
Kuszmaul, B.C.3
Leiserson, C.E.4
Randall, K.H.5
Zhou, Y.6
-
8
-
-
85035595949
-
Executing functional programs on a virtual tree of processors
-
ACM, New York, NY, USA
-
Burton, F. W., Sleep, M. R.: Executing functional programs on a virtual tree of processors. In: FPCA'81: Proceedings of the 1981 Conference on Functional Programming Languages and Computer Architecture, pp. 187-194. ACM, New York, NY, USA. (1981)
-
(1981)
FPCA'81: Proceedings of the 1981 Conference on Functional Programming Languages and Computer Architecture
, pp. 187-194
-
-
Burton, F.W.1
Sleep, M.R.2
-
9
-
-
43449114846
-
Dynamic load balancing with adaptive factoring methods in scientific applications
-
DOI 10.1007/s11227-007-0148-y
-
Cariño, R., Banicescu, I.: Dynamic load balancing with adaptive factoring methods in scientific applications. J. Supercomput. 44(1), 41-63 (2008) (Pubitemid 351665234)
-
(2008)
Journal of Supercomputing
, vol.44
, Issue.1
, pp. 41-63
-
-
Carino, R.L.1
Banicescu, I.2
-
10
-
-
33745200313
-
X10: An object-oriented approach to non-uniform cluster computing
-
Johnson, R., Gabriel, R. P. eds., Languages, and Applications OOPSLA, ACM, New york
-
Charles, P., Grothoff, C., Saraswat, V. A., Donawa, C., Kielstra, A., Ebcioglu, K., von Praun, C., Sarkar, V.: X10: An object-oriented approach to non-uniform cluster computing. In: Johnson, R., Gabriel, R. P. (eds.) Proceedings of the 20th Annual ACM SIGPLAN Conference on Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA), pp. 519-538. ACM, New york (2005)
-
(2005)
Proceedings of the 20th Annual ACM SIGPLAN Conference on Object-oriented Programming, Systems
, pp. 519-538
-
-
Charles, P.1
Grothoff, C.2
Saraswat, V.A.3
Donawa, C.4
Kielstra, A.5
Ebcioglu, K.6
Von Praun, C.7
Sarkar, V.8
-
11
-
-
7444229864
-
The cascade high productivity language
-
IEEE
-
Callahan, D., Chamberlain, B. L., Zima, H. P.: The cascade high productivity language. In: 9th international workshop on high-level parallel programming models and supportive environments (HIPS'04), pp. 52-60. IEEE (2004)
-
(2004)
9th International Workshop on High-level Parallel Programming Models and Supportive Environments (HIPS'04)
, pp. 52-60
-
-
Callahan, D.1
Chamberlain, B.L.2
Zima, H.P.3
-
12
-
-
74049140383
-
Scalable work stealing
-
Storage and Analysis, ACM
-
Dinan, J., Larkins, D., Sadayappan, P., Krishnamoorthy, S., Nieplocha, J.: Scalable work stealing. In: SC'09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, pp. 1-11. ACM (2009)
-
(2009)
SC'09: Proceedings of the Conference on High Performance Computing Networking
, pp. 1-11
-
-
Dinan, J.1
Larkins, D.2
Sadayappan, P.3
Krishnamoorthy, S.4
Nieplocha, J.5
-
13
-
-
70350786554
-
An adaptive cut-off for task parallelism
-
Austin, TX, Nov. 2008. Universitat Politecnica de Catalunya
-
Duran, A., Corbalan, J., Ayguade, E.: An adaptive cut-off for task parallelism. In: SC'08 USB Key. ACM/IEEE, Austin, TX, Nov. 2008. Universitat Politecnica de Catalunya (2008)
-
(2008)
SC'08 USB Key. ACM/IEEE
-
-
Duran, A.1
Corbalan, J.2
Ayguade, E.3
-
14
-
-
0021658497
-
Implementation of multilisp: Lisp on a multiprocessor
-
ACM, New York, NY, USA
-
Halstead, R. H. Jr.: Implementation of multilisp: Lisp on a multiprocessor. In: LFP'84: Proceedings of the 1984 ACM Symposium on LISP and Functional Programming, pp. 9-17. ACM, New York, NY, USA. (1984)
-
(1984)
LFP'84: Proceedings of the 1984 ACM Symposium on LISP and Functional Programming
, pp. 9-17
-
-
Halstead Jr., R.H.1
-
15
-
-
33751098538
-
A rapid hierarchical radiosity algorithm
-
Hanrahan, P., Salzman, D., Aupperle, L.: A rapid hierarchical radiosity algorithm. ACM SIGGRAPH Comput. Graph. 25(4), 197-206 (1991)
-
(1991)
ACM SIGGRAPH Comput. Graph
, vol.25
, Issue.4
, pp. 197-206
-
-
Hanrahan, P.1
Salzman, D.2
Aupperle, L.3
-
17
-
-
84947273880
-
Task pool teams for implementing irregular algorithms on clusters of SMPs
-
CD-ROM
-
Hippold, J., Rünger, G.: Task pool teams for implementing irregular algorithms on clusters of SMPs. In: Proceedings of IPDPS. Nice, France, CD-ROM (2003)
-
(2003)
Proceedings of IPDPS. Nice, France
-
-
Hippold, J.1
Rünger, G.2
-
18
-
-
0002217386
-
Quicksort
-
Hoare, C. A. R.: Quicksort. Comput. J. 5(4), 10-15 (1962)
-
(1962)
Comput. J
, vol.5
, Issue.4
, pp. 10-15
-
-
Hoare, C.A.R.1
-
19
-
-
51849123300
-
Fine-grained task scheduling using adaptive data structures
-
of LNCS, Springer
-
Hoffmann, R., Rauber, T.: Fine-grained task scheduling using adaptive data structures. In: Proceedings of Euro-Par 2008, vol. 5168 of LNCS, pp. 253-262. Springer (2008)
-
(2008)
Proceedings of Euro-par 2008
, vol.5168
, pp. 253-262
-
-
Hoffmann, R.1
Rauber, T.2
-
20
-
-
0002479236
-
CHARM++
-
Wilson, G. V., Lu, P. eds., chap. 5, MIT Press, Cambridge, MA
-
Kalé, L. V., Krishnan, S.: CHARM++. In: Wilson, G. V., Lu, P. (eds.) Parallel Programming in C++, chap. 5, pp. 175-214. MIT Press, Cambridge, MA (1996)
-
(1996)
Parallel Programming in C++
, pp. 175-214
-
-
Kalé, L.V.1
Krishnan, S.2
-
21
-
-
35348855586
-
Carbon: Architectural support for fine-grained parallelism on chip multiprocessors
-
DOI 10.1145/1250662.1250683, ISCA'07: 34th Annual International Symposium on Computer Architecture, Conference Proceedings
-
Kumar, S., Hughes, C. J., Nguyen, A.: Carbon: Architectural support for fine-grained parallelism on chip multiprocessors. ACM SIGARCH Comput. Arch. News 35(2), 162-173 (2007) (Pubitemid 47582100)
-
(2007)
Proceedings - International Symposium on Computer Architecture
, pp. 162-173
-
-
Kumar, S.1
Hughes, C.J.2
Nguyen, A.3
-
22
-
-
0002088086
-
Scalable load balancing techniques for parallel computers
-
Kumar, V., Grama, A., Vempaty, N.: Scalable load balancing techniques for parallel computers. J. Parallel Distrib. Comput. 22(1), 60-79 (1994)
-
(1994)
J. Parallel Distrib. Comput
, vol.22
, Issue.1
, pp. 60-79
-
-
Kumar, V.1
Grama, A.2
Vempaty, N.3
-
23
-
-
0023535689
-
Guided self-scheduling: A practical scheduling scheme for parallel supercomputers
-
Polychronopoulos, C., Kuck, D.: Guided self-scheduling: A practical scheduling scheme for parallel supercomputers. IEEE Trans. Comput. C-36(12), 1425-1439 (1987) (Pubitemid 18537642)
-
(1987)
IEEE Transactions on Computers
, vol.C-36
, Issue.12
, pp. 1425-1439
-
-
Kuck David, J.1
-
26
-
-
0347810322
-
A Unified algorithm for load-balancing adaptive scientific simulations
-
IEEE
-
Schloegel, K., Karypis, G., Kumar, V.: A Unified algorithm for load-balancing adaptive scientific simulations. In: Proceedings of Supercomputing'2000, pp. 75-75. IEEE (2000)
-
(2000)
Proceedings of Supercomputing'2000
, pp. 75-75
-
-
Schloegel, K.1
Karypis, G.2
Kumar, V.3
-
28
-
-
0028466452
-
Parallel visualization algorithms: Performance and architectural implications
-
Singh, J. P., Gupta, A., Levoy, M.: Parallel visualization algorithms: Performance and architectural implications. IEEE Comput. 27(7), 45-55 (1994)
-
(1994)
IEEE Comput
, vol.27
, Issue.7
, pp. 45-55
-
-
Singh, J.P.1
Gupta, A.2
Levoy, M.3
-
29
-
-
0043005053
-
Load balancing and data locality in adaptive hierarchical n-body methods: Barnes-hut, fast multipole, and radiosity
-
Singh, J. P., Holt, C., Tosuka, T., Gupta, A., Hennessy, J. L.: Load balancing and data locality in adaptive hierarchical n-body methods: Barnes-hut, fast multipole, and radiosity. J. Parallel Distrib. Comput. 27(2), 118-141 (1995)
-
(1995)
J. Parallel Distrib. Comput
, vol.27
, Issue.2
, pp. 118-141
-
-
Singh, J.P.1
Holt, C.2
Tosuka, T.3
Gupta, A.4
Hennessy, J.L.5
-
30
-
-
0029179077
-
The SPLASH-2 programs: Characterization and methodological considerations
-
ACM, Santa Margherita Ligure, Italy
-
Woo, S. C., Ohara, M., Torrie, E., Singh, J. P., Gupta, A.: The SPLASH-2 programs: characterization and methodological considerations. In: Proceedings of the 22nd International Symposium on Computer Architecture, pp. 24-36. ACM, Santa Margherita Ligure, Italy (1995)
-
(1995)
Proceedings of the 22nd International Symposium on Computer Architecture
, pp. 24-36
-
-
Woo, S.C.1
Ohara, M.2
Torrie, E.3
Singh, J.P.4
Gupta, A.5
|