-
1
-
-
0033703889
-
A scalable approach to thread-level speculation
-
Steffan, J.G., Colohan, C.B., Zhai, A., Mowry, T.C.: A scalable approach to thread-level speculation. In: Proceedings of the 27th Annual International Symposium on Computer Architecture (2000)
-
(2000)
Proceedings of the 27th Annual International Symposium on Computer Architecture
-
-
Steffan, J.G.1
Colohan, C.B.2
Zhai, A.3
Mowry, T.C.4
-
2
-
-
0031599505
-
Hardware for speculative run-time parallelization in distributed shared memory multiprocessors
-
Rauchwerger, L., Zhan, Y., Torrellas, J.: Hardware for speculative run-time parallelization in distributed shared memory multiprocessors. In: Proceedings of the 4th International Symposium on High-Performance Computer Architecture, p. 162 (1998)
-
(1998)
Proceedings of the 4th International Symposium on High-Performance Computer Architecture
, pp. 162
-
-
Rauchwerger, L.1
Zhan, Y.2
Torrellas, J.3
-
3
-
-
42549111870
-
Optimistic parallelism requires abstractions
-
Kulkarni, M., Pingali, K., Walter, B., Ramanarayanan, G., Bala, K., Chew, P.: Optimistic parallelism requires abstractions. In: Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation, pp. 211-222 (2007)
-
(2007)
Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
, pp. 211-222
-
-
Kulkarni, M.1
Pingali, K.2
Walter, B.3
Ramanarayanan, G.4
Bala, K.5
Chew, P.6
-
4
-
-
35348812496
-
Synchronization state buffer: Supporting efficient fine-grain synchronization on many-core architectures
-
Zhu, W., Sreedhar, V.C., Hu, Z., Gao, G.R.: Synchronization state buffer: Supporting efficient fine-grain synchronization on many-core architectures. In: The 34th International Symposium on Computer Architecture (2007)
-
(2007)
The 34th International Symposium on Computer Architecture
-
-
Zhu, W.1
Sreedhar, V.C.2
Hu, Z.3
Gao, G.R.4
-
5
-
-
34547423880
-
Exploiting coarse-grained task, data, and pipeline parallelism in stream programs
-
San Jose, CA
-
Gordon, M., Thies, W., Amarasinghe, S.: Exploiting coarse-grained task, data, and pipeline parallelism in stream programs. In: International Conference on Architectural Support for Programming Languages and Operating Systems, San Jose, CA (2006)
-
(2006)
International Conference on Architectural Support for Programming Languages and Operating Systems
-
-
Gordon, M.1
Thies, W.2
Amarasinghe, S.3
-
7
-
-
2342641297
-
-
Addison Wesley, Reading
-
Grama. A., Gupta, A., Karypis, G., Kumar, V.: Introduction to Parallel Computing. Addison Wesley, Reading (2003)
-
(2003)
Introduction to Parallel Computing
-
-
Grama, A.1
Gupta, A.2
Karypis, G.3
Kumar, V.4
-
8
-
-
0004052742
-
-
Kluwer Academic Publishers, Dordrecht
-
Zuker, M., Mathews, D.H., Turner, D.H.: Algorithms and Thermodynamics for RNA Secondary Structure Prediction: A Practical Guide. Kluwer Academic Publishers, Dordrecht (1999)
-
(1999)
Algorithms and Thermodynamics for RNA Secondary Structure Prediction: A Practical Guide
-
-
Zuker, M.1
Mathews, D.H.2
Turner, D.H.3
-
9
-
-
34548211484
-
Locality and parallelism optimization for dynamic programming algorithm in bioinformatics
-
ACM, New York
-
Tan, G., Feng, S., Sun, N.: Locality and parallelism optimization for dynamic programming algorithm in bioinformatics. In: SC 2006: Proceedings of the, ACM/IEEE conference on Supercomputing, p. 78. ACM, New York (2006)
-
(2006)
SC 2006: Proceedings of the, ACM/IEEE conference on Supercomputing
, pp. 78
-
-
Tan, G.1
Feng, S.2
Sun, N.3
-
10
-
-
33746317085
-
Tiny threads: Athread virtual machine for the cyclops-64 cellular architecture
-
Cuvillo, J., Zhu, W., Hu, Z., Gao, G.R.: Tiny threads: athread virtual machine for the cyclops-64 cellular architecture. In: Fifth Workshop on Massively Parallel Processing (WMPP), held in conjunction with the 19th rnational Parallel and Distributed Processing System (2005)
-
(2005)
Fifth Workshop on Massively Parallel Processing (WMPP), held in conjunction with the 19th rnational Parallel and Distributed Processing System
-
-
Cuvillo, J.1
Zhu, W.2
Hu, Z.3
Gao, G.R.4
-
11
-
-
34247379126
-
Landing openmp on cyclops-64: An efficient mapping of openmp to a many-core system-on-a-chip
-
Ischia, Italy
-
Cuvillo, J., Zhu, W., Gao, G.R.: Landing openmp on cyclops-64: An efficient mapping of openmp to a many-core system-on-a-chip. In: The 3rd ACM International Conference on Computing Frontiers, Ischia, Italy (2005)
-
(2005)
The 3rd ACM International Conference on Computing Frontiers
-
-
Cuvillo, J.1
Zhu, W.2
Gao, G.R.3
-
12
-
-
0030385545
-
Hybrid technology multi-threaded architecture
-
Gao, G.R., Likharev, K.K., Messina, P.C., Sterling, T.L.: Hybrid technology multi-threaded architecture. In: Proceedings of Frontiers 1996: The Sixth Symposium on the Frontiers of Massively Parallel Computation, pp. 98-105 (1996)
-
(1996)
Proceedings of Frontiers 1996: The Sixth Symposium on the Frontiers of Massively Parallel Computation
, pp. 98-105
-
-
Gao, G.R.1
Likharev, K.K.2
Messina, P.C.3
Sterling, T.L.4
-
13
-
-
58449116190
-
-
Amaral, J.N., Gao, G.R., Merkey, P., Sterling, T., Ruiz, Z., Ryan, S.: Performance prediction for the htmt: A programming example. In: TFP3 1999 (1999)
-
Amaral, J.N., Gao, G.R., Merkey, P., Sterling, T., Ruiz, Z., Ryan, S.: Performance prediction for the htmt: A programming example. In: TFP3 1999 (1999)
-
-
-
-
14
-
-
58449115323
-
A refinement of the "htmt" program execution model
-
Technical report, CAPSL, University of Delaware
-
Gao, G., Amaral, J.N., Marquez, A., Theobald, K.: A refinement of the "htmt" program execution model. Technical report, CAPSL, University of Delaware (1998)
-
(1998)
-
-
Gao, G.1
Amaral, J.N.2
Marquez, A.3
Theobald, K.4
-
16
-
-
0029199163
-
Speeding up irregular applicaitons in shared-memory multiprocessors: Memory binding and group prefetching
-
Zhang, Z., Torrellas, J.: Speeding up irregular applicaitons in shared-memory multiprocessors: Memory binding and group prefetching. In: 22nd International Symposium on Computer Architecture (1995)
-
(1995)
22nd International Symposium on Computer Architecture
-
-
Zhang, Z.1
Torrellas, J.2
-
17
-
-
0002031606
-
Tolerating latency through software-controlled prefetching in shared-memory multiprocessors
-
Mowry, T., Gupta, A.: Tolerating latency through software-controlled prefetching in shared-memory multiprocessors. Journal of Parallel and Distributed Computing 12, 87-106 (1991)
-
(1991)
Journal of Parallel and Distributed Computing
, vol.12
, pp. 87-106
-
-
Mowry, T.1
Gupta, A.2
-
18
-
-
0035691709
-
Dynamic speculative preeomputation
-
Collins, J.D., Tullsen, D.M., Wang, H., Shen, J.P.: Dynamic speculative preeomputation. In: The 34th Annual International Symposium on Microarchitecture (2001)
-
(2001)
The 34th Annual International Symposium on Microarchitecture
-
-
Collins, J.D.1
Tullsen, D.M.2
Wang, H.3
Shen, J.P.4
-
20
-
-
34548052234
-
Executing irregular scientific applications on stream architectures
-
ACM, New York
-
Erez, M., Ahn, J.H., Gummaraju, J., Rosenblum, M., Dally, W.J.: Executing irregular scientific applications on stream architectures. In: ICS 2007: Proceedings of the 21st annual international conference on Supercomputing, pp. 93-104. ACM, New York (2007)
-
(2007)
ICS 2007: Proceedings of the 21st annual international conference on Supercomputing
, pp. 93-104
-
-
Erez, M.1
Ahn, J.H.2
Gummaraju, J.3
Rosenblum, M.4
Dally, W.J.5
|