-
1
-
-
79955370378
-
The future of microprocessors
-
S. Borkar and A. Chien, "The future of microprocessors," Communications of the ACM, vol. 54, no. 5, pp. 67-77, 2011.
-
(2011)
Communications of the ACM
, vol.54
, Issue.5
, pp. 67-77
-
-
Borkar, S.1
Chien, A.2
-
3
-
-
34547261834
-
Thousand core chips: A technology perspective
-
S. Borkar, "Thousand core chips: a technology perspective," in the Design Automation Conf., 2007, pp. 746-749.
-
(2007)
The Design Automation Conf.
, pp. 746-749
-
-
Borkar, S.1
-
4
-
-
85024275309
-
Software and the concurrency revolution
-
H. Sutter and J. Larus, "Software and the concurrency revolution," Queue, vol. 3, no. 7, pp. 54-62, 2005.
-
(2005)
Queue
, vol.3
, Issue.7
, pp. 54-62
-
-
Sutter, H.1
Larus, J.2
-
5
-
-
0042078549
-
A survey of rollback-recovery protocols in message-passing systems
-
E. N. M. Elnozahy, L. Alvisi, Y.-M. Wang, and D. B. Johnson, "A survey of rollback-recovery protocols in message-passing systems," ACM Computer Survey, vol. 34, pp. 375-408, 2002.
-
(2002)
ACM Computer Survey
, vol.34
, pp. 375-408
-
-
Elnozahy, E.N.M.1
Alvisi, L.2
Wang, Y.-M.3
Johnson, D.B.4
-
7
-
-
79951689916
-
Minimal multi-threading: Finding and removing redundant instructions in multi-threaded processors
-
G. Long, D. Franklin, S. Biswas, P. Ortiz, J. Oberg, D. Fan, and F. T. Chong, "Minimal multi-threading: Finding and removing redundant instructions in multi-threaded processors," in IEEE/ACM Int'l Symposium on Microarchitecture, 2010, pp. 337-348.
-
(2010)
IEEE/ACM Int'l Symposium on Microarchitecture
, pp. 337-348
-
-
Long, G.1
Franklin, D.2
Biswas, S.3
Ortiz, P.4
Oberg, J.5
Fan, D.6
Chong, F.T.7
-
8
-
-
48449104482
-
Characterization of error-tolerant applications when protecting control data
-
D. Thaker, D. Franklin, J. Oliver, S. Biswas, D. Lockhart, T. Metodi, and F. Chong, "Characterization of error-tolerant applications when protecting control data," in IEEE Int'l Symposium on Workload Characterization, 2006, pp. 142-149.
-
(2006)
IEEE Int'l Symposium on Workload Characterization
, pp. 142-149
-
-
Thaker, D.1
Franklin, D.2
Oliver, J.3
Biswas, S.4
Lockhart, D.5
Metodi, T.6
Chong, F.7
-
9
-
-
33646829087
-
SWIFT: Software implemented fault tolerance
-
G. Reis, J. Chang, N. Vachharajani, R. Rangan, and D. August, "SWIFT: Software implemented fault tolerance," in the Int'l Symposium on Code Generation and Optimization, 2005, pp. 243-254.
-
(2005)
The Int'l Symposium on Code Generation and Optimization
, pp. 243-254
-
-
Reis, G.1
Chang, J.2
Vachharajani, N.3
Rangan, R.4
August, D.5
-
10
-
-
78149269828
-
DAFT: Decoupled acyclic fault tolerance
-
Y. Zhang, J. Lee, N. Johnson, and D. August, "DAFT: decoupled acyclic fault tolerance," in the Int'l Conf. on Parallel Architectures and Compilation Techniques, 2010, pp. 87-98.
-
(2010)
The Int'l Conf. on Parallel Architectures and Compilation Techniques
, pp. 87-98
-
-
Zhang, Y.1
Lee, J.2
Johnson, N.3
August, D.4
-
11
-
-
77956600750
-
AutomaDeD: Automata-based debugging for dissimilar parallel tasks
-
G. Bronevetsky, I. Laguna, S. Bagchi, B. de Supinski, D. Ahn, and M. Schulz, "AutomaDeD: Automata-based debugging for dissimilar parallel tasks," in IEEE/IFIP Int'l Conf. on Dependable Systems and Networks, 2010, pp. 231-240.
-
(2010)
IEEE/IFIP Int'l Conf. on Dependable Systems and Networks
, pp. 231-240
-
-
Bronevetsky, G.1
Laguna, I.2
Bagchi, S.3
De Supinski, B.4
Ahn, D.5
Schulz, M.6
-
12
-
-
34548294322
-
Problem diagnosis in large-scale computing environments
-
A. Mirgorodskiy, N. Maruyama, and B. Miller, "Problem diagnosis in large-scale computing environments," in ACM/IEEE Conf. on Supercom-puting, 2006, pp. 88-100.
-
(2006)
ACM/IEEE Conf. on Supercom-puting
, pp. 88-100
-
-
Mirgorodskiy, A.1
Maruyama, N.2
Miller, B.3
-
13
-
-
0029179077
-
The SPLASH-2 programs: Characterization and methodological considerations
-
S. Woo, M. Ohara, E. Torrie, J. Singh, and A. Gupta, "The SPLASH-2 programs: Characterization and methodological considerations," in ACM SIGARCH Computer Architecture News, vol. 23, no. 2, 1995, pp. 24-36.
-
(1995)
ACM SIGARCH Computer Architecture News
, vol.23
, Issue.2
, pp. 24-36
-
-
Woo, S.1
Ohara, M.2
Torrie, E.3
Singh, J.4
Gupta, A.5
-
14
-
-
0026243790
-
Efficiently computing static single assignment form and the control dependence graph
-
R. Cytron, J. Ferrante, B. Rosen, M. Wegman, and F. Zadeck, "Efficiently computing static single assignment form and the control dependence graph," ACM Trans. on Programming Languages and Systems, vol. 13, no. 4, pp. 451-490, 1991.
-
(1991)
ACM Trans. on Programming Languages and Systems
, vol.13
, Issue.4
, pp. 451-490
-
-
Cytron, R.1
Ferrante, J.2
Rosen, B.3
Wegman, M.4
Zadeck, F.5
-
15
-
-
84976663837
-
Specifying concurrent program modules
-
L. Lamport, "Specifying concurrent program modules," ACM Trans. on Programming Languages and Systems, vol. 5, no. 2, pp. 190-222, 1983.
-
(1983)
ACM Trans. on Programming Languages and Systems
, vol.5
, Issue.2
, pp. 190-222
-
-
Lamport, L.1
-
19
-
-
84897584233
-
PIN: A binary instrumentation tool for computer architecture research and education
-
V. Reddi, A. Settle, D. Connors, and R. Cohn, "PIN: a binary instrumentation tool for computer architecture research and education," in the Workshop on Computer Architecture Education, 2004.
-
(2004)
The Workshop on Computer Architecture Education
-
-
Reddi, V.1
Settle, A.2
Connors, D.3
Cohn, R.4
-
21
-
-
0036443168
-
Loose synchronization of multithreaded replicas
-
C. Basile, K. Whisnant, Z. Kalbarczyk, and R. Iyer, "Loose synchronization of multithreaded replicas," in IEEE Symposium on Reliable Distributed Systems, 2002, pp. 250-255.
-
(2002)
IEEE Symposium on Reliable Distributed Systems
, pp. 250-255
-
-
Basile, C.1
Whisnant, K.2
Kalbarczyk, Z.3
Iyer, R.4
-
22
-
-
67650834931
-
Kendo: Efficient deterministic multithreading in software
-
M. Olszewski, J. Ansel, and S. Amarasinghe, "Kendo: efficient deterministic multithreading in software," in ACM SIGPLAN Notices, vol. 44, no. 3, 2009, pp. 97-108.
-
(2009)
ACM SIGPLAN Notices
, vol.44
, Issue.3
, pp. 97-108
-
-
Olszewski, M.1
Ansel, J.2
Amarasinghe, S.3
-
23
-
-
0032674982
-
Design and evaluation of system-level checks for on-line control flow error detection
-
Z. Alkhalifa, V. Nair, N. Krishnamurthy, and J. Abraham, "Design and evaluation of system-level checks for on-line control flow error detection," IEEE Trans. on Parallel and Distributed Systems, vol. 10, no. 6, pp. 627-641, 1999.
-
(1999)
IEEE Trans. on Parallel and Distributed Systems
, vol.10
, Issue.6
, pp. 627-641
-
-
Alkhalifa, Z.1
Nair, V.2
Krishnamurthy, N.3
Abraham, J.4
-
24
-
-
38149112346
-
Design and evaluation of preemptive control signature (PECOS) checking
-
S. Bagchi, Z. Kalbarczyk, R. Iyer, and Y. Levendel, "Design and evaluation of preemptive control signature (PECOS) checking," IEEE Trans. on Computers, 2003.
-
(2003)
IEEE Trans. on Computers
-
-
Bagchi, S.1
Kalbarczyk, Z.2
Iyer, R.3
Levendel, Y.4
-
25
-
-
0036507891
-
Control-flow checking by software signatures
-
N. Oh, P. Shirvani, and E. McCluskey, "Control-flow checking by software signatures," IEEE Tranactions on Reliability, vol. 51, no. 1, pp. 111-122, 2002.
-
(2002)
IEEE Tranactions on Reliability
, vol.51
, Issue.1
, pp. 111-122
-
-
Oh, N.1
Shirvani, P.2
McCluskey, E.3
-
26
-
-
56749097000
-
Dmtracker: Finding bugs in large-scale parallel programs by detecting anomaly in data movements
-
Q. Gao, F. Qin, and D. Panda, "Dmtracker: finding bugs in large-scale parallel programs by detecting anomaly in data movements," in ACM/IEEE Conf. on Supercomputing, 2007, pp. 1-12.
-
(2007)
ACM/IEEE Conf. on Supercomputing
, pp. 1-12
-
-
Gao, Q.1
Qin, F.2
Panda, D.3
-
27
-
-
78650817530
-
Flowchecker: Detecting bugs in MPI libraries via message flow checking
-
Z. Chen, Q. Gao, W. Zhang, and F. Qin, "Flowchecker: Detecting bugs in MPI libraries via message flow checking," in ACM/IEEE Int'l Conf. for High Performance Computing, Networking, Storage and Analysis, 2010, pp. 1-11.
-
(2010)
ACM/IEEE Int'l Conf. for High Performance Computing, Networking, Storage and Analysis
, pp. 1-11
-
-
Chen, Z.1
Gao, Q.2
Zhang, W.3
Qin, F.4
-
28
-
-
33746060520
-
Effective static race detection for java
-
M. Naik, A. Aiken, and J. Whaley, "Effective static race detection for Java," ACM SIGPLAN Conf. on Programming Language Design and Implementation, vol. 41, no. 6, pp. 308-319, 2006.
-
(2006)
ACM SIGPLAN Conf. on Programming Language Design and Implementation
, vol.41
, Issue.6
, pp. 308-319
-
-
Naik, M.1
Aiken, A.2
Whaley, J.3
-
29
-
-
78650808116
-
A scalable and distributed dynamic formal verifier for MPI programs
-
A. Vo, S. Aananthakrishnan, G. Gopalakrishnan, B. Supinski, M. Schulz, and G. Bronevetsky, "A scalable and distributed dynamic formal verifier for MPI programs," in ACM/IEEE Int'l Conf. for High Performance Computing, Networking, Storage and Analysis, 2010, pp. 1-10.
-
(2010)
ACM/IEEE Int'l Conf. for High Performance Computing, Networking, Storage and Analysis
, pp. 1-10
-
-
Vo, A.1
Aananthakrishnan, S.2
Gopalakrishnan, G.3
Supinski, B.4
Schulz, M.5
Bronevetsky, G.6
-
30
-
-
46749109635
-
Automated derivation of application-aware error detectors using static analysis
-
K. Pattabiraman, Z. Kalbarczyk, and R. Iyer, "Automated derivation of application-aware error detectors using static analysis," in IEEE Int'l On-Line Testing Symposium, 2007, pp. 211-216.
-
(2007)
IEEE Int'l On-line Testing Symposium
, pp. 211-216
-
-
Pattabiraman, K.1
Kalbarczyk, Z.2
Iyer, R.3
-
31
-
-
0036930706
-
On the placement of software mechanisms for detection of data errors
-
M. Hiller, A. Jhumka, and N. Suri, "On the placement of software mechanisms for detection of data errors," in IEEE/IFIP Int'l Conf. on Dependable Systems and Networks, 2002, pp. 135-144.
-
(2002)
IEEE/IFIP Int'l Conf. on Dependable Systems and Networks
, pp. 135-144
-
-
Hiller, M.1
Jhumka, A.2
Suri, N.3
-
32
-
-
0036038345
-
Tracking down software bugs using automatic anomaly detection
-
S. Hangal and M. Lam, "Tracking down software bugs using automatic anomaly detection," in the Int'l Conf. on Software Engineering, 2002, pp. 291-301.
-
(2002)
The Int'l Conf. on Software Engineering
, pp. 291-301
-
-
Hangal, S.1
Lam, M.2
-
33
-
-
53349128424
-
Using likely program invariants to detect hardware errors
-
S. Sahoo, M. Li, P. Ramachandran, S. Adve, V. Adve, and Y. Zhou, "Using likely program invariants to detect hardware errors," in IEEE/IFIP Int'l Conf. on Dependable Systems and Networks, 2008, pp. 70-79.
-
(2008)
IEEE/IFIP Int'l Conf. on Dependable Systems and Networks
, pp. 70-79
-
-
Sahoo, S.1
Li, M.2
Ramachandran, P.3
Adve, S.4
Adve, V.5
Zhou, Y.6
-
34
-
-
80053254113
-
Hauberk: Lightweight silent data corruption error detector for GPGPU
-
K. Yim, C. Pham, M. Saleheen, Z. Kalbarczyk, and R. Iyer, "Hauberk: Lightweight silent data corruption error detector for GPGPU," in IEEE Parallel & Distributed Processing Symposium, 2011, pp. 287-300.
-
(2011)
IEEE Parallel & Distributed Processing Symposium
, pp. 287-300
-
-
Yim, K.1
Pham, C.2
Saleheen, M.3
Kalbarczyk, Z.4
Iyer, R.5
-
35
-
-
0021439162
-
Algorithm-based fault tolerance for matrix operations
-
K. Huang and J. Abraham, "Algorithm-based fault tolerance for matrix operations," IEEE Trans. on Computers, pp. 518-528, 1984.
-
(1984)
IEEE Trans. on Computers
, pp. 518-528
-
-
Huang, K.1
Abraham, J.2
-
36
-
-
0028994249
-
Algorithm-based diskless checkpointing for fault tolerant matrix operations
-
J. Plank, Y. Kim, and J. Dongarra, "Algorithm-based diskless checkpointing for fault tolerant matrix operations," in the Int'l Symposium on Fault-Tolerant Computing, 1995, pp. 351-360.
-
(1995)
The Int'l Symposium on Fault-tolerant Computing
, pp. 351-360
-
-
Plank, J.1
Kim, Y.2
Dongarra, J.3
-
37
-
-
77956607557
-
A numerical optimization-based methodology for application robustification: Transforming applications for error tolerance
-
J. Sloan, D. Kesler, R. Kumar, and A. Rahimi, "A numerical optimization-based methodology for application robustification: Transforming applications for error tolerance," in IEEE/IFIP Int'l Conf. on Dependable Systems and Networks, 2010, pp. 161-170.
-
(2010)
IEEE/IFIP Int'l Conf. on Dependable Systems and Networks
, pp. 161-170
-
-
Sloan, J.1
Kesler, D.2
Kumar, R.3
Rahimi, A.4
-
38
-
-
12444258147
-
Development of naturally fault tolerant algorithms for computing on 100,000 processors
-
A. Geist and C. Engelmann, "Development of naturally fault tolerant algorithms for computing on 100,000 processors," Journal of Parallel and Distributed Computing, 2002.
-
(2002)
Journal of Parallel and Distributed Computing
-
-
Geist, A.1
Engelmann, C.2
|