-
1
-
-
35348962567
-
Exploring large-scale CMP architectures using manySim
-
DOI 10.1109/MM.2007.66
-
L. Zhao, R.R. Iyer, J. Moses, R. Illikkal, S. Makineni, and D. Newell, "Exploring Large-Scale CMP Architectures Using Many-Sim," IEEE Micro, vol. 27, no. 4, pp. 21-33, July/Aug. 2007. (Pubitemid 47595641)
-
(2007)
IEEE Micro
, vol.27
, Issue.4
, pp. 21-33
-
-
Zhao, L.1
Iyer, R.2
Moses, J.3
Illikkal, R.4
Makineni, S.5
Newell, D.6
-
2
-
-
10444238444
-
Fair cache sharing and partitioning in a chip multiprocessor architecture
-
S. Kim, D. Chandra, and Y. Solihin, "Fair Cache Sharing and Partitioning in a Chip Multiprocessor Architecture," Proc. 13th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT), pp. 111-122, 2004.
-
(2004)
Proc. 13th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT)
, pp. 111-122
-
-
Kim, S.1
Chandra, D.2
Solihin, Y.3
-
3
-
-
47849108985
-
Improving performance isolation on chip multiprocessors via an operating system scheduler
-
A. Fedorova, M. Seltzer, and M.D. Smith, "Improving Performance Isolation on Chip Multiprocessors via an Operating System Scheduler," Proc. 16th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT), pp. 25-38, 2007.
-
(2007)
Proc. 16th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT)
, pp. 25-38
-
-
Fedorova, A.1
Seltzer, M.2
Smith, M.D.3
-
4
-
-
63549085110
-
Analysis and approximation of optimal co-scheduling on chip multiprocessors
-
Y. Jiang, X. Shen, J. Chen, and R. Tripathi, "Analysis and Approximation of Optimal Co-Scheduling on Chip Multiprocessors," Proc. 17th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT), pp. 220-229, 2008.
-
(2008)
Proc. 17th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT)
, pp. 220-229
-
-
Jiang, Y.1
Shen, X.2
Chen, J.3
Tripathi, R.4
-
5
-
-
70449652924
-
Soft-Olp: Improving hardware cache performance through software-controlled object-level partitioning
-
Q. Lu, J. Lin, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan, "Soft-Olp: Improving Hardware Cache Performance through Software-Controlled Object-Level Partitioning," Proc. 18th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT), pp. 246-257, 2009.
-
(2009)
Proc. 18th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT)
, pp. 246-257
-
-
Lu, Q.1
Lin, J.2
Ding, X.3
Zhang, Z.4
Zhang, X.5
Sadayappan, P.6
-
6
-
-
21244474546
-
Predicting inter-thread cache contention on a chip multi-processor architecture
-
Proceedings - 11th International Symposium on High-Performance Computer Architecture, HPCA-11 2005
-
D. Chandra, F. Guo, S. Kim, and Y. Solihin, "Predicting Inter-Thread Cache Contention on a Chip Multi-Processor Architecture," Proc. 11th Int'l Symp. High-Performance Computer Architecture (HPCA-11), pp. 340-351, 2005. (Pubitemid 41731513)
-
(2005)
Proceedings - International Symposium on High-Performance Computer Architecture
, pp. 340-351
-
-
Chandra, D.1
Guo, F.2
Kim, S.3
Solihin, Y.4
-
8
-
-
0024656760
-
An analytical cache model
-
A. Agarwal, M. Horowitz, and J. Hennessy, "An Analytical Cache Model," ACM Trans. Computer Systems, vol. 7, no. 2, pp. 184-215, 1989.
-
(1989)
ACM Trans. Computer Systems
, vol.7
, Issue.2
, pp. 184-215
-
-
Agarwal, A.1
Horowitz, M.2
Hennessy, J.3
-
9
-
-
0023456387
-
Footprints in the cache
-
D. Thiebaut and H.S. Stone, "Footprints in the Cache," ACM Trans. Computer Systems, vol. 5, no. 4, pp. 305-329, 1987.
-
(1987)
ACM Trans. Computer Systems
, vol.5
, Issue.4
, pp. 305-329
-
-
Thiebaut, D.1
Stone, H.S.2
-
10
-
-
2442517698
-
Parallel program performance prediction using deterministic task graph analysis
-
V.S. Adve and M.K. Vernon, "Parallel Program Performance Prediction Using Deterministic Task Graph Analysis," ACM Trans. Computer Systems, vol. 22, no. 1, pp. 94-136, 2004.
-
(2004)
ACM Trans. Computer Systems
, vol.22
, Issue.1
, pp. 94-136
-
-
Adve, V.S.1
Vernon, M.K.2
-
14
-
-
20344374162
-
Niagara: A 32-way multithreaded sparc processor
-
DOI 10.1109/MM.2005.35
-
P. Kongetira, K. Aingaran, and K. Olukotun, "Niagara: A 32-Way Multithreaded Sparc Processor," IEEE Micro, vol. 25, no. 2, pp. 21-29, Mar./Apr. 2005. (Pubitemid 40784326)
-
(2005)
IEEE Micro
, vol.25
, Issue.2
, pp. 21-29
-
-
Kongetira, P.1
Aingaran, K.2
Olukotun, K.3
-
15
-
-
0029200683
-
Simultaneous multithreading: Maximizing on-chip parallelism
-
D.M. Tullsen, S.J. Eggers, and H.M. Levy, "Simultaneous Multithreading: Maximizing On-Chip Parallelism," Proc. 22nd Ann. Int'l Symp. Computer Architecture (ISCA-22), pp. 392-403, 1995.
-
(1995)
Proc. 22nd Ann. Int'l Symp. Computer Architecture (ISCA-22)
, pp. 392-403
-
-
Tullsen, D.M.1
Eggers, S.J.2
Levy, H.M.3
-
17
-
-
47249094055
-
System-level performance metrics for multiprogram workloads
-
May/June
-
S. Eyerman and L. Eeckhout, "System-Level Performance Metrics for Multiprogram Workloads," IEEE Micro, vol. 28, no. 3, pp. 42-53, May/June 2008.
-
(2008)
IEEE Micro
, vol.28
, Issue.3
, pp. 42-53
-
-
Eyerman, S.1
Eeckhout, L.2
-
18
-
-
84861822447
-
-
Standard Performance Evaluation Corporation
-
Standard Performance Evaluation Corporation, "SPEC CPU2000 Benchmarks,"http://www.spec.org, 2011.
-
(2011)
SPEC CPU2000 Benchmarks
-
-
-
19
-
-
33846547030
-
On the effectiveness of set associative page mapping and its application to main memory management
-
A.J. Smith, "On The Effectiveness of Set Associative Page Mapping and Its Application to Main Memory Management," ICSE '76: Proc. Second Int'l Conf. Software Eng., pp. 286-292, 1976.
-
(1976)
ICSE '76: Proc. Second Int'l Conf. Software Eng.
, pp. 286-292
-
-
Smith, A.J.1
-
20
-
-
0014701246
-
Evaluation techniques for storage hierarchies
-
R.L. Mattson, J. Gecsei, D.R. Slutz, and I.L. Traiger, "Evaluation Techniques for Storage Hierarchies," IBM Systems J., vol. 9, no. 2, pp. 78-117, 1970.
-
(1970)
IBM Systems J.
, vol.9
, Issue.2
, pp. 78-117
-
-
Mattson, R.L.1
Gecsei, J.2
Slutz, D.R.3
Traiger, I.L.4
-
21
-
-
0024903997
-
Evaluating associativity in CPU caches
-
DOI 10.1109/12.40842
-
M. Hill and A. Smith, "Evaluating Associativity in CPU Caches," IEEE Trans. Computers, vol. 38, no. 12, pp. 1612-1630, Dec. 1989. (Pubitemid 20642724)
-
(1989)
IEEE Transactions on Computers
, vol.38
, Issue.12
, pp. 1612-1630
-
-
Hill Mark, D.1
Smith Alan Jay2
-
22
-
-
33750363816
-
An analytical model for cache replacement policy performance
-
DOI 10.1145/1140103.1140304, SIGMETRICS 2006/Performance 2006 - Joint International Conference on Measurement and Modeling of Computer Systems, Proceedings
-
F. Guo and Y. Solihin, "An Analytical Model for Cache Replacement Policy Performance," Proc. Int'l Conf. Measurement and Modeling of Computer Systems (SIGMETRICS), pp. 228-239, 2006. (Pubitemid 44619094)
-
(2006)
Performance Evaluation Review
, vol.34
, Issue.1
, pp. 228-239
-
-
Guo, F.1
Solihin, Y.2
-
23
-
-
33244462442
-
Fast data-locality profiling of native execution
-
SIGMETRICS 2005: International Conference on Measurement and Modeling of Computer Systems - Proceedings
-
E. Berg and E. Hagersten, "Fast Data-Locality Profiling of Native Execution," Proc. Int'l Conf. Measurement and Modeling of Computer Systems (SIGMETRICS), pp. 169-180, 2005. (Pubitemid 43275419)
-
(2005)
Performance Evaluation Review
, vol.33
, Issue.1
, pp. 169-180
-
-
Berg, E.1
Hagersten, E.2
-
24
-
-
34548285855
-
Locality approximation using time
-
X. Shen, J. Shaw, B. Meeker, and C. Ding, "Locality Approximation Using Time," Proc. 34th Symp. Principles of Programming Languages (POPL), pp. 55-61, 2007.
-
(2007)
Proc. 34th Symp. Principles of Programming Languages (POPL)
, pp. 55-61
-
-
Shen, X.1
Shaw, J.2
Meeker, B.3
Ding, C.4
-
25
-
-
0030712794
-
Modeling cost/performance of a parallel computer simulator
-
B. Falsafi and D.A. Wood, "Modeling Cost/Performance of a Parallel Computer Simulator," ACM Trans. Modeling and Computer Simulation, vol. 7, no. 1, pp. 104-130, 1997.
-
(1997)
ACM Trans. Modeling and Computer Simulation
, vol.7
, Issue.1
, pp. 104-130
-
-
Falsafi, B.1
Wood, D.A.2
-
26
-
-
63549130253
-
Characterizing and modeling the behavior of context switch misses
-
F. Liu, F. Guo, Y. Solihin, S. Kim, and A. Eker, "Characterizing and Modeling the Behavior of Context Switch Misses," Proc. 17th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT), pp. 91-101, 2008.
-
(2008)
Proc. 17th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT)
, pp. 91-101
-
-
Liu, F.1
Guo, F.2
Solihin, Y.3
Kim, S.4
Eker, A.5
-
27
-
-
0034826142
-
Analytical cache models with applications to cache partitioning
-
G.E. Suh, S. Devadas, and L. Rudolph, "Analytical Cache Models with Applications to Cache Partitioning," Proc. 15th Int'l Conf. Supercomputing (ICS), pp. 1-12, 2001. (Pubitemid 32865298)
-
(2001)
Proceedings of the International Conference on Supercomputing
, pp. 1-12
-
-
Edward Suh, G.1
Devadas, S.2
Rudolph, L.3
-
28
-
-
0028087519
-
Performance estimation of multistreamed, superscalar processors
-
Jan.
-
W. Yamamoto, M. Serrano, A. Talcott, R. Wood, and M. Nemirosky, "Performance Estimation of Multistreamed, Superscalar Processors," Proc. 27th Hawaii Int'l Conf. System Sciences, Jan. 1994.
-
(1994)
Proc. 27th Hawaii Int'l Conf. System Sciences
-
-
Yamamoto, W.1
Serrano, M.2
Talcott, A.3
Wood, R.4
Nemirosky, M.5
-
29
-
-
64949125875
-
Toward a multicore architecture for real-time ray-tracing
-
V. Govindaraju, P. Djeu, K. Sankaralingam, M. Vernon, and W.R. Mark, "Toward A Multicore Architecture for Real-Time Ray-Tracing," Proc. IEEE/ACM 41st Int'l Symp. Microarchitecture (MICRO-41), pp. 176-187, 2008.
-
(2008)
Proc. IEEE/ACM 41st Int'l Symp. Microarchitecture (MICRO-41)
, pp. 176-187
-
-
Govindaraju, V.1
Djeu, P.2
Sankaralingam, K.3
Vernon, M.4
Mark, W.R.5
|