메뉴 건너뛰기




Volumn 23, Issue 3, 2005, Pages 253-300

The STAMPede approach to thread-level speculation

Author keywords

Automatic parallelization; Cache coherence; Chip multiprocessing; Thread level speculation

Indexed keywords

AUTOMATIC PARALLELIZATION; CACHE COHERENCE; CHIP-MULTIPROCESSING; THREAD-LEVEL SPECULATION;

EID: 33745198176     PISSN: 07342071     EISSN: 07342071     Source Type: Journal    
DOI: 10.1145/1082469.1082471     Document Type: Article
Times cited : (121)

References (54)
  • 1
    • 0033717865 scopus 로고    scopus 로고
    • Clock rate versus IPC: The end of the road for conventional microarchitectures
    • AGARWAL, W., HRISHIKESH, M., KECKLER, S., AND BURGER, D. 2000. Clock rate versus IPC: the end of the road for conventional microarchitectures. In Proceedings of ISCA 27.
    • (2000) Proceedings of ISCA , vol.27
    • Agarwal, W.1    Hrishikesh, M.2    Keckler, S.3    Burger, D.4
  • 3
    • 0012527549 scopus 로고    scopus 로고
    • A dynamic multithreading processor
    • AKKARY, H. AND DHISCOLL, M. 1998. A dynamic multithreading processor. In MICRO-31.
    • (1998) MICRO-31
    • Akkary, H.1    Dhiscoll, M.2
  • 4
    • 33745207042 scopus 로고    scopus 로고
    • Data memory alternatives for multiscalar processors
    • Computer Sciences Department, University of Wisconsin-Madison
    • BREACH, S. E., VIJAYKUMAR, T. N., GOPAL, S., SMITH, J. E., AND SOHI, G. S. 1996. Data memory alternatives for multiscalar processors. Tech. Rep. CS-TR-1997-1344, Computer Sciences Department, University of Wisconsin-Madison.
    • (1996) Tech. Rep. , vol.CS-TR-1997-1344
    • Breach, S.E.1    Vijaykumar, T.N.2    Gopal, S.3    Smith, J.E.4    Sohi, G.S.5
  • 7
    • 0033689702 scopus 로고    scopus 로고
    • Architectural support for scalable speculative parallelization in shared-memory multiprocessors
    • CINTRA, M., MARTÍNEZ, J. F., AND TORRELLAS, J. 2000. Architectural support for scalable speculative parallelization in shared-memory multiprocessors. In Proceedings of ISCA 27.
    • (2000) Proceedings of ISCA , vol.27
    • Cintra, M.1    Martínez, J.F.2    Torrellas, J.3
  • 8
    • 0012529668 scopus 로고    scopus 로고
    • Learning cross-thread violations in speculative parallelization for multiprocessors
    • CINTRA, M. AND TORRELLAS, J. 2002. Learning cross-thread violations in speculative parallelization for multiprocessors. In Proceedings of the 8th HPCA.
    • (2002) Proceedings of the 8th HPCA
    • Cintra, M.1    Torrellas, J.2
  • 10
    • 0028345337 scopus 로고
    • A study of single-chip processor/cache organizations for large number of transistors
    • FARRENS, M., TYSON, G., AND PLESZKUN, A. 1994. A study of single-chip processor/cache organizations for large number of transistors. In Proceedings of ISCA 21. pp. 338-347.
    • (1994) Proceedings of ISCA , vol.21 , pp. 338-347
    • Farrens, M.1    Tyson, G.2    Pleszkun, A.3
  • 12
    • 0007997616 scopus 로고    scopus 로고
    • ARB: A hardware mechanism for dynamic reordering of memory references
    • FRANKLIN, M. AND SOHI, G. S. 1996. ARB: A hardware mechanism for dynamic reordering of memory references. IEEE Trans. Comput. 45, 5 (May).
    • (1996) IEEE Trans. Comput. , vol.45 , Issue.5 MAY
    • Franklin, M.1    Sohi, G.S.2
  • 16
    • 78650416004 scopus 로고    scopus 로고
    • Techniques for speculative run-time parallelization of loops
    • GUPTA, M. AND NIM, R. 1998. Techniques for speculative run-time parallelization of loops. In Proceedings of Supercomputing 1998.
    • (1998) Proceedings of Supercomputing 1998
    • Gupta, M.1    Nim, R.2
  • 22
    • 0030685588 scopus 로고    scopus 로고
    • The SGI origin: A ccNUMA highly scalable server
    • LAUDON, J. AND LENOSKI, D. 1997. The SGI Origin: A ccNUMA highly scalable server. In Proceedings of the 24th ISCA. 241-251.
    • (1997) Proceedings of the 24th ISCA , pp. 241-251
    • Laudon, J.1    Lenoski, D.2
  • 29
    • 0003926726 scopus 로고    scopus 로고
    • Quantifying the complexity of superscalar processors
    • University of Wisconsin-Madison
    • PALACHARLA, S., JOUPPI, N. P., AND SMITH, J. E. 1996. Quantifying the complexity of superscalar processors. Tech. Rep. CS-TR-1996-1328, University of Wisconsin-Madison.
    • (1996) Tech. Rep. , vol.CS-TR-1996-1328
    • Palacharla, S.1    Jouppi, N.P.2    Smith, J.E.3
  • 33
    • 84976823223 scopus 로고
    • The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization
    • RAUCHWEHGER, L. AND PADUA, D. 1995. The LRPD Test: Speculative run-time parallelization of loops with privatization and reduction parallelization. In Proceedings of PLDI'95. 218-232.
    • (1995) Proceedings of PLDI'95 , pp. 218-232
    • Rauchwehger, L.1    Padua, D.2
  • 38
    • 84858891974 scopus 로고    scopus 로고
    • The SPEC benchmark suite
    • Standard Performance Evaluation Corporation
    • SPEC. 2000. The SPEC Benchmark Suite. Tech. rep., Standard Performance Evaluation Corporation. http://www.spechbench.org.
    • (2000) Tech. Rep.
  • 39
    • 4544233320 scopus 로고    scopus 로고
    • Hardware support for thread-level speculation
    • Ph.D. thesis
    • STEFFAN, J. G. 2003. Hardware Support for Thread-Level Speculation. Ph.D. thesis, Carnegie Mellon University. Tech. Rep. CMU-CS-03-122.
    • (2003) Carnegie Mellon University. Tech. Rep. , vol.CMU-CS-03-122
    • Steffan, J.G.1
  • 40
    • 0003564841 scopus 로고    scopus 로고
    • Architectural support for thread-level data speculation
    • School of Computer Science, Carnegie Mellon University. November
    • STEFFAN, J. G., COLOHAN, C. B., AND MOWRY, T. C. 1997. Architectural Support for Thread-Level Data Speculation. Tech. Rep. CMU-CS-97-188, School of Computer Science, Carnegie Mellon University. November.
    • (1997) Tech. Rep. , vol.CMU-CS-97-188
    • Steffan, J.G.1    Colohan, C.B.2    Mowry, T.C.3
  • 44
    • 0012619993 scopus 로고    scopus 로고
    • MAJC: Microprocessor architecture for Java computing
    • TREMBLAY, M. 1999. MAJC: Microprocessor Architecture for Java Computing. HotChips '99.
    • (1999) HotChips '99
    • Tremblay, M.1
  • 45
    • 0029200683 scopus 로고
    • Simultaneous multithreading: Maximizing on-chip parallelism
    • TULLSEN, D. M., EGGERS, S. J., AND LEVY, H. M. 1995. Simultaneous multithreading: Maximizing on-chip parallelism. In Proceedings of ISCA 22. 392-403.
    • (1995) Proceedings of ISCA , vol.22 , pp. 392-403
    • Tullsen, D.M.1    Eggers, S.J.2    Levy, H.M.3
  • 47
    • 0012583882 scopus 로고    scopus 로고
    • Ph.D. thesis, Computer Sciences Department, University of Wisconsin-Madison
    • VIJAYKUMAR, T. 1998. Compiling for the multiscalar architecture. Ph.D. thesis, Computer Sciences Department, University of Wisconsin-Madison.
    • (1998) Compiling for the Multiscalar Architecture
    • Vijaykumar, T.1
  • 48
    • 0030129806 scopus 로고    scopus 로고
    • The MIPS R10000 superscalar microprocessor
    • YEAGER, K. 1996. The MIPS R10000 superscalar microprocessor. IEEE Micro.
    • (1996) IEEE Micro
    • Yeager, K.1
  • 49


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.