메뉴 건너뛰기




Volumn 3, Issue , 2007, Pages 1-149

Chip multiprocessor architecture: Techniques to improve throughput and latency

Author keywords

Application classes: throughput sensitive applications; Basic terms: chip multiprocessors (CMPs); Latency sensitive applications; Microprocessor power; Multi core microprocessors; Parallel processing; Server applications; Threaded execution

Indexed keywords

COMPUTER SOFTWARE; ENERGY DISSIPATION; ERROR ANALYSIS; MICROPROCESSOR CHIPS; PARALLEL PROCESSING SYSTEMS; PROBLEM SOLVING; SERVERS;

EID: 35748948044     PISSN: 19353235     EISSN: 19353243     Source Type: Book Series    
DOI: 10.2200/s00093ed1v01y200707cac003     Document Type: Review
Times cited : (17)

References (76)
  • 1
    • 0000793139 scopus 로고
    • Cramming more components onto integrated circuits
    • Apr. 19
    • G. E. Moore, "Cramming more components onto integrated circuits," Electronics, vol. 38, no. 8, pp. 114-117, Apr. 19, 1965.
    • (1965) Electronics , vol.38 , Issue.8 , pp. 114-117
    • Moore, G.E.1
  • 3
    • 0003886621 scopus 로고
    • Limits of instruction-level parallelism
    • 93/6, Digital Western Research Laboratory, Palo Alto, CA
    • D. W. Wall, "Limits of instruction-level parallelism," WRL Research Report 93/6, Digital Western Research Laboratory, Palo Alto, CA, 1993.
    • (1993) WRL Research Report
    • Wall, D.W.1
  • 4
    • 84888693930 scopus 로고    scopus 로고
    • K. Yeager et al., R10 000 superscalar microprocessor, presented at Hot Chips VII, Stanford, CA, 1995.
    • K. Yeager et al., "R10 000 superscalar microprocessor," presented at Hot Chips VII, Stanford, CA, 1995.
  • 6
    • 2842561465 scopus 로고
    • Hot compilers for future hot chips
    • Stanford, CA, Aug
    • S. Amarasinghe et al., "Hot compilers for future hot chips," in Hot Chips VII, Stanford, CA, Aug. 1995. http://www.hotchips.org/ archives/
    • (1995) Hot Chips VII
    • Amarasinghe, S.1
  • 7
    • 0002532551 scopus 로고
    • Why aren't operating systems getting faster as fast as hardware?
    • June
    • J. Ousterhout, "Why aren't operating systems getting faster as fast as hardware?" in Summer 1990 USENIX Conference, June 1990, pp. 247-256.
    • (1990) Summer 1990 USENIX Conference , pp. 247-256
    • Ousterhout, J.1
  • 8
    • 84888657671 scopus 로고    scopus 로고
    • Standard Performance Evaluation Corporation, SPEC, VA
    • Standard Performance Evaluation Corporation, SPEC, http://www.spec.org, Warrenton, VA.
    • Warrenton
  • 9
    • 0037619265 scopus 로고    scopus 로고
    • Web search for a planet: The architecture of the Google cluster
    • Mar-Apr, doi:/10.1109/MM.2003.1196112
    • L. Barroso, J. Dean, and U. Hoezle, "Web search for a planet: the architecture of the Google cluster," IEEE Micro., Vol. 23, No. 2, pp. 22-28, Mar-Apr. 2003. doi:/10.1109/MM.2003.1196112
    • (2003) IEEE Micro , vol.23 , Issue.2 , pp. 22-28
    • Barroso, L.1    Dean, J.2    Hoezle, U.3
  • 11
    • 28444496784 scopus 로고    scopus 로고
    • UltraSPARC Gemini: Dual CPU processor
    • Stanford, CA, Aug
    • S. Kapil, "UltraSPARC Gemini: dual CPU processor," in Hot Chips 15, Stanford, CA, Aug. 2003. http://www.hotchips.org/archives/
    • (2003) Hot Chips 15
    • Kapil, S.1
  • 12
    • 28444448453 scopus 로고    scopus 로고
    • SPARC64 VI: Fujitsu's Next Generation Processor
    • San Jose, CA, Oct
    • T. Maruyama, "SPARC64 VI: Fujitsu's Next Generation Processor," in Microprocessor Forum, San Jose, CA, Oct. 2003.
    • (2003) Microprocessor Forum
    • Maruyama, T.1
  • 13
    • 28444488425 scopus 로고    scopus 로고
    • Montecito: The next product in the Itanium Processor Family
    • Stanford, CA, Aug
    • C. McNairy and R. Bhatia, "Montecito: the next product in the Itanium Processor Family," in Hot Chips 16, Stanford, CA, Aug. 2004. http://www.hotchips.org/archives/
    • (2004) Hot Chips 16
    • McNairy, C.1    Bhatia, R.2
  • 14
    • 4544256648 scopus 로고    scopus 로고
    • POWER4 system microarchitecture
    • San Jose, CA, Oct
    • C. Moore, "POWER4 system microarchitecture," in Microprocessor Forum, San Jose, CA, Oct. 2000.
    • (2000) Microprocessor Forum
    • Moore, C.1
  • 17
    • 0034312339 scopus 로고    scopus 로고
    • A performance methodology for commercial servers
    • Nov
    • S. Kunkel, R. Eickemeyer, M. Lip, and T. Mullins, "A performance methodology for commercial servers," IBM J. Res. Dev., Vol. 44, No. 6, Nov. 2000, pp. 851-872.
    • (2000) IBM J. Res. Dev , vol.44 , Issue.6 , pp. 851-872
    • Kunkel, S.1    Eickemeyer, R.2    Lip, M.3    Mullins, T.4
  • 18
    • 22944441274 scopus 로고    scopus 로고
    • Computer architecture: Challenges and opportunities for the next decade
    • May/June, doi:10.1109/MM.2005.45
    • T. Agerwala and S. Chatterjee, "Computer architecture: challenges and opportunities for the next decade," IEEE Micro, Vol. 25, No. 3, pp. 58-69, May/June 2005. doi:10.1109/MM.2005.45
    • (2005) IEEE Micro , vol.25 , Issue.3 , pp. 58-69
    • Agerwala, T.1    Chatterjee, S.2
  • 19
    • 20344374162 scopus 로고    scopus 로고
    • Niagara: A 32-way multithreaded SPARC processor
    • Mar./Apr, doi:10.1109/MM.2005.35
    • P. Kongetira, K. Aingaran, and K. Olukotun, "Niagara: a 32-way multithreaded SPARC processor," IEEE Micro, Vol. 25, No. 2, Mar./Apr. 2005, pp. 21-29. doi:10.1109/MM.2005.35
    • (2005) IEEE Micro , vol.25 , Issue.2 , pp. 21-29
    • Kongetira, P.1    Aingaran, K.2    Olukotun, K.3
  • 20
    • 2442653868 scopus 로고    scopus 로고
    • Design and implementation of the POWER5™ Microprocessor
    • San Francisco, CA, Feb
    • J. Clabes, J. Friedrich, and M. Sweet, "Design and implementation of the POWER5™ Microprocessor," in ISSCC Dig. Tech. Papers, San Francisco, CA, Feb. 2004, pp. 56-57.
    • (2004) ISSCC Dig. Tech. Papers , pp. 56-57
    • Clabes, J.1    Friedrich, J.2    Sweet, M.3
  • 22
    • 4043091754 scopus 로고    scopus 로고
    • Hyper-threading technology in the Netburst Microarchitecture
    • Stanford, CA, Aug
    • D. Marr, "Hyper-threading technology in the Netburst Microarchitecture," in Hot Chips XIV, Stanford, CA, Aug. 2002. http://www.hotchips.org/archives/
    • (2002) Hot Chips XIV
    • Marr, D.1
  • 23
    • 84945714902 scopus 로고
    • Sparcle: An evolutionary processor design for large-scale multiprocessors
    • June, doi:10.1109/40.216748
    • A. Agarwal, J. Kubiatowicz, D. Kranz, B.-H. Lim, D. Yeung, G. D'Souza, and M. Parkin, "Sparcle: an evolutionary processor design for large-scale multiprocessors," IEEE Micro, Vol. 13, No. 3, June 1993, pp. 48-61. doi:10.1109/40.216748
    • (1993) IEEE Micro , vol.13 , Issue.3 , pp. 48-61
    • Agarwal, A.1    Kubiatowicz, J.2    Kranz, D.3    Lim, B.-H.4    Yeung, D.5    D'Souza, G.6    Parkin, M.7
  • 32
    • 0030653560 scopus 로고    scopus 로고
    • Using the SimOS machine simulator to study complex computer systems
    • Jan, doi: 10.1145/244804.244807
    • M. Rosenblum, E. Bugnion, S. Herrod, and S. Devine, "Using the SimOS machine simulator to study complex computer systems," ACM Trans. Model. Comput. Simul., Vol. 7, No. 1, pp. 78-103, Jan. 1997. doi: 10.1145/244804.244807
    • (1997) ACM Trans. Model. Comput. Simul , vol.7 , Issue.1 , pp. 78-103
    • Rosenblum, M.1    Bugnion, E.2    Herrod, S.3    Devine, S.4
  • 35
    • 84888657671 scopus 로고    scopus 로고
    • Standard Performance Evaluation Corporation, SPEC, VA
    • Standard Performance Evaluation Corporation, SPEC, http://www.spec.org, Warrenton, VA.
    • Warrenton
  • 36
    • 84888745829 scopus 로고    scopus 로고
    • Transaction Processing Performance Council, TPC, San Francisco, CA
    • Transaction Processing Performance Council, TPC, http://www.tpc.org, San Francisco, CA.
  • 37
    • 84858934403 scopus 로고    scopus 로고
    • XML Processing Performance in Java and .Net
    • "XML Processing Performance in Java and .Net," http://java.sun.com/performance/ reference/whitepapers/XML.Test-1_0.pdf
  • 43
    • 0034839064 scopus 로고    scopus 로고
    • Tolerating memory latency through software-controlled pre-execution in simultaneous multithreading processors
    • Göteborg, Sweden, June
    • C.-K. Luk, "Tolerating memory latency through software-controlled pre-execution in simultaneous multithreading processors," in Proc. 28th Annual Int. Symp. Computer Architecture (ISCA-28), Göteborg, Sweden, June 2001, pp. 40-51.
    • (2001) Proc. 28th Annual Int. Symp. Computer Architecture (ISCA-28) , pp. 40-51
    • Luk, C.-K.1
  • 44
    • 84888657671 scopus 로고    scopus 로고
    • Standard Performance Evaluation Corporation, SPEC, VA
    • Standard Performance Evaluation Corporation, SPEC, http://www.spec.org, Warrenton, VA.
    • Warrenton
  • 47
    • 0007997616 scopus 로고    scopus 로고
    • ARB: A hardware mechanism for dynamic reordering of memory references
    • May, doi: 10.1109/12.509907
    • M. Franklin and G. Sohi, "ARB: a hardware mechanism for dynamic reordering of memory references," IEEE Trans. Comput., Vol. 45, No. 5, May 1996, pp. 552-571. doi: 10.1109/12.509907
    • (1996) IEEE Trans. Comput , vol.45 , Issue.5 , pp. 552-571
    • Franklin, M.1    Sohi, G.2
  • 49
    • 0025429331 scopus 로고
    • Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
    • Seattle, WA, June
    • N. P. Jouppi, "Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers," in Proc. 17th Annu. Int. Symp. Computer Architecture, Seattle, WA, June 1990, pp. 364-373.
    • (1990) Proc. 17th Annu. Int. Symp. Computer Architecture , pp. 364-373
    • Jouppi, N.P.1
  • 51
    • 0032685104 scopus 로고    scopus 로고
    • Improving the performance of speculatively parallel applications on the hydra CMP
    • Rhodes, Greece, June
    • K. Olukotun, L. Hammond, and M. Willey, "Improving the performance of speculatively parallel applications on the hydra CMP," in Proc. 1999 Int. Conf Supercomputing, Rhodes, Greece, June 1999, pp. 21-30.
    • (1999) Proc. 1999 Int. Conf Supercomputing , pp. 21-30
    • Olukotun, K.1    Hammond, L.2    Willey, M.3
  • 53
    • 84888721452 scopus 로고    scopus 로고
    • M. Tremblay, MAJC™: an architecture for the new millennium, in Hot Chips XI, Stanford, CA, Aug. 1999, pp. 275-288. http://www.hotchips.org/archives/. Also reported in B. Case, Sun makes MAJC with mirrors, Microprocessor Report, Oct. 25, 1999, pp. 18-21.
    • M. Tremblay, "MAJC™: an architecture for the new millennium," in Hot Chips XI, Stanford, CA, Aug. 1999, pp. 275-288. http://www.hotchips.org/archives/. Also reported in B. Case, "Sun makes MAJC with mirrors," Microprocessor Report, Oct. 25, 1999, pp. 18-21.
  • 57
    • 84888709728 scopus 로고    scopus 로고
    • http://www.byte.com
  • 58
    • 84888698096 scopus 로고    scopus 로고
    • http://www.specbench.org/jvm98/
  • 59
    • 84888677765 scopus 로고    scopus 로고
    • http://www.epcc.ed.ac.uk/javagrande/
  • 60
    • 15044358803 scopus 로고    scopus 로고
    • Transactional coherence and consistency: Simplifying parallel hardware and software
    • Nov.-Dec, doi:10.1109/MM.2004.91
    • L. Hammond, B. D. Carlstrom, V. Wong, M. Chen, C. Kozyrakis, and K. Olukotun, "Transactional coherence and consistency: simplifying parallel hardware and software," IEEE Micro, Vol. 24, No. 6, Nov.-Dec. 2004, pp. 92-103. doi:10.1109/MM.2004.91
    • (2004) IEEE Micro , vol.24 , Issue.6 , pp. 92-103
    • Hammond, L.1    Carlstrom, B.D.2    Wong, V.3    Chen, M.4    Kozyrakis, C.5    Olukotun, K.6
  • 61
    • 0033880036 scopus 로고    scopus 로고
    • The Stanford Hydra CMP
    • Mar-Apr, doi: 10.1109/40.848474
    • L. Hammond, B. Hubbert, M. Siu, M. Prabhu, M. Chen, and K. Olukotun, "The Stanford Hydra CMP," IEEE Micro., Vol. 20, No. 2, Mar-Apr. 2000, pp. 71-84. doi: 10.1109/40.848474
    • (2000) IEEE Micro , vol.20 , Issue.2 , pp. 71-84
    • Hammond, L.1    Hubbert, B.2    Siu, M.3    Prabhu, M.4    Chen, M.5    Olukotun, K.6
  • 62
    • 0033348795 scopus 로고    scopus 로고
    • A chip multiprocessor architecture with speculative multithreading
    • Sept, doi:10.1109/12.795218
    • V. Krishnan and J. Torrellas, "A chip multiprocessor architecture with speculative multithreading," IEEE Trans. Comput., Vol. 48, No. 9, Sept. 1999, pp. 866-880. doi:10.1109/12.795218
    • (1999) IEEE Trans. Comput , vol.48 , Issue.9 , pp. 866-880
    • Krishnan, V.1    Torrellas, J.2
  • 64
    • 0031605348 scopus 로고    scopus 로고
    • The potential for using thread-level data speculation to facilitate automatic parallelization
    • Las Vegas, NV, Feb
    • J. G. Steffan and T. Mowry, "The potential for using thread-level data speculation to facilitate automatic parallelization," in Proc. 4th Int. Symp. High-Performance Computer Architecture (HPCA-4), Las Vegas, NV, Feb. 1998, pp. 2-13.
    • (1998) Proc. 4th Int. Symp. High-Performance Computer Architecture (HPCA-4) , pp. 2-13
    • Steffan, J.G.1    Mowry, T.2
  • 67
    • 0003557978 scopus 로고    scopus 로고
    • Time varying behavior of programs
    • Dept. of Computer Science and Eng, UCSD, Tech. Rep. No. CS99-630, Aug
    • T. Sherwood and B. Calder, "Time varying behavior of programs," Dept. of Computer Science and Eng., UCSD, Tech. Rep. No. CS99-630, Aug. 1999.
    • (1999)
    • Sherwood, T.1    Calder, B.2
  • 70
  • 72
    • 84888760967 scopus 로고    scopus 로고
    • D. O'Hallaron, Spark98: sparse matrix kernels for shared memory and message passing systems, School of Computer Science, Carnegie Mellon University, Tech. Rep. CMUCS-97-178, Oct. 1997.
    • D. O'Hallaron, "Spark98: sparse matrix kernels for shared memory and message passing systems," School of Computer Science, Carnegie Mellon University, Tech. Rep. CMUCS-97-178, Oct. 1997.
  • 73
  • 75
    • 28244437702 scopus 로고    scopus 로고
    • Heterogeneous chip multiprocessors
    • Nov
    • R. Kumar, D. Tullsen, N. Jouppi, and P. Ranganathan, "Heterogeneous chip multiprocessors," IEEE Computer, Vol. 38, No. 11, Nov. 2005, pp. 32-38.
    • (2005) IEEE Computer , vol.38 , Issue.11 , pp. 32-38
    • Kumar, R.1    Tullsen, D.2    Jouppi, N.3    Ranganathan, P.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.