메뉴 건너뛰기




Volumn 5, Issue 3, 2008, Pages

Comparative evaluation of memory models for chip multiprocessors

Author keywords

Cache coherence; Chip multiprocessors; Locality optimizations; Parallel programming; Streaming memory

Indexed keywords

BANDWIDTH OPTIMIZATIONS; BANDWIDTH REQUIREMENTS; BEST EFFORTS; CACHE COHERENCE; CHIP MULTIPROCESSORS; COMPARATIVE EVALUATIONS; COMPETING MODELS; COMPUTATIONAL CAPABILITIES; DATA PARALLELS; DATA RE USE; ENERGY CONSUMPTION; IN BUILDINGS; LATENCY TOLERANCES; LOCALITY OPTIMIZATIONS; MEMORY MODELS; ON-CHIP MEMORIES; PRE-FETCHING; PROGRAMMING MODELS; STREAM PROGRAMMING; STREAMING MEMORY; STREAMING MODELS;

EID: 59449085862     PISSN: 15443566     EISSN: 15443973     Source Type: Journal    
DOI: 10.1145/1455650.1455651     Document Type: Article
Times cited : (11)

References (55)
  • 1
    • 0030382365 scopus 로고    scopus 로고
    • Shared memory consistency models: A tutorial
    • Dec
    • ADVE, S. V. AND GHARACHORLOO, K. 1996. Shared memory consistency models: A tutorial. IEEE Computer 29, 12 (Dec), 66-76.
    • (1996) IEEE Computer , vol.29 , Issue.12 , pp. 66-76
    • ADVE, S.V.1    GHARACHORLOO, K.2
  • 11
    • 33847152637 scopus 로고    scopus 로고
    • DRAKE, M., HOFFMANN, H., RABBAH, R., AND AMARASINGHE, S. 2006. Mpeg-2 decoding in a stream programming language. In Proceedings of the 20th IEEE International Parallel & Distributed Processing Symposium, Rhodes Island (IPDPS).
    • DRAKE, M., HOFFMANN, H., RABBAH, R., AND AMARASINGHE, S. 2006. Mpeg-2 decoding in a stream programming language. In Proceedings of the 20th IEEE International Parallel & Distributed Processing Symposium, Rhodes Island (IPDPS).
  • 12
    • 59449105466 scopus 로고    scopus 로고
    • EATHERTON, W. 2005. The push of network processing to the top of the pyramid. Keynote presentation at the Symposium on Architectures for Networking and Communication Systems, Princeton, NJ.
    • EATHERTON, W. 2005. The push of network processing to the top of the pyramid. Keynote presentation at the Symposium on Architectures for Networking and Communication Systems, Princeton, NJ.
  • 13
    • 34548052234 scopus 로고    scopus 로고
    • EREZ, M., AHN, J. H., GUMMAHAJU, J., ROSENBLUM, M., AND DALLY, W. J. 2007. Executing irregular scientific applications on stream architectures. In Proceedings of the 21st Annual International Conference on Supercomputing. 93-104.
    • EREZ, M., AHN, J. H., GUMMAHAJU, J., ROSENBLUM, M., AND DALLY, W. J. 2007. Executing irregular scientific applications on stream architectures. In Proceedings of the 21st Annual International Conference on Supercomputing. 93-104.
  • 14
    • 34548207355 scopus 로고    scopus 로고
    • FATAHALLAN, K., KNIGHT, T. J., HOUSTON, M. et al. 2006. Sequoia: Programming the memory hierarchy. In Proceedings of the 2006 ACM / IEEE Conference on Supercomputing.
    • FATAHALLAN, K., KNIGHT, T. J., HOUSTON, M. et al. 2006. Sequoia: Programming the memory hierarchy. In Proceedings of the 2006 ACM / IEEE Conference on Supercomputing.
  • 17
    • 35348919265 scopus 로고    scopus 로고
    • A novel SIMD architecture for the cell heterogeneous chipmulti-processor
    • GSCHWIND, M. et al. 2005. A novel SIMD architecture for the cell heterogeneous chipmulti-processor. In Conference Record of Hot Chips 17.
    • (2005) Conference Record of Hot Chips 17
    • GSCHWIND, M.1
  • 18
    • 77957817744 scopus 로고    scopus 로고
    • GUMMARAJU, J., COBURN, J., TURNER, Y., AND ROSENBLUM, M. 2008. Streamware: programming general-purpose multicore processors using streams. In Proceedings of the 13th International Conference on Architectural Support for Programming Languages and Operating Systems. 297-307.
    • GUMMARAJU, J., COBURN, J., TURNER, Y., AND ROSENBLUM, M. 2008. Streamware: programming general-purpose multicore processors using streams. In Proceedings of the 13th International Conference on Architectural Support for Programming Languages and Operating Systems. 297-307.
  • 19
    • 47849087164 scopus 로고    scopus 로고
    • GUMMARAJU, J., EREZ, M., COBURN, J., ROSENBLUM, M., AND DALLY, W. J. 2007. Architectural support for the stream execution model on general-purpose processors. In Proceedings of the 16th International Conference on Parallel Architecture and Compilation Techniques. 3-12.
    • GUMMARAJU, J., EREZ, M., COBURN, J., ROSENBLUM, M., AND DALLY, W. J. 2007. Architectural support for the stream execution model on general-purpose processors. In Proceedings of the 16th International Conference on Parallel Architecture and Compilation Techniques. 3-12.
  • 20
    • 33749377408 scopus 로고    scopus 로고
    • GUMMARAJU, J. AND ROSENBLUM, M. 2005. Stream programming on general-purpose processors. In Proceedings of the 38th International Symposium on Microarchitecture.
    • GUMMARAJU, J. AND ROSENBLUM, M. 2005. Stream programming on general-purpose processors. In Proceedings of the 38th International Symposium on Microarchitecture.
  • 22
    • 84976721461 scopus 로고    scopus 로고
    • HEINLEIN, J., GHARACHORLOO, K., DRESSER, S., AND GUPTA, A. 1994. Integration of message passing and shared memory in the Stanford flash multiprocessor. SIGOPS Oper. Syst. Rev. 28, 5, 38-50.
    • HEINLEIN, J., GHARACHORLOO, K., DRESSER, S., AND GUPTA, A. 1994. Integration of message passing and shared memory in the Stanford flash multiprocessor. SIGOPS Oper. Syst. Rev. 28, 5, 38-50.
  • 23
    • 59449101408 scopus 로고    scopus 로고
    • HO, R., MAI, K., AND HOROWITZ, M. 2001. The Future of wires. Proceedings of the IEEE 89, 4 (Apr.).
    • HO, R., MAI, K., AND HOROWITZ, M. 2001. The Future of wires. Proceedings of the IEEE 89, 4 (Apr.).
  • 24
    • 0141538149 scopus 로고    scopus 로고
    • HO, R., MAI, K., AND HOROWITZ, M. 2003. Efficient on-chip global interconnects. In Symposium on VLSI Circuits. 271-274.
    • HO, R., MAI, K., AND HOROWITZ, M. 2003. Efficient on-chip global interconnects. In Symposium on VLSI Circuits. 271-274.
  • 25
    • 2442653861 scopus 로고    scopus 로고
    • HOROWITZ, M. AND DALLY, W. 2004. How scaling will change processor architecture. In Proceedings of the International Solid-State Circuits Conference. 132-133.
    • HOROWITZ, M. AND DALLY, W. 2004. How scaling will change processor architecture. In Proceedings of the International Solid-State Circuits Conference. 132-133.
  • 27
    • 59449084767 scopus 로고    scopus 로고
    • ITU-T REC. H.264. 2002. ISO/IEC 144496-10 AVC. 2002.
    • ITU-T REC. H.264. 2002. ISO/IEC 144496-10 AVC. 2002.
  • 33
    • 84872894119 scopus 로고    scopus 로고
    • KRANZ, D., JOHNSON, K., AGARWAL, A., KUBIATOWICZ, J., AND LIM, B.-H. 1993. Integrating messagepassing and shared-memory: early experience. In Proceedings of the 4th ACM SIGPLAN symposium on Principles and Practice of Parallel. Programming. 54-63.
    • KRANZ, D., JOHNSON, K., AGARWAL, A., KUBIATOWICZ, J., AND LIM, B.-H. 1993. Integrating messagepassing and shared-memory: early experience. In Proceedings of the 4th ACM SIGPLAN symposium on Principles and Practice of Parallel. Programming. 54-63.
  • 35
    • 35348861326 scopus 로고    scopus 로고
    • LEVERICH, J., ARAKIDA, H., SOLOMATNIKOV, A., FIROOZSHAHIAN, A., HOROWITZ, M., AND KOZYRAKIS, C. 2007. Comparing memory systems for chip multiprocessors. In Proceedings of the 34th Annual, International Symposium on Computer Architecture. 358-368.
    • LEVERICH, J., ARAKIDA, H., SOLOMATNIKOV, A., FIROOZSHAHIAN, A., HOROWITZ, M., AND KOZYRAKIS, C. 2007. Comparing memory systems for chip multiprocessors. In Proceedings of the 34th Annual, International Symposium on Computer Architecture. 358-368.
  • 36
    • 59449105948 scopus 로고    scopus 로고
    • LEWIS, B. AND BERG, D. J. 1998. Multithreaded. Programming with Pthreads. Upper saddle River. NJ: Prentice Hall.
    • LEWIS, B. AND BERG, D. J. 1998. Multithreaded. Programming with Pthreads. Upper saddle River. NJ: Prentice Hall.
  • 37
    • 59449091948 scopus 로고    scopus 로고
    • LI, M. et al. 2005. ALP: efficient support for all levels of parallelism for complex MEDIA applications. TECH. REP. UIUCDCS-R-2005-2605, UIUC CS. JULY.
    • LI, M. et al. 2005. ALP: efficient support for all levels of parallelism for complex MEDIA applications. TECH. REP. UIUCDCS-R-2005-2605, UIUC CS. JULY.
  • 38
    • 17644395320 scopus 로고    scopus 로고
    • Blocking and array contraction across arbitrarily nested loops using affine partitioning
    • LIM, A. W., LIAO, S.-W., AND LAM, M. S. 2001. Blocking and array contraction across arbitrarily nested loops using affine partitioning. ACM SIGPLAN Notices 36, 7, 103-112.
    • (2001) ACM SIGPLAN Notices , vol.36 , Issue.7 , pp. 103-112
    • LIM, A.W.1    LIAO, S.-W.2    LAM, M.S.3
  • 39
    • 59449098153 scopus 로고    scopus 로고
    • LIN, Y. 2004. A programmable Vector coprocessor architecture for wireless applications. In Proceedings of the 3rd Workshop on Application Specific Processors.
    • LIN, Y. 2004. A programmable Vector coprocessor architecture for wireless applications. In Proceedings of the 3rd Workshop on Application Specific Processors.
  • 40
    • 33646932154 scopus 로고    scopus 로고
    • LOGHI, M. AND PNCINO, M. 2005. Exploring energy/performance tradeoffs in shared memory MPSoCs: Snoop-based cache coherence vs. software solutions. In Proceedings of the Design Automation and Test in Europe Conference
    • LOGHI, M. AND PNCINO, M. 2005. Exploring energy/performance tradeoffs in shared memory MPSoCs: Snoop-based cache coherence vs. software solutions. In Proceedings of the Design Automation and Test in Europe Conference
  • 41
    • 59449091947 scopus 로고    scopus 로고
    • MACHNICKI, E. 2005. Ultra high performance scalable DSP family for multimedia. In Proceedings of the Conference Record of Hot Chips 17.
    • MACHNICKI, E. 2005. Ultra high performance scalable DSP family for multimedia. In Proceedings of the Conference Record of Hot Chips 17.
  • 42
    • 59449086506 scopus 로고    scopus 로고
    • MAI, K. et al. 2000. Smart memories: A modular reconfigurable architecture. In Proceedings of the 27th International Symposium on Computer architecture.
    • MAI, K. et al. 2000. Smart memories: A modular reconfigurable architecture. In Proceedings of the 27th International Symposium on Computer architecture.
  • 43
    • 59449088280 scopus 로고    scopus 로고
    • MIPS32 2001. MIPS32 Architecture For Programmers II: The MIPS32 Instruction Set. MIPS Technologies, Inc.
    • MIPS32 2001. MIPS32 Architecture For Programmers Volume II: The MIPS32 Instruction Set. MIPS Technologies, Inc.
  • 45
    • 33749048325 scopus 로고    scopus 로고
    • MPEG SOFTWARE SIMULATION GROUP, Available at
    • MPEG SOFTWARE SIMULATION GROUP. Mssg mpeg2 encoder and decoder. Available at: http://www.mpeg.org/MPEG/MSSG/.
    • Mssg mpeg2 encoder and decoder
  • 46
    • 84905483003 scopus 로고    scopus 로고
    • TRIPS: A polymorphous architecture for exploiting ILP, TLP, and DLP
    • SANKARALINGAM, K. 2004. TRIPS: A polymorphous architecture for exploiting ILP, TLP, and DLP. ACM Trans. Archit. Code Optirn. 1, 1, 62-93.
    • (2004) ACM Trans. Archit. Code Optirn. 1 , vol.1 , pp. 62-93
    • SANKARALINGAM, K.1
  • 47
    • 38349050777 scopus 로고    scopus 로고
    • A performance analysis of PIM, stream processing, and tiled processing on memory-intensive signal processing kernels
    • SUH, J. et al. 2003. A performance analysis of PIM, stream processing, and tiled processing on memory-intensive signal processing kernels. In Proceedings of the 30th International Symposium on Computer Architecture.
    • (2003) Proceedings of the 30th International Symposium on Computer Architecture
    • SUH, J.1
  • 48
    • 59449087566 scopus 로고    scopus 로고
    • TARJAN, D., THOZIYOOR, S., AND JOUPPI, N. P. 2006. CACTI 4.0. Tech. Rep. HPL-2006-86, HP Labs.
    • TARJAN, D., THOZIYOOR, S., AND JOUPPI, N. P. 2006. CACTI 4.0. Tech. Rep. HPL-2006-86, HP Labs.
  • 50
    • 84868867970 scopus 로고    scopus 로고
    • TENSILICA 2007. Tensilica Software Tools, http://www.tensilica. com/products/software.htm.
    • (2007) Tensilica Software Tools
  • 52
  • 55


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.