메뉴 건너뛰기




Volumn , Issue 2008 PROCEEDINGS, 2008, Pages 164-175

Tradeoffs in designing accelerator architectures for visual computing

Author keywords

[No Author keywords available]

Indexed keywords

ACCELERATOR ARCHITECTURES; AREA RATIOS; BENCHMARK SUITES; CACHE BANDWIDTH; DATA-COMMUNICATION; DESIGN SPACES; GENERAL PURPOSE; GRAPHICS RENDERING; HIGH THROUGHPUT; MEMORY HIERARCHY; MIMD ARCHITECTURE; MULTI-THREADING; PERFORMANCE TRADE-OFF; POOR PERFORMANCE; UNIPROCESSOR; VIDEO ENCODING; VISUAL COMPUTING;

EID: 66749170578     PISSN: 10724451     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/MICRO.2008.4771788     Document Type: Conference Paper
Times cited : (35)

References (38)
  • 1
    • 34948848984 scopus 로고    scopus 로고
    • AGEIA PhysX. http://www.ageia.com.
    • AGEIA PhysX
  • 2
    • 66749087557 scopus 로고    scopus 로고
    • MIPS32 74K. http://www.mips.com/products/cores/32-bit-cores/mips32-74k/ index.cfm.
    • MIPS32 74K. http://www.mips.com/products/cores/32-bit-cores/mips32-74k/ index.cfm.
  • 3
    • 84868983129 scopus 로고    scopus 로고
    • Tensilica Diamond 570T. http://www.tensilica.com/dia-mond/di-570t.htm.
    • , vol.570 T
  • 5
    • 36849014883 scopus 로고    scopus 로고
    • The International Technology Roadmap for Semiconductors 2005 Edition
    • The International Technology Roadmap for Semiconductors 2005 Edition, System Drivers, 2005.
    • (2005) System Drivers
  • 6
    • 84868969839 scopus 로고    scopus 로고
    • ATI CTM Guide, 2007. http://ati.amd.com/companyinfo/researcher/documents/ ATI-CTM-Guide.pdf.
    • (2007) ATI CTM Guide
  • 8
    • 66749135283 scopus 로고    scopus 로고
    • Tradeoffs in designing accelerator architectures for visual computing
    • Technical Report UILU-ENG-08-2208, University of Illinois, May
    • Aqeel Mahesri et al. Tradeoffs in designing accelerator architectures for visual computing. Technical Report UILU-ENG-08-2208, University of Illinois, May 2008.
    • (2008)
    • Mahesri, A.1
  • 10
    • 34547471544 scopus 로고    scopus 로고
    • J. Balfour and W. J. Dally. Design tradeoffs for tiled CMP on-chip networks. In ICS-20, pages 187-198, 2006.
    • J. Balfour and W. J. Dally. Design tradeoffs for tiled CMP on-chip networks. In ICS-20, pages 187-198, 2006.
  • 11
    • 51549095074 scopus 로고    scopus 로고
    • The PARSEC Benchmark Suite: Characterization and Architectural Implications
    • Technical report, Princeton University, January
    • C. Bienia, S. Kumar, J. P. Singh, and K. Li. The PARSEC Benchmark Suite: Characterization and Architectural Implications. Technical report, Princeton University, January 2008.
    • (2008)
    • Bienia, C.1    Kumar, S.2    Singh, J.P.3    Li, K.4
  • 14
    • 0027678189 scopus 로고
    • NETRA: A Hierarchical and Partitionable Architecture for Computer Vision Systems
    • A. N. Choudhary, J. H. Patel, and N. Ahuja. NETRA: A Hierarchical and Partitionable Architecture for Computer Vision Systems. IEEE Trans. Parallel Distrib. Syst., 4(10):1092-1104, 1993.
    • (1993) IEEE Trans. Parallel Distrib. Syst , vol.4 , Issue.10 , pp. 1092-1104
    • Choudhary, A.N.1    Patel, J.H.2    Ahuja, N.3
  • 17
    • 34548858682 scopus 로고    scopus 로고
    • S. V. et. al. An 80-Tile 1.28TFLOPS Network-on-Chip in 65 nm CMOS. In ISSCC Digest of Technical Papers., February 2007.
    • S. V. et. al. An 80-Tile 1.28TFLOPS Network-on-Chip in 65 nm CMOS. In ISSCC Digest of Technical Papers., February 2007.
  • 18
    • 47349104432 scopus 로고    scopus 로고
    • Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow
    • December
    • W. W. Fung, I. Sham, G. Yuan, and T. M. Aamodt. Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow. In Micro-40, December 2007.
    • (2007) Micro-40
    • Fung, W.W.1    Sham, I.2    Yuan, G.3    Aamodt, T.M.4
  • 22
    • 0035187053 scopus 로고    scopus 로고
    • Exploring the design space of future CMPs
    • J. Huh, D. Burger, and S. Keckler. Exploring the design space of future CMPs. In PACT2001, pages 199-210, 2001.
    • (2001) PACT2001 , pp. 199-210
    • Huh, J.1    Burger, D.2    Keckler, S.3
  • 23
    • 34247174509 scopus 로고    scopus 로고
    • Core architecture optimization for heterogeneous chip multiprocessors
    • New York, NY, USA, ACM
    • R. Kumar, D. M. Tullsen, and N. P. Jouppi. Core architecture optimization for heterogeneous chip multiprocessors. In PACT '06, pages 23-32, New York, NY, USA, 2006. ACM.
    • (2006) PACT '06 , pp. 23-32
    • Kumar, R.1    Tullsen, D.M.2    Jouppi, N.P.3
  • 24
    • 27544456315 scopus 로고    scopus 로고
    • Interconnections in Multi-Core Architectures: Understanding Mechanisms, Overheads, and Scaling
    • R. Kumar, V. Zyuban, and D. M. Tullsen. Interconnections in Multi-Core Architectures: Understanding Mechanisms, Overheads, and Scaling. In ISCA-32, 2005.
    • (2005) ISCA-32
    • Kumar, R.1    Zyuban, V.2    Tullsen, D.M.3
  • 25
    • 54249127091 scopus 로고    scopus 로고
    • Design tradeoffs in floating-point unit implementation for embedded and processing-in-memory systems
    • May
    • T.-J. Kwon, J. Sondeen, and J. Draper. Design tradeoffs in floating-point unit implementation for embedded and processing-in-memory systems. In IEEE International Symposium on Circuits and Systems, volume 4, May 2005.
    • (2005) IEEE International Symposium on Circuits and Systems , vol.4
    • Kwon, T.-J.1    Sondeen, J.2    Draper, J.3
  • 27
    • 0031339427 scopus 로고    scopus 로고
    • Media-Bench: A Tool for Evaluating and Synthesizing Multimedia and Communications Systems
    • C. Lee, M. Potkonjak, and W. H. Mangione-Smith. Media-Bench: A Tool for Evaluating and Synthesizing Multimedia and Communications Systems. In Micro-30, 1997.
    • (1997) Micro-30
    • Lee, C.1    Potkonjak, M.2    Mangione-Smith, W.H.3
  • 28
    • 33748857902 scopus 로고    scopus 로고
    • CMP Design Space Exploration Subject to Physical Constraints
    • Y. Li, B. Lee, D. Brooks, Z. Hu, and K. Skadron. CMP Design Space Exploration Subject to Physical Constraints. In HPCA-12, 2006.
    • (2006) HPCA-12
    • Li, Y.1    Lee, B.2    Brooks, D.3    Hu, Z.4    Skadron, K.5
  • 30
    • 34547425357 scopus 로고    scopus 로고
    • Design space exploration for multicore architectures: A power/performance/thermal view
    • M. Monchiero, R. Canal, and A. Gonzlez. Design space exploration for multicore architectures: a power/performance/thermal view. In ICS-20, pages 178-186, 2006.
    • (2006) ICS-20 , pp. 178-186
    • Monchiero, M.1    Canal, R.2    Gonzlez, A.3
  • 31
    • 47349084021 scopus 로고    scopus 로고
    • Optimizing NUCA Organizations and Wiring Alternatives for Large Caches With CACTI 6.0
    • December
    • N. Muralimanohar, R. Balasubramonian, and N. Jouppi. Optimizing NUCA Organizations and Wiring Alternatives for Large Caches With CACTI 6.0. In Micro-40, December 2007.
    • (2007) Micro-40
    • Muralimanohar, N.1    Balasubramonian, R.2    Jouppi, N.3
  • 32
    • 66749163608 scopus 로고    scopus 로고
    • S. S. Stone, H. Yi, W. mei W. Hwu, J. P. Haldar, B. P. Sutton, and Z.-P. Liang. How GPUs Can Improve the Quality of Magnetic Resonance Imaging. The 1st Workshop on GPGPU, 2007.
    • S. S. Stone, H. Yi, W. mei W. Hwu, J. P. Haldar, B. P. Sutton, and Z.-P. Liang. How GPUs Can Improve the Quality of Magnetic Resonance Imaging. The 1st Workshop on GPGPU, 2007.
  • 33
    • 56649117084 scopus 로고    scopus 로고
    • SIMD Ray Stream Tracing - SIMD Ray Traversal with Generalized Ray Packets and On-the-fly Re-Ordering
    • Technical Report UUSCI-2007-012
    • I. Wald, C. P. Gribble, S. Boulos, and A. Kensler. SIMD Ray Stream Tracing - SIMD Ray Traversal with Generalized Ray Packets and On-the-fly Re-Ordering. Technical Report UUSCI-2007-012, 2007.
    • (2007)
    • Wald, I.1    Gribble, C.P.2    Boulos, S.3    Kensler, A.4
  • 35
    • 0029179077 scopus 로고
    • The SPLASH-2 Programs: Characterization and Methodological Considerations
    • S. C. Woo, M. Ohara, E. Torrie, J. P. Singh, and A. Gupta. The SPLASH-2 Programs: Characterization and Methodological Considerations. In ISCA-22, pages 24-6, 1995.
    • (1995) ISCA-22 , pp. 24-26
    • Woo, S.C.1    Ohara, M.2    Torrie, E.3    Singh, J.P.4    Gupta, A.5
  • 36
    • 30744459395 scopus 로고    scopus 로고
    • RPU: A programmable ray processing unit for realtime ray tracing
    • 434-444
    • S. Woop, J. Schmittler, and P. Slusallek. RPU: a programmable ray processing unit for realtime ray tracing. ACM Trans. Graph., 24(3):434-444, 2005.
    • (2005) ACM Trans. Graph , vol.24 , Issue.3
    • Woop, S.1    Schmittler, J.2    Slusallek, P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.