SCOPUS 정보 검색 플랫폼

Proceedings of the Annual International Symposium on Microarchitecture, MICRO

Volumn , Issue 2008 PROCEEDINGS, 2008, Pages 164-175

Tradeoffs in designing accelerator architectures for visual computing

(4) Mahesri, Aqeel a Johnson, Daniel a Crago, Neal a Patel, Sanjay J a

a University of Illinois at Urbana Champaign (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACCELERATOR ARCHITECTURES; AREA RATIOS; BENCHMARK SUITES; CACHE BANDWIDTH; DATA-COMMUNICATION; DESIGN SPACES; GENERAL PURPOSE; GRAPHICS RENDERING; HIGH THROUGHPUT; MEMORY HIERARCHY; MIMD ARCHITECTURE; MULTI-THREADING; PERFORMANCE TRADE-OFF; POOR PERFORMANCE; UNIPROCESSOR; VIDEO ENCODING; VISUAL COMPUTING;

BENCHMARKING; CACHE MEMORY; COMPUTER VISION; MULTITASKING;

ARCHITECTURAL DESIGN;

EID: 66749170578 PISSN: 10724451 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/MICRO.2008.4771788 Document Type: Conference Paper

Times cited : (35)

References (38)

1
- 34948848984
- AGEIA PhysX. http://www.ageia.com.
- AGEIA PhysX

2
- 66749087557
- MIPS32 74K. http://www.mips.com/products/cores/32-bit-cores/mips32-74k/ index.cfm.
- MIPS32 74K. http://www.mips.com/products/cores/32-bit-cores/mips32-74k/ index.cfm.

3
- 84868983129
- Tensilica Diamond 570T. http://www.tensilica.com/dia-mond/di-570t.htm.
- , vol.570 T

4
- 84868984135
- Tilera TILE64 Processor Overview. http://www.ti-lera.com/pdf/Pro-Brief- Tile64-Web.pdf.
- Tilera TILE64 Processor Overview

5
- 36849014883
- The International Technology Roadmap for Semiconductors 2005 Edition
- The International Technology Roadmap for Semiconductors 2005 Edition, System Drivers, 2005.
- (2005) System Drivers

6
- 84868969839
- ATI CTM Guide, 2007. http://ati.amd.com/companyinfo/researcher/documents/ ATI-CTM-Guide.pdf.
- (2007) ATI CTM Guide

7
- 34547309668
- CUDA Programming Guide 1.0, 2007. http://developer.nvidia.com/ob-ject/ cuda.html.
- (2007) CUDA Programming Guide 1.0

8
- 66749135283
- Tradeoffs in designing accelerator architectures for visual computing
- Technical Report UILU-ENG-08-2208, University of Illinois, May
- Aqeel Mahesri et al. Tradeoffs in designing accelerator architectures for visual computing. Technical Report UILU-ENG-08-2208, University of Illinois, May 2008.
- (2008)
- Mahesri, A.¹

9
- 84900342836
- SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance
- V. Aslot, M. Domeika, R. Eigenmann, G. Gaertner, W. B. Jones, and B. Parady. SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance. Lecture Notes in Computer Science, 2104, 2001.
- (2001) Lecture Notes in Computer Science , vol.2104
- Aslot, V.¹ Domeika, M.² Eigenmann, R.³ Gaertner, G.⁴ Jones, W.B.⁵ Parady, B.⁶

10
- 34547471544
- J. Balfour and W. J. Dally. Design tradeoffs for tiled CMP on-chip networks. In ICS-20, pages 187-198, 2006.
- J. Balfour and W. J. Dally. Design tradeoffs for tiled CMP on-chip networks. In ICS-20, pages 187-198, 2006.

11
- 51549095074
- The PARSEC Benchmark Suite: Characterization and Architectural Implications
- Technical report, Princeton University, January
- C. Bienia, S. Kumar, J. P. Singh, and K. Li. The PARSEC Benchmark Suite: Characterization and Architectural Implications. Technical report, Princeton University, January 2008.
- (2008)
- Bienia, C.¹ Kumar, S.² Singh, J.P.³ Li, K.⁴

12
- 84868968927
- Blender.org. Blender, http://www.blender.org.
- Blender.org. Blender

13
- 47349119875
- Faceperf: Benchmarks for face recognition algorithms
- 27-29 Sept
- D. Bolme, M. Strout, and J. Beveridge. Faceperf: Benchmarks for face recognition algorithms. Workload Characterization, 2007. IISWC2007, pages 114-119, 27-29 Sept. 2007.
- (2007) Workload Characterization, 2007. IISWC2007 , pp. 114-119
- Bolme, D.¹ Strout, M.² Beveridge, J.³

14
- 0027678189
- NETRA: A Hierarchical and Partitionable Architecture for Computer Vision Systems
- A. N. Choudhary, J. H. Patel, and N. Ahuja. NETRA: A Hierarchical and Partitionable Architecture for Computer Vision Systems. IEEE Trans. Parallel Distrib. Syst., 4(10):1092-1104, 1993.
- (1993) IEEE Trans. Parallel Distrib. Syst , vol.4 , Issue.10 , pp. 1092-1104
- Choudhary, A.N.¹ Patel, J.H.² Ahuja, N.³

15
- 84971454014
- The REYES image rendering architecture
- July
- R. L. Cook, L. Carpenter, and E. Catmull. The REYES image rendering architecture. In ACM SIGGRAPH, July 1987.
- (1987) ACM SIGGRAPH
- Cook, R.L.¹ Carpenter, L.² Catmull, E.³

16
- 84871292516
- EEMBC
- EEMBC. Embedded Microprocessor Benchmark Consortium. http://www.eembc. org.
- Embedded Microprocessor Benchmark Consortium

17
- 34548858682
- S. V. et. al. An 80-Tile 1.28TFLOPS Network-on-Chip in 65 nm CMOS. In ISSCC Digest of Technical Papers., February 2007.
- S. V. et. al. An 80-Tile 1.28TFLOPS Network-on-Chip in 65 nm CMOS. In ISSCC Digest of Technical Papers., February 2007.

18
- 47349104432
- Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow
- December
- W. W. Fung, I. Sham, G. Yuan, and T. M. Aamodt. Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow. In Micro-40, December 2007.
- (2007) Micro-40
- Fung, W.W.¹ Sham, I.² Yuan, G.³ Aamodt, T.M.⁴

19
- 33646015987
- Synergistic Processing in Cell's Multicore Architecture
- M. Gschwind, H. P. Hofstee, B. Flachs, M. Hopkins, Y. Watanabe, and T. Yamazaki. Synergistic Processing in Cell's Multicore Architecture. IEEE Micro, 26(2):10-24, 2006.
- (2006) IEEE Micro , vol.26 , Issue.2 , pp. 10-24
- Gschwind, M.¹ Hofstee, H.P.² Flachs, B.³ Hopkins, M.⁴ Watanabe, Y.⁵ Yamazaki, T.⁶

20
- 66749128524
- Multi-Core and Beyond: Evolving the x86 Architecture
- P. Hester. Multi-Core and Beyond: Evolving the x86 Architecture. AMD, Aug 2007. HotChips presentation.
- AMD, Aug 2007. HotChips presentation
- Hester, P.¹

21
- 42549168687
- Exploring the cache design space for large scale CMPs
- L. Hsu, R. Iyer, S. Makineni, S. Reinhardt, and D. Newell. Exploring the cache design space for large scale CMPs. ACM SIGARCH Computer Architecture News, 33(4):24-33, 2005.
- (2005) ACM SIGARCH Computer Architecture News , vol.33 , Issue.4 , pp. 24-33
- Hsu, L.¹ Iyer, R.² Makineni, S.³ Reinhardt, S.⁴ Newell, D.⁵

22
- 0035187053
- Exploring the design space of future CMPs
- J. Huh, D. Burger, and S. Keckler. Exploring the design space of future CMPs. In PACT2001, pages 199-210, 2001.
- (2001) PACT2001 , pp. 199-210
- Huh, J.¹ Burger, D.² Keckler, S.³

23
- 34247174509
- Core architecture optimization for heterogeneous chip multiprocessors
- New York, NY, USA, ACM
- R. Kumar, D. M. Tullsen, and N. P. Jouppi. Core architecture optimization for heterogeneous chip multiprocessors. In PACT '06, pages 23-32, New York, NY, USA, 2006. ACM.
- (2006) PACT '06 , pp. 23-32
- Kumar, R.¹ Tullsen, D.M.² Jouppi, N.P.³

24
- 27544456315
- Interconnections in Multi-Core Architectures: Understanding Mechanisms, Overheads, and Scaling
- R. Kumar, V. Zyuban, and D. M. Tullsen. Interconnections in Multi-Core Architectures: Understanding Mechanisms, Overheads, and Scaling. In ISCA-32, 2005.
- (2005) ISCA-32
- Kumar, R.¹ Zyuban, V.² Tullsen, D.M.³

25
- 54249127091
- Design tradeoffs in floating-point unit implementation for embedded and processing-in-memory systems
- May
- T.-J. Kwon, J. Sondeen, and J. Draper. Design tradeoffs in floating-point unit implementation for embedded and processing-in-memory systems. In IEEE International Symposium on Circuits and Systems, volume 4, May 2005.
- (2005) IEEE International Symposium on Circuits and Systems , vol.4
- Kwon, T.-J.¹ Sondeen, J.² Draper, J.³

26
- 66749149994
- Visualizing the Behavior of Logic Synthesis Algorithms
- H. A. Landman. Visualizing the Behavior of Logic Synthesis Algorithms. In SNUG 98: Proceedings of the Synopsys User Group Conference, 1998.
- (1998) SNUG 98: Proceedings of the Synopsys User Group Conference
- Landman, H.A.¹

27
- 0031339427
- Media-Bench: A Tool for Evaluating and Synthesizing Multimedia and Communications Systems
- C. Lee, M. Potkonjak, and W. H. Mangione-Smith. Media-Bench: A Tool for Evaluating and Synthesizing Multimedia and Communications Systems. In Micro-30, 1997.
- (1997) Micro-30
- Lee, C.¹ Potkonjak, M.² Mangione-Smith, W.H.³

28
- 33748857902
- CMP Design Space Exploration Subject to Physical Constraints
- Y. Li, B. Lee, D. Brooks, Z. Hu, and K. Skadron. CMP Design Space Exploration Subject to Physical Constraints. In HPCA-12, 2006.
- (2006) HPCA-12
- Li, Y.¹ Lee, B.² Brooks, D.³ Hu, Z.⁴ Skadron, K.⁵

29
- 84982318971
- GPGPU: General purpose computation on graphics hardware
- August
- D. Luebke, M. Harris, J. Krger, T. Purcell, N. Govindaraju, I. Buck, C. Woolley, and A. Lefohn. GPGPU: general purpose computation on graphics hardware. In ACM SIGGRAPH, August 2004.
- (2004) ACM SIGGRAPH
- Luebke, D.¹ Harris, M.² Krger, J.³ Purcell, T.⁴ Govindaraju, N.⁵ Buck, I.⁶ Woolley, C.⁷ Lefohn, A.⁸

30
- 34547425357
- Design space exploration for multicore architectures: A power/performance/thermal view
- M. Monchiero, R. Canal, and A. Gonzlez. Design space exploration for multicore architectures: a power/performance/thermal view. In ICS-20, pages 178-186, 2006.
- (2006) ICS-20 , pp. 178-186
- Monchiero, M.¹ Canal, R.² Gonzlez, A.³

31
- 47349084021
- Optimizing NUCA Organizations and Wiring Alternatives for Large Caches With CACTI 6.0
- December
- N. Muralimanohar, R. Balasubramonian, and N. Jouppi. Optimizing NUCA Organizations and Wiring Alternatives for Large Caches With CACTI 6.0. In Micro-40, December 2007.
- (2007) Micro-40
- Muralimanohar, N.¹ Balasubramonian, R.² Jouppi, N.³

32
- 66749163608
- S. S. Stone, H. Yi, W. mei W. Hwu, J. P. Haldar, B. P. Sutton, and Z.-P. Liang. How GPUs Can Improve the Quality of Magnetic Resonance Imaging. The 1st Workshop on GPGPU, 2007.
- S. S. Stone, H. Yi, W. mei W. Hwu, J. P. Haldar, B. P. Sutton, and Z.-P. Liang. How GPUs Can Improve the Quality of Magnetic Resonance Imaging. The 1st Workshop on GPGPU, 2007.

33
- 56649117084
- SIMD Ray Stream Tracing - SIMD Ray Traversal with Generalized Ray Packets and On-the-fly Re-Ordering
- Technical Report UUSCI-2007-012
- I. Wald, C. P. Gribble, S. Boulos, and A. Kensler. SIMD Ray Stream Tracing - SIMD Ray Traversal with Generalized Ray Packets and On-the-fly Re-Ordering. Technical Report UUSCI-2007-012, 2007.
- (2007)
- Wald, I.¹ Gribble, C.P.² Boulos, S.³ Kensler, A.⁴

34
- 2342646021
- Addison Wesley
- N. Weste and D. Harris. CMOS VLSI Design: A Circuits and Systems Perspective. Addison Wesley, 2005.
- (2005) CMOS VLSI Design: A Circuits and Systems Perspective
- Weste, N.¹ Harris, D.²

35
- 0029179077
- The SPLASH-2 Programs: Characterization and Methodological Considerations
- S. C. Woo, M. Ohara, E. Torrie, J. P. Singh, and A. Gupta. The SPLASH-2 Programs: Characterization and Methodological Considerations. In ISCA-22, pages 24-6, 1995.
- (1995) ISCA-22 , pp. 24-26
- Woo, S.C.¹ Ohara, M.² Torrie, E.³ Singh, J.P.⁴ Gupta, A.⁵

36
- 30744459395
- RPU: A programmable ray processing unit for realtime ray tracing
- 434-444
- S. Woop, J. Schmittler, and P. Slusallek. RPU: a programmable ray processing unit for realtime ray tracing. ACM Trans. Graph., 24(3):434-444, 2005.
- (2005) ACM Trans. Graph , vol.24 , Issue.3
- Woop, S.¹ Schmittler, J.² Slusallek, P.³

37
- 33646810855
- Prediction-based Directional Fractional Pixel Motion Estimation for H.264 Video Coding
- L. Yang, K. Yu, J. Li, and S. Li. Prediction-based Directional Fractional Pixel Motion Estimation for H.264 Video Coding. IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
- (2005) IEEE International Conference on Acoustics, Speech, and Signal Processing
- Yang, L.¹ Yu, K.² Li, J.³ Li, S.⁴

38
- 35348814920
- ParallAX: An Architecture for Real-Time Physics
- T. Y. Yeh, P. Faloutsos, S. J. Patel, and G. Reinmann. ParallAX: An Architecture for Real-Time Physics. In ISCA-34, 2007.
- (2007) ISCA-34
- Yeh, T.Y.¹ Faloutsos, P.² Patel, S.J.³ Reinmann, G.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.