SCOPUS 정보 검색 플랫폼

Proceedings - International Symposium on Computer Architecture

Volumn , Issue , 2012, Pages 84-93

Boosting mobile GPU performance with a decoupled access/execute fragment processor

(3) Arnau, José María a Parcerisa, Joan Manuel a Xekalakis, Polychronis b

a UNIVERSITAT POLITÈCNICA DE CATALUNYA (Spain)

b INTEL CORPORATION (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BANDWIDTH USAGE; DATA SHARING; DECOUPLED ARCHITECTURE; GRAPHICAL APPLICATIONS; GRAPHICS RENDERING; GROWING MARKETS; HARDWARE/SOFTWARE; HIDING MEMORY LATENCY; HIGH-ENERGY COSTS; MULTI-THREADING; MULTITHREADED; OPERATING TIME; POWER BUDGETS; PREFETCHING; TEXTURE CACHE; TRADITIONAL TECHNIQUES;

COMPUTER ARCHITECTURE; COMPUTER GRAPHICS; ENERGY EFFICIENCY; MULTITASKING;

COMPUTER GRAPHICS EQUIPMENT;

EID: 84864858885 PISSN: 10636897 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ISCA.2012.6237008 Document Type: Conference Paper

Times cited : (23)

References (34)

1
- 78149272413
- NoCaware cache design for chip multiprocessors
- Sept.
- A. K. Abousamra, R. G. Melhem, A. K. Jones. "NoCaware cache design for chip multiprocessors". In Proc. of PACT, pp. 565-566, Sept. 2010.
- (2010) Proc. of PACT , pp. 565-566
- Abousamra, A.K.¹ Melhem, R.G.² Jones, A.K.³

2
- 77954016821
- Graphics for the masses: A hardware rasterization architecture for mobile phones
- July
- T. Akenine-Möller and J. Strom. "Graphics for the masses: a hardware rasterization architecture for mobile phones". In Proc. of SIGGRAPH, pp. 801-808, July 2003.
- (2003) Proc. of SIGGRAPH , pp. 801-808
- Akenine-Möller, T.¹ Strom, J.²

3
- 79953106704
- An analysis of power consumption in a smartphone
- June
- A. Carroll and G. Heiser. "An analysis of power consumption in a smartphone". In Proc. of USENIXATC, pp. 21-34, June 2010.
- (2010) Proc. of USENIXATC , pp. 21-34
- Carroll, A.¹ Heiser, G.²

4
- 80052534264
- OUTRIDER : Efficient memory latency tolerance with decoupled strands
- June
- N. C. Crago and S. J. Patel. "OUTRIDER : efficient memory latency tolerance with decoupled strands". In Proc. of ISCA, pp. 117-128, June 2011.
- (2011) Proc. of ISCA , pp. 117-128
- Crago, N.C.¹ Patel, S.J.²

5
- 0026962180
- Stride directed prefetching in scalar processors
- Dec.
- John W. C. Fu, Janak H. Patel, and Bob L. Janssens. "Stride directed prefetching in scalar processors". SIGMICRO Newsl., pp. 102-110, Dec. 1992
- (1992) SIGMICRO Newsl. , pp. 102-110
- Fu, J.W.C.¹ Patel, J.H.² Janssens, B.L.³

6
- 47349104432
- Dynamic warp formation and scheduling for efficient GPU control flow
- Dec.
- Wilson W. L. Fung, I. Sham, G. Yuan and T. M. Aamodt. "Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow". In Proc. of MICRO, pp. 407-420, Dec. 2007.
- (2007) Proc. of MICRO , pp. 407-420
- Fung, W.W.L.¹ Sham, I.² Yuan, G.³ Aamodt, T.M.⁴

7
- 84863419336
- DAPSCO: Distance-aware partially shared cache organization
- Jan.
- A. García-Guirado, R. Fernández-Pascual, A. Ros and J. M. Garcí. "DAPSCO: Distance-aware partially shared cache organization". In ACM Trans. on Arch. and Code Optimization, 8 (4), Jan. 2012.
- (2012) ACM Trans. on Arch. and Code Optimization , vol.8 , Issue.4
- García-Guirado, A.¹ Fernández-Pascual, R.² Ros, A.³ Garcí, J.M.⁴

8
- 80052533471
- Energy-efficient mechanisms for managing thread context in throughput processors
- June
- M. Gebhart, D. R. Johnson, D. Taljan, S. W. Keckler, W. J. Dally, E. Lindholm and K. Skadron. "Energy-efficient mechanisms for managing thread context in throughput processors". In Proc. of ISCA, pp. 235-246, June 2011.
- (2011) Proc. of ISCA , pp. 235-246
- Gebhart, M.¹ Johnson, D.R.² Taljan, D.³ Keckler, S.W.⁴ Dally, W.J.⁵ Lindholm, E.⁶ Skadron, K.⁷

9
- 70350601187
- Reactive NUCA: Near-optimal block placement and replication in distributed caches
- June
- N. Hardavellas, M. Ferdman, B. Falsafi, A. Ailamaki. "Reactive NUCA: near-optimal block placement and replication in distributed caches". In Proc. of ISCA, pp. 184-195, June 2009.
- (2009) Proc. of ISCA , pp. 184-195
- Hardavellas, N.¹ Ferdman, M.² Falsafi, B.³ Ailamaki, A.⁴

10
- 84864858569
- Hewlett-Packard
- Hewlett-Packard. "HP ProBook 5330m Notebook PC Overview". http://h18000.wwwl.hp.com/products/quickspecs/14018-na/14018-na.HTML.
- HP ProBook 5330m Notebook PC Overview

11
- 77954994853
- An int rated GPU power and performance model
- June
- S.Hong and H.Kim."An int rated GPU power and performance model".In Proc. of ISCA,pp.280-289,June 2010.
- (2010) Proc. of ISCA , pp. 280-289
- Hong, S.¹ Kim, H.²

12
- 0031606564
- Prefetching in a texture cache architecture
- Aug.
- H. Igehy, M. Eldridge and K. Proudfoot. "Prefetching in a texture cache architecture". In Proc. of SIGGRAPHI 93 EUROGRAPHICS workshop on Graphics Hardware, pp. 133-142, Aug. 1998.
- (1998) Proc. of SIGGRAPHI 93 EUROGRAPHICS Workshop on Graphics Hardware , pp. 133-142
- Igehy, H.¹ Eldridge, M.² Proudfoot, K.³

13
- 0030677583
- Prefetching using markov predictors
- June
- D. Joseph and D. Grunwald. "Prefetching using markov predictors". In Proc. of ISCA, pp. 252-263, June 1997.
- (1997) Proc. of ISCA , pp. 252-263
- Joseph, D.¹ Grunwald, D.²

14
- 0036287598
- Going the distance for tlb prefetching: An application-driven study
- G. B. Kandiraju and A. Sivasubramaniam. "Going the distance for tlb prefetching: an application-driven study". In Proc. of ISCA, pp.195-206, 2002
- (2002) Proc. of ISCA , pp. 195-206
- Kandiraju, G.B.¹ Sivasubramaniam, A.²

15
- 79951719035
- Many-thread aware prefetching mechanisms for GPGPU applications
- December
- J. Lee, N. B. Lakshminarayana, H. Kim and R, Vuduc. "Many-Thread Aware Prefetching Mechanisms for GPGPU Applications". In Proc. of MICRO, pp. 213-224, December 2010.
- (2010) Proc. of MICRO , pp. 213-224
- Lee, J.¹ Lakshminarayana, N.B.² Kim, H.³ Vuduc, R.⁴

16
- 34047094473
- Power analysis of mobile 3D graphics
- March
- B. Mochocki, K. Lahiri and S. Cadambi. "Power analysis of mobile 3D graphics". In Proc. of DATE, pp. 502-507, March 2006.
- (2006) Proc. of DATE , pp. 502-507
- Mochocki, B.¹ Lahiri, K.² Cadambi, S.³

17
- 2342644731
- Data cache prefetching using a global history buffer
- February
- K. J. Nesbit and J. E. Smith. "Data Cache Prefetching Using a Global History Buffer". In Proc. of HPCA, pp. 96-105, February 2004.
- (2004) Proc. of HPCA , pp. 96-105
- Nesbit, K.J.¹ Smith, J.E.²

18
- 84864856361
- NVIDIA. Bringing High-End Graphics to Handheld Devices. 2011. http://www.nvidia.com/content/PDF/tra-white-papers/Bringing-HighEnd-Graphics-to- Handheld-Devices.pdf.
- (2011) Bringing High-End Graphics to Handheld Devices

19
- 0035481610
- Improving Latency Tolerance of Multithreading through Decoupling
- October
- J.-M.Parcerisa and A. González. "Improving Latency Tolerance of Multithreading through Decoupling". IEEE Transactions on Computers, vol. 50, no. 10, pp. 1084- 1094, October 2001.
- (2001) IEEE Transactions on Computers , vol.50 , Issue.10 , pp. 1084-1094
- Parcerisa, J.-M.¹ González, A.²

20
- 28244466233
- Designing graphics programming interfaces for mobile devices
- Nov.
- K. Pulli, T. Aarnio, K. Roimela and J. Vaarala. "Designing Graphics Programming Interfaces for Mobile Devices". In Proc. of IEEE Computer Graphics and Applications, pp. 66-75, Nov. 2005.
- (2005) Proc. of IEEE Computer Graphics and Applications , pp. 66-75
- Pulli, K.¹ Aarnio, T.² Roimela, K.³ Vaarala, J.⁴

21
- 84864857662
- Qualcomm. "Two-Headed Snapdragon Takes Flight". http://www.qualcomm.com/documents/files/linley-report-dual-core-snapdragon.pdf.
- Two-Headed Snapdragon Takes Flight

22
- 78650725832
- A flexible simulation framework for graphics architectures
- Aug.
- J. W. Sheaffer, D. Luebke and K. Skadron. "A flexible simulation framework for graphics architectures". In Proc. of SIGGRAPH/EUROGRAPHICS Conf on Graphics Hardware, pp. 85-94, Aug. 2004.
- (2004) Proc. of SIGGRAPH/EUROGRAPHICS Conf on Graphics Hardware , pp. 85-94
- Sheaffer, J.W.¹ Luebke, D.² Skadron, K.³

23
- 77949460181
- S. Thoziyoor, N. Muralimanohar, J.H. Ahn, and N.P. Jouppi. CACTI 5.1. Tech. report, HP Laboratories. 2008.
- (2008) CACTI 5.1. Tech. Report HP Laboratories
- Thoziyoor, S.¹ Muralimanohar, N.² Ahn, J.H.³ Jouppi, N.P.⁴

24
- 84976822030
- Decoupled access/execute computer architectures
- November
- J.E. Smith. "Decoupled accesslExecute Computer Architectures". In ACM Trans. Computer Systems, vol. 2, no. 4, pp. 289-308, November 1984.
- (1984) ACM Trans. Computer Systems , vol.2 , Issue.4 , pp. 289-308
- Smith, J.E.¹

25
- 84908086655
- R. M. Soneira. "Smartphone "Super" LCD-OLED Display Technology Shoot-Out". http://www.displaymate.com/Smartphone-ShootOut-l. htm.
- Smartphone " Super" LCD-OLED Display Technology Shoot-Out
- Soneira, R.M.¹

26
- 84864858573
- Increasing memory miss tolerance for SIMD cores
- Nov.
- D. Taljan, J. Meng and K. Skadron. "Increasing memory miss tolerance for SIMD cores". In Proc. of SC'09, pp. 22:1-22:11, Nov. 2009.
- (2009) Proc. of SC'09 , pp. 221-2211
- Taljan, D.¹ Meng, J.² Skadron, K.³

27
- 84864861610
- The sharing tracker: Using ideas from cache coherence hardware to reduce offchip memory traffic with non-coherent caches
- Nov.
- D. Taljan, K. Skadron. "The Sharing Tracker: Using Ideas from Cache Coherence Hardware to Reduce OffChip Memory Traffic with Non-Coherent Caches". In Proc. ofSC'IO, pp. 1-10, Nov. 2010.
- (2010) Proc. of SC'IO , pp. 1-10
- Taljan, D.¹ Skadron, K.²

28
- 80052539481
- SR AM-DRAM hybrid memory with applications to efficient r ister files in fine-grained multithreading
- June
- W.-k. S. Yu, R. Huang, S. Q. Xu, S.-E. Wang, E. Kan and G. E. Suh. "SR AM-DRAM hybrid memory with applications to efficient r ister files in fine-grained multithreading". In Proc. of ISCA, pp. 247-258, June 2011.
- (2011) Proc. of ISCA , pp. 247-258
- Yu, W.-K.S.¹ Huang, R.² Xu, S.Q.³ Wang, S.-E.⁴ Kan, E.⁵ Suh, G.E.⁶

29
- 77952040409
- COMPASS: A programmable data prefetcher using idle GPU shaders
- March
- D. H. Woo and Hsien-Hsin S. Lee. "COMPASS: a programmable data prefetcher using idle GPU shaders". In Proc. of ASPLOS, pp. 297-310, March 2010.
- (2010) Proc. of ASPLOS , pp. 297-310
- Woo, D.H.¹ Lee, H.-H.S.²

30
- 84864857664
- "2G GPRS vs. 3G UMTS connection battery usage on mobile phones". http://blog.famzah.netl2010/0S/24/2g-gprs-vs-3g-umts-connection- battery-usage-onmobile-phones I.
- 2G GPRS Vs. 3G UMTS Connection Battery Usage on Mobile Phones

31
- 84864858572
- http://en.wikipedia.org/wiki/Neo-FreeR unner

32
- 84864836528
- http://en.wikipedia.org/wiki/Samsung-eternity

33
- 84864836527
- http://en.wikipedia.org/wiki/Samsung-Galaxy-S

34
- 84864848414
- http://en.wikipedia.org/wiki/Samsung-Galaxy-S-II

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.