메뉴 건너뛰기




Volumn , Issue , 2012, Pages 84-93

Boosting mobile GPU performance with a decoupled access/execute fragment processor

Author keywords

[No Author keywords available]

Indexed keywords

BANDWIDTH USAGE; DATA SHARING; DECOUPLED ARCHITECTURE; GRAPHICAL APPLICATIONS; GRAPHICS RENDERING; GROWING MARKETS; HARDWARE/SOFTWARE; HIDING MEMORY LATENCY; HIGH-ENERGY COSTS; MULTI-THREADING; MULTITHREADED; OPERATING TIME; POWER BUDGETS; PREFETCHING; TEXTURE CACHE; TRADITIONAL TECHNIQUES;

EID: 84864858885     PISSN: 10636897     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ISCA.2012.6237008     Document Type: Conference Paper
Times cited : (23)

References (34)
  • 1
    • 78149272413 scopus 로고    scopus 로고
    • NoCaware cache design for chip multiprocessors
    • Sept.
    • A. K. Abousamra, R. G. Melhem, A. K. Jones. "NoCaware cache design for chip multiprocessors". In Proc. of PACT, pp. 565-566, Sept. 2010.
    • (2010) Proc. of PACT , pp. 565-566
    • Abousamra, A.K.1    Melhem, R.G.2    Jones, A.K.3
  • 2
    • 77954016821 scopus 로고    scopus 로고
    • Graphics for the masses: A hardware rasterization architecture for mobile phones
    • July
    • T. Akenine-Möller and J. Strom. "Graphics for the masses: a hardware rasterization architecture for mobile phones". In Proc. of SIGGRAPH, pp. 801-808, July 2003.
    • (2003) Proc. of SIGGRAPH , pp. 801-808
    • Akenine-Möller, T.1    Strom, J.2
  • 3
    • 79953106704 scopus 로고    scopus 로고
    • An analysis of power consumption in a smartphone
    • June
    • A. Carroll and G. Heiser. "An analysis of power consumption in a smartphone". In Proc. of USENIXATC, pp. 21-34, June 2010.
    • (2010) Proc. of USENIXATC , pp. 21-34
    • Carroll, A.1    Heiser, G.2
  • 4
    • 80052534264 scopus 로고    scopus 로고
    • OUTRIDER : Efficient memory latency tolerance with decoupled strands
    • June
    • N. C. Crago and S. J. Patel. "OUTRIDER : efficient memory latency tolerance with decoupled strands". In Proc. of ISCA, pp. 117-128, June 2011.
    • (2011) Proc. of ISCA , pp. 117-128
    • Crago, N.C.1    Patel, S.J.2
  • 5
    • 0026962180 scopus 로고
    • Stride directed prefetching in scalar processors
    • Dec.
    • John W. C. Fu, Janak H. Patel, and Bob L. Janssens. "Stride directed prefetching in scalar processors". SIGMICRO Newsl., pp. 102-110, Dec. 1992
    • (1992) SIGMICRO Newsl. , pp. 102-110
    • Fu, J.W.C.1    Patel, J.H.2    Janssens, B.L.3
  • 6
    • 47349104432 scopus 로고    scopus 로고
    • Dynamic warp formation and scheduling for efficient GPU control flow
    • Dec.
    • Wilson W. L. Fung, I. Sham, G. Yuan and T. M. Aamodt. "Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow". In Proc. of MICRO, pp. 407-420, Dec. 2007.
    • (2007) Proc. of MICRO , pp. 407-420
    • Fung, W.W.L.1    Sham, I.2    Yuan, G.3    Aamodt, T.M.4
  • 9
    • 70350601187 scopus 로고    scopus 로고
    • Reactive NUCA: Near-optimal block placement and replication in distributed caches
    • June
    • N. Hardavellas, M. Ferdman, B. Falsafi, A. Ailamaki. "Reactive NUCA: near-optimal block placement and replication in distributed caches". In Proc. of ISCA, pp. 184-195, June 2009.
    • (2009) Proc. of ISCA , pp. 184-195
    • Hardavellas, N.1    Ferdman, M.2    Falsafi, B.3    Ailamaki, A.4
  • 10
    • 84864858569 scopus 로고    scopus 로고
    • Hewlett-Packard
    • Hewlett-Packard. "HP ProBook 5330m Notebook PC Overview". http://h18000.wwwl.hp.com/products/quickspecs/14018-na/14018-na.HTML.
    • HP ProBook 5330m Notebook PC Overview
  • 11
    • 77954994853 scopus 로고    scopus 로고
    • An int rated GPU power and performance model
    • June
    • S.Hong and H.Kim."An int rated GPU power and performance model".In Proc. of ISCA,pp.280-289,June 2010.
    • (2010) Proc. of ISCA , pp. 280-289
    • Hong, S.1    Kim, H.2
  • 13
    • 0030677583 scopus 로고    scopus 로고
    • Prefetching using markov predictors
    • June
    • D. Joseph and D. Grunwald. "Prefetching using markov predictors". In Proc. of ISCA, pp. 252-263, June 1997.
    • (1997) Proc. of ISCA , pp. 252-263
    • Joseph, D.1    Grunwald, D.2
  • 14
    • 0036287598 scopus 로고    scopus 로고
    • Going the distance for tlb prefetching: An application-driven study
    • G. B. Kandiraju and A. Sivasubramaniam. "Going the distance for tlb prefetching: an application-driven study". In Proc. of ISCA, pp.195-206, 2002
    • (2002) Proc. of ISCA , pp. 195-206
    • Kandiraju, G.B.1    Sivasubramaniam, A.2
  • 15
    • 79951719035 scopus 로고    scopus 로고
    • Many-thread aware prefetching mechanisms for GPGPU applications
    • December
    • J. Lee, N. B. Lakshminarayana, H. Kim and R, Vuduc. "Many-Thread Aware Prefetching Mechanisms for GPGPU Applications". In Proc. of MICRO, pp. 213-224, December 2010.
    • (2010) Proc. of MICRO , pp. 213-224
    • Lee, J.1    Lakshminarayana, N.B.2    Kim, H.3    Vuduc, R.4
  • 16
    • 34047094473 scopus 로고    scopus 로고
    • Power analysis of mobile 3D graphics
    • March
    • B. Mochocki, K. Lahiri and S. Cadambi. "Power analysis of mobile 3D graphics". In Proc. of DATE, pp. 502-507, March 2006.
    • (2006) Proc. of DATE , pp. 502-507
    • Mochocki, B.1    Lahiri, K.2    Cadambi, S.3
  • 17
    • 2342644731 scopus 로고    scopus 로고
    • Data cache prefetching using a global history buffer
    • February
    • K. J. Nesbit and J. E. Smith. "Data Cache Prefetching Using a Global History Buffer". In Proc. of HPCA, pp. 96-105, February 2004.
    • (2004) Proc. of HPCA , pp. 96-105
    • Nesbit, K.J.1    Smith, J.E.2
  • 18
  • 19
    • 0035481610 scopus 로고    scopus 로고
    • Improving Latency Tolerance of Multithreading through Decoupling
    • October
    • J.-M.Parcerisa and A. González. "Improving Latency Tolerance of Multithreading through Decoupling". IEEE Transactions on Computers, vol. 50, no. 10, pp. 1084- 1094, October 2001.
    • (2001) IEEE Transactions on Computers , vol.50 , Issue.10 , pp. 1084-1094
    • Parcerisa, J.-M.1    González, A.2
  • 21
    • 84864857662 scopus 로고    scopus 로고
    • Qualcomm. "Two-Headed Snapdragon Takes Flight". http://www.qualcomm.com/documents/files/linley-report-dual-core-snapdragon.pdf.
    • Two-Headed Snapdragon Takes Flight
  • 24
    • 84976822030 scopus 로고
    • Decoupled access/execute computer architectures
    • November
    • J.E. Smith. "Decoupled accesslExecute Computer Architectures". In ACM Trans. Computer Systems, vol. 2, no. 4, pp. 289-308, November 1984.
    • (1984) ACM Trans. Computer Systems , vol.2 , Issue.4 , pp. 289-308
    • Smith, J.E.1
  • 26
    • 84864858573 scopus 로고    scopus 로고
    • Increasing memory miss tolerance for SIMD cores
    • Nov.
    • D. Taljan, J. Meng and K. Skadron. "Increasing memory miss tolerance for SIMD cores". In Proc. of SC'09, pp. 22:1-22:11, Nov. 2009.
    • (2009) Proc. of SC'09 , pp. 221-2211
    • Taljan, D.1    Meng, J.2    Skadron, K.3
  • 27
    • 84864861610 scopus 로고    scopus 로고
    • The sharing tracker: Using ideas from cache coherence hardware to reduce offchip memory traffic with non-coherent caches
    • Nov.
    • D. Taljan, K. Skadron. "The Sharing Tracker: Using Ideas from Cache Coherence Hardware to Reduce OffChip Memory Traffic with Non-Coherent Caches". In Proc. ofSC'IO, pp. 1-10, Nov. 2010.
    • (2010) Proc. of SC'IO , pp. 1-10
    • Taljan, D.1    Skadron, K.2
  • 28
    • 80052539481 scopus 로고    scopus 로고
    • SR AM-DRAM hybrid memory with applications to efficient r ister files in fine-grained multithreading
    • June
    • W.-k. S. Yu, R. Huang, S. Q. Xu, S.-E. Wang, E. Kan and G. E. Suh. "SR AM-DRAM hybrid memory with applications to efficient r ister files in fine-grained multithreading". In Proc. of ISCA, pp. 247-258, June 2011.
    • (2011) Proc. of ISCA , pp. 247-258
    • Yu, W.-K.S.1    Huang, R.2    Xu, S.Q.3    Wang, S.-E.4    Kan, E.5    Suh, G.E.6
  • 29
    • 77952040409 scopus 로고    scopus 로고
    • COMPASS: A programmable data prefetcher using idle GPU shaders
    • March
    • D. H. Woo and Hsien-Hsin S. Lee. "COMPASS: a programmable data prefetcher using idle GPU shaders". In Proc. of ASPLOS, pp. 297-310, March 2010.
    • (2010) Proc. of ASPLOS , pp. 297-310
    • Woo, D.H.1    Lee, H.-H.S.2
  • 31
    • 84864858572 scopus 로고    scopus 로고
    • http://en.wikipedia.org/wiki/Neo-FreeR unner
  • 32
    • 84864836528 scopus 로고    scopus 로고
    • http://en.wikipedia.org/wiki/Samsung-eternity
  • 33
    • 84864836527 scopus 로고    scopus 로고
    • http://en.wikipedia.org/wiki/Samsung-Galaxy-S
  • 34
    • 84864848414 scopus 로고    scopus 로고
    • http://en.wikipedia.org/wiki/Samsung-Galaxy-S-II


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.