메뉴 건너뛰기




Volumn 40, Issue 4, 2010, Pages 195-206

PacketShader: A GPU-accelerated software router

Author keywords

CUDA; GPU; Software router

Indexed keywords

CUDA; EVALUATION RESULTS; GPU; GPU-ACCELERATED; GRAPHICS PROCESSING UNIT; IPSEC TUNNELING; OPENFLOW; PACKET PROCESSING; PROCESSING POWER; SOFTWARE ROUTERS;

EID: 79956014740     PISSN: 01464833     EISSN: 19435819     Source Type: Conference Proceeding    
DOI: 10.1145/1851275.1851207     Document Type: Conference Paper
Times cited : (265)

References (55)
  • 1
    • 84860345475 scopus 로고    scopus 로고
    • AMD Fusion. http://fusion.amd.com.
    • AMD Fusion
  • 4
    • 84874681337 scopus 로고    scopus 로고
    • Cisco QuantumFlow Processors. http://www.cisco.com/en/US/prod/collateral/ routers/ps9343/solution-overview-c22-448936.html.
    • Cisco QuantumFlow Processors
  • 6
    • 84874703488 scopus 로고    scopus 로고
    • GNU Zebra project. http://www.zebra.org.
  • 10
    • 84870515842 scopus 로고    scopus 로고
    • Version 0.8.9
    • OpenFlow Switch Specification, Version 0.8.9. http://www.openflowswitch. org/documents/openflow-spec-v0.8.9.pdf.
    • OpenFlow Switch Specification
  • 11
    • 84874739540 scopus 로고    scopus 로고
    • Quagga project. http://www.quagga.net.
  • 13
    • 84874743129 scopus 로고    scopus 로고
    • The OpenFlow Switch Consortium. http://www.openflowswitch.org.
  • 14
    • 84874707663 scopus 로고    scopus 로고
    • University of Oregon RouteViews project. http://www.routeviews.org/.
  • 15
    • 65249154060 scopus 로고    scopus 로고
    • PC-based software routers: High performance and application service support
    • R. Bolla and R. Bruschi. PC-based software routers: High performance and application service support. In ACM PRESTO, 2008.
    • (2008) ACM PRESTO
    • Bolla, R.1    Bruschi, R.2
  • 18
    • 34748875272 scopus 로고    scopus 로고
    • Evaluating network processing efficiency with processor partitioning and asynchronous i/o
    • T. Brecht, G. J. Janakiraman, B. Lynn, V. Saletore, and Y. Turner. Evaluating network processing efficiency with processor partitioning and asynchronous i/o. SIGOPS Oper. Syst. Rev., 40(4):265-278, 2006.
    • (2006) SIGOPS Oper. Syst. Rev. , vol.40 , Issue.4 , pp. 265-278
    • Brecht, T.1    Janakiraman, G.J.2    Lynn, B.3    Saletore, V.4    Turner, Y.5
  • 21
    • 21644458839 scopus 로고    scopus 로고
    • An in-depth analysis of the impact of processor affinity on network performance
    • A. Foong, J. Fung, and D. Newell. An in-depth analysis of the impact of processor affinity on network performance. In IEEE ICON, 2004.
    • (2004) IEEE ICON
    • Foong, A.1    Fung, J.2    Newell, D.3
  • 22
    • 0031679714 scopus 로고    scopus 로고
    • Routing lookups in hardware at memory access speeds
    • P. Gupta, S. Lin, and N. McKeown. Routing lookups in hardware at memory access speeds. In IEEE INFOCOM, 1998.
    • (1998) IEEE INFOCOM
    • Gupta, P.1    Lin, S.2    McKeown, N.3
  • 24
    • 68949216895 scopus 로고    scopus 로고
    • Practical Symmetric Key Cryptography on Modern Graphics Hardware
    • O. Harrison and J. Waldron. Practical Symmetric Key Cryptography on Modern Graphics Hardware. In USENIX Security, 2008.
    • (2008) USENIX Security
    • Harrison, O.1    Waldron, J.2
  • 25
    • 70450231944 scopus 로고    scopus 로고
    • An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness
    • S. Hong and H. Kim. An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness. In ISCA, 2009.
    • (2009) ISCA
    • Hong, S.1    Kim, H.2
  • 26
    • 14944350490 scopus 로고    scopus 로고
    • Lawrence Berkeley Laboratory, Berkeley, CA
    • V. Jacobson, C. Leres, and S. McCanne. libpcap, Lawrence Berkeley Laboratory, Berkeley, CA. http://www.tcpdump.org.
    • Libpcap
    • Jacobson, V.1    Leres, C.2    McCanne, S.3
  • 28
    • 19544389325 scopus 로고    scopus 로고
    • System capability effects on algorithms for network bandwidth measurement
    • G. Jin and B. L. Tierney. System capability effects on algorithms for network bandwidth measurement. In IMC, 2003.
    • (2003) IMC
    • Jin, G.1    Tierney, B.L.2
  • 29
    • 71949099451 scopus 로고    scopus 로고
    • HPCCD: Hybrid Parallel Continuous Collision Detection using CPUs and GPUs
    • John Wiley & Sons
    • D. Kim, J. Heo, J. Huh, J. Kim, and S. Yoon. HPCCD: Hybrid Parallel Continuous Collision Detection using CPUs and GPUs. In Computer Graphics Forum, volume 28, pages 1791-1800. John Wiley & Sons, 2009.
    • (2009) Computer Graphics Forum , vol.28 , pp. 1791-1800
    • Kim, D.1    Heo, J.2    Huh, J.3    Kim, J.4    Yoon, S.5
  • 31
    • 78149305038 scopus 로고    scopus 로고
    • PdP: Parallelizing data plane in virtual network substrate
    • Y. Liao, D. Yin, and L. Gao. PdP: parallelizing data plane in virtual network substrate. In ACM VISA, 2009.
    • (2009) ACM VISA
    • Liao, Y.1    Yin, D.2    Gao, L.3
  • 32
    • 51049111938 scopus 로고    scopus 로고
    • CUDA compatible GPU as an efficient hardware accelerator for AES cryptography
    • S. Manavski. CUDA compatible GPU as an efficient hardware accelerator for AES cryptography. In IEEE Signal Processing and Communications, 2007.
    • (2007) IEEE Signal Processing and Communications
    • Manavski, S.1
  • 34
    • 0031199999 scopus 로고    scopus 로고
    • Eliminating Receive Livelock in an Interrupt-Driven Kernel
    • J. Mogul and K. Ramarkishnan. Eliminating Receive Livelock in an Interrupt-Driven Kernel. ACM TOCS, 15(3):217-252, 1997.
    • (1997) ACM TOCS , vol.15 , Issue.3 , pp. 217-252
    • Mogul, J.1    Ramarkishnan, K.2
  • 37
    • 78651550268 scopus 로고    scopus 로고
    • Scalable parallel programming with CUDA
    • J. Nickolls, I. Buck, M. Garland, and K. Skadron. Scalable parallel programming with CUDA. Queue, 6(2):40-53, 2008.
    • (2008) Queue , vol.6 , Issue.2 , pp. 40-53
    • Nickolls, J.1    Buck, I.2    Garland, M.3    Skadron, K.4
  • 40
    • 35948991669 scopus 로고    scopus 로고
    • NVIDIA Corporation Version 3.0
    • NVIDIA Corporation. NVIDIA CUDA Programming Guide, Version 3.0, 2009.
    • (2009) NVIDIA CUDA Programming Guide
  • 43
    • 79959466764 scopus 로고    scopus 로고
    • Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
    • S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W.-m. W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. In ACM PPoPP, 2008.
    • (2008) ACM PPoPP
    • Ryoo, S.1    Rodrigues, C.I.2    Baghsorkhi, S.S.3    Stone, S.S.4    Kirk, D.B.5    Hwu, W.-M.W.6
  • 46
    • 8844226032 scopus 로고    scopus 로고
    • Np-click: A productive software development approach for network processors
    • N. Shah, W. Plishker, K. Ravindran, and K. Keutzer. Np-click: A productive software development approach for network processors. IEEE Micro, 24(5):45-54, 2004.
    • (2004) IEEE Micro , vol.24 , Issue.5 , pp. 45-54
    • Shah, N.1    Plishker, W.2    Ravindran, K.3    Keutzer, K.4
  • 47
    • 70349653470 scopus 로고    scopus 로고
    • Nuclei: GPU-accelerated many-core network coding
    • H. Shojania, B. Li, and X. Wang. Nuclei: GPU-accelerated many-core network coding. In IEEE INFOCOM, 2009.
    • (2009) IEEE INFOCOM
    • Shojania, H.1    Li, B.2    Wang, X.3
  • 50
    • 0028446907 scopus 로고
    • False Sharing and Spatial Locality in Multiprocessor Caches
    • J. Torrellas, H. S. Lam, and J. L. Hennessy. False Sharing and Spatial Locality in Multiprocessor Caches. IEEE Trans. on Computers, 43(6):651-663, 1994.
    • (1994) IEEE Trans. on Computers , vol.43 , Issue.6 , pp. 651-663
    • Torrellas, J.1    Lam, H.S.2    Hennessy, J.L.3
  • 54
    • 55849110386 scopus 로고    scopus 로고
    • Performance Scalability of a Multi-Core Web Server
    • B. Veal and A. Foong. Performance Scalability of a Multi-Core Web Server. In ANCS, 2007.
    • (2007) ANCS
    • Veal, B.1    Foong, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.