Reading Fast Packet Processing A Survey
- COMST 2018
主要内容
- 这是一篇有关快速包转发的综述,先介绍了包转发的有关基础知识和背景,具体介绍了包转发的主流方法,对这些方法进行了细致详尽的比较,最后介绍了最新的方法和未来的研究方向。
- 包处理包括Fast Path 和Slow Path,前者用于包转发和包头处理,后者主要用于管理、错误控制、维护。
- 主要的方法有三种:纯软件、纯硬件、软硬结合。
- 纯软件方法主要在软件层面(零拷贝、批处理、并行性、用户/内核空间)进行性能优化,性能不足的主要是因为网络协议栈架构的不足。
- 纯硬件方法性能高但灵活性差。
- 软硬结合方法主要将部分功能卸载到高性能的硬件上,具有软件和硬件二者的优势。
- 软硬结合里硬件处理主要有三种:CPU,GPU,FPGA,主要是将部分功能卸载到硬件上,利用硬件的高性能处理包。
- CPU在指令并行上最大化,GPU在线程并行上最大化,FPGA有大量的并行但是使用硬件编程难度较大(HDLs)。
- 最新的方法包括:FD.io、VPP、ODP、OFP、P4、Openstate、BESS.
introduction
Highly demond
- increased performance of network interfaces:
- high-speed router
- multi-terabit IP router
- datacenter switch.
Main methods
- Software
- Hardware
- Combination of the two
General purpose device
Problem
- Networks stacks' architectural design brings high overheads
Solution
- Many kinds of technology
Programmable aspect
- Openflow:high-end specialized equipment and make it programmable
- Use general-purpose computers in the well-known environment
- "Semi-specialized" solution:use network processors to offload packet processing,rest is done on conventional processors
- Reconfigurable Hardware
Based on Three hardware
- Cpu:less latency and programs laster longer
- Gpu:expreme thread-level parallelism
- Fpga:energy efficient and not easy to program
Base on Virtualize environments
- Wire and wireless
Background
Terminology
- Time-critical and non-time critical
- Fast path and slow path
Time-critical router
- Corresponding to fast path
- Performed on the majority of packets that pass through the router
- Have the higher priority.
- Bypass the router processing card
- Including forwarding and header processing.
forwaring
- Bypass,ASICs
- Forwading table,classification,queue
header processing
- version->length->checksum->TTL
Non-time critical router
- Corresponding to slow path
- Mainly used for management, error handling and maintenance.
- ICMP,SNMP,ARP,PKT fragmentation(ipv4/v6)
Background on Packet Processing
- Target:maximizing the utilization of available resource and providing the fastest possible service.
- Surporting a high throughput implicates a sufficiently low packet processing latency.
Steps of Packet Processing
- NIC->(by DMA)memory->Cpu->ring buffer->NIC register->(by DMA)NIC
- Multiqueue NICs designed for multi-core.
- RSS(Receive-Side Scaling)enables distributed among different cores.
- Memory-mapping techqinues reduce the cost of packet transmission from kernel-space to user-space through system calls.
CPU/GPU/FPGA
- CPU maximize the instruction-level parallelism.
- GPU maximize the threads-level parallelism.
- FPGAs have massive amount parallelism built-in.
Software Implemetations
Click-based solutions
- Problem:Inflexible closed designs,difficult to extend,rounting configuration limited.
- Solution:Building blocks or fine-graned components which are called elements.
RouteBricks
- RouteBricks is a software router architecture which can run on multiple cores of a single server in order to leverage the performance of a software router.
FastClick
- FastClick is a solution which intergrates both DPDK and Netmap in Click.
Netmap
- Netmap is a framework which allows commodity hardware(withou modiying applications or adding custom hardware) to handle millions of packets per second which go over 1...10Gbit/s links
- It builds a fast path betwwen the NIC and the applications
- Regular mode is a standard mode where NIC exchanges packet with the host stack.
- Netmap mode,NIC rings are disconnected from the host netork stack and packets are disconnected from the host network stack and packets are transferred through the netmap API.
NetSlices
- NetSlices represents operting system abstraction which processes packets in user-space and enables a linear increase of performance with the number of cores.
PF_RING
- is a high-speed packet capture library that allows a commodity PC to perform efficient network mearsurement which allows both packet and active traffic analysis and manipulation.
DPDK
- ~is a set of data plane libraries and drivers which are used for fast packet procceing.
Gpu-based solutions
Snap
- Snap is a packet proccessing system based on Click which offloads some of the computation lod on GPUs.
PacketShader
- ~ is a software router framwork which uses Graphic Processing Units.
APUNet:
- ~ is an APU-accelerated network pakcet processing system that exploits the power of intgrated GPUs for parallel packet processing while using a COU for scalable packet I/O.
ClickNP
GASPP
- ~ is a programmable network traffic processing framework that was made of modern GPUs.
FPGA-based solutions
- represents and FPGA-accelerated platform for high performance and highly flexible Network NF processing on commodity servers.
GRIP
- The authors point out that transmitting or receiving data at gigabit speeds already fully monopolize the CPU, therefore, it is not possible to realize any additional processing of these data without degrading toughput.
SwitchBlade
- ~ represents a platform which is used for rapid prototyping and deployment of custon protocols on programmable hardware.
chimpp
- ~ is a development environment for reconfigurable networking hardware that is base on the Click modular router and that targets the NetFPGA platform.
Comparation




Integration Possibilityies in Virtualized Environments
- One solution to packet processing in virtualized environments: group packet processing instead of processing them individually.
VALE
- ~ is a system base on netmap API which implements high performance Virtual local Ethernet that can be used to interconnet virtual machines by providing access ports to multiple cients.
ptnetmap
- ~ is a Virual Passthrough solution based on the netmao framework, which is used as the "device" model exported to VMs.
NetVM
- ~ is a high-speed network packet processing platform built on top of KVM and DPDK library.
OVS+DPDK
- An open source virtual switch with high performance.
- It consumes too much cpu resource.
Summary of the constraints

Latest approaches and Future Directions
- FD.io
- VPP
- ODP:is a set of APIs for the networking software defined date palone which are open-source and cross-platform.
- OFP:A project provides an open-source implementation of high-performance TCP/IP stacks.
- P4:a high-level language used to program protocol-independent packet processors.
- Openstate:A approach which allows performing stateful control functionalities directly inside a switch without the nedd for the intervention of the externel controller.
- BESS is concerned with building a programmable plaform called SoftNIC that auguments hardware NICs with software.
Reading Fast Packet Processing A Survey的更多相关文章
- Fast Packet Processing - A Survey
笔记是边读边写的旁注,比较乱,没有整理就丢上来了. 可以说不仅要说fast packet process servey,也同时是一篇packet process的综述了.packet processi ...
- Reading SketchVisor Robust Network Measurement for Sofeware Packet Processing
SIGCOMM17 摘要 在现有的网络测量任务中包括流量监测.数据收集和一系列网络攻击的预防.现有的基于sketch的测量算法存在严重性能损失.大量计算开销以及测量的精确性不足,而基于硬件的优化方法并 ...
- ERROR 2013 (HY000): Lost connection to MySQL server at 'reading authorization packet', system error: 0
最近遇到一个MySQL连接的问题,远程连接MySQL时遇到"ERROR 2013 (HY000): Lost connection to MySQL server at 'reading a ...
- PatentTips - Data Plane Packet Processing Tool Chain
BACKGROUND The present disclosure relates generally to systems and methods for providing a data plan ...
- 翻译:MySQL "Got an Error Reading Communication Packet" Errors
前言: 本文是对Muhammad Irfan的这篇博客MySQL "Got an Error Reading Communication Packet" Errors的翻译,如有翻 ...
- VPP(Vector Packet Processing)浅析
VPP简介 VPP(Vector Packet Processing)是思科旗下的一款可拓展的开源框架,提供容易使用的.高质量的交换.路由功能 特点:高性能.运行在普通的cpu上. 优点:高性能.技术 ...
- Internet protocol security (ipsec) packet processing for multiple clients sharing a single network address
Embodiments of the present invention address deficiencies of the art in respect to secure communicat ...
- 连接 MySQL 报错:Lost connection to MySQL server at 'reading authorization packet', system error: 34
报错信息: Lost connection to MySQL server at 解决方案: use mysql; ; flush privileges; 参考: https://blog.csdn. ...
- HIGH-SPEED PACKET PROCESSING USING RECONFIGURABLE COMPUTING
摘要 本文介绍了一种新的工具链,它将一门称为 PX 的专门用于包处理的编程语言运用到基于 FPGA 技术的高性能可重构计算架构(HIGH-PERFORMANCE RECONFIGURABLECOMPU ...
随机推荐
- 【MFC】CHtmlView或WebBrowser禁止脚本错误提示
错误展示: 解决办法: 1.CHtmlView类或子类 CHtmlView::SetSilent(TRUE); 2.IWebBrowser2控件 IWebBrowser2::put_Silent(TR ...
- <Android 应用 之路> 百度地图API使用(4)
前言 百度地图的定位功能和基础地图功能是分开的,使用的是另外的jar包和so库文件,详情请关注官网: 百度定位SDK 配置 下载对应的jar包和so库,然后移动到lib目录下 AS中注意事项 sour ...
- HTML 5 教程
HTML5 是下一代的 HTML.有必要再过一遍.看下要点. 具体看 http://www.w3school.com.cn/html5/index.asp 教程 和 HTML5 标签参考手册 ...
- C++ 线程学习
原文:http://blog.csdn.net/wzy_1988/article/details/46562921 线程的概念 C++中的线程的Text Segment和Data Segment都是共 ...
- Python爬虫教程-00-写在前面
鉴于好多人想学Python爬虫,缺没有简单易学的教程,我将在CSDN和大家分享Python爬虫的学习笔记,不定期更新 基础要求 Python 基础知识 Python 的基础知识,大家可以去菜鸟教程进行 ...
- 理解 on-policy 和 off-policy
转自:https://www.cnblogs.com/xiachongkun/p/7767976.html,感谢分享 大名鼎鼎的蒙特卡洛方法(MC),源自于一个赌城的名字,作为一种计算方法,应用领域众 ...
- vue学习(一)、Vue.js简介
Vue.js 五天 汤小洋一. Vue.js简介1. Vue.js是什么Vue.js也称为Vue,读音/vju:/,类似view,错误读音v-u-e 版本:v1.0 v2.0 是一个构建用户界面的框架 ...
- iis7 未注册framework4 导致 莫名的404错误
server2008 R2 IIS7 已经安装 安装framework 4.6.1 然后建立站点 404错误 :在站点目录自动生成了 asp_client 重新注册到 iis(不需要) iisre ...
- Eclipse 中出现红色下划波浪线与红色感叹号
一直用eclipse写Python,老是看到一些字符串都给出红色波浪线, 看着就不舒服.弄了老半天终于消除了,原来是拼写检查 Windows->Preferences->General-& ...
- .net core系列之《.net core中使用集成IDistributedCache接口的Redis和MongoDB实现分布式缓存》
分布式的缓存可以提高性能和可伸缩性的 ASP.NET Core 应用程序,尤其是托管在云中或服务器场中时. 什么是分布式的缓存 分布式的缓存由多个应用程序服务器共享,缓存中的信息不存储在单独的 Web ...