Skip to content

The repository for the 2024 IEEE Cloud Submission of OS4C

License

Notifications You must be signed in to change notification settings

UIUC-ChenLab/OS4C

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OS4C: An Open-Source SR-IOV System for SmartNIC-based Cloud Platforms

Introduction

This is the repository associated with our 2024 IEEE Cloud paper. OS4C is the first open-source 100 Gbps NIC with support for virtualization (SR-IOV). We took Corundum (https://github.com/corundum/corundum/) and modified it to support virtualization. This entailed adding a new scheduler, SR-IOV support, and updating the testing/simulation tools. If you leverage this work, please cite both our paper and the original Corundum paper.

Documentation

For detailed documentation on our modifications to Corundum, please see our IEEE Cloud 2024 paper.

For documentation on Corundum's original design, see https://docs.corundum.io/

OS4C Key Features

  • OS4C Supports SR-IOV with up to 252 VFs. We are limited by the PCIe IP.
  • OS4C Supports a two-stage weighted round-robin transmit scheduler that integrates with the virtualization logic
  • We currently offically support the Alveo U280 - but other Alveo boards should work with minimal modifications (they are as of yet untried)

Usage

Instructions are for Linux machines. We tested with Ubuntu 20.04. Other Ubuntu versions may not work - if they do not, please reach out and we will try to fix it.

Source your Vivado tools (We tried 2022.1, 2022.2, and 2021.1 - they all worked).

Navigate to fpga/mqnic/Alveo/fpga_100g/fpga_AU280 and type make to build. You will need the 100G Xilinx license.

You can change the synthesis parameters in the Makefile located in fpga/mqnic/Alveo/fpga_100g/fpga_AU28O/config.tcl

Note: you need to make sure there are more of each type of resource than there are functions - otherwise synthesis will fail. Also, we assume everything is a power of 2 even though we have 1 PF and 252 VFs (we round up to 256). So for FUNC_ID_WIDTH we would put 8.

Testbench

We need the same tools as Corundum to run the testbench. See below.

"Running the included testbenches requires cocotb, cocotbext-axi, cocotbext-eth, cocotbext-pcie, scapy, and Icarus Verilog. The testbenches can be run with pytest directly (requires cocotb-test), pytest via tox, or via cocotb makefiles." - from the original Corundum repository.

Many of the testbenches for modules are from the original Corundum repository.

Once you install the dependencies (see above) you can run them using pytest. Navigate to the corresponding directory and run pytest. We created a new testbench for the resource translator module. Regarding full-system tests, we have currently only released a slightly modified version of the single-tenant testbench from Corundum. The multi-tenant testbench mentioned in the publication required us to make modifications to the above cocotb-pcie library. We are working on seeing if those modifications can be added to the repository as a new release. If not, we will release patches here. Expect more information and/or the code in the next few weeks (target mid August 2024).

Example: pytest -n auto --log-file=log.txt

The above will use pytest to launch a series of tests. The -n auto parameter tells pytest to automatically determine how many cores to use (more cores should enable faster completion of the set of tests). The --log-file=log.txt parameter lets you print lots of useful information to a log file.

Notes and Future Work

  • We plan to discuss with the existing Corundum developers possible integration of our work into their projcect.
  • We are working to improve documentation over the next few weeks as we fully release the simulation/testbench tools.
  • If you encounter any bugs, please let us know and we will work hard to fix them. Our goal is to continuously support this project and extend it with further enhancements/features.

The OS4C Publication

  • S. Smith, Y. Ma, M. Lanz, B. Dai, M. Ohmacht, B. Sukhwani, H. Franke, V. Kindratenko, D. Chen, OS4C: An Open-Source SR-IOV System for SmartNIC-based Cloud Platforms, in IEEE Cloud'2024

Corundum Publications

  • A. Forencich, A. C. Snoeren, G. Porter, G. Papen, Corundum: An Open-Source 100-Gbps NIC, in FCCM'20. (FCCM Paper, FCCM Presentation)

  • J. A. Forencich, System-Level Considerations for Optical Switching in Data Center Networks. (Thesis)

Dependencies

We build upon the following repositories (Corundum + the libraries Corundum relies on)