This repository contains public releases of Huawei Cloud production serverless FaaS traces made available to the research and academic community by Huawei's Systems Infrastructure Research (SIR) lab in Edinburgh, UK.
These public traces are hashed versions of our raw production logs of tens of billions of user requests over multiple data centers and many months. We release them to enable researchers to conduct realistic simulations and train machine learning models to improve scheduling and resource allocation in cloud platforms.
The data is analyzed in two papers:
-
EuroSys 2025: Serverless Cold Starts and Where to Find Them
-
ACM SoCC 2023: How Does It Function? Characterizing Long-term Trends in Production Serverless Workloads
The datasets used in our papers can be downloaded at the links below.
- Huawei Public cold start traces 2025 contains 85 billion raw user requests and 11.9 cold start events in 5 regions. There are 19 metrics per function over 31 days, as well as aggregated time series formats for convenience.
- Huawei Public and Public traces 2023 contains 1.4 trillion function requests in time series format. There are 8 metrics per function over 235 days at per-minute and per-second granularity for two serverless platforms.
In some cases, you may not need all files in a zip folder. You can use 7zip to drag and drop the desired files or directories without extracting the entire archive.
To get started using the datasets, look at our notebooks for tips on how to load files and visualize the data.
To run our notebooks with the required packages installed, you can install our conda environment as follows:
conda env create -f environment.yml
conda activate trace-analysis
We welcome feedback, collaboration, or questions. Feel free to open an Issue.
These traces and associated research result from a collaboration between the Systems Infrastructure Research (SIR) lab in Edinburgh (part of Huawei Research UK) and Huawei's YuanRong serverless cloud platform team.