Skip to content


Switch branches/tags

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time

🐍 Easy data processing with Azure functions and Python

License docs

This repository contains the tutorial for the Microsoft Azure Sponsored workshop. As well as all the solutions to the different sections.

✨ You can access the tutorial (self-paced) at

✨ The solutions are located in this repository in the solutions directory.

NEW πŸ“Ή Tutorial video! Go to the PyCon US Youtube Channel:

NEW ✨ Accompanying slides are on SpeakerDeck

Table of Contents

πŸ“ Description

Serverless computing (also known as function as a service, FaaS) is a design patterns where applications are hosted by a third-party service (i.e. Azure) eliminating the need for server software and hardware management by the developer.

Serverless can be an excellent alternative for Pythonistas interested in data processing as it allows them to focus on their code rather than the cloud infrastructure. This workshop we introduce attendees to Azure Functions for data processing scenarios (including data acquisition, cleaning and transformation and storage for subsequent usage).

After this tutorial, attendees will have practical experience with Azure functions for data processing scenarios. Also, they will leave the workshop with a basic function for data processing that could be further modified/extended to suit their needs/requirements.

πŸ”– Outline

  1. Introduction to serverless and Azure functions
  2. Creating your first Azure function:
    • Create a simple scheduled function using the VS Code extension
    • Familiarise with functions projects and structure
    • Running and debugging locally
  3. Functions deployment
    • Deploy your function to Azure
    • Familiarise with the Azure portal
  4. Data processing use case
    • Updating your function to collect data
    • Data cleaning, aggregation and storage

πŸ’» Pre-requisites

This workshop is aimed at folks interested in data processing, data engineering or data science. The goal is to provide a practical introduction to serverless for data processing scenarios.

We assume that you:

  • Have intermediate Python knowledge:

    • Have a good understanding of how to write and call functions
    • Have a good understanding of how Python modules and scripts work
  • Have some experience with data wrangling and/or data processing (not extensive experience required but have, for example, used libraries like pandas and requests for data wrangling and API access)

  • Are comfortable using the command line/terminal (no need to be an expert but should be comfortable enough to navigate file systems and perform necessary Git tasks)

πŸ‘€ Solutions

The solutions can be found in the solutions directory in this repository.

ARM templates included:

πŸ”‘ License

The contents in this repo are licensed under the OSI license.

The icons used in the tutorial are from Smashicons from Flaticon.


βš‘οΈπŸ™‡πŸ»β€β™€οΈ Sponsored tutorial content for PyCon 2020







No releases published


No packages published