Skip to content

A unified framework for privacy-friendly data science collaboration

Notifications You must be signed in to change notification settings

aadityaverma/bastionlab

 
 

Repository files navigation

BastionLab

Mithril Security – BastionLab


👋 Welcome to BastionLab!

Where data owners and data scientists can securely collaborate without exposing data - opening the way to projects that were too risky to consider.

What is BastionLab?

BastionLab is a data science framework to perform remote and secure Exploratory Data Analysis.

  • Data scientists can remotely run queries on data frames without seeing the original data or intermediary results.
  • Data owners can let external or internal data scientists explore and extract values from their datasets, according to the strict privacy policies they defined.

This wasn’t possible until now for highly regulated fields like health, finance, or advertising. When collaborating remotely, data owners had to open their whole dataset, often through a Jupyter notebook. This was dangerous because too many operations were allowed and the data scientist had numerous ways to extract information from the remote infrastructure (print the whole database, save the dataset in the weights, etc).

BastionLab solves this problem by ensuring that only privacy-friendly operations are allowed on the data and anonymized outputs are shown to the data scientist.

BastionLab is an open-source project. Our solution is coded in Rust and uses Polars, a pandas-like library for data exploration.

Disclaimer: BastionAI, our fortified learning framework using TEEs will be incorporated in the broader offers of BastionLab, a holistic secure data science toolkit. While waiting for the merge, you can still use BastionAI under the folder bastionai.

Getting Started

Getting Help

Who made BastionLab?

Mithril Security is a startup aiming to make privacy-friendly data science easy so data scientists and data owners can collaborate without friction. Our solutions apply Privacy Enhancing Technologies and security best practices, like Remote Data Science and Confidential Computing.

About

A unified framework for privacy-friendly data science collaboration

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 38.0%
  • Rust 32.1%
  • Python 23.1%
  • Shell 3.3%
  • Mako 1.5%
  • Dockerfile 1.0%
  • Other 1.0%