Starred repositories
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
Code for the paper "Language Models are Unsupervised Multitask Learners"
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
Open source platform for the machine learning lifecycle
Production-Grade Container Scheduling and Management
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Curated list of resources about Apache Airflow
Automatic SQL injection and database takeover tool
Azure Data SQL Samples - Official Microsoft GitHub Repository containing code samples for SQL Server, Azure SQL, Azure Synapse, and Azure SQL Edge
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Apache Spark - A unified analytics engine for large-scale data processing
Flume - Ingestion, an Apache Flume distribution
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
A toolkit for developing and comparing reinforcement learning algorithms.
Open-source JavaScript charting library behind Plotly and Dash
The interactive graphing library for Python ✨
Extremely simple yet powerful header-only C++ plotting library built on the popular matplotlib