The Open-source Framework for Validating DataFrame-like Objects

📊 🔎 ✅

Data validation for scientists, engineers, and analysts seeking correctness.

Pandera is a Union.ai open source project that provides a flexible and expressive API for performing data validation on dataframe-like objects. The goal of Pandera is to make data processing pipelines more readable and robust with statistically typed dataframes.

Install

Pandera supports multiple dataframe libraries, including pandas, polars, pyspark, and more. To validate pandas DataFrames, install Pandera with the pandas extra:

With pip:

pip install 'pandera[pandas]'

With uv:

uv pip install 'pandera[pandas]'

With conda:

conda install -c conda-forge pandera-pandas

Get started

First, create a dataframe:

import pandas as pd
import pandera.pandas as pa

# data to validate
df = pd.DataFrame({
    "column1": [1, 2, 3],
    "column2": [1.1, 1.2, 1.3],
    "column3": ["a", "b", "c"],
})

Validate the data using the object-based API:

# define a schema
schema = pa.DataFrameSchema({
    "column1": pa.Column(int, pa.Check.ge(0)),
    "column2": pa.Column(float, pa.Check.lt(10)),
    "column3": pa.Column(
        str,
        [
            pa.Check.isin([*"abc"]),
            pa.Check(lambda series: series.str.len() == 1),
        ]
    ),
})

print(schema.validate(df))
#    column1  column2 column3
# 0        1      1.1       a
# 1        2      1.2       b
# 2        3      1.3       c

Or validate the data using the class-based API:

# define a schema
class Schema(pa.DataFrameModel):
    column1: int = pa.Field(ge=0)
    column2: float = pa.Field(lt=10)
    column3: str = pa.Field(isin=[*"abc"])

    @pa.check("column3")
    def custom_check(cls, series: pd.Series) -> pd.Series:
        return series.str.len() == 1

print(Schema.validate(df))
#    column1  column2 column3
# 0        1      1.1       a
# 1        2      1.2       b
# 2        3      1.3       c

Next steps

See the official documentation to learn more.

Name	Name	Last commit message	Last commit date
Latest commit cosmicBboy bugfix: custom parser runs before getting column_info (#1978 ) Apr 22, 2025 5a39bc1 · Apr 22, 2025 History 847 Commits
.github	.github	Pandas dependency deprecation future warning, add `pandera[pandas]` e…	Apr 8, 2025
asv_bench	asv_bench	add import and future warning for top-level pandera module (#1969 )	Apr 22, 2025
docs	docs	add import and future warning for top-level pandera module (#1969 )	Apr 22, 2025
pandera	pandera	bugfix: custom parser runs before getting column_info (#1978 )	Apr 22, 2025
scripts	scripts	Pandas dependency deprecation future warning, add `pandera[pandas]` e…	Apr 8, 2025
tests	tests	bugfix: custom parser runs before getting column_info (#1978 )	Apr 22, 2025
.coveragerc	.coveragerc	update mypy plugin and tests (#1007 )	Nov 14, 2022
.gitignore	.gitignore	Use uv in noxfile and ci-tests, migrate to pyproject.toml (#1916 )	Feb 28, 2025
.pre-commit-config.yaml	.pre-commit-config.yaml	update pylint version (#1945 )	Mar 20, 2025
.pylintrc	.pylintrc	update pylint version (#1945 )	Mar 20, 2025
.readthedocs.yml	.readthedocs.yml	Pandas dependency deprecation future warning, add `pandera[pandas]` e…	Apr 8, 2025
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	Add code of conduct	Mar 4, 2020
LICENSE.txt	LICENSE.txt	add license to pypi distribution	Nov 11, 2019
Makefile	Makefile	Use uv in noxfile and ci-tests, migrate to pyproject.toml (#1916 )	Feb 28, 2025
README.md	README.md	Update imports from pandera to pandera.pandas (#1965 )	Apr 15, 2025
environment.yml	environment.yml	add import and future warning for top-level pandera module (#1969 )	Apr 22, 2025
mypy.ini	mypy.ini	enh: enable mypy in more polars places (#1976 ) (#1977 )	Apr 22, 2025
new_example.py	new_example.py	Update imports from pandera to pandera.pandas (#1965 )	Apr 15, 2025
noxfile.py	noxfile.py	Pandas dependency deprecation future warning, add `pandera[pandas]` e…	Apr 8, 2025
pyproject.toml	pyproject.toml	add import and future warning for top-level pandera module (#1969 )	Apr 22, 2025
requirements.txt	requirements.txt	add import and future warning for top-level pandera module (#1969 )	Apr 22, 2025
setup.cfg	setup.cfg	fix mypy extra unit tests, pin pandas-stubs for dev env (#1056 )	Dec 15, 2022
setup.py	setup.py	Use uv in noxfile and ci-tests, migrate to pyproject.toml (#1916 )	Feb 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Open-source Framework for Validating DataFrame-like Objects

Install

Get started

Next steps

About

Releases 102

Sponsor this project

Packages

Used by 2.4k

Contributors 161

Languages

License

unionai-oss/pandera

Folders and files

Latest commit

History

Repository files navigation

The Open-source Framework for Validating DataFrame-like Objects

Install

Get started

Next steps

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 102

Sponsor this project

Packages 0

Used by 2.4k

Contributors 161

Languages

Packages