Skip to content

Great Expectations vs Pandera #598

Answered by cosmicBboy
Veganveins asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @Veganveins thanks for your question!

So one big caveat here is that I haven't used GE very extensively but I'll do my best to summarize the similarities and differences.

Overlap

The main overlap is that both libraries aim to solve the same problem of ensuring data quality, but I think the approach pandera takes is closer in spirit to pydantic or dataclasses, in that it's a light weight package that focuses on one thing, which is parsing and validation of in-memory dataframes. Think of this as run-time enforced type-annotations for your dataframes.

Differences

GE provides data validation, profiling, and documentation, and is closer to a declarative tool that you'd integrate with your v…

Replies: 6 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by Veganveins
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
question Further information is requested
4 participants
Converted from issue

This discussion was converted from issue #590 on September 06, 2021 15:58.