Releases: unionai-oss/pandera
Releases · unionai-oss/pandera
0.6.1: coercion and required column bugfixes
0.6.0: Data Synthesis Strategies, Schema Enhancements
🎉🎉🎉 Thanks to @jeffzi, @ktroutman, @m1so for your contributions! 🎉🎉🎉
Enhancements
- Improve memory efficiency of validation process (#360)
- Add column order validation (#352)
- Implement data synthesis strategies using hypothesis (#344)
- Add support for aliases in
SchemaModel
(#329) - Add support for optional name validation of single-index (#326)
- Move columns to multiindex: add
reset_index
,set_index
method toDataFrameSchema
(#319) - Add support for Python 3.9 (#307)
Bugfixes
- typing.DataFrame should expect annotation input (#318)
Deprecations
SchemaErrors.schema_errors
has been changed tofailure_cases
, and theschema_errors
attribute now contains a list of dicts containing schema errors and reason codes. This is a breaking change, but is a minor part of the API and is fairly straightforward to fix (#360).
Documentation Improvements
- Add required columns documentation for schema models (#362)
- Fix docs: schema examples (#347)
- Add documentation for dataframeschema transformations (#333)
- Fix deprecated SchemaErrorReport references in docs (#310)
- Fix SchemaModel dtype example (#309)
Repo Improvements
0.5.1: bugfix - add packaging dependency
pandera relied on the packaging package to get version information to determine pandas legacy status. This was an implicit sub-dependency of one of pandera's dependencies, which was apparently dropped and led to a bug: #335. This bugfix version explicitly adds packaging.
0.5.0: Class-based API for DataFrame Typing
Enhancements
- Implement class-based API for pydantic-style schema definitions 786b504. Big thanks to @jeffzi 🎉
- Add
inplace=False
argument toschema.validate
method to prevent mutation of original dataframe 586ebf3. - Make pandera optional extensions
[hypothesis]
,[io]
,[all]
available c4716a0. Thanks @amitripshtos and @jeffzi 🎉 - Add support for complex number data types 50e86e4 thanks @ferhah 🎉
- Add support for numpy scalar types a519db5
- Add
check_io
decorator for check inputs and outputs of a function 913cbd7 - Throw SchemaError with column name instead of ValueError for nulls in int series f7b03e3 thanks @TheCleric 🎉
Bugfixes
- Bugfix io.to_script and to_yaml: Ignoring serializing Checks with lambda functions da9c3a5 thanks @ferhah 🎉
Deprecations
Documentation Improvements
Repo Improvements
0.4.5: additional type support, SeriesSchema index support, built-in check Aliases, bugfixes
Enhancements
- improve failure case reporting more intuitive #232
- rename internal decorator for setting check statistics #235 thanks @Aditya1001001
- from_yaml supports all column properties #240 thanks @d33bs
- support for nullable integer string aliases and dtypes #244
- add
check_output
to theCheckResult
namedtuple #251 - built-in python scalar types are supported: int, float, str, bool #263
- Use Check.name in Check.repr #265 thanks @JacobHayes
- add comparison operator aliases to built-in checks #269
- add support for
SeriesSchema
index specification #270
Bugfixes
- io serialization can handle Index.name = None #248
- pandas_dtype can be correctly set in
Column
object #256 - fix
check_input
decorator when df passed in kwargs #257 thanks @vshulyak
Documentation Improvements
0.4.4: bugfixes in yaml serialization, error reporting, refactor internals
New Features
DataFrameSchema
providesrename_columns
method #226 @baskervilski- Failure case reporting is more intuitive as a tidy dataframe #232
Bugfixes
- yaml schema files support null index #225 @baskervilski
- add details to error reporting in type coercion #193 @c3-Anthony-Truchet
- fix column coerce warning in
check_input/output
decorators #228 - renamed internal decorator for check statistics #233 @Aditya1001001
0.4.3: bugfixes handle scalar False check_fn, yaml schema supports strict kwarg
bugfix: conda build failure, use version.py file
package uses version.py
file for single source of truth of package version.
add check ignore_na arg, bugfixes
New Feature
ignore_na
keyword argument toCheck
class 180072e
drops null columns within the check function before passing tocheck_fn
. The
SeriesSchemaBase.validate
method no longer does this.