Whistling Duck is a Python library for data quality exploration and validation, designed for small to medium-sized datasets. It is written using DuckDB.
- Handles datasets larger than available RAM efficiently with DuckDB.
- All quality checks are implemented in DuckDB SQL for speed.
- Minimal dependencies: requires DuckDB & Pydantic.
- No additional infrastructure needed; can be installed like any standard Python library.
pip install whistlingduck
Goal is to develop a Python library for data quality exploration and validation, for small to medium-sized datasets. The library utilize DuckDB & Pydantic, ensuring no external dependencies Contributions are welcome from anyone interested in enhancing the library.
Email: whistlingducklib@gmail.com