Release 0.2.0
Release 0.2.0
This release was automatically created from tag v0.2.0.
Installation
pip install baselinr==0.2.0What's Changed
Major feature: pattern-based table selection
- Pattern-based table selection: use wildcards (
user_*,*_staging) or regex patterns to profile multiple tables - Schema-level profiling: profile all tables in a schema with
select_schema: true - Database-level profiling: profile all tables across all schemas with
select_all_schemas: true - Tag-based selection: filter tables using database metadata tags (Snowflake object tags, BigQuery labels, PostgreSQL comments, etc.) or dbt tags from manifest.json
- Advanced filtering: exclude patterns, filter by table types, row counts, required columns, and modification dates
- Precedence override system: use specific table configurations to override broader pattern matches with priority-based resolution
- Multi-database tag support: tag-based selection across PostgreSQL, Snowflake, SQLite, MySQL, BigQuery, and Redshift
- Performance safeguards: configurable limits for large schemas with caching and discovery limits
Developer SDK improvements
- Initial developer SDK released: programmatic access to Baselinr functionality via Python SDK
- Enhanced programmatic interface for integration into custom workflows
Documentation and website updates
- Migrated documentation to Docusaurus: improved documentation site and navigation
- Updated website structure for better discoverability
Bug fixes
- Fixed pattern expansion bug preventing proper table discovery in profile commands
- Fixed Dagster integration to correctly handle pattern-based configurations
- Improved error handling and validation throughout the codebase
Other changes
- License update: Updated to Apache 2.0 with commercial distribution restrictions
- CI/CD improvements: Fixed Windows emoji handling and Python 3.12 compatibility issues
See the full changelog for details.