dada
is an add-on package for dbt
that helps
you, the heroic data analyst or engineer, rapidly explore and profile the data in
your data warehouse.
It can be used interactively for exploration, or on a schedule to track the evolution of data.
- PostgreSQL
Other databases may work as much of the code is standard SQL. Please contribute by testing dada on your database, and raising an issue if it doesn't work as expected.
- R's
summary()
andglimpse()
functions - Python's
pandas_profiling
module - data-mie's
dbt_profiler
package - csvkit's
csvstat
command - Adam Aspin's article Data Profiling with T-SQL