Releases · dlt-hub/dlt

orjson is used instead of simplejson by default (with a fallback for pypy)
spawn method now supported to run normalizer stage in parallel (so works on MacOS and Windows)
indexes are not created by default on duckdb (inserting slow), but enabled in postgres
bumped duckdb to 0.7.0 and newest dbt-duckdb
bugfixes and additional tests

Full Changelog: 0.2.0a18...0.2.0a19

Contributors

TyDunn, rudolfix, and matthauskrzykowski

Assets 2

15 Feb 00:43

rudolfix

0.2.0a18

77c1291

0.2.0a18 Pre-release

Pre-release

What's Changed

small improvements to duckdb docs by @TyDunn in #128
Fix current columns and new columns referencing the same object by @steinitzu in #127
tests file rotation on schema changes + bumps duckdb to 0.7 by @rudolfix in #129

New Contributors

@steinitzu made their first contribution in #127

Contributors

steinitzu, TyDunn, and rudolfix

Assets 2

09 Feb 13:54

rudolfix

0.2.0a17

2f1e056

0.2.0a17 Pre-release

Pre-release

What's Changed

adds duckdb destination and dbt support by @rudolfix in #124
The 🦆 db destination is added and may be used like any other destination. The multithreaded loading is quite fast thanks to duckdb dropping the GIL when called. See more in our docs
We also support the dbt-duckdb adapter, the jaffle shop example was converted to duckdb to showcase this ability
transaction support was added to sql_client and Big Query got multi-statement transactions via Sessions

Full Changelog: 0.2.0a16...0.2.0a17

Contributors

rudolfix

Assets 2

31 Jan 16:39

rudolfix

0.2.0a16

791d7dc

0.2.0a16 Pre-release

Pre-release

What's Changed

uses structured data types to store json by @rudolfix in #121
brings back the functionality to run dbt packages by @rudolfix in #122

This update changes how the complex data type is being stored. Most often complex data type is generated to hold json data when nesting_level is limited in the source. The complex types are stored as JSONB (postgres), SUPER (redshift), JSON (BigQuery)

This update also allows to easily run dbt packages together with dlt pipelines. Most of the work went into making it user friendly and avoiding any dependency conflicts. This feature is also quite well tested. Take a look at those two examples:
https://github.com/dlt-hub/dlt/blob/devel/docs/examples/dbt_run_jaffle.py
https://github.com/dlt-hub/dlt/blob/devel/docs/examples/chess/chess_dbt.py
https://github.com/dlt-hub/dlt/tree/devel/docs/examples/chess/dbt_transform

Runing the dbt package is just two lines and you have more control over it that with the CLI

Contributors

rudolfix

Assets 2

08 Jan 18:48

rudolfix

0.1.0a15

8310c40

0.1.0a15 Pre-release

Pre-release

What's Changed

improving docs UX by @TyDunn in #110
docstings and cli improvements by @rudolfix in #114
allows hash mismatch when loading schema that needs migration
makes full_refresh flag configurable
fixes config/secrets injection when union with base configuration is used, fixes secret hint detection
properly restores pipeline state when extract step fails

Contributors

TyDunn and rudolfix

Assets 2

07 Dec 23:01

rudolfix

0.2.0a10

52227e1

0.2.0a10 Pre-release

Pre-release

What's Changed

only specific default config values are generated in dlt init ie BigQuery Location
correct postgres port generated
sends traces to sentry if RUNTIME__SENTRY_DSN is present
sends slack notification if RUNTIME__SLACK_INCOMING_HOOK is present

Assets 2

05 Dec 22:55

rudolfix

0.2.0a9

af4ce68

0.2.0a9 Pre-release

Pre-release

What's Changed

dlt init renames sources and resources by @rudolfix in #106

Contributors

rudolfix

Assets 2

04 Dec 21:55

rudolfix

0.2.0a8

cfcc6a6

0.2.0a8 Pre-release

Pre-release

What's Changed

deletes all pipeline state (schemas, state, intermediate files) if destination dataset is dropped
synchronizes state with the destination in the run method
you can opt out from state sync with restore_from_destination=false in ie. config.toml
loads all schemas/source into single dataset by default. this will simplify experience for less advanced users. you can switch back to the old behavior (each source/schema has separate dataset) with use_single_dataset=false config option
enables CTRL-C when running user code
commits all files extracted from several sources after all user code ran

Assets 2

02 Dec 17:28

rudolfix

0.2.0a7

92515b9

0.2.0a7 Pre-release

Pre-release

What's Changed

adds max_table_nesting argument to dlt.source to control the depth of parent-child table nesting
fixes the pipeline_name when runtime configuration is embedded
reacts to signals (ie CTRL-C) during extraction and other signal improvements
passes github vars to loggers/tracers

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

What's Changed

New Contributors

`dlt` library changes

Contributors

What's Changed

New Contributors

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

What's Changed

Contributors

What's Changed

What's Changed

Releases: dlt-hub/dlt

0.2.0a20

What's Changed

0.2.0a19

What's Changed

New Contributors

dlt library changes

Contributors

0.2.0a18

What's Changed

New Contributors

Contributors

0.2.0a17

What's Changed

Contributors

0.2.0a16

What's Changed

Contributors

0.1.0a15

What's Changed

Contributors

0.2.0a10

What's Changed

0.2.0a9

What's Changed

Contributors

0.2.0a8

What's Changed

0.2.0a7

What's Changed

`dlt` library changes