Skip to content

Releases: dlt-hub/dlt

0.3.6

30 Jul 20:54
6a9c322
Compare
Choose a tag to compare

Core Library

  • fixes lost data and incorrect handling of child tables during truncate-and-insert replace by @sh-rp in #499
    This is important improvement that fixes a few holes in truncate-and-insert replace mode (which was there from beginning of dlt). Now we truncate all the tables before multithreaded append process starts. We also truncate child tables that could be left with data before.
    details: #263 #271

  • fixes deploy airflow secrets and makes toml the default layout by @rudolfix in #513

  • check the required verified source dlt version during dlt init and warn users by @steinitzu in #514

  • add schema version to _dlt_loads table by @codingcyclist in #466

Docs

New Contributors

Full Changelog: 0.3.5...0.3.6

0.3.5

23 Jul 22:11
95644ec
Compare
Choose a tag to compare

Core Library

  • Fix incremental hitting end_value throwing out whole batches by @steinitzu in #495

  • replace with staging tables by @sh-rp in #488
    Now staging dataset may be used to replace tables. you can chose from several replace strategies (https://dlthub.com/docs/general-usage/full-loading) including fully transactional and atomic replacing of parent and all child tables or optimized where we use ie. ability to clone tables and copy on write in BigQuery and Snowflake

  • detect serverless aws_lambda by @muppinesh in #490

  • fixes credentials instantiation by @rudolfix in #507

  • fixes extract schema update by @rudolfix in #509

  • adds destination fingerprint by @rudolfix in #511

Docs

New Contributors

Full Changelog: 0.3.4...0.3.5

0.3.4

17 Jul 05:30
15c32da
Compare
Choose a tag to compare

Core Library

  • staging for loader files implemented by @sh-rp in #451
  • staging for redshift on s3 bucket and json + parquet by @sh-rp in #451
  • staging for bigquery on gs bucket and json + parquet by @sh-rp in #451
  • staging for snowflake on s3+gs buckets and json + parquet by @sh-rp in #451
  • improvements and bugfixes for parquet generation by @rudolfix in #451
  • tracks helpers usage and source names by @rudolfix in #497
  • Fix: use sets to prevent unnecessary truncate calls by @z3z1ma in #481

Docs

New Contributors

Full Changelog: 0.3.3...0.3.4

0.3.3

10 Jul 06:44
306baf1
Compare
Choose a tag to compare

Core Library

  • supports motherduck as a destination by @rudolfix in #460
  • dbt 1.5 compatibility, enabled motherduck dbt support by @sh-rp in #475
  • add more retry conditions and makes timeouts configurable in dlt requests drop-in replacement by @steinitzu in #477
  • end_value support to incremental: backloading in parallel chunks now possible by @steinitzu in #467

Docs

Full Changelog: 0.3.2...0.3.3

0.3.3a0

28 Jun 21:34
e7fafaf
Compare
Choose a tag to compare
0.3.3a0 Pre-release
Pre-release

Core Library

This is a pre-release version with adds motherduck destination. for details please see our docs

Full Changelog: 0.3.2...0.3.3a0

0.3.2

25 Jun 22:41
408e2f1
Compare
Choose a tag to compare

Core Library

  • snowflake destination: we support loading via PUT stage (parquet and jsonl) and password and key pair authentication by @steinitzu in #414
  • parquet files in load packages are supported with pyarrow. following destinations accept those when loading: bigquery, duckdb, snowflake and filesystem, by @sh-rp in #403
  • dbt-snowflake supported by dbt wrapper by @steinitzu in #448

Docs

Full Changelog: 0.3.1...0.3.2

0.3.1

18 Jun 20:00
Compare
Choose a tag to compare

What's Changed

  • add computed exhausted property by @sh-rp in #380
  • removes the unpickable lambdas from destination caps and updates tests by @rudolfix in #404
  • add secrets format option to dlt deploy by @sh-rp in #401
  • Feat: Use compression to maximize network and disk space efficiency by @z3z1ma in #415
  • 379 round robin pipe iterator by @sh-rp in #421

Docs

New Contributors

Full Changelog: 0.3.0...0.3.1

0.3.0

11 Jun 18:51
Compare
Choose a tag to compare

Core Library

  • renames Pipelines to Verified Sources by @rudolfix in #382
  • adds tests to build containers, removes psutil by @rudolfix in #373
  • finalizes where the resource state is stored in pipeline state by @rudolfix in #374
  • accepts explicit values for unions if type of value is one of types by @rudolfix in #377
  • add quotes to missing dependency exception output by @sh-rp in #387
  • Feat/Add transaction management for filesystem operations using fsspec by @z3z1ma in #384

Minor Version Changes

  • source name is now the key in pipeline state that stores all the source and resource state. previously the source section (which was the name of python module where source was defined) was used. this change will affect the already deployed pipelines that had name of the source different from the name of the module. they will not see the already stored state and may, for example, load some data twice. the only verified source affected by this is zendesk.

Docs

  • rewrites the sections on source, resource and pipeline state by @rudolfix in #376
  • minor changes to schema evolution doc by @rahuljo in #372
  • pushing experiment 4 blog by @rahuljo in #371
  • update docusaurus and fix gtag by @sh-rp in #385
  • add section landing pages to docusaurus by @sh-rp in #386

New Contributors

Full Changelog: 0.2.9...0.3.0

0.2.9

29 May 18:39
f22963c
Compare
Choose a tag to compare

Core Library

  • dlt source decomposition into Airflow DAG by @rudolfix in #352
  • airflow dlt wrapper to run dlt pipelines as DAGs by @rudolfix in #357
  • dlt deploy airflow-composer by @AstrakhantsevaAA in #356
  • new destination: filesystem/bucket with fsspec by @steinitzu in #342
  • Update deprecated GitHub action by @tungbq in #345
  • A base class for vault config providers with two implementations Google Secrets config provider and Airflow config provider

Docs

New Contributors

Full Changelog: 0.2.8...0.2.9

0.2.8

21 May 19:09
5cae979
Compare
Choose a tag to compare

Core Library

  • fixes various airflow deployment issues by @rudolfix in #334 that include on-atomic renames on bucket mapped with fuse
  • bumps duckdb dependency to include 0.8.0
  • Fix/incremental with timezone naive datetime by @steinitzu in #330
  • splits schema migration script to fit in max query length by @rudolfix in #339
  • resource_state got final interface and is now exposed in dlt.current.resource_state #350
  • adds transformer overload that may be used when creating transformers dynamically to pass the decorated function
  • source.with_resources creates a clone of resource and selects in the clone. previously source was modified in place
  • you can write back the secrets and configuration using dlt.config and dlt.secrets indexers

Docs

  • improve spaces on code samples by @TyDunn in #325
  • Incremental loading image by @adrianbr in #318
  • Adding matomo docs by @AmanGuptAnalytics in #331
  • Adding asana dlt setup guide by @AmanGuptAnalytics in #319
  • adding experiment 2 blog by @rahuljo in #336
  • Fix Broken image in Docs > Pipelines > Google Analytics by @tungbq in #328
  • Adding shopify docs by @AmanGuptAnalytics in #335
  • Fixed the broken image link on zendesk page by @MirrorCraze in #337
  • Fixed capitalization in docs by @burnash in #341
  • Fix typo in docs/pipelines/asana.md by @tungbq in #344
  • Fix analytics-engineer.md broken links by @tungbq in #349
  • Correct typo in run pipeline guide of shopify.md by @tungbq in #347

New Contributors

Full Changelog: 0.2.6...0.2.8