Releases: dlt-hub/dlt
Releases · dlt-hub/dlt
0.3.4
Core Library
- staging for loader files implemented by @sh-rp in #451
- staging for redshift on s3 bucket and json + parquet by @sh-rp in #451
- staging for bigquery on gs bucket and json + parquet by @sh-rp in #451
- staging for snowflake on s3+gs buckets and json + parquet by @sh-rp in #451
- improvements and bugfixes for parquet generation by @rudolfix in #451
- tracks helpers usage and source names by @rudolfix in #497
- Fix: use sets to prevent unnecessary truncate calls by @z3z1ma in #481
Docs
- staging docs update by @sh-rp in #485
- rewritten documentation for destinations @rudolfix @AstrakhantsevaAA @dat-a-man
- adds category pages for sources and destinations by @rudolfix in #486
- Clarifies create-a-pipeline docs by @willi-mueller in #493
New Contributors
- @willi-mueller made their first contribution in #493
Full Changelog: 0.3.3...0.3.4
0.3.3
Core Library
- supports motherduck as a destination by @rudolfix in #460
- dbt 1.5 compatibility, enabled motherduck dbt support by @sh-rp in #475
- add more retry conditions and makes timeouts configurable in dlt requests drop-in replacement by @steinitzu in #477
- end_value support to incremental: backloading in parallel chunks now possible by @steinitzu in #467
Docs
- deploy cloud function as webhook by @dat-a-man in #449
- several key sections were updated and refactored by @AstrakhantsevaAA
- destination documentation refactor by @rudolfix in #478
Full Changelog: 0.3.2...0.3.3
0.3.3a0
0.3.2
Core Library
- snowflake destination: we support loading via PUT stage (
parquet
andjsonl
) and password and key pair authentication by @steinitzu in #414 - parquet files in load packages are supported with pyarrow. following destinations accept those when loading: bigquery, duckdb, snowflake and filesystem, by @sh-rp in #403
dbt-snowflake
supported by dbt wrapper by @steinitzu in #448
Docs
- Docs: polished reference's docs by @AstrakhantsevaAA in #430
dhelp
(AI assistant in docs) enabled by @burnash in #390- Added deploy with google cloud functions by @dat-a-man in #426
- train-gpt-q&a-blog by @TongHere in #438
- adding the open api spec article by @rahuljo in #442
- Docs/user guide data scientists by @AstrakhantsevaAA in #436
- Docs: airflow intro by @AstrakhantsevaAA in #444
- documents snowflake destination by @rudolfix in #447
- add file formats and fill out the parquet page in docs by @sh-rp in #439
- Added filesystem destination docs by @dat-a-man in #440
Full Changelog: 0.3.1...0.3.2
0.3.1
What's Changed
- add computed exhausted property by @sh-rp in #380
- removes the unpickable lambdas from destination caps and updates tests by @rudolfix in #404
- add secrets format option to dlt deploy by @sh-rp in #401
- Feat: Use compression to maximize network and disk space efficiency by @z3z1ma in #415
- 379 round robin pipe iterator by @sh-rp in #421
Docs
- adding article by @TongHere in #411
- GPT Training fix link by @TongHere in #417
- Docs: deploy airflow by @AstrakhantsevaAA in #410
- restructured docs: new Getting Started and dlt Ecosystem @rahuljo in #398 @adrianbr in #408
- Added Jira Docs by @dat-a-man in #425
- add structured data lake, fix titles by @adrianbr in #419
- adds duckdb->bigquery walkthrough by @rudolfix in #392
- Added sql_database pipeline by @dat-a-man in #396
- Added stripe setup guide by @dat-a-man in #394
- Added Workable pipeline docs by @dat-a-man in #395
- Added salesforce docs by @dat-a-man in #413
- Added Notion Docs by @dat-a-man in #409
- Added Mux docs by @dat-a-man in #412
New Contributors
Full Changelog: 0.3.0...0.3.1
0.3.0
Core Library
- renames Pipelines to Verified Sources by @rudolfix in #382
- adds tests to build containers, removes psutil by @rudolfix in #373
- finalizes where the resource state is stored in pipeline state by @rudolfix in #374
- accepts explicit values for unions if type of value is one of types by @rudolfix in #377
- add quotes to missing dependency exception output by @sh-rp in #387
- Feat/Add transaction management for filesystem operations using fsspec by @z3z1ma in #384
Minor Version Changes
- source name is now the key in pipeline state that stores all the source and resource state. previously the source section (which was the name of python module where source was defined) was used. this change will affect the already deployed pipelines that had name of the source different from the name of the module. they will not see the already stored state and may, for example, load some data twice. the only verified source affected by this is zendesk.
Docs
- rewrites the sections on source, resource and pipeline state by @rudolfix in #376
- minor changes to schema evolution doc by @rahuljo in #372
- pushing experiment 4 blog by @rahuljo in #371
- update docusaurus and fix gtag by @sh-rp in #385
- add section landing pages to docusaurus by @sh-rp in #386
New Contributors
Full Changelog: 0.2.9...0.3.0
0.2.9
Core Library
- dlt source decomposition into Airflow DAG by @rudolfix in #352
- airflow dlt wrapper to run dlt pipelines as DAGs by @rudolfix in #357
- dlt deploy airflow-composer by @AstrakhantsevaAA in #356
- new destination: filesystem/bucket with fsspec by @steinitzu in #342
- Update deprecated GitHub action by @tungbq in #345
- A base class for vault config providers with two implementations Google Secrets config provider and Airflow config provider
Docs
- pushing experiment 3 blog post by @rahuljo in #361
- structured data lakes post by @adrianbr in #362
- Several fixes and improvements by @tungbq
New Contributors
- @AstrakhantsevaAA made their first contribution in #356
Full Changelog: 0.2.8...0.2.9
0.2.8
Core Library
- fixes various airflow deployment issues by @rudolfix in #334 that include on-atomic renames on bucket mapped with fuse
- bumps duckdb dependency to include 0.8.0
- Fix/incremental with timezone naive datetime by @steinitzu in #330
- splits schema migration script to fit in max query length by @rudolfix in #339
resource_state
got final interface and is now exposed indlt.current.resource_state
#350- adds transformer overload that may be used when creating transformers dynamically to pass the decorated function
source.with_resources
creates a clone of resource and selects in the clone. previously source was modified in place- you can write back the secrets and configuration using
dlt.config
anddlt.secrets
indexers
Docs
- improve spaces on code samples by @TyDunn in #325
- Incremental loading image by @adrianbr in #318
- Adding matomo docs by @AmanGuptAnalytics in #331
- Adding asana dlt setup guide by @AmanGuptAnalytics in #319
- adding experiment 2 blog by @rahuljo in #336
- Fix Broken image in Docs > Pipelines > Google Analytics by @tungbq in #328
- Adding shopify docs by @AmanGuptAnalytics in #335
- Fixed the broken image link on zendesk page by @MirrorCraze in #337
- Fixed capitalization in docs by @burnash in #341
- Fix typo in docs/pipelines/asana.md by @tungbq in #344
- Fix analytics-engineer.md broken links by @tungbq in #349
- Correct typo in run pipeline guide of shopify.md by @tungbq in #347
New Contributors
- @tungbq made their first contribution in #328
- @MirrorCraze made their first contribution in #337
Full Changelog: 0.2.6...0.2.8
0.2.6
Core Library
- An experimental google secrets config provider #292 (actively used on our CI, goes to GA after adding more tests)
- Several bug fixes for
dlt init
anddlt pipeline
CLI commands - We are shifting from pre-releases to patch versions with post-releases for bugfixes and quick iterations to allow upgrades with
pip install -U dlt
Building Blocks
- add
dlt.sources.credentials
module with reusable credentials by @rudolfix in #315 (google service account, oauth2, database connection string and their base classes available to be used in pipelines)
Docs
- fix redshift docs by @TyDunn in #313
- User guides by @adrianbr in #301
- Updating landing page code snippet by @rahuljo in #314
- Updated README in
pipelines
repo with more building blocks examples and a guide on sharing community pipelines (https://github.com/dlt-hub/pipelines/blob/master/README.md#read-the-docs-on-building-blocks)
Full Changelog: 0.2.6a1...0.2.6
0.2.6a1
Core library
- Feat/pipeline drop command by @steinitzu in #285
- collectors and progress bars by @rudolfix in #302
Customizations
- Feat/new
add_limit
method for resources by @z3z1ma in #298 - Same method added to sources. overall you can now quickly sample large sources to ie. create example data sets, test your transformations etc. without the need to load everything
Docs
- explains how to set logging level and format by @rudolfix in #297
- ga4 internal dashboard demo blog post by @TyDunn in #299
- Added google_analytics docs by @AmanGuptAnalytics in #305
- Update README, add contributor's guide by @burnash in #311
- progress bars docs by @rudolfix in #312
New Contributors
- @z3z1ma made their first contribution in #298
- @ashish-weblianz made their first contribution in #306
Full Changelog: 0.2.6a0...0.2.6a1