0.10.1 (2024-02-05)
github-actions
released this
05 Feb 09:16
·
171 commits
to develop
since this release
Features
-
Add support of
Incremental Strategies
forKafka
connection:reader = DBReader( connection=Kafka(...), source="topic_name", hwm=AutoDetectHWM(name="some_hwm_name", expression="offset"), ) with IncrementalStrategy(): df = reader.run()
This lets you resume reading data from a Kafka topic starting at the last committed offset from your previous run. (#202)
-
Add
has_data
,raise_if_no_data
methods toDBReader
class. (#203) -
Updade VMware Greenplum connector from
2.1.4
to2.3.0
. This implies:- Greenplum 7.x support
- Kubernetes support
- New read option gpdb.matchDistributionPolicy which allows to match each Spark executor with specific Greenplum segment, avoiding redundant data transfer between Greenplum segments
- Allows overriding Greenplum optimizer parameters in read/write operations (#208)
-
Greenplum.get_packages()
method now accepts optional argpackage_version
which allows to override version of Greenplum connector package. (#208)