Skip to content

Latest commit

 

History

History
66 lines (46 loc) · 2.46 KB

HISTORY.md

File metadata and controls

66 lines (46 loc) · 2.46 KB

History

0.3.3 (XXXX-XX-XX)

  • Fix perf_logger missing in pandas expressions

0.3.2 (2024-01-08)

  • Add option to skip rows from the top of a csv file for the csv read rule
  • Add support for reading/writing compressed csv files for the polars backend
  • Support reading csv files via http and https
  • Read csv files in one block in dask

0.3.1 (2023-12-22)

  • Remove polars-business dependency and implement vectorized datetime operations for weekdays offsets
  • Add tests to run the examples to make sure they don't get broken by future changes
  • Optimize pandas business_offset and date_offset using values from a different columns to be vectorized operations
  • Add a perf logger which will log a warning when using operations which are not vectorized

0.3.0 (2023-12-11)

  • Add support for dask backend
  • Ability to deserialize custom rules (ie not part of the etlrules package) to support users implementing their own rules

0.2.3 (2023-11-28)

  • Fix to apply substitution in the WriteSQLTableRule sql_engine parameter
  • Apply substitution in the Read/Write rules for csv and parquet files for the file_name and file_dir parameters
  • Add a cli runner which allows users to run a yml file and parameterize with cli args the plan context
  • Add the csv2db plan/yml example
  • Add the db2csv plan/yml example
  • Remove poetry

0.2.2 (2023-11-26)

  • Support environment variables substitution in the sql_engine string for SQL rules
  • Add support for the Boolean type
  • Introduce the concept of a plan context, consisting of a key-value mapping of string to int/float/str/bool values which will act as the args into the plan. They can be used in expressions when adding new columns, ifthenelse and filter rules.
  • Add env and context substitution feature to sql queries

0.2.1 (2023-11-20)

  • Add support for unsigned int types
  • Add AddRowNumbersRule, a rule to add row numbers
  • Add ExplodeValuesRule which explode lists of values into individual scalar values as additional rows
  • Add support for reading/writing to/from DBs via sqlalchemy under etlrules.backends.common.io.db

0.2.0 (2023-11-14)

  • Add support for polars as a backend

0.1.1 (2023-10-29)

  • Fix bug in RuleEngine validate affecting pipeline mode
  • Export the main classes from etlrules init
  • More typing annotations
  • More documentation

0.1.0 (2023-10-28)

  • First release on PyPI.
  • Rule engine supporting running pipeline mode and graph mode plans
  • Support for an initial set of etl rules, pandas backend only