Skip to content

Releases: edanalytics/earthmover

v0.3.1

26 Apr 15:37
6094c1b
Compare
Choose a tag to compare

What's Changed

  • allow any ordering of Transformations during graph-building in compile by @jayckaiser
  • only create a /packages dir when earthmover deps succeeds by @jayckaiser
  • explain earthmover_compiled.yaml in README.md by @sleblanc23

Full Changelog: v0.3.0...v0.3.1

earthmover 0.3.0

17 Apr 15:59
983a2f8
Compare
Choose a tag to compare

What's Changed

  • feature: add project composition using packages keyword in template file (see README)
  • feature: add installation extras for optional libraries, and improve error logging to notify which is missing
  • feature: GroupByWithRankOperation cumulatively sums record counts by group-by columns
  • feature: setting log_level: DEBUG in template configs or setting debug: True for a node displays the head of the node mid-run
  • feature: add optional_fields key to all Sources to add optional empty columns when missing from schema
  • feature: add optional ignore_errors and exact_match boolean flags to DateFormatOperation
  • internal: remove attempted directory-hashing when a source is a directory (i.e., Parquet)
  • internal: Remove unused group_by_with_count and group_by_with_agg operations

Full Changelog: v0.2.1...v0.3.0

Note: This version has slightly different packaging requirements than v0.2.1. Please make sure to re-install the package if using locally.

v0.2.1 (and v0.2.2)

08 Apr 23:38
6edc19c
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.2.0...v0.2.1

Note: An error in the release of v0.2.1 to PyPI required releasing a v0.2.2 there which is identical in content to v0.2.1 here on GitHub. GitHub releases and PyPI versions will sync back up in the upcoming 0.3.0 release.

v0.2.0

11 Oct 23:00
29af4c3
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.1.6...v0.2.0

v0.1.6

11 Jul 15:30
de408fd
Compare
Choose a tag to compare

v0.1.5

13 Jun 22:11
3759510
Compare
Choose a tag to compare
  • fixing a bug to skip hashing missing optional source files
  • adding a tmp_dir config so we can tell Dask where to store data it spills to disk
  • adding a --results-file option to produce structured run metadata
  • adding a skip exit code

v0.1.4

12 May 20:10
Compare
Choose a tag to compare

This release fixes several bugs:

  • bugfix: config.state_file was being ignored when specified
  • bugfix: further issues with multi-line config.macros
  • bugfix: charset issues when reading / writing non-UTF8 files

v0.1.3

05 May 22:02
Compare
Choose a tag to compare

This release implements the ability to call {{ md5(column) }} in Jinja throughout eathmover, and adds a framework for other Python functions to be added in the future. It also fixes a bug with multi-line macros.

v0.1.2

02 May 19:38
Compare
Choose a tag to compare

This release fixes bugs with environment variable expansion under Windows and multi-line macros definitions in YAML config.

v0.1.1

27 Mar 21:20
Compare
Choose a tag to compare

This release fixes a bug where a single quote in the config YAML could prevent environment variable expansion from working since os.path.expandvars() does not expand variables within single quotes in Python under Windows.