Skip to content
This repository was archived by the owner on Jul 16, 2024. It is now read-only.

Merge test changes and data read post-processing into MAIN#20

Merged
DavidJBianco merged 4 commits intomasterfrom
dev
Mar 23, 2021
Merged

Merge test changes and data read post-processing into MAIN#20
DavidJBianco merged 4 commits intomasterfrom
dev

Conversation

@DavidJBianco
Copy link
Copy Markdown
Contributor

No description provided.

* Added 'test-infra.sh' script to start/stop the docker containers and load the data needed to
  run the unit tests. This makes it easier (and faster) to perform tests during development, since
  you can leave the test infra running and just execute the tests and save a few minutes during
  each iteration.
* Modified 'tox.ini' to use the new test-infra.sh script instead of having the commands embedded
  directly.
* Changed _read_multi's 'func' parameter to be 'read_function' since we now have both read and
  post-processing function parameters.
* Added a new 'post_function' parameter to read_csv, read_json and the backend _read_multi
  functions.  As each individual file is read, _read_multi can optionally call a post-processing
  function to manipulate the data before it's added to the resulting DataFrame. As a beneficial
  side effect, the new mechanism for reading multiple files appends each chunk as it's read, rather
  than keeping them all in memory until the end.  This should make things faster and use less memory.
* Added documentation about the new post-processing function
* Updated the version to 0.5.3
@DavidJBianco DavidJBianco merged commit daf9cf7 into master Mar 23, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant