This is a -lightweight- library for defining and processing data(sets). The goals of the library are to a) uniformize dataset classes, b) speed up experiments where the data does not fit into memory, c) offer a general way of processing data independent of the data source, d) seamlessly combine different datasets.
The reader is advised to first go through EXAMPLES/introduction_notebook/Readme1/2/3_* and then check the EXAMPLE implementation. For more information on each function check the docstrings or https://dabstract.readthedocs.io/.
- clone to a folder of choice (should be easily accessible, e.g. among other utilities)
- go to folder in terminal
- "pip install --editable ."
The --editable flag makes sure that the package uses the files you used for installing as source, such that you can adjust and push updates to this repository when you have a new feature or for a bug fix.