This repository has been archived by the owner on Jun 22, 2022. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 32
planning 2018.05.14
Kamil A. Kaczmarek edited this page May 14, 2018
·
13 revisions
- library name
gradibus
- pip package and readthedocs
- notebooks
- tasks
- cache/save output
- Why do we have to define a caching directory for each and every transformer? Is there a better way to do it (Per project? Context handler? Something else?)
- How do we handle partitioning into training and test datasets? (just use separate data nodes?)
- Input has complicated notation: nested dicts in input -> simplify interface -> DataStep should merge input_step and input_data into one API piece.
data = {'input':
{
'X': X_train,
'y': y_train,
}
}
- You add input that is never used -> you see it on the graph.
- cache_transformer -> persisted_transformer
- Do not transform twice!