Filtering for train/val/test splits #7

RemyLau · 2021-08-23T15:23:50Z

https://github.com/krishnanlab/NetworkLearningEval/blob/1a75b5b1e4525dce81e261d33d13c151cf595135/src/NLEval/label/LabelsetCollection.py#L344-L351

There is a potential issue with the current scheme that would lead to unnecessary removal of label sets. For example, in an iteration, multiple labelsets are to be removed by the LabelsetRangeFilterTrainTestPos filter, but some of them might be okay after reassigning the train/val/test splits.

One potential solution would be reassigning train/val/test splits after every removal of a labelset. However, this poses potential issues such as slower runtime, and also the ordering of removal, which might also lead to different final solution.

The text was updated successfully, but these errors were encountered:

RemyLau · 2022-01-08T01:55:36Z

The optimal solution (for removing the least amount of data to make the split fulfilling the criterion) might not be trivial to obtain. Also given that SplitLSC is deprecated soon, see #72, this won't be needed anyways.

RemyLau added the wontfix This will not be worked on label Jan 8, 2022

RemyLau closed this as completed Jan 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filtering for train/val/test splits #7

Filtering for train/val/test splits #7

RemyLau commented Aug 23, 2021

RemyLau commented Jan 8, 2022 •

edited

Loading

Filtering for train/val/test splits #7

Filtering for train/val/test splits #7

Comments

RemyLau commented Aug 23, 2021

RemyLau commented Jan 8, 2022 • edited Loading

RemyLau commented Jan 8, 2022 •

edited

Loading