Made dataset splits possible during export #1162

Arthemide · 2023-04-19T07:51:28Z

I am working on a project using Kili and exporting my datasets with different types of annotations.

I think it would be useful to add a new feature that allows me to split the datasets into train/validation/test sets directly during the export process.
Currently, I have to create another script after the Kili export, and I believe that every other data scientist has to do the same.

Is this feature being developed internally, and do you think it would be a valuable addition?

Jonas1312 · 2023-04-19T08:18:33Z

Hi,

If you want to create three separate exports for your three folds, you can try to use the asset_ids, external_ids or asset_filter_kwargs parameters of the kili.export_labels() method.

For example, you can get the asset ids using kili.assets(), then split those ids into three folds, and call the kili.export_labels() method thrice using the asset_ids parameter and the different folds.

You could also add metadata to your assets, as described in this tutorial.

Would this solution solve your issue?

Jonas1312 closed this as completed May 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Made dataset splits possible during export #1162

Made dataset splits possible during export #1162

Arthemide commented Apr 19, 2023

Jonas1312 commented Apr 19, 2023 •

edited

Made dataset splits possible during export #1162

Made dataset splits possible during export #1162

Comments

Arthemide commented Apr 19, 2023

Jonas1312 commented Apr 19, 2023 • edited

Jonas1312 commented Apr 19, 2023 •

edited