-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Custom dataset training notebook ? #152
Comments
Hi @jeromen7 - I've been working on one but work has delayed it. do you have a good example dataset in mind? That will help as I can't use my work ones and haven't found a good one yet. |
Re: dataset format - I've been using coco format json with xyxy format (convert from coco format). Some others have had luck with cxcy. |
What about one of these? https://www.kaggle.com/andrewmvd/face-mask-detection https://www.kaggle.com/mbkinaci/fruit-images-for-object-detection https://www.kaggle.com/wobotintelligence/face-mask-detection-dataset |
Thank you for your answer @lessw2020 ! Thank you again for using some of your time to help us all ! 🙏 |
Great thanks @woctezuma and @jeromen7 |
Yes you are right @lessw2020 , the 'no mask' class if for faces with no mask, and the 'mask' class is for faces with a mask. |
@lessw2020 Thank you for your work so far - I am looking forward to seeing the rest of the custom training dataset Colab notebook. I took the course on creating Coco Datasets (https://www.udemy.com/course/creating-coco-datasets/), by Adam Kelly. Now that I have my own image data and annotations in the Coco format, I am looking to start training with DETR. I haven't found anybody else who has posted an example of training with DETR on a custom dataset, yet. |
You can finetune DETR either directly:
Or with the "detectron2" wrapper:
|
Hello,
I was wondering if someone managed to write a notebook for training DETR on a custom dataset.
I saw the issue #9 but there is a lot of messages and nobody provided a complete solution for what I am looking for.
The kaggle solution seems to work well but I don't know how to generalize it with multiple classes (and not only one for the wheat), and without the k-fold cross validation, which is great but adds a lot of computation time.
Moreover, the README in this git says that the dataset should be in COCO format with a json annotation files, but the kaggle solution never uses json files ... So I don't know the expected format of a dataset in order to do a training with DETR on it... csv file ? json file ? xyxy or xywh format or both ?
To sum up, I am looking for a simple and well-structured notebook that works on a dataset split into 2 folders (train and validation) like for example :
I know I am asking a lot, but any little assistance will be very appreciated !
Thank you all very much for your help 😄
The text was updated successfully, but these errors were encountered: