Training (or Fine-Tuning) the Model #64

martholomew · 2023-10-14T10:53:24Z

I would like to fine-tune the model towards the data that I will be feeding it. My pipeline would be to binarize the images using sbb_binarize, then manually edit them to be high-quality ground-truth, then feed a large amount of these images back into the model.

Would the end-result be better binarization on my dataset?
How would this be accomplished?

A link to point me in the right direction would be a great help.

vahidrezanezhad · 2023-10-17T11:06:18Z

Dear @martholomew,

Of course, Pseudo-labeling can be effective, and we have also utilized this technique to enhance our models. You can employ https://github.com/qurator-spk/sbb_pixelwise_segmentation for your training needs. Initially, you can use our models to binarize your dataset and subsequently choose the documents with satisfactory results for custom dataset training. Sometimes, the predictions may exhibit local excellence. In such cases, you can employ cropping to prepare your ground truth (GT).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training (or Fine-Tuning) the Model #64

Training (or Fine-Tuning) the Model #64

martholomew commented Oct 14, 2023 •

edited

Loading

vahidrezanezhad commented Oct 17, 2023

Training (or Fine-Tuning) the Model #64

Training (or Fine-Tuning) the Model #64

Comments

martholomew commented Oct 14, 2023 • edited Loading

vahidrezanezhad commented Oct 17, 2023

martholomew commented Oct 14, 2023 •

edited

Loading