chore: Added CI jobs to check classification training #457

fg-mindee · 2021-09-03T14:23:11Z

As per #429, this PR introduces the following modifications:

added option to change the font family in the character classification script
added also an option to change the number of samples during training & validation
added CI jobs to run the classification training for a small epoch
increased the sleep in CI job before checking if the demo is up

Any feedback is welcome!

Added options to change the font family and the number of generated samples

codecov · 2021-09-03T14:40:32Z

Codecov Report

Merging #457 (2949965) into main (64c7864) will increase coverage by 0.02%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##             main     #457      +/-   ##
==========================================
+ Coverage   95.83%   95.86%   +0.02%     
==========================================
  Files          96       96              
  Lines        4013     4013              
==========================================
+ Hits         3846     3847       +1     
+ Misses        167      166       -1

Flag	Coverage Δ
unittests	`95.86% <ø> (+0.02%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
doctr/models/core.py	`95.08% <0.00%> (+0.81%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 64c7864...2949965. Read the comment docs.

charlesmindee

Thanks!

fg-mindee · 2021-09-03T14:56:49Z

I've checked on my end, there is some seriously obscure issue with TF backprop:
the script works perfectly on GPU, but if you run on CPU, it throws a model sizing error ...

Traceback (most recent call last):
  File "references/classification/train_tensorflow.py", line 261, in <module>
    main(args)
  File "references/classification/train_tensorflow.py", line 198, in main
    fit_one_epoch(model, train_loader, batch_transforms, optimizer, mb)
  File "references/classification/train_tensorflow.py", line 41, in fit_one_epoch
    grads = tape.gradient(train_loss, model.trainable_weights)
  File "/home/fg/miniconda3/lib/python3.8/site-packages/tensorflow/python/eager/backprop.py", line 1074, in gradient
    flat_grad = imperative_grad.imperative_grad(
  File "/home/fg/miniconda3/lib/python3.8/site-packages/tensorflow/python/eager/imperative_grad.py", line 71, in imperative_grad
    return pywrap_tfe.TFE_Py_TapeGradient(
  File "/home/fg/miniconda3/lib/python3.8/site-packages/tensorflow/python/eager/backprop.py", line 159, in _gradient_function
    return grad_fn(mock_op, *out_grads)
  File "/home/fg/miniconda3/lib/python3.8/site-packages/tensorflow/python/ops/nn_grad.py", line 581, in _Conv2DGrad
    gen_nn_ops.conv2d_backprop_input(
  File "/home/fg/miniconda3/lib/python3.8/site-packages/tensorflow/python/ops/gen_nn_ops.py", line 1247, in conv2d_backprop_input
    _ops.raise_from_not_ok_status(e, name)
  File "/home/fg/miniconda3/lib/python3.8/site-packages/tensorflow/python/framework/ops.py", line 6897, in raise_from_not_ok_status
    six.raise_from(core._status_to_exception(e.code, message), None)
  File "<string>", line 3, in raise_from
tensorflow.python.framework.errors_impl.InvalidArgumentError: Computed input depth 960 doesn't match filter input depth 1 [Op:Conv2DBackpropInput]

fg-mindee · 2021-09-03T15:24:48Z

I can confirm this is at TF level and this is because of grouped convolutions. We cannot do much about this right now

For reference tensorflow/tensorflow#51825

fg-mindee added 4 commits September 3, 2021 16:17

feat: Added options in classification script

ba0f34f

Added options to change the font family and the number of generated samples

chore: Added CI job to check the classification training script

ad0fb46

chore: Increased sleep for CI job checking the demo

70d9560

chore: Ensured the training is run in CI for only a single epoch

cd9df1e

fg-mindee added topic: ci Related to CI ext: references Related to references folder topic: character classification Related to the task of character classification labels Sep 3, 2021

fg-mindee added this to the 0.4.0 milestone Sep 3, 2021

fg-mindee self-assigned this Sep 3, 2021

fg-mindee mentioned this pull request Sep 3, 2021

[tests] Add CI jobs to check all scripts outside of the core codebase #429

Closed

7 tasks

fg-mindee added 2 commits September 3, 2021 16:31

chore: Installed free fonts before CI checks

2d2d54d

chore: Making sure to avoid permission denied

2949965

charlesmindee approved these changes Sep 3, 2021

View reviewed changes

fg-mindee merged commit 3e7e9de into main Sep 3, 2021

fg-mindee deleted the ci-ref branch September 3, 2021 15:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: Added CI jobs to check classification training #457

chore: Added CI jobs to check classification training #457

fg-mindee commented Sep 3, 2021

codecov bot commented Sep 3, 2021 •

edited

charlesmindee left a comment

fg-mindee commented Sep 3, 2021

fg-mindee commented Sep 3, 2021 •

edited

chore: Added CI jobs to check classification training #457

chore: Added CI jobs to check classification training #457

Conversation

fg-mindee commented Sep 3, 2021

codecov bot commented Sep 3, 2021 • edited

Codecov Report

charlesmindee left a comment

Choose a reason for hiding this comment

fg-mindee commented Sep 3, 2021

fg-mindee commented Sep 3, 2021 • edited

codecov bot commented Sep 3, 2021 •

edited

fg-mindee commented Sep 3, 2021 •

edited