Avoid that the OMR processes finishes prematurely #53

liebharc · 2023-11-30T08:53:10Z

Hi,

This is just a collection of small fixes for various exceptions and errors I experienced while running oemer through various images. It's mainly bound checks. The changes also try to filter out invalid results instead of raising errors in the assumptions that it's better to try to extract as much as possible.

liebharc · 2023-11-30T08:59:49Z

The ruff error in the CI pipeline should disappear as soon as #52 is merged

* Fixed 'TypeError: Cannot convert 4.999899999999999e-07 to EagerTensor of dtype int64' in training, fixes BreezeWhite#39 https://stackoverflow.com/questions/76511182/tensorflow-custom-learning-rate-scheduler-gives-unexpected-eagertensor-type-erro * --format was deprecated in ruff and replaced wtih --output-format

BreezeWhite

Hi, sorry for the late reply. I've been on a vacation recently.

I have admit that I've forgot lots of the implementation details, and thus may not correctly justify whether the modifications are proper or not. Rather, I choose to trust the refactoring makes sense. Just a few comments need to be confirmed.

Thanks again for making this PR.

BreezeWhite · 2023-12-20T09:20:55Z

oemer/rhythm_extraction.py

+        try:
+            cur_scan_line = note_id_map[int(start_y):int(bbox[3]), int(right_bound)]
+            ids = set(np.unique(cur_scan_line))
+            if -1 in ids:
+                ids.remove(-1)
+            if len(ids) > 0:
+                break
+            right_bound += 1
+            if right_bound >= bbox[2] + unit_size:
+                break
+        except IndexError as e:
+            print(e)


Maybe just this scope is enough

try: cur_scan_line = note_id_map[int(start_y):int(bbox[3]), int(right_bound)] except IndexError as e: ...

True and good idea

BreezeWhite · 2023-12-20T09:23:53Z

oemer/staffline_extraction.py

-            raise E.StafflineCountInconsistent(
-                f"Some of the stafflines contains less or more than 5 lines: {line_num}")
+            print(f"Some of the stafflines contains less or more than 5 lines: {line_num}")
+            continue


I'm not sure if ignoring these exceptions is good or not, since the later flow relies heavily on strong stafflines assumptions.

In the cases I saw, with the change you get e.g. 4 out of 5 staffs. The results then weren't great. I think what the change mainly allows is to understand better which staff (or part of the image) caused the problems. Like you can see in the teaser, what staffs have been detected and which ones haven't. With the exception, it's much harder to understand that as you also don't get a teaser image.

But that said, I'm also fine to revert this change.

Maybe you can do some tests on those cases with and without these modifications to see if removing the exception really effects the later process?

BreezeWhite · 2023-12-20T09:24:02Z

oemer/staffline_extraction.py

-            raise E.StafflineNotAligned(
-                f"Centers of staff parts at the same row not aligned (Th: {horizontal_diff_th}): {norm(centers)}")
+            print(f"Centers of staff parts at the same row not aligned (Th: {horizontal_diff_th}): {norm(centers)}")
+            continue


Same as above

BreezeWhite · 2023-12-20T09:24:39Z

oemer/staffline_extraction.py

-        if not np.all(norm(unit_size) < unit_size_diff_th):
-            raise E.StafflineUnitSizeInconsistent(
-                f"Unit sizes not consistent (th: {unit_size_diff_th}): {norm(unit_size)}")
+        if not np.all(norm(unit_size) < unit_size_diff_th):          
+            print(f"Unit sizes not consistent (th: {unit_size_diff_th}): {norm(unit_size)}")
+            continue
+        valid_staffs.append(staffs)


Same as above

BreezeWhite · 2023-12-20T09:26:22Z

oemer/symbol_extraction.py

-                raise E.SfnNoteTrackMismatch(f"Track of sfn and note not mismatch: {ss}\n{note}")
-            if ss.group != note.group:
-                raise E.SfnNoteGroupMismatch(f"Group of sfn and note not mismatch: {ss}\n{note}")
-            notes[ss.note_id].sfn = ss.label
-            ss.is_key = False
+                print(f"Track of sfn and note not mismatch: {ss}\n{note}") 
+                notes[ss.note_id].invalid = True
+            elif ss.group != note.group:
+                print(f"Group of sfn and note not mismatch: {ss}\n{note}")
+                notes[ss.note_id].invalid = True
+            else:
+                notes[ss.note_id].sfn = ss.label
+                ss.is_key = False


Same as above.

And also sorry for the typo 😂 Should be "Group of sfn and note mismatch", the "not" is redundant.

…are different for the same source code level on different test runs due to update of the dependencies

liebharc · 2024-01-05T10:01:16Z

Happy new year! This is an example for the staffline exceptions:

On the main branch is should give you this error: "oemer.exceptions.StafflineNotAligned: Centers of staff parts at the same row not aligned (Th: 0.1): [0.14311853 0.14315237 0.14318621 0.14334265 0.14335062 0.1434011
0.85955148]"

The issue here is the background which is dark and noisy.

On this branch you instead get a result which looks like this in MuseScore:

While there are plenty of issues in the result, I still think it's useable. The teaser looks quite good:

Let me know what you think and if you would prefer to keep the exceptions or keep the PR as it is.

…were introduced by cv2

BreezeWhite · 2024-01-23T06:47:25Z

Well...though the result still has lots of room to be improved, but still surprised that oemer can work to some extent with the weak assumptions. I think the replacement of the exceptions is worth keeping comparing to the user experience of easily getting errors. Nice job 👍🏻

liebharc · 2024-01-23T11:02:33Z

Thanks. I was mostly using oemer with pictures taken from phone cameras. It really did an impressive job if I compared the results to other tools.

BreezeWhite · 2024-01-26T06:19:28Z

Thanks for the appreciation. So is it good to merge this PR? Anything need to modify?

liebharc · 2024-01-26T09:24:02Z

Yes, I believe the PR can be merged.

* Fixed typos in comments * IndexError while scanning for a dot should not abort the whole process * Bound check while getting the note label * Added check if label is in the note_type_map * Filter staffs instead of aborting with an exception * Bound check during symbol extraction * Marking notes as invalid instead of aborting with an exception * Bound check * Fixed type error * Fixed TypeError at start of unet or segnet training (BreezeWhite#52) * Fixed 'TypeError: Cannot convert 4.999899999999999e-07 to EagerTensor of dtype int64' in training, fixes BreezeWhite#39 https://stackoverflow.com/questions/76511182/tensorflow-custom-learning-rate-scheduler-gives-unexpected-eagertensor-type-erro * --format was deprecated in ruff and replaced wtih --output-format * HoughLinesP can return None if no lines are found * Fixed error which happens if no rest bboxes were found * Limited try/except block * Fixed typo * Use fixed versions for the linter dependencies to avoid that results are different for the same source code level on different test runs due to update of the dependencies * Fixed type errors which came up with the recent version of cv2 * Going back to the newest version of ruff and mypy as the type errors were introduced by cv2

…ingle entry point to all training steps (#54) * Fixed 'TypeError: Cannot convert 4.999899999999999e-07 to EagerTensor of dtype int64' in training, fixes #39 https://stackoverflow.com/questions/76511182/tensorflow-custom-learning-rate-scheduler-gives-unexpected-eagertensor-type-erro * --format was deprecated in ruff and replaced wtih --output-format * Added a single entry point to train all models * Added convenience wrapper for oemer * Tried to figure out the definitions for the dense dataset and to document them in code There is likely an official definition somewhere but I just couldn't find it. So I looked at example and tried to reconstruct the mapping. Unknown basically means that I just couldn't see the symbol on the picture. * Decreased queue sizes as otherwise the training process crashed with an out of memory exception after it used up about 30GB of memory * Added model outputs to git ignore * Added checks for dataset folders * Using default training params * Added workarounds for removal of np.float * Using dataset definitions * Added type annotations * Added a train_all_rests even if the resulting model is right now not used in oemer * segnet and unet should now pick the correct model * Changed label definitions from what appears to be used in oemer right now * With this commit the resulting arch.json matches the one inside of oemer/ceckpoints/seg_net/arch.json * Avoid that the OMR processes finishes prematurely (#53) * Fixed typos in comments * IndexError while scanning for a dot should not abort the whole process * Bound check while getting the note label * Added check if label is in the note_type_map * Filter staffs instead of aborting with an exception * Bound check during symbol extraction * Marking notes as invalid instead of aborting with an exception * Bound check * Fixed type error * Fixed TypeError at start of unet or segnet training (#52) * Fixed 'TypeError: Cannot convert 4.999899999999999e-07 to EagerTensor of dtype int64' in training, fixes #39 https://stackoverflow.com/questions/76511182/tensorflow-custom-learning-rate-scheduler-gives-unexpected-eagertensor-type-erro * --format was deprecated in ruff and replaced wtih --output-format * HoughLinesP can return None if no lines are found * Fixed error which happens if no rest bboxes were found * Limited try/except block * Fixed typo * Use fixed versions for the linter dependencies to avoid that results are different for the same source code level on different test runs due to update of the dependencies * Fixed type errors which came up with the recent version of cv2 * Going back to the newest version of ruff and mypy as the type errors were introduced by cv2 * Fix install from github command in README --------- Co-authored-by: Yoyo <miyashita2010@tuta.io>

liebharc added 9 commits November 30, 2023 09:16

Fixed typos in comments

bb64377

IndexError while scanning for a dot should not abort the whole process

107ec50

Bound check while getting the note label

d70d78b

Added check if label is in the note_type_map

88fd4df

Filter staffs instead of aborting with an exception

d30ba59

Bound check during symbol extraction

b0351ae

Marking notes as invalid instead of aborting with an exception

0563cbc

Bound check

a30005f

Fixed type error

a764104

liebharc and others added 3 commits December 1, 2023 15:31

HoughLinesP can return None if no lines are found

49a52ec

Fixed error which happens if no rest bboxes were found

267b35e

BreezeWhite reviewed Dec 20, 2023

View reviewed changes

liebharc added 4 commits January 5, 2024 10:20

Limited try/except block

df9981c

Fixed typo

c8e5c19

Use fixed versions for the linter dependencies to avoid that results …

5ee7f9b

…are different for the same source code level on different test runs due to update of the dependencies

Merge branch 'main' of https://github.com/liebharc/oemer into stability

a37c53b

liebharc added 2 commits January 5, 2024 11:29

Fixed type errors which came up with the recent version of cv2

c53ca1a

Going back to the newest version of ruff and mypy as the type errors …

81155a7

…were introduced by cv2

BreezeWhite merged commit 49cef64 into BreezeWhite:main Jan 29, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid that the OMR processes finishes prematurely #53

Avoid that the OMR processes finishes prematurely #53

liebharc commented Nov 30, 2023 •

edited

Loading

liebharc commented Nov 30, 2023

BreezeWhite left a comment

BreezeWhite Dec 20, 2023

liebharc Dec 20, 2023

BreezeWhite Dec 20, 2023

liebharc Dec 20, 2023

BreezeWhite Dec 25, 2023

BreezeWhite Dec 20, 2023

BreezeWhite Dec 20, 2023

BreezeWhite Dec 20, 2023

liebharc commented Jan 5, 2024

BreezeWhite commented Jan 23, 2024

liebharc commented Jan 23, 2024

BreezeWhite commented Jan 26, 2024

liebharc commented Jan 26, 2024

Avoid that the OMR processes finishes prematurely #53

Avoid that the OMR processes finishes prematurely #53

Conversation

liebharc commented Nov 30, 2023 • edited Loading

liebharc commented Nov 30, 2023

BreezeWhite left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liebharc commented Jan 5, 2024

BreezeWhite commented Jan 23, 2024

liebharc commented Jan 23, 2024

BreezeWhite commented Jan 26, 2024

liebharc commented Jan 26, 2024

liebharc commented Nov 30, 2023 •

edited

Loading