[MRG] IS float #1720

darcymason · 2022-10-26T19:54:28Z

Describe the changes

Closes #1661. Introduces new ISfloat class which is used if a non-integer float is passed to IS and settings allow it.

Tasks

Unit tests added that reproduce the issue or prove feature is working
Fix or feature added
Code typed and mypy shows no errors
Documentation updated (if relevant)
- No warnings during build
- Preview link (CircleCI -> Artifacts -> doc/_build/html/index.html)
Unit tests passing and overall coverage the same or better

codecov · 2022-10-26T20:01:54Z

Codecov Report

Merging #1720 (7e09eac) into master (a8be738) will increase coverage by 0.01%.
The diff coverage is 97.22%.

❗ Current head 7e09eac differs from pull request most recent head cb29fc4. Consider uploading reports for the commit cb29fc4 to get more accurate results

@@            Coverage Diff             @@
##           master    #1720      +/-   ##
==========================================
+ Coverage   97.58%   97.60%   +0.01%     
==========================================
  Files          66       66              
  Lines       10744    10769      +25     
==========================================
+ Hits        10485    10511      +26     
+ Misses        259      258       -1

Impacted Files	Coverage Δ
pydicom/valuerep.py	`99.28% <94.11%> (-0.14%)`	⬇️
pydicom/config.py	`98.03% <100.00%> (+0.09%)`	⬆️
pydicom/dataset.py	`99.06% <100.00%> (+<0.01%)`	⬆️
pydicom/filebase.py	`99.19% <0.00%> (+1.61%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

darcymason · 2022-10-26T20:35:16Z

pydicom/dataset.py

+            config.strict_reading() if suppress_invalid_tags
+            else nullcontext()
+        )
+        with context:


Allowing ISfloat by default caused some json tests to fail. Added this strict context to try to keep previous behavior.

I have some trouble understanding the logic here: if suppress_invalid_tags is set, we raise an exception, and will be cught and ignored here? I'll probably have another look tomorrow with a fresh brain...

I checked the failing test (test_suppress_invalid_tags_with_failed_dataelement) - I wrote that, and I think it just should be changed to use another invalid value that would still raise, for example:

ds[0x00082128] = RawDataElement( Tag(0x00082128), 'IS', 4, b'5:25', 0, True, True ) with pytest.warns(UserWarning): ds_json = ds.to_json_dict(suppress_invalid_tags=True) assert "00082128" not in ds_json

and revert the change here.

...I think it just should be changed to use another invalid value that would still raise

I thought about that too, but that would still change existing behavior for IS - ones that did not get written before would now be accepted and written out. Probably not a big deal, but still something a bit different. I'm not sure just skipping values in the output is the best idea anyway. I'll think about this a bit before this is merged. I'd still like to have a "bigger picture" in mind for how to handle settings in v3.X... I'll start an issue and post some thoughts on that in the coming days.

but that would still change existing behavior for IS - ones that did not get written before would now be accepted and written out

That is true, but I assumed that this is a change we want to make. Though maybe I haven't thought this through... I will wait your thoughts then.

Having a look at this again ... This change has no effect (nullcontext()) for the default case where suppress_invalid_tags is False. So I think I was right, if suppress... is True (as seen only in tests probably at the moment), then we will catch the exception and ignore it, as is currently done (the new ISFloat is not allowed with strict_reading() turned on).

It is still messy, but I'd rather push real fixes to this messy config (and all the other messy config) to v3.0.

Well, I leave the decision to you. I find it a bit ugly, but you are probably right - better defer the real fix to 3.0, where behavior changes are expected.

darcymason · 2022-10-26T20:36:10Z

pydicom/tests/test_valuerep.py

+        bin_elem = b"\x18\x00\x52\x11\x04\x00\x00\x0014.5"
+        with BytesIO(bin_elem) as bio:
+            ds = read_dataset(bio, True, True)
+        assert isinstance(ds.Exposure, ISfloat)


Not strictly necessary to go back to binary like this rather than just creating an IS, but I thought it a good idea in case of future hooks into reading invalid values.

darcymason · 2022-10-26T20:54:32Z

pydicom/tests/test_valuerep.py

+            _ = IS("14.5", validation_mode=config.RAISE)
+        with pytest.raises(TypeError):
+            _ = IS(14.5, validation_mode=config.RAISE)
+


This seems a bit odd to me to have two different Errors for the same value, but that is the way it is done currently. Perhaps to be revisited for the grand redesign of validation settings?

I remember that I was not happy about this either, though forgot why I didn't change it...

In part I guess there are two branches, so to speak, because some are by string processing (regex) and others are by value. Another thing to think about in harmonizing validation.

pydicom/config.py

mrbean-bremen · 2022-10-28T19:48:45Z

pydicom/dataset.py

+            config.strict_reading() if suppress_invalid_tags
+            else nullcontext()
+        )
+        with context:


I have some trouble understanding the logic here: if suppress_invalid_tags is set, we raise an exception, and will be cught and ignored here? I'll probably have another look tomorrow with a fresh brain...

pydicom/tests/test_json.py

mrbean-bremen · 2022-10-28T19:50:55Z

pydicom/tests/test_valuerep.py

+        bin_elem = b"\x18\x00\x52\x11\x04\x00\x00\x0014.5"
+        with BytesIO(bin_elem) as bio:
+            ds = read_dataset(bio, True, True)
+        assert isinstance(ds.Exposure, ISfloat)


mrbean-bremen · 2022-10-28T19:52:20Z

pydicom/tests/test_valuerep.py

+            _ = IS("14.5", validation_mode=config.RAISE)
+        with pytest.raises(TypeError):
+            _ = IS(14.5, validation_mode=config.RAISE)
+


I remember that I was not happy about this either, though forgot why I didn't change it...

mrbean-bremen · 2022-10-28T19:55:25Z

pydicom/valuerep.py

+        # If a string passed, then store it
+        if isinstance(val, str):
+            self.original_string = val.strip()
+        elif isinstance(val, (IS, ISfloat)) and hasattr(val, 'original_string'):


I recently learned that black uses 88 instead of 79 characters as a default maximum line length, and I tend to agree with it... I think we had this discussion before, and it is not really important, though.

I was going to raise this point as well. I find 79 very limiting, it would really help to have even 9 more characters, and useful to just let black do its thing on new code before posting a PR.

We can change to 88 in a separate PR maybe, and update the documentation for contributors.

Yes, of course - this was not meant for this PR, just wanted to mention it.

darcymason added 3 commits October 25, 2022 16:36

Add failing test for IS float in a file

8c6ad1d

Test float IS with/without strict checking

d1fb0a9

Change JSON conversion to strict reading

5dcda34

darcymason added 2 commits October 26, 2022 16:09

mypy fixes

80d8d3c

Update docs and release notes

7e09eac

darcymason changed the title ~~[WIP] IS float~~ [MRG] IS float Oct 26, 2022

darcymason commented Oct 26, 2022

View reviewed changes

Add test coverage of original_string

cb29fc4

darcymason commented Oct 26, 2022

View reviewed changes

mrbean-bremen reviewed Oct 28, 2022

View reviewed changes

mrbean-bremen approved these changes Nov 14, 2022

View reviewed changes

darcymason merged commit 01ae3fc into master Nov 14, 2022

darcymason deleted the ISFloat branch November 14, 2022 19:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] IS float #1720

[MRG] IS float #1720

darcymason commented Oct 26, 2022 •

edited

codecov bot commented Oct 26, 2022 •

edited

darcymason Oct 26, 2022

mrbean-bremen Oct 28, 2022

mrbean-bremen Oct 29, 2022

darcymason Oct 29, 2022

mrbean-bremen Oct 29, 2022

darcymason Nov 14, 2022

mrbean-bremen Nov 14, 2022

darcymason Oct 26, 2022 •

edited

mrbean-bremen Oct 28, 2022

darcymason Oct 26, 2022

mrbean-bremen Oct 28, 2022

darcymason Oct 29, 2022

mrbean-bremen Oct 28, 2022

mrbean-bremen Oct 28, 2022

mrbean-bremen Oct 28, 2022

mrbean-bremen Oct 28, 2022

darcymason Oct 29, 2022

mrbean-bremen Nov 14, 2022

[MRG] IS float #1720

[MRG] IS float #1720

Conversation

darcymason commented Oct 26, 2022 • edited

Describe the changes

Tasks

codecov bot commented Oct 26, 2022 • edited

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

darcymason Oct 26, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

darcymason commented Oct 26, 2022 •

edited

codecov bot commented Oct 26, 2022 •

edited

darcymason Oct 26, 2022 •

edited