Do not explicitly call gc.collect in `CompImageHDU.compressed_data` #14576

Cadair · 2023-03-24T15:01:30Z

I have a nice pathological benchmark where I load 150 files and read all the data:

Before:

$ python run_timeit.py dkist_visp_tiled_post
Using DKIST dataset (BEJZP) with shape: (4, 18, 150, 996, 2545)
FITS tile shape is (1, 256, 256) (numpy order).
Ran dkist_visp_tiled_post 5 times, average time 27.755185409799743 s

After:

$ python run_timeit.py dkist_visp_tiled_post
Using DKIST dataset (BEJZP) with shape: (4, 18, 150, 996, 2545)
FITS tile shape is (1, 256, 256) (numpy order).
Ran dkist_visp_tiled_post 5 times, average time 4.998688907800533 s

We already discuss the reason why this line exists pretty comprehensively in the docs here: https://docs.astropy.org/en/stable/io/fits/appendix/faq.html#i-am-opening-many-fits-files-in-a-loop-and-getting-oserror-too-many-open-files

I am not really sure I see the benefit of this still being here?

github-actions · 2023-03-24T15:02:10Z

github-actions · 2023-03-24T15:02:16Z

👋 Thank you for your draft pull request! Do you know that you can use [ci skip] or [skip ci] in your commit messages to skip running continuous integration tests until you are ready?

astrofrog · 2023-03-24T15:05:09Z

I agree that users who run into issues should simply call gc.collect themselves - we shouldn't call it by default as this can have a large performance penalty.

pllim · 2023-03-24T15:05:57Z

But people been relying on this indirectly for so long, so now suddenly they have to manually call gc.collect() to get back the same behavior?

astrofrog · 2023-03-24T15:08:45Z

@pllim - I'm not sure it would actually change anything for most users, we shouldn't actually have to call garbage collection explicitly and it's surprising that the code even includes this. It might have been added as a 'just in case' but the main use case it might have been intended for is opening many files in a small amount of time which is what @Cadair is doing but it seems to be hindering not helping.

pllim · 2023-03-24T15:12:28Z

Is asking over at astropy-dev mailing list necessary, just in case?

Cadair · 2023-03-24T15:21:08Z

I can't imagine that even if anyone is relying on this they know it exists, so asking wont help.

pllim · 2023-03-24T15:27:01Z

Should at least add a change log then?

Cadair · 2023-03-24T15:33:58Z

Should at least add a change log then?

Sure. I was waiting to see if people a) rejected it outright and b) that all the tests passed 😀

Cadair · 2023-03-24T15:49:39Z

For what it's worth, I just loaded 10800 FITS files with memmap enabled without this line and I didn't hit any issues. I am not sure when dask would have dropped the references to them, so I don't know how many were open at once, but it certainly wasn't keeping them around forever.

saimn · 2023-03-24T16:31:28Z

I agree we should not call gc.collect since it will have a performance penalty when processing a lot of files.
For reference, this was added in #3283 / cf51894.

For what it's worth, I just loaded 10800 FITS files with memmap enabled without this line and I didn't hit any issues

Interesting, though problems may arise only in specific configurations, e.g. did you take care of not keeping references to .data ? What the value of ulimit -n on your system ?

I guess the other case to check, where people could be impacted by this change, is reading one or a few big files, keeping / not keeping references to .data, to see if memory if released or not.

Cadair · 2023-03-24T17:30:05Z

did you take care of not keeping references to .data ?

I didn't but dask might have done.

What the value of ulimit -n on your system ?

$ ulimit -n
1024

I am not suggesting this is a comprehensive test.

astrofrog · 2023-03-24T19:10:09Z

docs/changes/io.fits/14576.feature.rst

@@ -0,0 +1,2 @@
+Do not call ``gc.collect()`` in ``CompImageHDU.compressed_data`` as it has


Specifically in the deleter right? Maybe rephrase to say when closing a ConpImageHDU since that is when people are most likely to call this indirectly?

Cadair · 2023-04-19T09:04:52Z

@astrofrog @saimn I think this is good to go?

saimn · 2023-04-19T10:43:24Z

I did a few quick tests and I'm confused, it seems we have a memory leak with the new code ?
https://github.com/saimn/misc-astro/blob/master/astropy/FITS%20compression.ipynb
Note, for dev I get the same memory usage with gc.collect or without.
Also the amount of memory that leaks seems to vary with the tiling, it's bigger with the default tiling.

pllim · 2023-04-19T11:26:27Z

I am guessing this needs to be rebased on top of #14649 and re-reviewed after that one is merged first?

saimn · 2023-04-19T16:19:25Z

Running again some tests after #14649, I don't see a difference with or without gc.collect(), so sounds good to me.

saimn

Thanks @Cadair !

github-actions bot added the io.fits label Mar 24, 2023

pllim added this to the v5.3 milestone Mar 24, 2023

pllim added the Performance label Mar 24, 2023

Cadair marked this pull request as ready for review March 24, 2023 17:33

Cadair requested a review from saimn as a code owner March 24, 2023 17:33

astrofrog reviewed Mar 24, 2023

View reviewed changes

Cadair force-pushed the no_gc_in_tiled_data branch from c0de06b to 0dbf5d5 Compare April 19, 2023 09:04

saimn mentioned this pull request Apr 19, 2023

Fix memory leak in tile decompression #14649

Merged

Cadair force-pushed the no_gc_in_tiled_data branch from 0dbf5d5 to 23420f1 Compare April 19, 2023 13:15

Do not manually call gc

7e310fc

Cadair force-pushed the no_gc_in_tiled_data branch from 23420f1 to 7e310fc Compare April 19, 2023 13:16

saimn approved these changes Apr 19, 2023

View reviewed changes

saimn merged commit 80c3854 into astropy:main Apr 19, 2023
25 checks passed

Cadair deleted the no_gc_in_tiled_data branch April 19, 2023 16:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not explicitly call gc.collect in `CompImageHDU.compressed_data` #14576

Do not explicitly call gc.collect in `CompImageHDU.compressed_data` #14576

Cadair commented Mar 24, 2023 •

edited

github-actions bot commented Mar 24, 2023

github-actions bot commented Mar 24, 2023

astrofrog commented Mar 24, 2023

pllim commented Mar 24, 2023

astrofrog commented Mar 24, 2023

pllim commented Mar 24, 2023

Cadair commented Mar 24, 2023

pllim commented Mar 24, 2023

Cadair commented Mar 24, 2023

Cadair commented Mar 24, 2023

saimn commented Mar 24, 2023

Cadair commented Mar 24, 2023

astrofrog Mar 24, 2023

Cadair commented Apr 19, 2023

saimn commented Apr 19, 2023

pllim commented Apr 19, 2023

saimn commented Apr 19, 2023

saimn left a comment

		@@ -0,0 +1,2 @@
		Do not call ``gc.collect()`` in ``CompImageHDU.compressed_data`` as it has

Do not explicitly call gc.collect in CompImageHDU.compressed_data #14576

Do not explicitly call gc.collect in CompImageHDU.compressed_data #14576

Conversation

Cadair commented Mar 24, 2023 • edited

github-actions bot commented Mar 24, 2023

github-actions bot commented Mar 24, 2023

astrofrog commented Mar 24, 2023

pllim commented Mar 24, 2023

astrofrog commented Mar 24, 2023

pllim commented Mar 24, 2023

Cadair commented Mar 24, 2023

pllim commented Mar 24, 2023

Cadair commented Mar 24, 2023

Cadair commented Mar 24, 2023

saimn commented Mar 24, 2023

Cadair commented Mar 24, 2023

astrofrog Mar 24, 2023

Choose a reason for hiding this comment

Cadair commented Apr 19, 2023

saimn commented Apr 19, 2023

pllim commented Apr 19, 2023

saimn commented Apr 19, 2023

saimn left a comment

Choose a reason for hiding this comment

Do not explicitly call gc.collect in `CompImageHDU.compressed_data` #14576

Do not explicitly call gc.collect in `CompImageHDU.compressed_data` #14576

Cadair commented Mar 24, 2023 •

edited