BUG: fix an issue where missing files would be indexed without verification #3816

neutrinoceros · 2022-02-18T14:06:32Z

PR Summary

I think this fixes #2819
However I do not have a clear enough picture of what the code in the initial report is supposed to accomplish.

would it be preferable to log missing files
should an error be raised instead when files are missing ?

pinging @brittonsmith and @matthewturk for review

neutrinoceros · 2022-02-18T14:23:09Z

For reference I've been testing this locally with yt_astro_analysis 1.1.1 using this script (which is a self-contained version of OP's script)

import yt
from yt.extensions.astro_analysis.halo_analysis import HaloCatalog

data_ds = yt.load('snap_000.11')
hc = HaloCatalog(data_ds=data_ds, finder_method='hop')
hc.create()

as I'm writing this, the script has been running on one CPU for 30min and counting, I have no idea wether that's expected for a 500Mb dataset or if it means I'm actually stuck in an infinite loop.

edit: I used a continue instead of a break at first... so yeah I had an infinite loop. This is fixed now, and the script takes less than 5min to run

neutrinoceros · 2022-02-18T15:42:40Z

this is clearly broken. Switching to draft for now

neutrinoceros · 2022-02-18T16:16:38Z

yt/geometry/particle_geometry_handler.py

+                    df = cls(
+                        self.dataset, self.io, template % {"num": i}, fi, (start, end)
+                    )
+                except FileNotFoundError:


note that other frontends will directly benefit from this without changes because they call open or h5py.File, which both raise FileNotFoundError naturally.

neutrinoceros · 2022-02-18T16:50:09Z

@matthewturk I think the codetour watch workflow is broken (and probably always was)
Seeing that it's been directly copied from their documentation, my guess is that we never set up the required repo secret properly.

brittonsmith · 2022-02-21T15:23:20Z

I don't think this is the right solution. If we are breaking out of this loop because a file is not found, then we are ignoring data files (and hence, particles) associated with this dataset. For Issue #2819, it looks like the directory of the data files is getting lost somehow. I think that's the thing that needs to be fixed.

neutrinoceros · 2022-02-21T15:41:57Z

So the way I see it there are two solutions:

tag the issue as wontfix and close (the error message is already clear enough ?)
change this PR to add a warning that data is missing.

Personally I would still favour the second approach because I see no harm in allowing informed users to work with partial datasets. Your call :)

brittonsmith · 2022-02-23T16:02:04Z

I think I'm only now fully understanding the original issue and I'm coming around to agreeing with your solution. I agree we should allow users to operate on incomplete datasets if they want to and if they understand that that is what is happening. I'm happy with your second option of adding a warning message.

…cation

neutrinoceros · 2022-02-23T16:16:11Z

@brittonsmith there you go !

BUG: fix an issue where missing files would be indexed without verification (cherry picked from commit ec79686)

neutrinoceros · 2022-04-01T07:54:50Z

backported as #3881

Manual backport #3816 to yt-4.0.x (BUG: fix an issue where missing files would be indexed without verification)

neutrinoceros added the bug label Feb 18, 2022

neutrinoceros requested review from matthewturk and brittonsmith February 18, 2022 14:06

neutrinoceros force-pushed the hotfix_2819 branch from d46f937 to c179206 Compare February 18, 2022 15:12

neutrinoceros marked this pull request as draft February 18, 2022 15:42

neutrinoceros force-pushed the hotfix_2819 branch from c179206 to f9c9a79 Compare February 18, 2022 15:48

neutrinoceros added the code frontends Things related to specific frontends label Feb 18, 2022

neutrinoceros commented Feb 18, 2022

View reviewed changes

neutrinoceros marked this pull request as ready for review February 18, 2022 16:28

neutrinoceros force-pushed the hotfix_2819 branch from a62c662 to 1c7df4e Compare February 18, 2022 16:48

neutrinoceros mentioned this pull request Feb 21, 2022

bug in Gadget frontend #2819

Closed

neutrinoceros added 2 commits February 23, 2022 17:15

BUG: fix an issue where missing files would be indexed without verifi…

5b01b63

…cation

MNT: update codetour watch GH action

cf356a9

neutrinoceros force-pushed the hotfix_2819 branch from 1c7df4e to cf356a9 Compare February 23, 2022 16:15

brittonsmith approved these changes Feb 23, 2022

View reviewed changes

jzuhone approved these changes Feb 23, 2022

View reviewed changes

neutrinoceros removed the code frontends Things related to specific frontends label Feb 24, 2022

matthewturk merged commit ec79686 into yt-project:main Mar 31, 2022

neutrinoceros deleted the hotfix_2819 branch March 31, 2022 13:28

neutrinoceros mentioned this pull request Mar 31, 2022

BUG: fix particle indexing code tour #3880

Merged

neutrinoceros added this to the 4.0.3 milestone Mar 31, 2022

neutrinoceros pushed a commit to neutrinoceros/yt that referenced this pull request Mar 31, 2022

Merge pull request yt-project#3816 from neutrinoceros/hotfix_2819

65a3f30

BUG: fix an issue where missing files would be indexed without verification (cherry picked from commit ec79686)

matthewturk added a commit that referenced this pull request Apr 13, 2022

Merge pull request #3881 from neutrinoceros/manual_bp_3816

96a67dc

Manual backport #3816 to yt-4.0.x (BUG: fix an issue where missing files would be indexed without verification)

neutrinoceros mentioned this pull request Apr 19, 2022

BUG: fix an UnboundLocalError #3898

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: fix an issue where missing files would be indexed without verification #3816

BUG: fix an issue where missing files would be indexed without verification #3816

neutrinoceros commented Feb 18, 2022

neutrinoceros commented Feb 18, 2022 •

edited

Loading

neutrinoceros commented Feb 18, 2022

neutrinoceros Feb 18, 2022

neutrinoceros commented Feb 18, 2022

brittonsmith commented Feb 21, 2022

neutrinoceros commented Feb 21, 2022

brittonsmith commented Feb 23, 2022

neutrinoceros commented Feb 23, 2022

neutrinoceros commented Apr 1, 2022

BUG: fix an issue where missing files would be indexed without verification #3816

BUG: fix an issue where missing files would be indexed without verification #3816

Conversation

neutrinoceros commented Feb 18, 2022

PR Summary

neutrinoceros commented Feb 18, 2022 • edited Loading

neutrinoceros commented Feb 18, 2022

neutrinoceros Feb 18, 2022

Choose a reason for hiding this comment

neutrinoceros commented Feb 18, 2022

brittonsmith commented Feb 21, 2022

neutrinoceros commented Feb 21, 2022

brittonsmith commented Feb 23, 2022

neutrinoceros commented Feb 23, 2022

neutrinoceros commented Apr 1, 2022

neutrinoceros commented Feb 18, 2022 •

edited

Loading