Strange square holes in slices and projections of Arepo data #3672

jzuhone · 2021-11-16T21:25:40Z

Bug report

Bug summary

Often, but not always, making slices and/or projections of Arepo data creates square/rectangle-shaped gaps in the image. See the code below for examples.

A couple of weeks ago @matthewturk and I worked on this but were not able to find the source of the gaps. We've convincingly ruled out the pixelization routines themselves for slices and projections, leading us to suspect that something has gone wrong in data selection. As you can see below, the behavior is quite general, applying to SlicePlot, ProjectionPlot, and ParticleProjectionPlot, and it can also be seen if you examine the underlying objects themselves (e.g, make a scatter plot of the particle positions in the YTSlice data).

Oddly, sometimes the gaps go away if you change the center or the width of the image (sometimes only very slightly).

Code for reproduction

The dataset used in this example can be downloaded via curl -JO http://use.yt/upload/165c65a1, but the problem appears in many of these datasets.

The code can be found in this notebook:

https://gist.github.com/jzuhone/c4b41eb06947f28e220a364713947995

Version Information

Operating System: macOS, Linux
Python Version: 3.9
yt version: 4.0.1, installed from source
Other Libraries (if applicable): N/A

The text was updated successfully, but these errors were encountered:

neutrinoceros · 2021-11-16T21:38:36Z

I'm going to triage this as a "viz" bug because it's clear that visualisations are affected, but don't hesitate to apply any more appropriate labels instead.

jzuhone · 2021-11-16T21:57:19Z

Even simpler script showing that this is entirely about data selection:

https://gist.github.com/jzuhone/c2cc2621347b2f188b3d84296970af0b

neutrinoceros · 2021-11-16T22:00:56Z

Indeed it's pretty clearly not viz that's the problem. Any idea if this bug is frontend specific yet ?

matthewturk · 2021-11-16T22:03:38Z

Yes, likely as a result of the computed/estimated smoothing length versus actual nearest-neighbor distance, is my guess.

…

On Tue, Nov 16, 2021 at 4:01 PM Clément Robert ***@***.***> wrote: Indeed it's pretty clearly not viz that's the problem. Any idea if this bug is frontend specific yet ? — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub <#3672 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAVXO6QRIFHX5FVXLGPLWDUMLICFANCNFSM5IFJLR7Q> .

jzuhone · 2021-11-16T22:06:32Z

@matthewturk but why would it manifest itself in squares like that?

matthewturk · 2021-11-16T22:07:24Z

OK, so maybe that's not it.

…

On Tue, Nov 16, 2021 at 4:06 PM John ZuHone ***@***.***> wrote: @matthewturk <https://github.com/matthewturk> but why would it manifest itself in squares like that? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3672 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAVXO6ENIL4ULNJWNKZEDTUMLIXFANCNFSM5IFJLR7Q> .

jzuhone · 2021-11-16T22:42:36Z

So Arepo computes the "smoothing length" by taking the equivalent sphere volume of each Voronoi cell, computing a radius from that sphere, and multiplying by a small factor. IOW, it's not a standard SPH smoothing length.

Is there anything in the selection or chunking code that assumes SPH things like kernels or number of nearest neighbors, instead of just using the snoothing length that it's given by the frontend to select on?

matthewturk · 2021-11-17T11:28:49Z

That's a good question. I looked into it and I was unable to find anything like that. I'll try to recreate my checks again today to really dive into this.

jzuhone · 2021-11-17T13:13:36Z

I plan on playing around with a couple of things to see if I can help narrow this down.

matthewturk · 2021-11-22T22:18:22Z

I used this script, which in a very uncool fashion accepts arguments via int(sys.argv[-1]) to update the index order and compare; additionally, I disabled guessing the mi1 and mi2 values by inserting a return right after self.regions.find_collisions_coarse() in particle_geometry_handler.py around line 227.

import sys
import os
import glob
import yt

for fn in sorted(glob.glob("*.ewah*")):
    print(fn)
    os.unlink(fn)

index_order = (int(sys.argv[-2]), int(sys.argv[-1]))

ds = yt.load("snapshot_250.hdf5", index_order=index_order)

_, c = ds.find_min(("PartType1","Potential"))

slc = yt.SlicePlot(ds, "z", ("gas","density"), center=c, width=(1500.0,"kpc"))
print(slc.save(f"slice_{index_order[0]}_{index_order[1]}"))

I generated these plots, which I've alt-texted with their values for index_order.

The upshot here is that I am not really sure what's happening. I will continue to investigate.

matthewturk · 2021-11-23T01:57:16Z

I have realized that one thing we have not looked at is the potential asymmetry of the particles we are missing. Specifically, do the "missing" particles tend to exist all on the positive side, all on the negative side, etc, and are they out of the slice based on just the slicing coordinate?

matthewturk · 2022-01-24T16:38:14Z

I've spent a good amount of time on this, and here is what I found to work, although I have to admit that I think it may be masking a different problem.

Here is a script, with some inline comments, that I ran:

https://gist.github.com/555124c31c389acfc95966cf1c073741

Note that I had to disable the auto-resetting of the index order to make this operate as expected, by making the conditional on line 228 of particle_geometry_handler.py always fail.

What I found was that the following patch produced correct behavior for all tested scenarios:

diff --git a/yt/geometry/particle_oct_container.pyx b/yt/geometry/particle_oct_container.pyx
index e4c05453c..3489c4902 100644
--- a/yt/geometry/particle_oct_container.pyx
+++ b/yt/geometry/particle_oct_container.pyx
@@ -852,16 +852,18 @@ cdef class ParticleBitmap:
         cdef ewah_word_type w
         this_collection = BoolArrayCollection()
         cdef ewah_bool_array *refined_arr = NULL
+        out_collection = BoolArrayCollection()
         for it1 in coarse_refined_map:
             mi1 = it1.first
             refined_arr = &this_collection.ewah_coll[0][mi1]
-            this_collection.ewah_keys[0].set(mi1)
-            this_collection.ewah_refn[0].set(mi1)
+            out_collection.ewah_keys[0].set(mi1)
+            out_collection.ewah_refn[0].set(mi1)
             buf = &it1.second
             for vec_i in range(buf.sizeInBytes() / sizeof(ewah_word_type)):
                 w = buf.getWord(vec_i)
                 refined_arr.addWord(w)
-        out_collection = BoolArrayCollection()
         in_collection._logicalor(this_collection, out_collection)
         return out_collection

But I have reason to believe that the main impact of this patch is not to update out_collection but rather to disable the setting on this_collection. I have inspected the _logicalor method and I did not find any errors (including in the order of setting values, which I was briefly concerned about, as compressed EWAH arrays have to be set in increasing order.) What I think may be happening is that by not having this_collection set in the visited areas, it doesn't set any refined values in the coarse region, thus marking the entire coarse cell as being marked. (This essentially negates the entire ghost zone refined zone checks, I believe.)

So that's where I've gotten.

jzuhone · 2022-01-25T17:44:03Z

I still find it very strange that we only seem to see it on Arepo data. Should we keep thinking about if there is some kind of special thing about it that might interact with this?

langmm · 2022-01-25T20:49:26Z

At @matthewturk's request, I am taking a look at this today/tomorrow to see if the bitmap indexing is involved. One idea I am chasing down is the treatment of the refined ghost zones for the Arepo "smoothing lengths". One consequence of estimating the voronoi cells as spheres in the hsml estimation may be that some particles' neighbors are missed in the refined ghost zones if their cells are overly oblong. If this is the cause, the only way to account for this may be to use a larger radius (multiple smoothing lengths) for expanding ghost zones for Arepo datasets.

jzuhone · 2022-01-25T21:00:02Z

Thanks @langmm!

jzuhone · 2022-02-02T00:31:25Z

@langmm were you able to look into this yet?

langmm · 2022-02-02T19:47:08Z

@jzuhone I was able to take a look, but I am still unsure of the cause. Scaling up the hsml used for selecting files/particles without adjusting the hsml used in the pixelation eliminated any gaps, but resulted in a pixelized image for a scale factor of 10. What was odd is that a scale factor of 2 resulted in more files being selected than a factor of 10 which would indicate to me that there is an error in the bitmap file selection. I will be checking those routines today.

langmm · 2022-02-07T21:01:57Z

It turns out this was a bug in the refined bitmap index creation where some coarse cells never got refined.

jzuhone assigned matthewturk and jzuhone Nov 16, 2021

neutrinoceros added the bug label Nov 16, 2021

neutrinoceros added the viz: 2D label Nov 16, 2021

neutrinoceros removed the viz: 2D label Nov 16, 2021

langmm mentioned this issue Feb 7, 2022

Fix bug in bitmap index for particle datasets #3788

Merged

2 tasks

jzuhone closed this as completed in #3788 Feb 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strange square holes in slices and projections of Arepo data #3672

Strange square holes in slices and projections of Arepo data #3672

jzuhone commented Nov 16, 2021

neutrinoceros commented Nov 16, 2021

jzuhone commented Nov 16, 2021

neutrinoceros commented Nov 16, 2021

matthewturk commented Nov 16, 2021 via email

jzuhone commented Nov 16, 2021

matthewturk commented Nov 16, 2021 via email

jzuhone commented Nov 16, 2021

matthewturk commented Nov 17, 2021

jzuhone commented Nov 17, 2021

matthewturk commented Nov 22, 2021

matthewturk commented Nov 23, 2021

matthewturk commented Jan 24, 2022

jzuhone commented Jan 25, 2022

langmm commented Jan 25, 2022

jzuhone commented Jan 25, 2022

jzuhone commented Feb 2, 2022

langmm commented Feb 2, 2022

langmm commented Feb 7, 2022

Strange square holes in slices and projections of Arepo data #3672

Strange square holes in slices and projections of Arepo data #3672

Comments

jzuhone commented Nov 16, 2021

Bug report

neutrinoceros commented Nov 16, 2021

jzuhone commented Nov 16, 2021

neutrinoceros commented Nov 16, 2021

matthewturk commented Nov 16, 2021 via email

jzuhone commented Nov 16, 2021

matthewturk commented Nov 16, 2021 via email

jzuhone commented Nov 16, 2021

matthewturk commented Nov 17, 2021

jzuhone commented Nov 17, 2021

matthewturk commented Nov 22, 2021

matthewturk commented Nov 23, 2021

matthewturk commented Jan 24, 2022

jzuhone commented Jan 25, 2022

langmm commented Jan 25, 2022

jzuhone commented Jan 25, 2022

jzuhone commented Feb 2, 2022

langmm commented Feb 2, 2022

langmm commented Feb 7, 2022