Question/Feature request: Make filters.randomize streamable #4195

jo-chemla · 2023-09-28T16:45:41Z

Hi PDAL community,
I'm wondering what prevents the filters.randomize from being streamable. Using this in combination with other pdal pipeline filters (head, stats, head, crop, splitter, merge, stats, etc) would allow for out-of-core operations that would process pointclouds with requirements higher than RAM capacity.

Having thatrandomize filter produce a random list of indices, and processing points in subsequent pipeline stages entry by entry (or in batches of specified size), looks like this stage could support streaming.
Best, Jonathan

The text was updated successfully, but these errors were encountered:

abellgithub · 2023-10-02T19:28:17Z

I don't understand how this would work. You could theoretically reorder points that are in a loaded chunk (but this operation is not currently supported), but this seems of little use. You will have to provide some more detail on an implementation.

jo-chemla · 2023-10-03T14:28:51Z

Hi Andrew,
It is true that even once the random indices have been computed, the entirety of the pointcloud file has to be crawled to retrieve each point (+ coords and attributes) to feed for other pipeline stages. This question should therefore more be rephrased something like:

Can we execute filters.randomize, so that it process batches of points of the input pointcloud, without ever overflowing RAM?

If so, one could then just chain two pipelines, one to use filter.randomize and produce a resulting intermediate pointcloud that could then be parsed as an input for a streamable pipeline. Thanks for the feedback!

abellgithub · 2023-10-03T15:16:34Z

This just isn't how PDAL works. We never perform random access on points in a pipeline. Access is always sequential. Random access with some file types (notably compressed files), just doesn't work well. I don't know what else to suggest.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question/Feature request: Make filters.randomize streamable #4195

Question/Feature request: Make filters.randomize streamable #4195

jo-chemla commented Sep 28, 2023 •

edited

abellgithub commented Oct 2, 2023

jo-chemla commented Oct 3, 2023

abellgithub commented Oct 3, 2023

Question/Feature request: Make filters.randomize streamable #4195

Question/Feature request: Make filters.randomize streamable #4195

Comments

jo-chemla commented Sep 28, 2023 • edited

abellgithub commented Oct 2, 2023

jo-chemla commented Oct 3, 2023

abellgithub commented Oct 3, 2023

jo-chemla commented Sep 28, 2023 •

edited