Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Split a pointcloud file over several RDD records instead of having it in one single record. #12

Open
romulogoncalves opened this issue May 28, 2018 · 0 comments

Comments

@romulogoncalves
Copy link

romulogoncalves commented May 28, 2018

If we understood correctly, for each LAS/LAZ file loaded, geotrellis creates a RDD record. If a data set has some large files this will create unbalance task processing, i.e., ones will take a lot longer than others.

A solution would be to split large LAS/LAZ files into multiple RDD records. From PDAL we have the following filter functions: chipper, divider and splitter. We have tried them in a PDAL pipeline, but then when we check the number of elements in the RDD, they all report 1.

Is there away in GeoTrellis to split the LAZ file into multiple RDD records and run a PDAL pipeline?

We could do tilling but then how is possible to run a PDAL pipeline on each Tile?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants