Feature/xyz routes #88

iacopoff · 2021-07-26T09:25:49Z

So, the service is quite simple, however there are few design choices that may not be ideal:

Data projection: The service's tiling system uses morecantile, therefore one of the available CRS projections can be passed to the tiling function. The input data projection should match that CRS, however, in case there is a mismatch, the service does not try to re-project the data as it expects the users to take care of that. If that sounds OK, then I think there should be a check that raises an error if the CRS don't match.
Data validation: For simplicity the service requires that data spatial dimensions are named 'x' and 'y'. The function that checks that is defined in the router function. Is there a better way (or place) to implement a data input validation functionality?

benbovy

Sorry for the wait @iacopoff. I've played with the example notebooks and it looks very cool!

I left some minor comments.

The notebooks are very helpful, but I'm wondering if this repository is the right place for it. We should try to keep simple the management of the docs and the dependencies. Xpublish is front-end agnostic and FastAPI already provides good tools for documenting the API endpoints.

xpublish/rest.py

xpublish/routers/xyz.py

iacopoff · 2021-08-05T19:24:26Z

Hi @benbovy, I think I have opened this PR too early as I am kind of rethinking the design as I am understanding better xpublish and thinking also how to accommodate the future wmts router. However, it is still very helpful to get your comments at this stage.

Regarding the notebooks, I can definitely delete them. The docstrings will be enough.

I agree with you that a sort of factory class, such as Titiler Router factory, will probably solve the problem of passing options to the individual routers or add data validation logic.

There are 2 sets of parameters that are required by tiling and image production:

Tiling requires a CRS, unless we default to the PseudoMercator.
Image creation requires colours mapping parameters. For the simpler xyz router datashader is enough. For the wmts router I would like to be able to create a colour bar as well to add to the getCapabilities.xml, if possible. That means I need to use, for example, matplotlib.

Regarding point 2. I am not sure whether you think that is outside the scope of this project.

Thanks!

xpublish/routers/__init__.py

benbovy · 2021-08-06T22:13:13Z

xpublish/routers/xyz.py

+xyz_router = XYZRouter()
+
+
+@xyz_router.get("/tiles/{layer}/{time}/{z}/{x}/{y}")


Perhaps we could make less assumptions here about the structure of the dataset. I'd rather see something like /tiles/{var}/{z}/{x}/{y} and allow setting time or any other extra dimension(s) as a query parameter.

Allowing flexible image formats /tiles/{var}/{z}/{x}/{y}.{format} would be great.

(If later xarray supports multiscale datasets pydata/xarray#4118 pydata/xarray#5376 it will be nice to have /tiles/{var}/{z}/{x}/{y}@{scale}x too)

@benbovy

Perhaps we could make less assumptions here about the structure of the dataset. I'd rather see something like /tiles/{var}/{z}/{x}/{y} and allow setting time or any other extra dimension(s) as a query parameter.

yes it makes sense to be minimalist in the assumption about data structure.
In general the get_tile operation should be used by every tile services (XYZ, WMTS...). Different tile services can then implement a certain logic on how other dimensions (time included) can be managed.

These are some references about time dimension in WMTS:

https://wiki.earthdata.nasa.gov/display/GIBS/GIBS+API+for+Developers#GIBSAPIforDevelopers-OGCWebMapTileService(WMTS)

http://demo.geo-solutions.it/share/wmts-multidim/wmts_multidim.html

http://docs.opengeospatial.org/per/16-042r1.html

@benbovy which other image formats do you think it should support?

benbovy · 2021-08-06T22:28:57Z

I started adding router factory classes in #89.

Regarding data validation, I think that it would be better to expose the x and y dimensions as query parameters. Using a router factory class, it will be possible to override the default values (e.g., lon and lat).

Regarding the color mapping parameters, I'm not sure to know why matplotlib would be required unless we want to support by default all colormaps available in matplotlib (I'd be OK with that). There is also colorcet on which datashader depends.

xpublish/routers/xyz.py

davidbrochart · 2021-08-07T06:37:40Z

It looks very promising @iacopoff! In fact, I'd like to base xarray-leaflet on xpublish when this PR gets in, as there is a lot of overlap.
In xarray-leaflet we use the Jupyter Server to serve the tiles, but I'm thinking this is not a great idea. One of the drawbacks is that the server runs in another process than the kernel, which means we have to poll to check if the tile files are generated (by the kernel) before the server can send them. I changed that in xtrude, where I use aiohttp to run a server inside the kernel, in an async task.
@benbovy I'm not sure how xpublish works, is it like a server that runs in a blocking way, or does it allow to run async code concurrently? In the later case, that would work with xarray-leaflet.
I recently added support for a colorbar in xarray-leaflet, based on matplotlib, but I think it is a bit overkill for just using colormaps, and colorcet looks great.
It looks like you're also using rioxarray to do the reprojection, but on the whole data array. In xarray-leaflet we reproject each tile individually. I was wondering if you expect the data array to be chunked, otherwise it could take up a lot of memory/CPU. Or maybe it is done lazily on the tile slices?
Also, in xarray-leaflet we have hooks at different levels of the data pipeline to allow for data transformation. The default transformation coarsens the data in order to get approximately the same resolution as the tile, and thus have an efficient reprojection. This is illustrated in this static map example. But you can customize the transformations as shown in this example of a dynamic map (you might want to run this one locally as sometimes you get a very slow machine in Binder, and dynamic maps are more CPU intensive). Would it make sense to have support for that in this PR?
Anyways, great work and looking forward to it!

benbovy · 2021-08-08T20:48:37Z

Thanks for chiming in @davidbrochart.

I agree that there is quite some overlap with xarray-leaflet, at least for everything between an xarray dataset (it may be chunked data) and the created image tiles (projection / transform, mapping options, etc.). It would be great if somehow we could join efforts on this.

Since xpublish is front-end agnostic it might make sense to implement this functionality here. In fact, I think we could probably pick up many things already implemented in xarray-leaflet since it is in a more advanced stage of development regarding the generation of the tiles.

However, I'm not sure how easy / hard would be for xarray-leaflet to rely on xpublish for serving the tiles:

A fastapi application like xpublish usually runs via uvicorn in a blocking way, although it seems possible to run it in a separate thread: Uvicorn cannot be shutdown programmatically encode/uvicorn#742 (comment). I haven't found anything yet about running it as an async task. Fastapi's path operation functions may be run asynchronously but I don't have experience enough with doing both multi-threading and async programming in Python to know if it's working well together.
Xpublish a-priori serves a static collection of xarray Datasets, but we could probably imagine passing a mutable dictionary to xpublish.Rest. If we are running the server in a separate thread I guess it should be OK since the dictionary should never be updated in the server thread.
In Xpublish tiles are both generated and served on the server side. I'm not sure what would be the best approach for hooks like in xarray-leaflet. Serializing custom transform functions and passing them through the REST API? Or execute the transformations in the main kernel thread and give the transformed dataset to xpublish?

benbovy · 2021-08-08T20:50:28Z

Also, in xarray-leaflet we have hooks at different levels of the data pipeline to allow for data transformation. The default transformation coarsens the data in order to get approximately the same resolution as the tile, and thus have an efficient reprojection. [...] Would it make sense to have support for that in this PR?

Yes I think it makes a lot of sense (at least for the default transformation), in this PR or in a follow-up PR.

benbovy · 2021-08-08T20:56:12Z

Xpublish a-priori serves a static collection of xarray Datasets, but we could probably imagine passing a mutable dictionary to xpublish.Rest.

Alternatively, we could create one server per dataset. Not sure it makes sense to have multiple (many!) threads with each their own event loop, though.

iacopoff · 2021-08-12T10:39:12Z

Hi @davidbrochart, thanks for your inputs!

It looks like you're also using rioxarray to do the reprojection, but on the whole data array. In xarray-leaflet we reproject each tile individually. I was wondering if you expect the data array to be chunked, otherwise it could take up a lot of memory/CPU. Or maybe it is done lazily on the tile slices?

The xyz service so far assumes the dataset projection == map projection, this is required for the tiling to work. In few words the user should take care of the reprojection outside xpublish. I thought this was good to reduce dependencies and to keep the code simple. Regarding chunking, the rioxarray reproject method persists the data in memory, so either you save the reprojected data to disk and then read it again in chunks or indeed it may not fit in memory if the dataset is large.
At the moment if your dataset is chunked then xpublish will persist only once the image is created by datashader shade so on the individual tiles. Do you think the reprojection functionality should be included in xpublish at the tiling level?

Also, in xarray-leaflet we have hooks at different levels of the data pipeline to allow for data transformation. The default transformation coarsens the data in order to get approximately the same resolution as the tile, and thus have an efficient reprojection.

This is a nice functionality, I was thinking that Datashader also supports some of the transformation you probably are referring to by sampling the raster in The Datashader's raster. I would probably keep this development for the next PR, but it is good to think about it now.

I recently added support for a colorbar in xarray-leaflet, based on matplotlib, but I think it is a bit overkill for just using colormaps, and colorcet looks great.

Also to reply to @benbovy, I think that for a simple xyz service we don't need a colorbar. However, if we are going to develop a WMTS, then it would be good to add a colorbar and legend and it seems that datashader alone can't do that How can I get legends and colorbars for my Datashader plot?¶. So this may still be something for a later PR?

@benbovy thanks for developing the router factory! I will have a look at it.
Regarding the server I think it makes sense to make it non-blocking. From my side I think this may need a bit more research on how to do it properly, but I guess it fits well within the scope of this PR?

benbovy · 2021-08-12T10:52:57Z

Regarding the server I think it makes sense to make it non-blocking. From my side I think this may need a bit more research on how to do it properly, but I guess it fits well within the scope of this PR?

This could be done in another PR I think. I'm not sure this would be really useful in most of uses cases, but if that makes sense for xarray-leaflet and/or other use cases we may add a run_in_thread option (defaults to False) to xpublish.Rest.serve() and implement the solution that I mention in my previous comment.

iacopoff · 2021-09-09T13:01:15Z

Hi @benbovy, I have drafted the xyz-router-factory following #89. I think it works well particularly for setting optional parameters as you suggested.

Now, I guess this PR will eventually go through after #89, right?

davidbrochart · 2021-11-08T12:49:52Z

A fastapi application like xpublish usually runs via uvicorn in a blocking way, although it seems possible to run it in a separate thread: Uvicorn cannot be shutdown programmatically encode/uvicorn#742 (comment). I haven't found anything yet about running it as an async task.

It is possible to run a FastAPI application as an async task using uvicorn, since uvicorn.Server.serve() is async (see encode/uvicorn#541 (comment)).

In Xpublish tiles are both generated and served on the server side. I'm not sure what would be the best approach for hooks like in xarray-leaflet. Serializing custom transform functions and passing them through the REST API? Or execute the transformations in the main kernel thread and give the transformed dataset to xpublish?

Since the server will run in the Jupyter kernel as an async task, passing the transformation functions to the FastAPI app shouldn't be a problem, right?

iacopoff · 2021-11-23T19:03:52Z

@davidbrochart, I was experimenting a bit trying finding ways for running uvicorn as non-blocking server and I came up with two options:

The first is to run the server.serve() in a thread, which can be shutdown with the companion method shutdown .
The second is to return a corutine object

I am not familiar enough with async programming in Python, maybe you could tell if the second option is what you would need to make it work in xarray-leaflet.

I think it is possible to pass the transformation functions, they could be defined as class parameters, like in the xyz router factory class.

davidbrochart · 2021-11-24T09:16:41Z

maybe you could tell if the second option is what you would need to make it work in xarray-leaflet.

Yes, the second option would be best.

iacopoff · 2021-12-01T23:24:18Z

Hi @benbovy and @davidbrochart, in the last few days I have progressed a bit on the xyz router development, as some time in the future I will need this feature for another project.
However, I have been working in another branch that benefits from the refactoring in #89 .

I think this current PR should be closed in favour of xyz-router-factory.
Let me know what are your thoughts on this matter.

There are two main improvements in this xyz service:

Transformers which are callbacks that can be passed as parameters in the instantiation of the XYZFactory class.
Renders that take care of the tiles' colour mapping. There are two, one based on datashader (default) and the other based on matplotlib.

I have created a repo with some notebooks that show the usage of transformers (where I basically follow @davidbrochart's dynamic.ipynb ) and renders.

Any feedback is much appreciated!

thanks

davidbrochart · 2021-12-02T12:04:10Z

Hi @iacopoff, looks like https://github.com/iacopoff/xpublish-example/tree/main/notebooks doesn't exist.

iacopoff · 2021-12-02T12:25:49Z

@davidbrochart ops, it was set to private, I have changed to public :)

davidbrochart · 2021-12-05T20:37:56Z

I looked at the xyz_server-tiff.ipynb and xyz_client-tiff.ipynb notebooks, this looks good! You've basically reimplemented xarray-leaflet.
I'm also interested in using xpublish as a jupyverse plugin. Since both use FastAPI, this is a good match. That would allow us to create a JupyterLab extension for Zarr visualization, in 2D using Leaflet or in 3D using deck.gl (the equivalent of xarray-leaflet and xtrude).

benbovy · 2021-12-06T15:13:08Z

@iacopoff this looks great indeed! I should find some time to finish #89, so that we can then merge your work on a XYZ router into the main branch!

@davidbrochart jupyverse looks very interesting.

iacopoff · 2022-02-10T21:06:55Z

Hey @benbovy, any chance that you will have a look at #89?

benbovy · 2022-02-21T21:56:39Z

Hey @iacopoff, sorry for my long absence here! Yeah I should definitely find some time to finish #89. I need to check but I think it is mostly done.

maif added 5 commits July 22, 2021 16:20

first commit xyz router

a3b60ab

xyz example

1b2ee9f

example xyz

920ac17

added xyz example notebook

7ad5dcd

first test with time dimension in endpoint

0a97331

benbovy reviewed Aug 5, 2021

View reviewed changes

xpublish/rest.py Outdated Show resolved Hide resolved

xpublish/routers/xyz.py Outdated Show resolved Hide resolved

xpublish/routers/xyz.py Outdated Show resolved Hide resolved

maif added 3 commits August 5, 2021 19:15

refactor to allow for wmtsrouter

7510660

updated example notebook

a8c6226

update ex notebook

73e451c

benbovy mentioned this pull request Aug 6, 2021

Add router factory classes #89

Closed

benbovy reviewed Aug 6, 2021

View reviewed changes

xpublish/routers/__init__.py Outdated Show resolved Hide resolved

benbovy reviewed Aug 6, 2021

View reviewed changes

xpublish/routers/xyz.py Outdated Show resolved Hide resolved

iacopoff and others added 3 commits August 10, 2021 09:23

removed xyzrouter example notebooks

92441dd

fix import error

b0dc9d0

fix import 2

e28a6e8

iacopoff added 3 commits August 23, 2021 14:17

add format option and caching

683c572

black

adf60fe

x,y as query parameters

e116f19

benbovy mentioned this pull request Aug 30, 2021

Switch to pyproj for new morecantile major release developmentseed/morecantile#62

Closed

benbovy mentioned this pull request Sep 29, 2021

Reuse maps as a jupyter notebook (ipy)widget carbonplan/maps#15

Open

norlandrhagen mentioned this pull request Sep 30, 2021

Support nested collections of datasets (datatree) #92

Open

abarciauskas-bgse mentioned this pull request Jan 6, 2022

Document next steps abarciauskas-bgse/maps#5

Open

davidbrochart mentioned this pull request Jun 28, 2022

Moving xarray-leaflet to xarray-contrib xarray-contrib/xarray-contrib#4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/xyz routes #88

Feature/xyz routes #88

iacopoff commented Jul 26, 2021

benbovy left a comment •

edited

iacopoff commented Aug 5, 2021 •

edited

benbovy Aug 6, 2021

iacopoff Aug 12, 2021

iacopoff Aug 20, 2021

benbovy commented Aug 6, 2021

davidbrochart commented Aug 7, 2021

benbovy commented Aug 8, 2021

benbovy commented Aug 8, 2021

benbovy commented Aug 8, 2021

iacopoff commented Aug 12, 2021 •

edited

benbovy commented Aug 12, 2021

iacopoff commented Sep 9, 2021

davidbrochart commented Nov 8, 2021

iacopoff commented Nov 23, 2021

davidbrochart commented Nov 24, 2021

iacopoff commented Dec 1, 2021

davidbrochart commented Dec 2, 2021

iacopoff commented Dec 2, 2021

davidbrochart commented Dec 5, 2021

benbovy commented Dec 6, 2021

iacopoff commented Feb 10, 2022

benbovy commented Feb 21, 2022

		xyz_router = XYZRouter()


		@xyz_router.get("/tiles/{layer}/{time}/{z}/{x}/{y}")

Feature/xyz routes #88

Are you sure you want to change the base?

Feature/xyz routes #88

Conversation

iacopoff commented Jul 26, 2021

benbovy left a comment • edited

Choose a reason for hiding this comment

iacopoff commented Aug 5, 2021 • edited

benbovy Aug 6, 2021

Choose a reason for hiding this comment

iacopoff Aug 12, 2021

Choose a reason for hiding this comment

iacopoff Aug 20, 2021

Choose a reason for hiding this comment

benbovy commented Aug 6, 2021

davidbrochart commented Aug 7, 2021

benbovy commented Aug 8, 2021

benbovy commented Aug 8, 2021

benbovy commented Aug 8, 2021

iacopoff commented Aug 12, 2021 • edited

benbovy commented Aug 12, 2021

iacopoff commented Sep 9, 2021

davidbrochart commented Nov 8, 2021

iacopoff commented Nov 23, 2021

davidbrochart commented Nov 24, 2021

iacopoff commented Dec 1, 2021

davidbrochart commented Dec 2, 2021

iacopoff commented Dec 2, 2021

davidbrochart commented Dec 5, 2021

benbovy commented Dec 6, 2021

iacopoff commented Feb 10, 2022

benbovy commented Feb 21, 2022

benbovy left a comment •

edited

iacopoff commented Aug 5, 2021 •

edited

iacopoff commented Aug 12, 2021 •

edited