-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AGU 2016 Poster #40
Comments
I added an image with catchments and streams, along with graphs of (real) streamflow and ET data from August. The highlighted stream and the graph lines use the same cornflower blue as in the simple geometry image to the left. The simple geom images use similar colors to the USGS and NOAA logos in the top left, so I used orange for the catchments to be similar to the UT logo in the top right. Orange is also a complementary color to blue so it makes the image pop. I suggest using a color other than black for the leader lines, graph outlines, and map outline, but that can come later. How does that look for the middle box? What else do you want in there? Another example, some text, or make the existing example bigger? What else do you need help with? I was thinking of filling in text for "What is a simple geometry" next. |
The use of 'instance' should be explained some where else on the slide. I have a hard time describing 'instance' versus 'element'. Maybe... "each simple feature is an instance that we describe with element variables" |
@twhiteaker: Thanks for putting the image together. It looks good, and I agree the color combination is nice. I think we should add a point example (stream gauge, infrastructure). With that, this should be sufficient for a real-world example. Is it possible to put values on the streamflow and evapotranspiration plots? Makes it look more "realistic". Not necessary if difficult.
@twhiteaker: Yes, please tackle the simple geometry section. I will work on the text in the other sections.
@dblodgett-usgs: Graphical layout is excellent with the highlighting and arrows + boxes. I thought we were going to link the CDL example with hydrologic catchment data used in the data graphic? No big deal as this looks sufficient as an explanatory graphic. I think we should try and work in a multi-geometry example here as this is a confusing bit. Is that simple for you to do with your graphic? I noticed the We should stick the full Bull Creek CDL with data variables below the hydrologic catchment graphic in any case. I'll add that. It will also fill out the space I expect.
I'm not sure we should adopt the DSG lingo if we are trying to move towards OGC. I think |
Are we trying to move toward OGC? I would much rather focus on CF adoption. Yeah. Instances are the features, so any variable defined on the instance dimension only is feature attribute data. Elements are those variables defined on temporal or other element dimensions as well as the instance dimension. |
I'll work up a multipolygon example if you guys think it's not going to be too confusing. I was sticking to a hole just to make it a simple demonstration of the contiquous_ragged_dimension details. |
I think we should provide crosswalks with OGC at all times at the very least. CF adoption is most important but that doesn't mean compromising unnecessarily - using instance/element is not that much of a compromise.
I have a couple questions regarding the use of instance identifiers. We can talk about them at a later date, but I am trying to wrap my head around this approach.
I'm looking more closely at your example. I just now noticed that it is for a holed polygon and not a polygon on green background. 😶 With that in mind, I don't think we need a multi-polygon example since you are demonstrating the use of break values. A couple other questions:
|
By my understanding of the DSG spec, this would be handled by putting things in separate files. The reason for this is that the 'featureType' attribute is global. Having multiple featureTypes present in the same file would add quite a bit of complexity. For the purposes of our CF 1.* compatible proposal, I think we should probably stick to that model. That said, I do think putting the 'geometry_type' attribute in the coodinate_index rather than the global attributes, is a good idea to allow for multiple geometry_types in a single file (e.g. watersheds polygons and their associated outlet locations).
They don't HAVE to be strings. This is a common practice though and is a nice way to embed identifiers of any type generically rather than requiring coercion to int or some other data type. The code I've worked up uses strings but I'm hoping to loosen that up as I work forward.
Good call. I missed this in the spec draft. Will add it. While I'm thinking about it, for the contiguous ragged array indexing, I found it helpful to make a distinction between things that index into the contiguous_ragged_dimension and things that index into the coordinate dimension. In my code I've named variables with 'ind' for things in the contiguous ragged dimension and 'coord' in the coordinate dimension. My naming was arbitrary. My point here is that the distinction between the two kinds of indexing is really critical and is pretty easy to stumble over unless you are really explicit about how you talk about them. Just something to think about in the poster. |
@dblodgett-usgs what do you think of adding x- and y-axes to the CDL example geometry so that users can more easily find the coordinates described in the CDL? |
Good idea. But not super easy to do. I just screenshot a GIS rendering of the shapefile. Could we hack it in by hand? |
If you send me the shapefile I could do this in ArcGIS. |
Here it is. |
Added axis labels to the graphs. Is the font size (10) too small? I didn't think the labels were important enough to make them as big as other fonts on the poster. |
Thanks for the explanations, @dblodgett-usgs.
I would really, really, really like to make the spec compatible with different geometry counts per data/element variable. It's probably best to avoid the issue with the poster directly provided the point examples have the same count as the polygons and stream segments. If we do not want to propose this, we should at least make sure that it can be proposed during the next "version". It may be as easy as adding an And, yes, definitely keep the geometry type out of the global attributes.
Interesting to think about. The Python code uses an object to translate in and out of CRAs and mostly relies on variable-length unless reading/writing. |
@dblodgett-usgs I added coordinate grid to the CDL polygon screenshot. How does it look? |
Small font is fine. There are ways to read it if someone is desperate. Looks like real data now. Thanks! 😄 |
FYI, Grid labels in CDL polygon screenshot are Arial 24, RGB (110, 110, 110) |
Added simple geometry text. I talk about "features" in there, so we may want to harmonize that with whatever you all decide to use for feature/instance/element. There's a bullet about what multiparts are for. Not sure if this is important enough to get a bullet, but I think it's something the CF-metadata readers were a little confused about. |
Added some fake points to the catchment/river screenshot. Soil moisture is from NLDAS. |
Since the graphs look like real data now, I added the data source just under the graph title. |
👍 |
Added Bull Creek CDL to the poster. It captures multiple geometries with time-varying data variables. It differs slightly from @dblodgett-usgs's CDL which uses instance identifiers. I think it's okay to have the different approaches. We can use these examples when deciding on draft spec. It's open for editing now of course. @dblodgett-usgs: Were you planning to add text for the CRA v. VLen? I think you moved your CDL graphic around a bit. P.S. Does anyone know the CF standard names for streamflow, evapotranspiration, and soil moisture? |
I think it would be helpful to keep the bull creek example 'conceptual' and give a more basic netcdf3 example on the poster. I've got a list of things to comment about the Bull Creek CDL, but not sure that's worth providing right now since this is a NetCDF-4 VLEN example. |
I could draft the text for CRA/VLEN, but think one of you might be better. I'm not familiar with VLEN at all since I'm focused on a CF1.* spec addition. |
The Bull Creek CDL has upstream node of river segments instead of the three fake soil moisture stations. I would also remove the GNIS_Name and AreaSqKm variables to simplify things. Ah, and then there's Dave's comment above about using a more basic example. I suggest:
|
I think adding a section in the top left briefly summarizing what we're doing would be a nice lead in the story that unfolds naturally from top left to bottom right. Otherwise, the poster doesn't seem to inform the user of what we're doing until bottom middle. This would require some shuffling of the sections around. |
@twhiteaker I'd be happy to do multipolygon for the example. I'll make you the shapefile and switch the CDL in a bit. Should we not do hole then? |
Either a hole or a multipart is good for demonstrating break values. A second geometry is good for demonstrating the coordinate index stop. Let's start with a hole and a second geometry. If there's room and the poster and time, I might play with adding a second part to the first geometry. I don't think it would make the CDL too complex, but I don't think it's vital either. |
@dblodgett-usgs change -2 to -1 in your data. |
Jeez I'm lazy. Ok, here's what I'm suggesting. Also, I took out the hole break value since no holes. And I fixed the coordinates so that they were all anticlockwise...by hand, so hopefully it's right.
|
After playing with several colors for highlighting, I didn't find any of them harmonious with the rest of the poster. Color might also be confusing since we use color to associate sections of CDL with features in the map in the other CDL example. In the end I just lightened the black a bit. |
Opps... @twhiteaker - My code is fine, the WKT I created that gets read in is encoded wrong. The second polygon is encoded as a hole that is outside the first polygon! |
Updated a few things. Here's some better cdl created by this script: https://github.com/dblodgett-usgs/NCDFSG/tree/master/demo
|
Oh, and 👍 to the grey highlighting! |
Thanks for the new CDL, @dblodgett-usgs. After reading #44 (comment), I wondered about Why is All of your rings are clockwise. Either reverse them or use |
OK, thanks for the in-depth review here, @twhiteaker. On the standard name, the My code to add instance data doesn't handle units yet. It just takes a table (DBF-like) and dumps it in NetCDF. We can remove that for the poster. I also hadn't worked on ring order yet. TBH, I don't even know how to programatically check if they are clockwise or counterclockwise. I have just been naively dumping them in NetCDF files and reading them back out. I'll look into this before next week. dblodgett-usgs/NCDFSG#4 |
To me,
|
Got my ring order issue fixed. I regenerated the CDL and updated the poster. |
No major additions on my end! Have at it. |
A note: I don't think we should use |
My interpretation of axis is that it's a declaration of the axis type to be interpreted when you look at the 'coordinates' of a given data variable. You may be right though. |
From @bekozi
Get's a 👍 from me. |
I just realized I need to print this tomorrow morning! YIPES!!! I really don't want to work tonight if I can help it. Any chance you guys can help finalize? |
I'm tied up today. I might have time tonight, but best chance is tomorrow morning. Can you post back say around 4:30pm Central with what still needs to be done? |
Yeah. Let me move stuff around and give it a last read. |
My editing and review is complete. Spacing forced swapping Justification... and Storing... from what I proposed above. I think it looks pretty good! |
I fixed some typos. I changed standard_name to long_name for evapotranspiration. I think it's good enough. But here are some other lingering things to ignore unless you are a glutton for last minute punishment. Is there a reason why stop_encoding = "cra" isn't in the CRA CDL example? CRA example uses axis="X" but VLen example uses cf_role. Did we decide to stick with cf_role? It's all running together in my head. Is |
I think we kind of decided to suffer a bit of disarray. I expect @dblodgett-usgs will speak to the six ways to Sunday nature of these schemas.
Not really with VLen. However, it is needed if the example were CRA as the coordinate index variable dimension would be longer than the number of geometries/instances. Hence, it is left on there. |
I'll take a last pass of these two and strip out stuff that needs to be normalized between then. I think it's better to leave questions than inconsistencies. |
Done and dusted. See here for analytics: https://goo.gl/#analytics/goo.gl/0NI4Sd/all_time |
Congrats, all! 😂 As @twhiteaker said, 'twas a pleasure. |
Template is up: https://docs.google.com/drawings/d/1zwJTWQ9uOkuLxTnDNBdKLlVWDUFoIQ2UcE89P9UzppI/edit.
Please edit as you see fit. I am impressed with Google Drawings for this sort of thing. If we can't get things quite aligned, we can export and refine. Otherwise, I recommend we just continue to use this. Google has PDF and SVG export options.
The text was updated successfully, but these errors were encountered: