Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Representation of "Container" for Bioschema, and Ontology #751

Closed
vsoch opened this issue Sep 17, 2018 · 11 comments
Closed

Representation of "Container" for Bioschema, and Ontology #751

vsoch opened this issue Sep 17, 2018 · 11 comments

Comments

@vsoch
Copy link
Contributor

vsoch commented Sep 17, 2018

hey opencontainers team!

I'm working on creating a representation for a "Container" in bioschemas, the goal being that this feeds into schema.org, and schema.org is the bread and butter powering the Google Datasets search. So you can imagine, being able to represent containers and then have Google discover them across these webby places and index is going to be very important for curation and search of (what is currently) a messy universe of peanut butter and jelly container piles.

Toward this goal, I'm taking a first shot at this addition. Importantly, I want the specification to be entirely derived from the specifications here, and start out by modeling the higher level things. The work is early, but I want to ask you about some mappings (if you think they are reasonable to start). Remember that we can of course tweak these once the initial draft is done! Here we go!

Organization

Please see here for the organization I think is logical for the root of a Container, and also the modeling of a ContainerRecipe and ContainerRegistry (which now I'm thinking might make sense to mirror OCI and be called ContainerDistribution. For easy reference, here is the overall first plan I'm working on:

Thing > CreativeWork > SoftwareApplication > Container > ContainerImage
                                                         ContainerRuntime
                                                         ContainerRecipe
                                                         ContainerDistribution

where the list there would match the Image, runtime, (no match for recipe, but important to model because we have repositories of just Recipes, for example?), and then distribution. I will have a proper drafty thing for you to critique this in detail, for now I just want to ask about particular mappings (below).

Ontology

As far as I can tell, we don't have a container ontology. The closest I found (thanks @satra !) is this one that has virtualization. I reached out to them to see if they had thoughts about modeling containers there, but we would want uris to reference in this schema. Does anyone know anything about options here? For a lot of the below, I'm wondering if we need to tackle this (somewhat scary and uncharted) domain of creating a simple container technology based off the specs here.

Features

The "SoftwareApplication" has a field for "FeatureList." In the image spec I noticed general features (at the bottom) for signatures and federated naming. Is that a reasonable start?

featureList

  • Text or URL
  • Features or modules provided by this application (and possibly required by other applications).

We would want to say "Please choose a value from EDAM:Operation.

And then the example would be like:

"{
  ""@type"": [""SoftwareApplication"", ""Container""],
  ""featureList"": [""http://oci.org/ontology/topic_0821"", 
                    ""http://oci.org/ontology/topic_0821""]
}"

Let's start with just this simple question for now - I'm just looking over the parent terms and attributes now (from SoftwareApplication and CreativeWork) and will tackle the Container specific ones next (from the standards here) and I'll update this issue as I run into questions. In the meantime, please let me know your thoughts on the above, and please excuse my ontology noobness, just learning this stuff :)

@vsoch
Copy link
Contributor Author

vsoch commented Sep 25, 2018

Quick update: started ContainerRecipe draft. This one is harder because it doesn't parallel a spec here - I'll be doing the Image/Distribution and others soon, will post here when I do.

@vsoch
Copy link
Contributor Author

vsoch commented Sep 26, 2018

hey oci maintainers! Is this the right place to have this discussion? I now have generated the following:

and I'd like to get feedback, if possible (and even minimal!) so I can move forward with submission to schema.org. The goal is to be able to have these definitions to tag containers (and their associated stuffs like recipe files) in webby places, and then a crawler like Google Dataset Search can index them. My strategy has been to follow the OCI specs (except for ContainerRecipe since that doesn't seem to have one?) so any feedback you might have would be greatly appreciated! The specifications repository is here --> https://github.com/openschemas/specifications and the rendered versions that it serves are linked above. And once it's submit, I'm going to take a shot at working with biocontainers to tag their containers, of course Singularity Hub, and a repo of Dockerfiles that I made, because :)

Looking forward to your feedback!

@vsoch
Copy link
Contributor Author

vsoch commented Oct 9, 2018

hello... anyone out there? :(

@caniszczyk
Copy link
Contributor

ping @opencontainers/image-spec-maintainers + @vbatts :)

@vsoch I'd consider emailing the OCI dev mailing list too:
https://groups.google.com/a/opencontainers.org/forum/#!forum/dev

@vsoch there's an OCI meeting tomorrow I invited you too

@vsoch
Copy link
Contributor Author

vsoch commented Oct 9, 2018

Thanks @caniszczyk 😊 I’d like to discuss how the work being done here might be integrated into schema.org so we can have containers show up nicely from registries, personal pages, etc in Google Datasets (or similar) search. The entrypoint seems to be scheme.org but the expertise is all here!

@stevvooe
Copy link
Contributor

@vsoch I'm a little confused on what youre actually proposing. What problems does this solve? What changes are required in OCI?

@vsoch
Copy link
Contributor Author

vsoch commented Oct 10, 2018

No changes in OCI! The idea is that the simplified representation for schema.org should be derived from OCI, where the expertise is.

@vsoch
Copy link
Contributor Author

vsoch commented Oct 10, 2018

Google Dataset Search uses schema.org, so the idea is to get a representation of a container in there so we can eventually tag containers and have the universe of them searchable! See -->https://www.blog.google/products/search/making-it-easier-discover-datasets/ for the first work with Datasets. I don't think we need to have all the details (attributes) from OCI, but rather the higher level ones, and this is what I'd like to discuss - what from OCI should map over to schema.org. I'm glad to do the work to generate the mapping, but the second (important) bit is to get support from the community here for the specification there.

@vsoch
Copy link
Contributor Author

vsoch commented Oct 10, 2018

https://github.com/schemaorg/schemaorg/issues/2059

@sudo-bmitch
Copy link
Contributor

I'm looking back through old issues. Is this still an issue @vsoch? I'm rather confused by it myself, so it may make sense to bring it up during a weekly meeting if it's still needed.

@vsoch
Copy link
Contributor Author

vsoch commented Aug 1, 2024

Nah, nobody cares about ontologies, really.

@vsoch vsoch closed this as completed Aug 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants