Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Comments about DPROD 11/24/2023 #24

Open
matthiasautrata opened this issue Nov 24, 2023 · 4 comments
Open

Comments about DPROD 11/24/2023 #24

matthiasautrata opened this issue Nov 24, 2023 · 4 comments
Assignees

Comments

@matthiasautrata
Copy link

matthiasautrata commented Nov 24, 2023

Please don't take offense. My Black Friday search wasn't successful yet and so I need to vent a little bit.

Writing and commenting as I read. Especially, this may mean that I comment on something that you define much further down without a forward reference. That is somewhat intentional.

DCAT is a catalog. DPROD is for products listed in the catalog. Why would one be a profile/extension of the other?

You use the term “Semantic Data…” What is the semantic of “semantic” in this context?

As pointed out elsewhere: Why emphasize or limit this for “Mesh.” It is a cool word and very popular. But what exactly makes products only suitable in a Mesh? Imagine a data-product that lists the geo-coordinates of fire hydrants around the country. Couldn’t I just download the file and move on? What would be meshy or semantic about that?

I disagree with your notion of shift left/right. The data-owner (maker) should also own the meta-data. Granted, all publishers cannot and should not handle data-integration. But they should handle explaining what it is they are offering. Likewise, I really don’t want a central team to manage my ontology. They are just going to make a mess of it. It is my ontology.

“...and ontological classes…” what does this mean?

“... input and output ports…” ports is undefined

If memory serves, in DCAT a data service is a means to access the dataset described in the catalog. Are you suggesting that data services are extended to describe meta-data?

“...semantic meaning…” This basically says: “meaning meaning.” You sure that is what you mean? ;-)

“...This allows for…” Of course it allows. Its absence also would not prohibit it. It just might make it less convenient.

“...semantics ensure that all stakeholders have a common understanding…” You ought to be really careful with such statements. I have seen ontologies and that didn’t mean that they were useful or supported understanding, especially not by humans. Imagine I wrote you an ontology and obfuscated all names and IRIs, replaced them with UUIDs and eliminated all regular comments and explanations.The formal structure and hence implied formal semantics would remain the same. Good luck emailing one this ontology, say as an RDF/Turtle file and expecting anybody to find that it creates a “common understanding.”

“...fundamental idea behind a Data Mesh…” Colloquialism. Intuitive but neither semantic nor appropriate. If you want to explain what a data mesh is supposed to be, put a reference there assuming that something even half normative exists.

“...can be programmatically understood…” will there be LLMs involved? Or did you mean “...can be verified against something and interpreted programmatically…”?

“...ensure that these products can interact…” are products “active” in the sense that they do something? Like “interact?”

“DPROD maps to the Data Mesh notion of a port to the DCAT notion of a DataService, so we can declare a DataProduct by and specify and input and output ports and these ports are Dataservice.” Could not parse.

DCAT states that a data service is defined as: “A collection of operations that provides access to one or more datasets or data processing functions.” You might make it very clear that you are changing that definition. Is it really just that: One inputPort and one OutputPort? Nothing else?

“A data product is a rational, managed, and governed collection of data, with purpose, value and ownership, meeting consumer needs over a planned life-cycle.”

How do you define and measure “rational?” How do you assign and measure “value?” How is value expressed? In US$? I suppose I could write more here. I’ll leave it as: You don’t define what a data product is. You list a bunch of cool, marketable attributes.

I’ll stop here because the rest seems unfinished.Maybe a parting thought: Reading all of this, I still don’t get at all how I would understand (you know, the semantic bit) what a specific dataproduct (like listOfFireHydrants) would be described and how I would use that. I get that somehow a port is involved. Now what?

Footnote: You mention odrl but don’t seem to use/reference it elsewhere.

@tonyseale
Copy link
Contributor

tonyseale commented Nov 25, 2023 via email

@matthiasautrata
Copy link
Author

matthiasautrata commented Nov 25, 2023 via email

@jgeluk
Copy link
Member

jgeluk commented Jan 11, 2024

It would be best to convert these long lists of suggestions into individual GitHub issues that are actionable issue by issue. And then refer to the issue numbers in each individual commit message so that it's clear which changes have been made for any given issue. @tonyseale @rivettp @nvar

@jgeluk
Copy link
Member

jgeluk commented Sep 3, 2024

@matthiasautrata @tonyseale @nvar @FroehlichMarcel are all points addressed?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants