-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Vocabularies x Ontologies #4
Comments
Diego, I completely agree with you. We should target: a simple vocabulary Larisa On Tue, Oct 27, 2015 at 8:24 PM, Diego Esteves notifications@github.com
|
Agreed. What I feel we sort of agree on is that we need a commonly agreed set of concepts and definitions, as a core vocabulary. Then, existing and new ontologies could adapt to that core vocabulary and add interesting information. Agreeing on this core vocabulary is already a nice step forward. I do believe some basic structure is useful (e.g. a classification algorithm reads data and produces models), and that can be expressed in a schema (e.g. in TTL syntax). I would not be too afraid of including such basic structure since it gives meaning to the concepts. |
Great! This issue is dependent on the issue #2 and the generated matrix of correlation based on existing vocabularies and ontologies. |
How broad should it be? What is the aim? ML or DM descriptions? Interoperability (engineering) or Meta-learning strategies (science)? As as much as we make it more complex, more and more people will avoid the usage. I suggest a vocabulary for engineering purposes (tools integrations e description of ML executions) and an ontology for more complex analysis (pre processing, statistical methods and related DM tasks). We should remember we have to provide ways for people understanding and using the methodologies. People from mathematics, statistics, electrical engineering, for instance, are not (commonly) aware of linked data methods and tools. Besides, what’s the sense of describing an ontology for ML if, in the end, we use it for schema representation only, instead of benefit meta-learning approaches, for instance? The benefit of providing this kind of metadata is not clear for everybody in the short term and is pretty hard to convince people to represent their data using a schema. "what is the current volume of (real) metadata we have?" and "what kind of information has been discovered so far?" are examples of questions we should highlight.
The text was updated successfully, but these errors were encountered: