Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

directions for RDBMS support in general #108

Closed
ceteri opened this issue Feb 4, 2021 · 8 comments
Closed

directions for RDBMS support in general #108

ceteri opened this issue Feb 4, 2021 · 8 comments
Assignees
Labels
enhancement New feature or request help wanted Extra attention is needed partnering Projects on which other teams are partnering research Research projects, often tied to partnering or funding
Projects

Comments

@ceteri
Copy link
Collaborator

ceteri commented Feb 4, 2021

Following up after some work with Trino authors yesterday, there are needs ahead for better metadata modeling based on inference techniques, semantic technologies, etc., for example in Iceberg connectors.

Just found this about Morph-RDB :

Probably a good thing to have a spike toward: how to integrate with Trino and Morph-RDB. Will check with @dachafra, et al.

Could be good to discuss with @dvsrepo and Asun about this, too? It could become an integration point for Recognai?

@ceteri ceteri self-assigned this Feb 4, 2021
@ceteri ceteri added the enhancement New feature or request label Feb 4, 2021
@ceteri ceteri added this to the Release 0.4.x milestone Feb 4, 2021
@ceteri ceteri added this to To do in kglab Feb 4, 2021
@ceteri ceteri removed this from the Release 0.4.x milestone May 10, 2021
@ceteri ceteri added help wanted Extra attention is needed partnering Projects on which other teams are partnering research Research projects, often tied to partnering or funding labels May 10, 2021
@dachafra
Copy link

dachafra commented Jun 4, 2021

@ceteri we are currently developing a new python-based engine that could fit better than Morph-RDB (java-based), and it is also able to parse RML and R2RML mappings. It's currently under review so it's not public yet, but it will be soon. @ArenasGuerreroJulian and I would be really happy to help in the integration.

@arenas-guerrero-julian
Copy link
Contributor

Hi @ceteri we have released Morph-KGC if it is useful

@ceteri
Copy link
Collaborator Author

ceteri commented Feb 22, 2022

Hi @ArenasGuerreroJulian @dachafra thank you, this is great to see! We're beginning integration of morph-kgc with kglab now, and have a potential large use case in industry at BASF. I'm working with @paoespinozarias @neobernad @jelisf @jmueller5 on the integration. There may be ways we can collaborate on parallelization, e.g., with Ray, Dask, etc. ?

@Mec-iS
Copy link
Contributor

Mec-iS commented Feb 22, 2022

happy to help when implementation guidelines are available

@arenas-guerrero-julian
Copy link
Contributor

Hi @ceteri awesome! morph-kgc already parallelizes using multiprocessing library. I am exploring further parallelization with Dask, happy to collaborate

@ceteri
Copy link
Collaborator Author

ceteri commented Feb 22, 2022

@ArenasGuerreroJulian that's excellent. We're using Ray in the production environment to scale out graphs, and in Ray there's a drop-in replacement for the standard multiprocessing library: https://github.com/DerwenAI/ray_tutorial/blob/main/ex_04_mult_pool.ipynb

@ceteri
Copy link
Collaborator Author

ceteri commented Feb 23, 2022

@Mec-iS how about this approach:

  • add morph-kgc into project dependencies
  • add a .materialize() method to the kglab.KnowledgeGraph() which takes an .ini file as its parameter
  • merge the graph materialized by Morph with the KnowledgeGraph._g graph

For testing, we could use a CSV file as the input.

Does that seem like a good approach for integration?

@ceteri
Copy link
Collaborator Author

ceteri commented Feb 27, 2022

Many thanks @Mec-iS @ArenasGuerreroJulian @dachafra !

@ceteri ceteri closed this as completed Feb 27, 2022
@ceteri ceteri moved this from To do to Done in kglab Feb 27, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed partnering Projects on which other teams are partnering research Research projects, often tied to partnering or funding
Projects
No open projects
kglab
  
Done
Development

No branches or pull requests

4 participants