Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ADD] pipelines for the whole database #5

Open
jcapels opened this issue Dec 7, 2022 · 0 comments · May be fixed by #9
Open

[ADD] pipelines for the whole database #5

jcapels opened this issue Dec 7, 2022 · 0 comments · May be fixed by #9
Assignees
Labels
enhancement New feature or request

Comments

@jcapels
Copy link
Owner

jcapels commented Dec 7, 2022

Add pipelines for the whole database.

This will include an apache airflow ETL pipeline that will integrate and update the whole database of LIPID MAPS (https://www.lipidmaps.org/), SwissLipids (https://www.swisslipids.org/), and ModelSEED (https://modelseed.org/).

This will have to include

  • the cross-references
  • SMILES string
  • hierarchy
  • the relationship between alcohols and fatty acids with lipids
  • biosynthetic relationships (that will have to be generated by hand-made reaction rules of RHEA reactions)

After that, a cronjob have to be run every month to integrate new data and update the database.

@jcapels jcapels self-assigned this Dec 7, 2022
@jcapels jcapels added good first issue Good for newcomers enhancement New feature or request and removed good first issue Good for newcomers labels Dec 7, 2022
@AdrianoSilva19 AdrianoSilva19 linked a pull request Oct 5, 2023 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant