Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Put 583 remaining compounds in Wikidata #5

Closed
egonw opened this issue Aug 28, 2019 · 17 comments
Closed

Put 583 remaining compounds in Wikidata #5

egonw opened this issue Aug 28, 2019 · 17 comments
Assignees
Labels
enhancement New feature or request

Comments

@egonw
Copy link
Collaborator

egonw commented Aug 28, 2019

By creating a Bacting script that takes PubChem CIDs and adds the corresponding compounds to Wikidata.

@egonw
Copy link
Collaborator Author

egonw commented Aug 28, 2019

@petermr, where can I find the list of 200 CIDs?

@egonw egonw added the enhancement New feature or request label Aug 28, 2019
@petermr
Copy link
Owner

petermr commented Aug 28, 2019 via email

@egonw
Copy link
Collaborator Author

egonw commented Aug 28, 2019

I need to part this until at least next week, I'm afraid. I got some urgent stuff to solve first :(

@petermr
Copy link
Owner

petermr commented Aug 28, 2019 via email

@egonw
Copy link
Collaborator Author

egonw commented Sep 10, 2019

Sorry, there are too many files in that folder... I have no idea at this moment how to see which compounds have not been found in Wikidata yet (and that I should add).

Help/suggestions welcome.

@petermr
Copy link
Owner

petermr commented Sep 10, 2019 via email

@petermr
Copy link
Owner

petermr commented Sep 10, 2019 via email

@petermr
Copy link
Owner

petermr commented Sep 10, 2019 via email

@egonw
Copy link
Collaborator Author

egonw commented Sep 16, 2019

Yes, thanks!

@egonw
Copy link
Collaborator Author

egonw commented Sep 18, 2019

Processing the file...

@egonw
Copy link
Collaborator Author

egonw commented Sep 18, 2019

This looks promising :)

====================
C₁₇H₂₈O₂ is not yet in Wikidata
====================
====================
C₁₅H₂₀O is not yet in Wikidata
====================
====================
C₁₇H₂₄O₂ is not yet in Wikidata
====================
====================
C₁₆H₂₂O₂ is not yet in Wikidata
====================
====================
C₁₀H₁₆O is not yet in Wikidata
====================
====================
C₉H₁₆ is not yet in Wikidata
====================
====================
C₁₅H₂₆O is not yet in Wikidata
====================
====================
C₁₅H₂₂O is not yet in Wikidata
====================
====================
C₁₅H₂₄O is not yet in Wikidata
====================
====================
C₁₅H₂₆O is not yet in Wikidata
====================

@egonw
Copy link
Collaborator Author

egonw commented Sep 18, 2019

And so does the next step! :) The first 10 missing entries are in (573 to go ;)

image

@egonw egonw changed the title Put 200 remaining compounds in Wikidata Put 583 remaining compounds in Wikidata Sep 18, 2019
@egonw
Copy link
Collaborator Author

egonw commented Sep 18, 2019

For the next batch, I do find hits in Wikidata, tho. But that's not a problem.

@egonw
Copy link
Collaborator Author

egonw commented Sep 18, 2019

Okay, this is the workflow. On the above linked CSV file, I run this script: https://github.com/egonw/ons-wikidata/blob/master/EssOil/prepareInput.groovy This prepares the content for https://github.com/egonw/ons-wikidata/blob/master/Wikidata/createWDitemsFromSMILES.groovy which I run after that. The first (new) script fetches the SMILES for the compounds from PubChem.

@egonw
Copy link
Collaborator Author

egonw commented Sep 18, 2019

I'm now doing the remaining batch: https://tools.wmflabs.org/quickstatements/#/batch/18772

@petermr
Copy link
Owner

petermr commented Sep 18, 2019 via email

@egonw
Copy link
Collaborator Author

egonw commented Sep 21, 2019

Hi all, so what is next for this issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants