Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

License issues #1

Closed
stanstrup opened this issue Oct 19, 2017 · 8 comments
Closed

License issues #1

stanstrup opened this issue Oct 19, 2017 · 8 comments
Labels

Comments

@stanstrup
Copy link
Owner

stanstrup commented Oct 19, 2017

  1. Which databases can I include data from?
  2. If there are ones I cannot they will need to be download and table generated by the user. Is there such a thing as "in-package cache"?
  3. Which license can the package have if it includes db data?
  4. Is license a concern at all? As far as I know data cannot be copyrighted so is there any concern at all?

The info I extract is: id, name, inchi, formula, and mass..

For the moment I force-removed the files until this is settled.

@egonw
Copy link

egonw commented Oct 19, 2017

I don't think LipidMaps is Open Data. Wikidata is, PubChem is.

@chasemc
Copy link

chasemc commented Oct 19, 2017

"LMSD lipid structures are deposited into PubChem database (http://pubchem.ncbi.nlm.nih.gov/) periodically and a link to PubChem substance ID (SID) is also maintained within LMSD. Access to complete set of LMSD lipid structures in PubChem is available at www.ncbi.nlm.nih.gov/entrez/query.fcgi?CMD=search&DB=pcsubstance&term=LipidMAPS[sourcename])."

@stanstrup
Copy link
Owner Author

@chasemc thanks! That is very useful info. So I might be able to get around that one by just including PubChem and leave the indicator to lipidmaps so that you can eventually filter for the lipidmaps compounds.

@stanstrup
Copy link
Owner Author

@chasemc It seems the source is only in the SID entries. Not the CIDs. However the lipidmaps ids have been added as a name so it is possible to filter by those prefixes.

@egonw
Copy link

egonw commented Oct 22, 2017

@chasemc also note that PubChem is not formally Open Data: it mixes their own public domain data with copyrighted upstream material. Legally, this is quite hard to untangle.

Generally, just contact LipidMaps and ask if it is OK to index their structures in the table as you want to do, and if you are allowed to make that available under terms compatible with the license of the R package.

For LipidMaps, a subset of about 1400 lipids is available under CCZero from Wikidata: http://tinyurl.com/ycbm9gfq

@stanstrup
Copy link
Owner Author

Thanks! I already contacted LipidMaps. Waiting for an answer.

@stanstrup
Copy link
Owner Author

@egonw what do you mean by upstream material? All the calculated properties? Si if I only use basic info as name and inchi it should be ok?

@stanstrup
Copy link
Owner Author

This issue was moved to rformassspectrometry/CompoundDb#4

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants