CidToChebi

CidToChebi is a Knime Workflow that creates a SDF file by fetching information from PubChem for Structure, Name & Biological Description (IUPAC Condensed and IUPAC). It also looks at glycan resource called GlyTouCan to retrieve the WURCS information.

The pipeline contains two subpipelines first one to create the SDF from PubChem & GlyTouCan, second one classifies the compounds using Classyfire and prepares the SDF as headers required to be loaded into ChEBI Bulk Submission.

Input

A CSV file with the headers "GLYTOUCAN_ACC,PUBCHEM_CID" and please make sure input CID from pubchem should be just numbers as "70678538" not as "CID70678538". These two are requirements for the pipeline, hopefully if the pipeline doesnt run these two should be the main culprits rest shouldn't be an issue.

Dependencies

A Json jar is used (Node 70) to parse the response from PubChem. A mapping file is used (Node 26 XLS Reader) to convert between ChEBI and ClassyFire classifications.

Warning

After Classification of entries, please check whether all the entries are classfied, in some cases ClassyFire fails to classify the entries resulting in missing entries after classification.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Knime workflow		Knime workflow
dependencies		dependencies
LICENSE		LICENSE
README.md		README.md
cidToChebi.knar.knwf		cidToChebi.knar.knwf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CidToChebi

Input

Dependencies

Warning

About

Releases

Packages

License

muthuvenkat/CidToChebi

Folders and files

Latest commit

History

Repository files navigation

CidToChebi

Input

Dependencies

Warning

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages