Script for converting GlycoCT format to INP. Just for GAG for now.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
Old
HP_dp02_0001.glycoct
README.md
convertor2.0.pl
phi_psi_all_disacc.csv

README.md

convert-glycoct-inp

This work is part of MatrixDB project.

This script in perl converts Glycosaminoglycans (GAGs) in GlycoCT format to INP format. The output is then used by POLYS tool to create 3D structure in PDB format.

This script takes as args a CSV file and a GlycoCT file. It prints on stdout the corresponding INP format for POLYS tool.

Important to know

  • This script doesn't support ALT, UND and ISO sections of GlycoCT
  • This script support REP sections but not nested repetitions
  • This script only converts GAG's monosaccharides
  • In order to build a 3D structure, POLYS need to have all information about monosaccharides: alpha/beta, L/D, etc...
  • Iduronic acid can have many conformations (1C4, 4C1, 2S0). We have not this information in GlycoCT format but we need it for POLYS. Thus we decided to choose 1C4 conformation by default.

Example of use:

    perl convertor2.0.pl phi_psi_all_disacc.csv HP_dp02_0001.glycoct > HP_dp02_0001.inp
  • convertor2.0.pl is the script in perl
  • phi_psi_all_disacc.csv is the CSV file in which we have phi and psi values of GAG's disaccharides
  • HP_dp02_0001.glycoct is a GlycoCT file of disaccharide of heparin