Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

List all linked out files in a single new table. #2

Closed
tomgoddard opened this issue Dec 21, 2016 · 4 comments
Closed

List all linked out files in a single new table. #2

tomgoddard opened this issue Dec 21, 2016 · 4 comments

Comments

@tomgoddard
Copy link
Collaborator

In order to use IHM format as a working format while a modeling project is in progress, the externally linked files (comparative models, sequence alignments, EM maps, ensembles of result structures, localization maps, ...) should be listed in a table that refers to local files (on the local disk) instead of referencing a DOI zip archive.

One design would give an integer id to every external file, external database reference, external DOI reference, and all other tables would use this id. Any external data could be referenced in any of these 3 ways (local file, database, DOI archive).

Having external references in one table will simplify validating that deposited structures do not reference missing data.

@brindakv
Copy link
Collaborator

Currently, all input data is listed in IHM_DATASET_LIST. Data present in other databases are described in IHM_DATASET_RELATED_DB_REFERENCE and those present in other resources are referenced via DOI in IHM_EXTERNAL_REFERENCE_INFO and IHM_DATASET_EXTERNAL_REFERENCE_DETAILS categories.

Localization density is an output from the I/H modeling and relates to the molecular system and the ensemble of good scoring models obtained from the modeling. Localization densities can be included as Gaussian mixture models in IHM_GAUSSIAN_OBJ_ENSEMBLE or can be linked out via IHM_EXTERNAL_REFERENCE_INFO and IHM_LOCALIZATION_DENSITY_FILES categories.

Sequence alignments corresponding to the starting structural models can be included via IHM_EXTERNAL_REFERENCE_INFO and IHM_STARTING_MODEL_ALIGNMENT_FILES categories. We will also define categories to capture the alignments internally. This requires coordination with the PMP/Model Archive developers so that we are consistent with their definitions as well.

Modeling scripts, associated documentation and other workflow-related files can be referred to via IHM_EXTERNAL_REFERENCE_INFO and IHM_MODELING_WORKFLOW_FILES categories.

Wherever possible, we provide ways to incorporate data directly into the I/H model file and also provide the option to link out to external files in standard formats (eg: starting model coordinates, localization densities, alignments). The details of the linked out files are split into the different categories mentioned above because of the differences in the nature of the data that is stored in them. If these files and the data stored within are to be useful, they are best described in separate tables. The IHM_EXTERNAL_REFERENCE_INFO table provides a way to collect the non-database external references is one place.

@tomgoddard
Copy link
Collaborator Author

tomgoddard commented Jan 10, 2017 via email

@brindakv
Copy link
Collaborator

Reference to local file has been added.

@brindakv
Copy link
Collaborator

brindakv commented Feb 8, 2017

A new category (IHM_EXTERNAL_FILES) has been added to reference all external files.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants