Update tractability plugin in data pipeline #880

andrewhercules · 2020-03-16T10:28:27Z

In order to process the latest tractability .tsv file generated by ChEMBL, we will need to update the pipeline as the headings for the small molecule modality have changed. Also, there is now a section for other clinical modalities (e.g. protein, enzyme) with the same clinical precedence buckets 1, 2, and 3. And there is also a single string of ChEMBL IDs that support the clinical precedence buckets 1, 2, and 3 for small molecule, antibody, and other clinical modalities.

As such, @cmalangone, can you please update the pipeline with the following changes:

Update the pipeline to use the data about other clinical modalities and add an entry in the gene index tractability object - see below for a scaffold of what it could look like:

"tractability": {
  "smallmolecule": {},
  "antibody": {},
  "other_modalities": {
    "buckets": [
      1
    ],
    "categories": {
      "clinical_precedence": 1
    }
  }
}

For other clinical modalities, the buckets are 1, 2, and 3 and the categories are "clinical_precedence".

Update the pipeline to use the new small molecule column headings

Old column name / heading	New column name / heading
Bucket_1	Bucket_1_sm
Bucket_2	Bucket_2_sm
Bucket_3	Bucket_3_sm
Bucket_4	Bucket_4_sm
Bucket_5	Bucket_5_sm
Bucket_6	Bucket_6_sm
Bucket_7	Bucket_7_sm
Bucket_8	Bucket_8_sm
Bucket_sum	Bucket_sum_sm
Top_bucket	Top_bucket_sm
Category	Category_sm
Clinical_Precedence	Clinical_Precedence_sm
Discovery_Precedence	Discovery_Precedence_sm
Predicted_Tractable	Predicted_Tractable_sm
PDB_Known_Ligand	PDB_Known_Ligand
ensemble	DrugEBIlity_score
High_Quality_ChEMBL_compounds	High_Quality_ChEMBL_compounds
Small_Molecule_Druggable_Genome_Member	Small_Molecule_Druggable_Genome_Member

The text was updated successfully, but these errors were encountered:

andrewhercules · 2020-03-24T10:17:22Z

ChEMBL have made the data available and it has been uploaded into otar001-core/Tractability/20.04

andrewhercules · 2020-03-26T16:06:49Z

Based on a conversation with @cmalangone, we will not update the pipeline to process the ChEMBL IDs and labels for this release. Rather, we will work with ChEMBL to update the JSON generated by the tractability pipeline for 20.06.

cmalangone · 2020-04-02T17:58:43Z

Point 1 and 2 done.
Had a conversation with @LucaFumis about the format for the index.

PR done and merge.
Run a first test and no issue came up.

andrewhercules · 2020-04-06T11:41:24Z

Thank you @cmalangone - the data is now available in the API! 👍

andrewhercules mentioned this issue Mar 18, 2020

Tractability changes for 20.04 #879

Closed

andrewhercules assigned AsierGonzalez Mar 18, 2020

andrewhercules added Data Relates to Open Targets data team Enhancement Update to existing feature Priority: High labels Mar 18, 2020

andrewhercules added this to the 20.04 milestone Mar 18, 2020

andrewhercules assigned cmalangone and unassigned AsierGonzalez Mar 18, 2020

andrewhercules added the Topic: Pipeline label Mar 18, 2020

This was referenced Mar 18, 2020

Update API gene index response to include new tractability data #881

Closed

Update Angular application to display other modality data and link to tractability data file #882

Closed

andrewhercules removed the Data Relates to Open Targets data team label Mar 26, 2020

d0choa modified the milestones: 20.04, Wedding crasher sprint Apr 1, 2020

andrewhercules closed this as completed Apr 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update tractability plugin in data pipeline #880

Update tractability plugin in data pipeline #880

andrewhercules commented Mar 16, 2020 •

edited

Loading

andrewhercules commented Mar 24, 2020

andrewhercules commented Mar 26, 2020

cmalangone commented Apr 2, 2020

andrewhercules commented Apr 6, 2020

Update tractability plugin in data pipeline #880

Update tractability plugin in data pipeline #880

Comments

andrewhercules commented Mar 16, 2020 • edited Loading

andrewhercules commented Mar 24, 2020

andrewhercules commented Mar 26, 2020

cmalangone commented Apr 2, 2020

andrewhercules commented Apr 6, 2020

andrewhercules commented Mar 16, 2020 •

edited

Loading