Reharmonising a file with error #1226

ljwh2 · 2024-01-10T16:05:30Z

GCST002047 was not harmonised successfully because our harmonisation pipeline cannot recognise the column “Effect_Allele”. The harmonisation pipeline reads the “effect_allele” column in the input file to harmonise the variant. However, all data in this column is NA. This is the reason why all variants give hm 14. If we change the header of this file, it should be able to be harmonised. (same as other_allele)

Please fix the file and re-qeue for harmonisation

karatugo · 2024-03-19T15:25:41Z

When unzipped two files appeared. I fixed the header for both but the metadata yaml files are missing. Since the study is old, the data is not available from the ingest api.

karatugo · 2024-03-19T15:25:46Z

Create metadata yaml files that contain GCST ID and genome assembly for the harmonisation.
Submit the harmonisation

karatugo · 2024-03-19T16:48:57Z

Submitted to codon with the submission script /hps/software/users/parkinso/spot/gwas/prod/scripts/cron/start_harmonisation_pre_standard_goci1226.sh

Job <92779129> is submitted to default queue <standard>.

karatugo · 2024-03-20T10:38:33Z

Compare two studies in FTP ("EduYears" and "College")
If they are the same, replace the zipped file with the correct one - Moved SSGAC_College_Rietveld2013_publicrelease.txt to GCST002001-GCST003000/GCST002047 and removed the zipped file
Harmonise with pre_gwas_ssf
Also, upload only one harmonised file with the correct title

karatugo · 2024-03-20T16:06:53Z

Use variant_id in the header (as per pre_gwas_ssf standard)
Rename the file with their GCST_ID.txt for harmonisation

karatugo · 2024-03-20T16:56:33Z

Using the script at /hps/software/users/parkinso/spot/gwas/prod/scripts/cron/start_harmonisation_pre_standard_goci1226.sh

Job <93028342> is submitted to default queue <standard>.

karatugo · 2024-03-21T10:29:20Z

Added chromosome and bas_pair_location columns filled with NA and submitted again.

Job <93126943> is submitted to default queue <standard>.

karatugo · 2024-03-27T11:54:29Z

Harmonised files, metadata files, running logs and .tbi files are copied to the respective harmonised directories.

sprintell · 2024-04-04T09:39:34Z

This is confirmed done, @earlEBI will double check

earlEBI · 2024-04-04T15:59:43Z

Reopening as the yaml files do not look quite right. (is_harmonised = false).
Also, the .tbi files should be renamed .tbi.gz.

karatugo · 2024-05-17T14:12:42Z

Fixed the following fields:

genome_assembly: GRCh38
is_harmonised: true
is_sorted: true

@earlEBI Could you check again please? Thanks.

ljwh2 · 2024-05-22T09:15:34Z

@earlEBI please confirm

earlEBI · 2024-05-22T09:21:45Z

The yamls are only five lines long. Should they not contain more detail?

karatugo · 2024-05-22T12:12:38Z

I thought that's because it's a very old submission. And also they are not available in the ingest api. @sajo-ebi

https://www.ebi.ac.uk/gwas/ingest/api/v2/studies/GCST008396
https://www.ebi.ac.uk/gwas/ingest/api/v2/studies/GCST002047

sprintell · 2024-05-23T09:02:36Z

Old studies are meant to be retrieved fromthe public rest API: https://www.ebi.ac.uk/gwas/rest/api/studies/GCST008396

karatugo · 2024-05-24T15:05:44Z

TODO: Update sumstats tools so that we fetch the REST API if Ingest API does not return any data.

sprintell · 2024-05-29T09:49:17Z

Harmonization done, but yaml file has some missing data.

karatugo · 2024-06-03T16:05:45Z

Regenerated YAML files for GCST002047 and GCST008396. Expect them in the public ftp in 2 days.

karatugo · 2024-06-10T13:03:05Z

YAML files are in staging FTP but not in public FTP. The reason why it didn't sync is in our ftp-sync code, we only filter the files that start with 'GCST*'. See https://github.com/EBISPOT/gwas-utils/blob/6fbf2c7a6d6fdfc79e0b8c2d1e74539bb1073303/ftpSummaryStatsScript/ftp_sync.py#L186-L188

Will renamed files, expect them in the public ftp in 2 days.

ljwh2 · 2024-06-12T09:23:06Z

Agreed to keep original files as per old guidelines

ljwh2 assigned karatugo Jan 10, 2024

sprintell closed this as completed Apr 4, 2024

earlEBI reopened this Apr 4, 2024

This was referenced May 28, 2024

Fetch the REST API if Ingest API does not return any data EBISPOT/gwas-sumstats-tools#40

Closed

fix: Bump gwas-sumstats-tools to v1.0.20 EBISPOT/gwas-sumstats-service#340

Merged

ljwh2 closed this as completed Jun 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reharmonising a file with error #1226

Reharmonising a file with error #1226

ljwh2 commented Jan 10, 2024

karatugo commented Mar 19, 2024

karatugo commented Mar 19, 2024 •

edited

Loading

karatugo commented Mar 19, 2024

karatugo commented Mar 20, 2024 •

edited

Loading

karatugo commented Mar 20, 2024 •

edited

Loading

karatugo commented Mar 20, 2024

karatugo commented Mar 21, 2024

karatugo commented Mar 27, 2024 •

edited

Loading

sprintell commented Apr 4, 2024

earlEBI commented Apr 4, 2024

karatugo commented May 17, 2024

ljwh2 commented May 22, 2024

earlEBI commented May 22, 2024

karatugo commented May 22, 2024

sprintell commented May 23, 2024

karatugo commented May 24, 2024

sprintell commented May 29, 2024

karatugo commented Jun 3, 2024

karatugo commented Jun 10, 2024 •

edited

Loading

ljwh2 commented Jun 12, 2024

Reharmonising a file with error #1226

Reharmonising a file with error #1226

Comments

ljwh2 commented Jan 10, 2024

karatugo commented Mar 19, 2024

karatugo commented Mar 19, 2024 • edited Loading

karatugo commented Mar 19, 2024

karatugo commented Mar 20, 2024 • edited Loading

karatugo commented Mar 20, 2024 • edited Loading

karatugo commented Mar 20, 2024

karatugo commented Mar 21, 2024

karatugo commented Mar 27, 2024 • edited Loading

sprintell commented Apr 4, 2024

earlEBI commented Apr 4, 2024

karatugo commented May 17, 2024

ljwh2 commented May 22, 2024

earlEBI commented May 22, 2024

karatugo commented May 22, 2024

sprintell commented May 23, 2024

karatugo commented May 24, 2024

sprintell commented May 29, 2024

karatugo commented Jun 3, 2024

karatugo commented Jun 10, 2024 • edited Loading

ljwh2 commented Jun 12, 2024

karatugo commented Mar 19, 2024 •

edited

Loading

karatugo commented Mar 20, 2024 •

edited

Loading

karatugo commented Mar 20, 2024 •

edited

Loading

karatugo commented Mar 27, 2024 •

edited

Loading

karatugo commented Jun 10, 2024 •

edited

Loading