COG_UK
Spike protein structures showing locations of amino acid residues that are mutated in each variant of concern (VOC). The spike protein protrudes from the surface of the SARS-CoV-2 virus, is responsible for initating binding to and entry into host cells, and is also the primary target for antibodies that recognise the virus.
Consult our own Metadata Catalogue for the
-
COG UK metadata metadata (e.g.
cog_metadata.csv
), corresponds toalignment
patients -
COG UK naive variants naive variants (e.g.
naive_variants.csv
) - Consensus sequence (e.g.
cog_all.fasta
) in FASTA format - Full alignment (e.g.
cog_alignment.fasta
) in FASTA format - Unmasked alignment (e.g.
cog_unmasked_alignment.fasta
) in FASTA format, same set of patients ascog_all
A note about identifiers:
- the main component of the patient identifier is the ISARIC id (in the form
ABCD-0123
) - the ISARIC id will be given a numerical suffix (
.1
,.2
,.3
etc) because there may be multiple samples per patient - the FASTA identifier has that ISARIC id surrounded by /England/ and /year/ (e.g.
/England/ABCD-0123.1/2020
) - some patients have multiple ISARIC ids
A note about multiple samples per patient: each physical sample should be assigned exactly one COG ID and any re-sequencing of the same sample should be submitted under the same COG ID. In practise, this is not enforced (and hard to do so anyway) and submitting organisations (including PHE/UKHSA) have often issued new COG IDs for the same sample, resulting in multiple COG IDs actually referring to the same swab. Trying to de-duplicate on sample date is not entirely reliable (frequently off by a day or two in different reports) and it would also be perfectly legitimate to have more than one actually distinct samples (like a throat and a nose swab) on the same day from the same patient.
A note about the FASTA format: some of the letters are lower-case because they are masked.
Data dictionary:
-
Data dictionary for CSV and FASTA files COG-UK-TRE_data_dictionary_v1_020721.pdf [PDF]
-
Metadata CSV schema cog_uk_metadata_spec_20210415.pdf [PDF]
Web references:
Contents
ISARIC
-
Additional Fields
-
Case Report Form Definitions
- Participant Identification Number (PIN)
- Tier
- Inclusion Criteria
- Demographics
- Onset and Admission
- Admission Signs and Symptoms
- Comorbidities
- Preadmission Treatment
- Preadmission Medication
- Reinfection Form
- Daily Form
- Infectious Respiratory Disease Pathogen Diagnosis
- Infectious Respiratory Disease Pathogen Testing
- Treatment
- Complications
- Study Participation
- Outcome
- Final Outcome
- Core Additional Information
- Withdrawal Form
- Consent Ctu Dms
- Confirmed Negative PCR
- Follow up Consent
- Follow up Self Assessment
PHOSP
-
Additional Fields
-
Case Report Form Definitions
- Forms
- Timepoints
- PHOSP ID
- eConsent Tier 1
- eConsent Tier 2
- Informed Consent Form
- Split Tier Consent
- Eligibility Checklist
- CRF1A Part 1
- CRF1A Part 2
- CRF1A Part 3
- CRF1B
- CRF2A
- CRF2B
- CRF3A
- CRF3B
- CRF3C
- CRF4A
- CRF4B
- CRF Emergency Visit
- Adverse Event Log
- Withdrawals
- CRF Early Termination
- CRF Tier 2 Withdrawal
- Medications Log
- Activity Monitor Log
- Mental Health Assessment
- Nutrition
- Social History
- EQ-5D-5L
- GAD-7
- PHQ-9
- MRC Dyspnoea
- SARC-F
- GPPAQ
- Dyspnoea-12
- FACIT Fatigue
- PCL-5
- BPI
- NEADL
- MoCA
- Rockwood Clinical Frailty
- PSQI
- MEQ
- LCQ
- PFTs
- Walk Tests
- Tier 2 Core Test Checklist
- Tier 2 Research Samples
- Tier 2 Notification - Blood
- Tier 2 Notification - Oral Wash
- Tier 2 Notification - Sputum
- Tier 2 Notification - Urine
- QRISK3
- BIA
- DXA
- Muscle Strength
- SPPB
- Pre-PSQ
- PSQ
- Lab Log - Routine Blood
- Lab Log - Urine
- Lab Log - Immuno
- Lab Log - Additional Tests
- PCR Swab Tests
GPES
-
Additional Fields
-
Table Definitions
APC
-
Additional Fields
-
Table Definitions
IAPT
-
Additional Fields
AE
-
Additional Fields
-
Table Definitions
NIMS
-
Additional Fields
-
Table Definitions
ECDS
-
Additional Fields
-
Table Definitions
MHS
-
Additional Fields
-
Table Definitions
- MHS Master Patient Index table
- MHS GP data table
- NHS Accommodation status table
- MHS Employment status table
- MHS Patient Ind table
- MHS MH Care Coord table
- MHS Disability type table
- MHS Care plan type table
- MHS Care plan arrangement table
- MHS Assistive technology to support disability type table
- MHS Social and Personal Circumstances table
- MHS Overseas Visitor Charging Category table
- MHS Service or Team Referral table
- MHS Service or Team Type Referred To table
- MHS Other Reason for Referral table
- MHS Referral to Treatment table
- MHS Onward Referral table
- MHS Discharge Plan Agreement table
- MHS Care Contact table
- MHS Care Activity table
- MHS Other in Attendance table
- MHS Indirect Activity table
- MHS Medical History (Previous Diagnosis) table
- MHS Provisional Diagnosis table
- MHS Primary Diagnosis table
- MHS Secondary Diagnosis table
- MHS Coded Scored Assessment (Referral) table
- MHS Coded Scored Assessment (Care Activity) table
- MHS Care Programme Approach (CPA) Care Episode table
- MHS Care Programme Approach (CPA) Review table
NDA
-
Additional Fields
SGSS
-
Additional Fields
-
Table Definitions
GDPPR
-
Additional Fields
-
Table Definitions
VACCINATION
-
Additional Fields
-
Table Definitions
COG_UK
-
Additional Fields
-
Table Definitions
COG_UK_VOC
-
Additional Fields
-
Table Definitions
CHESS
-
Additional Fields
-
Table Definitions
SGTF
-
Additional Fields
-
Table Definitions
CIVIL_REG_DEATHS
-
Additional Fields
-
Table Definitions