-
Notifications
You must be signed in to change notification settings - Fork 16
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #28 from replikation/report_file
rki output, fastq_raw input, and some smaller fixes, results structuree optimized
- Loading branch information
Showing
20 changed files
with
335 additions
and
37 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,19 @@ | ||
#!/bin/bash | ||
#Info: Creates report.csv for RKI from all pangolin.csv-files in the actual working dir. | ||
|
||
SENDING_LAB_ID=$1 | ||
|
||
echo "IMS_ID,SENDING_LAB,DATE_DRAW,SEQ_TYPE,SEQ_REASON,SAMPLE_TYPE,OWN_FASTA_ID" > rki_report.csv | ||
|
||
for FILENAME in lineage*.csv; do | ||
IMS_ID=$(echo "IMS-00000-CVDP-00000") | ||
SENDING_LAB=$(echo "$SENDING_LAB_ID") #In Process export "SENDING_LAB_ID" (given as Input/default when using [--rki]). | ||
DATE_DRAW=$(echo "YYYYMMDD") | ||
SEQU_TYPE=$(echo "OXFORD_NANOPORE") | ||
SEQU_REASON=$(echo "X") | ||
SAMPLE_TYPE=$(echo "X") | ||
OWN_FASTA_ID=$(tail -n+2 "$FILENAME" | rev | cut -f 6-10 -d "," | rev) | ||
echo $IMS_ID","$SENDING_LAB","$DATE_DRAW","$SEQU_TYPE","$SEQU_REASON","$SAMPLE_TYPE","$OWN_FASTA_ID >> rki_report.csv | ||
done | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,110 @@ | ||
# RKI Report-file Elements | ||
|
||
[toc] | ||
|
||
## Source -Please check first | ||
|
||
* https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/DESH/DESH.html | ||
* https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/DESH/Cryptshare-Anleitung.pdf?__blob=publicationFile [v1.1 (2021-01-21)] | ||
|
||
## Data Overview | ||
**The report must contain following seven elements in this order:** | ||
|
||
1. IMS_ID | ||
2. SENDING_LAB | ||
3. DATE_DRAW | ||
4. SEQ_TYPE | ||
5. SEQ_REASON | ||
6. SAMPLE_TYPE | ||
7. OWN_FASTA_ID | ||
|
||
In the following a short description of each element is given. | ||
|
||
|
||
### 1. IMS_ID | ||
|
||
Sequencing-based pseudonym as unique identifier for the aggregation in DEMIS ("Deutschen Elektronischen Melde- und Informationssystem für den Infektionsschutz"). | ||
|
||
**Format: IMS-12345-CVDP-00001** | ||
* IMS: permanent prefix | ||
* 12345: 5-digit identifier of the sequencing laboratory ("Untersuchungslabor"), analog to the already existing DEMIS-system (DEMIS-10001 to currently DEMIS-10563). The list is managed by the DEMIS-Geschäftsstelle. If you are not registered already pls reach out to demis@rki.de | ||
* CVDP: 4-digit DEMIS-abbreviation, which is directly dedicated to the "Meldetatbestand". Later additional pathogen-abbreviations, aside from SARS-CoV-2, applied. | ||
* 00001: Ongoing Number, which in phase 0 is autonmously continued by the laboratory (later in phase 1 the system will automatically gernerate it). | ||
|
||
|
||
### 2. SENDING_LAB | ||
|
||
12345: 5-digit identifier of the sending laboratory, analog to the already existing DEMIS-system. | ||
|
||
**ATTENTION:** | ||
This only applys for laboratories, which don´t sequence on their own, but instead sent their samples to other laboratories for sequencing. In case that the sending lab is also the sequencing lab the digit from the IMS_ID and the SENDING_LAB-id can be identical. | ||
|
||
|
||
### 3. DATE_DRAW | ||
|
||
Date of the sample isolation n ISO8601 (YYYYMMDD) | ||
|
||
|
||
### 4. SEQ_TYPE | ||
|
||
Used sequencing-platform. "OXFORD_NANOPORE" is provided automatically as entry. | ||
|
||
|
||
### 5. SEQ_REASON | ||
|
||
Cause for the sequencing. Choose one entry from the following list: | ||
|
||
|Entry|Description| | ||
|-|-| | ||
|X|Unknown to the sequencing laboratory| | ||
|N|No (e. g. random selection of a PCR-positive sample for sequencing)| | ||
|Y|Yes, but the kind of mutation or variante is unknown (to the sequencing laboratory) | ||
|A|Yes, it exists evidence for the mutation/variante from previous diagnostic [spezifying in textfield after entry-letter]| | ||
|
||
**Note for "A":** | ||
* Textfield, max. length 64 signs | ||
* Mutation to specify in "[ ]", in case of multiple mutations divided by "/" | ||
* Example entry: A[B.1.1.7/B.1.351] | ||
|
||
|
||
### 6. SAMPLE_TYPE | ||
|
||
Type of sample. Choose one entry from the following list: | ||
|
||
|Entry|Description| | ||
|-|-| | ||
|s001|Upper respiratory swab sample (specimen)| | ||
|s002|Nasopharyngeal swab (specimen)| | ||
|s003|Swab from nasal sinus (specimen)| | ||
|s004|Anterior nares swab (specimen)| | ||
|s005|Oropharyngeal aspirate (specimen)| | ||
|s006|Nasopharyngeal aspirate (specimen)| | ||
|s007|Lower respiratory sample (specimen)| | ||
|s008|Bronchoalveolar lavage fluid sample (specimen)| | ||
|s009|Sputum specimen (specimen)| | ||
|s010|Specimen from trachea obtained by aspiration (specimen)| | ||
|s011|Pleural fluid specimen (specimen)| | ||
|s012|Specimen from lung obtained by biopsy (specimen) | ||
|s013|Blood specimen (specimen)| | ||
|s014|Plasma specimen or serum specimen or whole blood specimen (specimen)| | ||
|s015|Whole blood sample (specimen)| | ||
|s016|Stool specimen (specimen)| | ||
|s017|Urine specimen (specimen)| | ||
|s018|Lower respiratory fluid sample (specimen)| | ||
|s019|Nasopharyngeal washings (specimen)| | ||
|s020|Plasma specimen (specimen)| | ||
|s021|Saliva specimen (specimen)| | ||
|s022|Serum specimen (specimen)| | ||
|s023|Specimen unsatisfactory for evaluation (finding)| | ||
|s024|Swab of internal nose (specimen)| | ||
|s025|Throat swab (specimen)| | ||
|X|Unknown (to the sequencing laboratory) | ||
|
||
Value Set is geared to SNOMED CT and SNOMED CT COVID-19 Related Content(https://simplifier.net/covid-19labormeldung/materialsarscov2) | ||
|
||
### 7. OWN_FASTA_ID | ||
|
||
Laboratory-internal identifier, which enables the distinct assignment of the FASTA-file to the sequence (given in the FASTA-header). Autoprovided by PoreCov-workflow, according to your sample-names. | ||
|
||
**NOTE:** | ||
Used in phase 0 for the assignment of the metadata from the .csv-file to the sequence-data in the FASTA-file. |
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,17 @@ | ||
include { collect_fastq } from './process/collect_fastq' | ||
|
||
workflow collect_fastq_wf { | ||
take: | ||
fastq_dir | ||
main: | ||
collect_fastq(fastq_dir) | ||
|
||
if (params.single) { fastq_channel = collect_fastq.out } | ||
else { fastq_channel = collect_fastq.out | ||
.map { it -> it[1] } | ||
.flatten() | ||
.map { it -> [ it.simpleName, it ] } | ||
} | ||
|
||
emit: fastq_channel | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.