Dante for genetic report generation

This repository contains R code for generating clinical genetic reports from the output of a Whole Genome Sequencing (WGS) pipeline. The main functionality transforms raw pipeline data into structured reports (PDF/HTML) that provide variant interpretation and annotations for clinical analysis.

Overview

Key Features

Automated report generation: Generates PDF reports based on patient-specific data.
Customisable input parameters: Allows parameterised execution for single-case analysis.
Data visualisation: Includes PCA plots and tabular summaries of variants.
Standards compliant: Adheres to ACMG guidelines for variant interpretation.

Exported Datasets

We export 4 CSV datasets from the paths specified below:

Core Dataset
- Path: PanelAppData_genes_combined_core
- Contains fields:
  - id
  - Gene
  - confidence_level
  - mode_of_inheritance
  - name
  - disease_group
  - disease_sub_group
  - status
Minimal Dataset
- Path: PanelAppData_genes_combined_minimal
- Contains fields:
  - panel ID
  - Gene symbol
Metadata Dataset
- Path: PanelAppData_genes_combined_meta_names.csv
- Contains fields:
  - panel id
  - panel name
Metadata Counts
- Path: PanelAppData_genes_combined_meta_variable_counts.csv
- Description: Contains metadata counts for each variable
  - Contains:

# A tibble: 52 × 2
Column_Name           Unique_Counts
<chr>                         <int>
1 id                              447
2 entity_type                       1
3 Gene                           6280
4 confidence_level                  4
5 penetrance                        4
6 mode_of_pathogenicity            12
7 publications                  13102
8 evidence                       3653
9 phenotypes                    21636
10 mode_of_inheritance              15
# ℹ 42 more rows

Tree

Currently, the private data from WGS in set in .gitinore for output (some pipeline output for testing) and reports (the reports generated by Dante). Lastly, the main analysis data is source from outside the repository itself. This is currently hardcoded for testing: data_path / file_list (path to Guru output .Rds).

.
├── README.md
├── Rmd
├── images
├── output
└── reports

..
└── guru_data_path_to_files

Requirements

Software

R (>= 4.0)
LaTeX (e.g., TeX Live, MikTeX) with xelatex

R Packages

rmarkdown
kableExtra
dplyr
digest
stringr
ggplot2

Install packages using:

install.packages(c("rmarkdown", "kableExtra", "dplyr", "digest", "stringr", "ggplot2"))

Usage

Input Data

Ensure the following input files are available:

WGS pipeline output files from ACMGuru/GuRu (i.e. .Rds).
Phenotypic and clinical metadata.
Supporting annotation files (e.g., population reference data).

Execution Steps

Prepare input data: Place the required data files in the appropriate directories.
Set parameters: Update parameters in the report.Rmd file or pass them dynamically via the R script.
Run Report Generation:
```
source("report_runner.R")
```
```

Rscript report_runner.R
```
This script processes the input data and generates reports for each case.

Output

Individual PDF reports, named by sample ID and rank (e.g., report_priority_1_sample_ABC123.pdf).

References

Pipedev documentation

Support

For issues, please create a GitHub issue or contact the repository maintainer.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
20250202_Rmd		20250202_Rmd
Rmd		Rmd
data		data
guru_validations		guru_validations
images		images
latex		latex
ollama_test		ollama_test
rag		rag
ref/uniprot		ref/uniprot
.gitignore		.gitignore
README.md		README.md
sync_data.sh		sync_data.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dante for genetic report generation

Overview

Key Features

Exported Datasets

Tree

Requirements

Software

R Packages

Usage

Input Data

Execution Steps

Output

References

Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Dante for genetic report generation

Overview

Key Features

Exported Datasets

Tree

Requirements

Software

R Packages

Usage

Input Data

Execution Steps

Output

References

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages