Skip to content

vjcitn/CMG_EYE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CMG_EYE

CMG_EYE repo for AnVIL days demonstrations

Here's the 'dashboard description' for the AnVIL workspace

Introduction

This workspace was produced to illustrate exploration of CMG data in AnVIL using

  • RStudio to control all interactions
  • version control with git for all textual material related to the computations and the workspace
  • an Rmarkdown document specifying all aspects of
    • software extensions required
    • data acquisition using the AnVIL package for Bioconductor
    • interactive tabulation and visualization
  • a shiny app that explores available FHIR documents on a subset of CMG sequencing data

Resources and activities

  • https://github.com/vjcitn/CMG_EYE is a github repository with cmgeye.Rmd and fhirinspect.R. The following steps will help you import the content from this github repository into your own AnVIL session/workspace.
    • use Rstudio's File/New project/ widget picker

to select a version controlled project

versioncontrol

with git

gitpick

Fill in the git widget

cloner

AnVIL RStudio will run git commands to populate a folder of your workspace's disk

files

Compile cmgeye.Rmd

To begin, use the Rstudio console to run BiocManager::install("BiocStyle") as it is useful for formatting the document we are aiming for.

After installation of BiocStyle completes, click on the cmgeye.Rmd file in the Files pane, and then in the upper left pane, use the Knit to html_document widget to generate and view HTML reporting on the CMG data.

Some commands may require AnVIL_DEV group membership. Set the chunks involving DRS resolution to eval=FALSE if an error related to permissions crops up.

Use the fhirinspector app

These commands may not work if you do not have access to gs://bioc-anvildays-2021. Email stvjc at channing.harvard.edu if you are interested and authorized to work with this information.

Use the commands

source("setup_fhirapp.R")
setup_fhirapp(path=tempdir())

to copy and unzip a file of FHIR JSON.

These can then be explored with

source("fhirinspect.R")
fhirinspector(paste0(tempdir(), "/tmp/CMG"))

The gen3 export process

We've used gen3.theanvil.io/explorer to request data on 35 subjects from CMG:

sample request splash

These were exported to AnVIL via the first tab, leading to a "data view" in this workspace (selecting 'subjects'):

data view

This data view includes DRS identifiers for CRAM files resulting from WGS. The phenotypes are variously annotated, in multiple fields, with terms 'retinitis pigmentosa', 'RETINITIS PIGMENTOSA 69; RP69', 'Joubert syndrome 1'. OMIM codes are often provided.

We can also use the Gen3 system to generate PFB (portable format for bioinformatics) using the second tab in the Gen3 explorer page. This is returned as an Avro file, which can be parsed using sparklyr into a spark_tbl instance.

About

CMG_EYE repo for AnVIL days demonstrations

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages