# Backup Data

For each of the notebooks there are datasets that are available on the CyVerse data store for import. If you make a mistake, or need to skip ahead, use the commands below to import the data you need. 

## Setup iRods configuration (required)

Before using this notebook you must execute the following cell before running any of the other cells

In [None]:
echo '{"irods_zone_name": "iplant", "irods_host": "data.iplantcollaborative.org","irods_port": 1247,"irods_user_name": "anonymous"}' > /home/jovyan/.irods/irods_environment.json 

# Get fastq files 

This will import the 6 high-fat diet control and tumor fastq files and make the appropriate directory if needed. 

In [None]:
mkdir -p /home/gea_user/data/raw_data/
iget -rPVT /iplant/home/shared/gea/rna-seq-leptin/data/raw_data/fastq /home/gea_user/data/raw_data/

Verify the fastq files are now present

In [None]:
ls /home/gea_user/data/raw_data/fastq

# Get FASTQC reports for untrimmed fastq files

This will import fastqc reports for the untrimmed fastq file and make the appropriate directory if needed. 

In [None]:
mkdir -p /home/gea_user/rna-seq-project/
iget -rPVT /iplant/home/shared/gea/rna-seq-leptin/worked_results/rna-seq-project/fastqc-untrimmed-results /home/gea_user/rna-seq-project/

Verify the fastqc reports are now present

In [None]:
ls /home/gea_user/rna-seq-project/fastqc-untrimmed-results

# Get trimmed fastq files and fastqc reports

This will import trimmed fastq files and make the appropriate directory if needed. 

In [None]:
mkdir -p /home/gea_user/rna-seq-project/
iget -rPVT /iplant/home/shared/gea/rna-seq-leptin/worked_results/rna-seq-project/trimmed-reads /home/gea_user/rna-seq-project/

Verify the trimmed reads and fastqc reports are now present

In [None]:
ls -R /home/gea_user/rna-seq-project/trimmed-reads

# Get transcriptome and indexed transcriptome for Kallisto

This will import transcriptome and index files and make the appropriate directory if needed. 

In [None]:
mkdir -p /home/gea_user/rna-seq-project/
iget -rPVT /iplant/home/shared/gea/rna-seq-leptin/worked_results/rna-seq-project/transcriptome /home/gea_user/rna-seq-project/

Verify the transcriptome and index files are now present

In [None]:
ls /home/gea_user/rna-seq-project/transcriptome

# Get Kallisto pseudoalignments, genomes/transcriptome files, pseudoBAM files

This will import the gene annotation files needed for kallisto as well as the kallisto outputs

In [None]:
mkdir -p /home/gea_user/rna-seq-project/
iget -rPVT /iplant/home/shared/gea/rna-seq-leptin/worked_results/rna-seq-project/kallisto  /home/gea_user/rna-seq-project/

Verify the Kallisto outputs are now present


In [None]:
ls -R /home/gea_user/rna-seq-project/kallisto