Aidan Coyle, afcoyle@uw.edu

Roberts Lab, UW-SAFS

2021-03-18

This script follows the same steps as 11_download_libraries_run_kallisto.ipynb and 21_download_libraries_run_kallisto.ipynb

11_... dealt with cbai_trancriptomev2.0, which contained sequences from both _C. bairdi_ and _Hematodinium_.

31_... dealt with hemat_transcriptomev1.6, which only contains sequences presumed to be _Hematodinium_.

41_download_libraries_run_kallisto.ipynb uses a new transcriptome - cbai_transcriptomev4.0 - which contains only sequences presumed to be _C. bairdi_

As the title implies, we will first download the libraries and transcriptome, then build a kallisto index from the transcriptome, and then pseudoalign the libraries to the transcriptome, thus providing counts for each library. Finally, we will build matrices for each of the pairwise comparisons we want to perform.

The plan is to perform two pairwise comparisons of gene expression. The first is between individual libraries of Ambient Day 2 and Elevated Day 2 infected crab. In this, we hold day constant but examine different crabs at different temperatures. The second is between individual libraries of Elevated Day 0 and Elevated Day 2 (Day 0 samples were taken prior to exposure to elevated temperatures). In this comparison, we are examining the same crab over time, but at different temperatures and on different days.

Despite the limited number of samples in these pairwise comparisons, we will download and build kallisto libraries for all samples, both individual and pooled. This makes it easier to perform future pairwise comparisons if we decide to. Nevertheless, it is not strictly necessary - feel free to skip if you are confident about the pairwise comparisons to be made.

Some notes:

1. We will build indices for both individual and pooled libraries, but will likely only utilize those for individual libraries
2. Most of this script will be run remotely on the lab's Roadrunner computer, rather than on a local machine. This means that commands will largely be copied and pasted from the command line, rather than ran directly in this Jupyter notebook.




Library IDs are as follows. Asterisks label Day 0 crabs that were part of either the elevated or lowered treatment groups - since at Day 0, they had not yet been exposed to changes away from ambient temperature, they are included as part of the ambient treatment group:

| Crab ID    | Library ID | Day| Temperature |
|-------------|----------------|-------------|----------|
| G        | 272             |   2          |   Elevated       |
| H        | 294             |   2          |   Elevated       |
| I        | 280             |   2          |   Elevated       |
|pooled    | 380825          |   2          |   Elevated       |
| G*       | 173*            |   0*         |   Ambient*       |
| H*       | 72*             |   0*         |   Ambient*       |
| I*       | 127*            |   0*         |   Ambient*       |
| A        | 178             |   0          |   Ambient        |
| A        | 359             |   2          |   Ambient        |
| A        | 463             |   17         |   Ambient        |
| B        | 118             |   0          |   Ambient        |
| B        | 349             |   2          |   Ambient        |
| B        | 481             |   17         |   Ambient        |
| C        | 132             |   0          |   Ambient        |
| C        | 334             |   2          |   Ambient        |
| C        | 485             |   17         |   Ambient        |
| E*       | 151*            |   0*         |   Ambient*       |
| pooled   | 380821          |   0          |   Ambient        |
| E        | 254             |   2          |   Lowered        |
| E        | 445             |   17         |   Lowered        |
| pooled   | 380823          |   2          |   Lowered        |


For the sake of completeness, we will also align libraries from uninfected crabs, as described below:

| Crab ID    | Library ID | Day| Temperature |
|-------------|----------------|-------------|----------|
| D        | 073             |   0          |   Ambient*      |
| D        | 221             |   2          |   Lowered       |
| D        | 427             |   17         |   Lowered       |
| F        | 113             |   0          |   Elevated      |
| F        | 222             |   2          |   Lowered       |
| F        | 425             |   17         |   Lowered       |
| pooled   | 380820          |   0          |   Ambient*      |
| pooled   | 380822          |   2          |   Lowered       |
| pooled   | 380824          |   0          |   Elevated      |


Trimmed individual libraries were downloaded from Gannet, available [here](https://gannet.fish.washington.edu/Atumefaciens/20200318_cbai_RNAseq_fastp_trimming/), at 11:00 PSTon 2021-03-08.

Trimmed pooled libraries were downloaded from Gannet, available [here](https://gannet.fish.washington.edu/Atumefaciens/20200414_cbai_RNAseq_fastp_trimming/), at 12:00 PST on 2021-03-08


Transcriptome used is **cbai_transcriptomev4.0**. 

Link and background info available [here](https://robertslab.github.io/sams-notebook/2021/03/17/Transcriptome-Assembly-C.bairdi-Transcriptome-v4.0-Using-Trinity-on-Mox.html). 

Direct link to folder with data available [here](https://gannet.fish.washington.edu/Atumefaciens/20210317_cbai_trinity_RNAseq_transcriptome-v4.0/cbai_transcriptome_v4.0.fasta_trinity_out_dir/). 

Transcriptome md5sum is 6450d6f5650bfb5f910a5f42eef94913

Transcriptome was downloaded at 2:00 PST on 2021-03-18


## Download individual libraries

Working on Roadrunner as user afcoyle. Working directory: ~/projects/hemat_bairdi_transcriptome/scripts to start

### If you have already ran 11_download_libraries_run_kallisto or 31_download_libraries_run_kallisto, you already downloaded all these same libraries - no need to re-download. Skip to downloading the transcriptome and building the kallisto index

In [None]:
# Download all files in directory
!wget --no-check-certificate --no-parent --recursive --reject "index.html" https://gannet.fish.washington.edu/Atumefaciens/20200318_cbai_RNAseq_fastp_trimming/

In [None]:
# Move into new directory
!cd gannet.fish.washington.edu/Atumefaciens/20200318_cbai_RNAseq_fastp_trimming/

In [None]:
# Remove all files that aren't .fq.gz or .md5
!rm *.html
!rm *.zip
!rm index.html*
!rm *.json
!rm *.sh
!rm *.log
!rm *.out
!rm *.txt
!rm -r multiqc_data*

In [None]:
# Move files from data/gannet.fish.washington.edu/Atumefaciens/20200318_cbai_RNAseq_fastp_trimming into data/libraries
!cd ..
!mv 20200318_cbai_RNAseq_fastp_trimming/* ../../../data/libraries/
# Delete old directory
!cd ../..
!rm -r gannet.fish.washington.edu
# Move into directory with all libraries
cd ../data/libraries/

In [None]:
# Rename checksum file to clarify it is specific to individual libraries
!mv trimmed_fastq_checksums.md5 trimmed_indivfastq_checksums.md5

In [None]:
# Check that files downloaded properly with checksums
!md5sum -c trimmed_indivfastq_checksums.md5

## Download pooled libraries


In [None]:
# Move up a directory to keep download simpler
!cd ..

In [None]:
# Download all files in directory
!wget --no-check-certificate --no-parent --recursive --reject "index.html" https://gannet.fish.washington.edu/Atumefaciens/20200414_cbai_RNAseq_fastp_trimming/

In [None]:
# Move into our new file structure
cd gannet.fish.washington.edu/Atumefaciens/20200414_cbai_RNAseq_fastp_trimming

In [None]:
# Remove all files that aren't .fq.gz or .md5
!rm *.html
!rm *.zip
!rm index.html*
!rm *.json
!rm *.log
!rm *.out
!rm *.txt
!rm -r multiqc*

In [None]:
!mv trimmed_fastq_checksums.md5 trimmed_pooledfastq_checksums.md5

In [None]:
# Check that files downloaded properly with checksums
!md5sum -c trimmed_pooledfastq_checksums.md5

In [None]:
# Move files from data/gannet.fish.washington.edu/Atumefaciens/20200414_cbai_RNAseq_fastp_trimming into data/libraries
!cd ..
!mv 20200414_cbai_RNAseq_fastp_trimming/* ../../libraries
# Delete old directory
!cd ../..
!rm -r gannet.fish.washington.edu
cd libraries

In [None]:
# Merge libraries by lanes, removing un-merged files
!cat 380820_S1_L001_R1_001.fastp-trim.202004143431.fq.gz 380820_S1_L002_R1_001.fastp-trim.202004143700.fq.gz > 380820_S1_R1_001.fastp-trim.fq.gz
!cat 380820_S1_L001_R2_001.fastp-trim.202004143431.fq.gz 380820_S1_L002_R2_001.fastp-trim.202004143700.fq.gz > 380820_S1_R2_001.fastp-trim.fq.gz
!rm 380820_S1_cL00*
!cat 380821_S2_L001_R1_001.fastp-trim.202004143925.fq.gz 380821_S2_L002_R1_001.fastp-trim.202004144145.fq.gz > 380821_S2_R1_001.fastp-trim.fq.gz
!cat 380821_S2_L001_R2_001.fastp-trim.202004143925.fq.gz 380821_S2_L002_R2_001.fastp-trim.202004144145.fq.gz > 380821_S2_R2_001.fastp-trim.fq.gz
!rm 380821_S2_L00*
!cat 380822_S3_L001_R1_001.fastp-trim.202004144409.fq.gz 380822_S3_L002_R1_001.fastp-trim.202004144633.fq.gz > 380822_S3_R1_001.fastp-trim.fq.gz
!cat 380822_S3_L001_R2_001.fastp-trim.202004144409.fq.gz 380822_S3_L002_R2_001.fastp-trim.202004144633.fq.gz > 380822_S3_R2_001.fastp-trim.fq.gz
!rm 380822_S3_L00*
!cat 380823_S4_L001_R1_001.fastp-trim.202004144852.fq.gz 380823_S4_L002_R1_001.fastp-trim.202004145106.fq.gz > 380823_S4_R1_001.fastp-trim.fq.gz
!cat 380823_S4_L001_R2_001.fastp-trim.202004144852.fq.gz 380823_S4_L002_R2_001.fastp-trim.202004145106.fq.gz > 380823_S4_R2_001.fastp-trim.fq.gz
!rm 380823_S4_L00*
!cat 380824_S5_L001_R1_001.fastp-trim.202004145320.fq.gz 380824_S5_L002_R1_001.fastp-trim.202004145558.fq.gz > 380824_S5_R1_001.fastp-trim.fq.gz
!cat 380824_S5_L001_R2_001.fastp-trim.202004145320.fq.gz 380824_S5_L002_R2_001.fastp-trim.202004145558.fq.gz > 380824_S5_R2_001.fastp-trim.fq.gz
!rm 380824_S5_L00*
!cat 380825_S6_L001_R1_001.fastp-trim.202004145835.fq.gz 380825_S6_L002_R1_001.fastp-trim.202004140109.fq.gz > 380825_S6_R1_001.fastp-trim.fq.gz
!cat 380825_S6_L001_R2_001.fastp-trim.202004145835.fq.gz 380825_S6_L002_R2_001.fastp-trim.202004140109.fq.gz > 380825_S6_R2_001.fastp-trim.fq.gz
!rm 380825_S6_L00*

## Download transcriptome
Transcriptome is filtered by taxa to only include presumed _C. bairdi_ sequences

In [None]:
!cd ../transcriptomes
# Download transcriptome 4.0
!curl -O -k https://gannet.fish.washington.edu/Atumefaciens/20210317_cbai_trinity_RNAseq_transcriptome-v4.0/cbai_transcriptome_v4.0.fasta_trinity_out_dir/cbai_transcriptome_v4.0.fasta
# Transcriptome checksum: 6450d6f5650bfb5f910a5f42eef94913
!md5sum cbai_transcriptome_v4.0.fasta | grep "6450d6f5650bfb5f910a5f42eef94913"
# Matches

## Create an index for kallisto
Warning: if running on local machine, could take days. Ran in much shorter time on Roadrunner.

If kallisto is not installed, follow instructions [here](http://pachterlab.github.io/kallisto/download)

In [None]:
!cd ../../output/kallisto_indices
# Index for transcriptome
!kallisto index -i kallisto_cbai_index_v4.0.idx ../../data/transcriptomes/cbai_transcriptome_v4.0.fasta


## Run kallisto quantification for all libraries for Hemat. Transcriptome v1.6, starting with individual libraries
Order is same as table at start of script, with all pooled libraries examined last

In [None]:
# Quantify ID 272
kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id272 \
../../data/libraries/272_R1_001.fastp-trim.202003184536.fq.gz \
../../data/libraries/272_R2_001.fastp-trim.202003184536.fq.gz \
2> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 294
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id294 \
../../data/libraries/294_R1_001.fastp-trim.202003180701.fq.gz \
../../data/libraries/294_R2_001.fastp-trim.202003180701.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 280
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id280 \
../../data/libraries/280_R1_001.fastp-trim.202003185124.fq.gz \
../../data/libraries/280_R2_001.fastp-trim.202003185124.fq.gz  \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 173
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id173 \
../../data/libraries/173_R1_001.fastp-trim.202003181159.fq.gz \
../../data/libraries/173_R2_001.fastp-trim.202003181159.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 072
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id072 \
../../data/libraries/72_R1_001.fastp-trim.202003181709.fq.gz \
../../data/libraries/72_R2_001.fastp-trim.202003181709.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 127
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id127 \
../../data/libraries/127_R1_001.fastp-trim.202003185538.fq.gz \
../../data/libraries/127_R2_001.fastp-trim.202003185538.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 178
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id178 \
../../data/libraries/178_R1_001.fastp-trim.202003181815.fq.gz \
../../data/libraries/178_R2_001.fastp-trim.202003181815.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 359
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id359 \
../../data/libraries/359_R1_001.fastp-trim.202003182247.fq.gz \
../../data/libraries/359_R2_001.fastp-trim.202003182247.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 463
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id463 \
../../data/libraries/463_R1_001.fastp-trim.202003185732.fq.gz \
../../data/libraries/463_R2_001.fastp-trim.202003185732.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 118
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id118 \
../../data/libraries/118_R1_001.fastp-trim.202003184931.fq.gz \
../../data/libraries/118_R2_001.fastp-trim.202003184931.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 349
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id349 \
../../data/libraries/349_R1_001.fastp-trim.202003181609.fq.gz \
../../data/libraries/349_R2_001.fastp-trim.202003181609.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 481
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id481 \
../../data/libraries/481_R1_001.fastp-trim.202003180047.fq.gz \
../../data/libraries/481_R2_001.fastp-trim.202003180047.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 132
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id132 \
../../data/libraries/132_R1_001.fastp-trim.202003180140.fq.gz \
../../data/libraries/132_R2_001.fastp-trim.202003180140.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 334
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id334 \
../../data/libraries/334_R1_001.fastp-trim.202003181149.fq.gz \
../../data/libraries/334_R2_001.fastp-trim.202003181149.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 485
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id485 \
../../data/libraries/485_R1_001.fastp-trim.202003181245.fq.gz \
../../data/libraries/485_R2_001.fastp-trim.202003181245.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 151
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id151 \
../../data/libraries/151_R1_001.fastp-trim.202003180619.fq.gz \
../../data/libraries/151_R2_001.fastp-trim.202003180619.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 254
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id254 \
../../data/libraries/254_R1_001.fastp-trim.202003184228.fq.gz \
../../data/libraries/254_R2_001.fastp-trim.202003184228.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 445
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id445 \
../../data/libraries/445_R1_001.fastp-trim.202003185018.fq.gz \
../../data/libraries/445_R2_001.fastp-trim.202003185018.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

Continue with building libraries for individual libraries of uninfected crab

In [None]:
# Quantify ID 073
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id073 \
../../data/libraries/73_R1_001.fastp-trim.202003182229.fq.gz \
../../data/libraries/73_R2_001.fastp-trim.202003182229.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 221
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id221 \
../../data/libraries/221_R1_001.fastp-trim.202003182309.fq.gz \
../../data/libraries/221_R2_001.fastp-trim.202003182309.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 427
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id427 \
../../data/libraries/427_R1_001.fastp-trim.202003184618.fq.gz \
../../data/libraries/427_R2_001.fastp-trim.202003184618.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 113
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id113 \
../../data/libraries/113_R1_001.fastp-trim.202003184225.fq.gz \
../../data/libraries/113_R2_001.fastp-trim.202003184225.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 222
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id222 \
../../data/libraries/222_R1_001.fastp-trim.202003182800.fq.gz \
../../data/libraries/222_R2_001.fastp-trim.202003182800.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify ID 425
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id425 \
../../data/libraries/425_R1_001.fastp-trim.202003182722.fq.gz \
../../data/libraries/425_R2_001.fastp-trim.202003182722.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

Continue with building libraries for pooled samples of infected crab

In [None]:
# Quantify 380821
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id380821 \
../../data/libraries/380821_S2_R1_001.fastp-trim.fq.gz \
../../data/libraries/380821_S2_R2_001.fastp-trim.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify 380823
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id380823 \
../../data/libraries/380823_S4_R1_001.fastp-trim.fq.gz \
../../data/libraries/380823_S4_R2_001.fastp-trim.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify 380825
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id380825 \
../../data/libraries/380825_S6_R1_001.fastp-trim.fq.gz \
../../data/libraries/380825_S6_R2_001.fastp-trim.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

Continue with building libraries for pooled samples of uninfected crab

In [None]:
# Quantify 380820
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id380820 \
../../data/libraries/380820_S1_R1_001.fastp-trim.fq.gz \
../../data/libraries/380820_S1_R2_001.fastp-trim.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify 380822
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id380822 \
../../data/libraries/380822_S3_R1_001.fastp-trim.fq.gz \
../../data/libraries/380822_S3_R2_001.fastp-trim.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
# Quantify 380824
!kallisto quant \
-i kallisto_cbai_index_v4.0.idx \
-o ../kallisto_libraries_cbai_transcriptomev4.0/id380824 \
../../data/libraries/380824_S5_R1_001.fastp-trim.fq.gz \
../../data/libraries/380824_S5_R2_001.fastp-trim.fq.gz \
2>> ../kallisto_libraries_cbai_transcriptomev4.0/std_errortracking.txt

In [None]:
!cd ../kallisto_libraries_cbai_transcriptomev4.0/

In [None]:
# Create file of checksums for std error file and all subdirectories. This only goes one level down, but that's all we need
!md5sum * */* > checksums.md5

## Move kallisto libraries to local machine

Commands from here are ran on local machine unless otherwise specified

In [1]:
!pwd

/mnt/c/Users/acoyl/Documents/GitHub/hemat_bairdi_transcriptome/scripts


In [None]:
# Download files from remote machine to local directory. Run this on command line directly, as interactivity is needed
!rsync -chavzP --stats \
afcoyle@roadrunner.fish.washington.edu:/home/afcoyle/projects/hemat_bairdi_transcriptome/output/kallisto_libraries_cbai_transcriptomev4.0/* \
/mnt/c/Users/acoyl/Documents/GitHub/hemat_bairdi_transcriptome/output/kallisto_libraries/cbai_transcriptomev4.0/

In [5]:
# Move into directory with new files to check rsync worked properly
# Note: use %cd because !cd doesn't work in Jupyter notebook as it's run in temporary subshell
%cd ../output/kallisto_libraries/cbai_transcriptomev4.0/

/mnt/c/Users/acoyl/Documents/GitHub/hemat_bairdi_transcriptome/output/kallisto_libraries/cbai_transcriptomev4.0


In [7]:
!md5sum --check checksums.md5

std_errortracking.txt: OK
id072/abundance.h5: OK
id072/abundance.tsv: OK
id072/run_info.json: OK
id118/abundance.h5: OK
id118/abundance.tsv: OK
id118/run_info.json: OK
id127/abundance.h5: OK
id127/abundance.tsv: OK
id127/run_info.json: OK
id132/abundance.h5: OK
id132/abundance.tsv: OK
id132/run_info.json: OK
id151/abundance.h5: OK
id151/abundance.tsv: OK
id151/run_info.json: OK
id173/abundance.h5: OK
id173/abundance.tsv: OK
id173/run_info.json: OK
id178/abundance.h5: OK
id178/abundance.tsv: OK
id178/run_info.json: OK
id254/abundance.h5: OK
id254/abundance.tsv: OK
id254/run_info.json: OK
id272/abundance.h5: OK
id272/abundance.tsv: OK
id272/run_info.json: OK
id280/abundance.h5: OK
id280/abundance.tsv: OK
id280/run_info.json: OK
id294/abundance.h5: OK
id294/abundance.tsv: OK
id294/run_info.json: OK
id334/abundance.h5: OK
id334/abundance.tsv: OK
id334/run_info.json: OK
id349/abundance.h5: OK
id349/abundance.tsv: OK
id349/run_info.json: OK
id359/abundance.h5: OK
id359/abundance.tsv: OK
id35

In [9]:
# Move back into our scripts directory
%cd ../../../scripts/

/mnt/c/Users/acoyl/Documents/GitHub/hemat_bairdi_transcriptome/scripts


## Begin building transcript expression matrices
Again, we are using hemat_transcriptomev1.6, which only includes sequences whose taxa matched to Alveolata

Now working on local machine. Build matrices to compare Day 0/2 ambient-temperature crabs vs. Day 2 elevated-temperature crabs

#### This utilizes a perl script from the Trinity pipeline, available [here](https://github.com/trinityrnaseq/trinityrnaseq/wiki/Trinity-Transcript-Quantification)

It'll throw an error like `Error, cmd: R --no-save --no-restore --no-site-file --no-init-file -q < kallisto.isoform.TPM.not_cross_norm.runTMM.R 1>&2  died with ret (32512)  at /mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl line 105.
Error, CMD: /mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl --matrix kallisto.isoform.TPM.not_cross_norm > kallisto.isoform.TMM.EXPR.matrix died with ret 6400 at ../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl line 385.
`

But that's totally fine - the script should have already given us the matrix we need

In [10]:
!pwd

/mnt/c/Users/acoyl/Documents/GitHub/hemat_bairdi_transcriptome/scripts


#### Elevated Day 0 vs. Elevated Day 2, individual libraries only

Build a matrix to compare infected individual Elevated libraries from Day 0 with individual Elevated libraries from Day 2. Reminder: Day 0 samples were taken when all crabs were held at ambient-temperature waters, and the same crabs were sampled on Day 0 and Day 2.

Effectively, this compares the same infected crab prior to exposure to elevated temps and post-exposure.

In [11]:
# Use the perl script from Trinity to create matrix. Output will appear to have errors, but runs just fine for our purposes - all we need is the kallisto.isoform.counts.matrix file
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id173/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id072/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id127/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id272/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id294/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id280/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id173/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id072/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id127/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id272/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id294/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id280/abundance.tsv


* Outputting combined matrix.

/mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl --matrix kallisto.isoform.TPM.not_cross_norm > kallisto.isoform.TMM.EXPR.matrixCMD: R --no-save --no-restore --no-site-file --no-init-file -q < kallisto.isoform.TPM.not_cross_norm.runTMM.R 1>&2 
/mnt/c/Users/acoyl/Downloads/anaconda3/lib/R/bin/exec/R: error while loading shared libraries: libreadline.so.6: canno

In [12]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/elev0_vs_elev2_indiv/

#### Lowered Day 0 vs. Lowered Day 2

In [1]:
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id221/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id254/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id222/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id221/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id254/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id222/abundance.tsv


* Outputting combined matrix.

/mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl --matrix kallisto.isoform.TPM.not_cross_norm > kallisto.isoform.TMM.EXPR.matrixCMD: R --no-save --no-restore --no-site-file --no-init-file -q < kallisto.isoform.TPM.not_cross_norm.runTMM.R 1>&2 
/mnt/c/Users/acoyl/Downloads/anaconda3/lib/R/bin/exec/R: error while loading shared libraries: libreadline.so.6: canno

In [2]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/low0_vs_low2_indiv/

#### Lowered Day 0 vs. Lowered Day 2 + 17

In [3]:
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id221/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id254/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id222/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id427/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id445/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id425/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id221/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id254/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id222/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id427/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id445/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id425/abundance.tsv


* Outputting combined matrix.

/mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl --matrix kallisto.isoform.TPM.not_cross_norm > kalli

In [4]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/low0_vs_low217_indiv/

#### Lowered Day 0 + Ambient Day 0 + 2 + 17 vs. Lowered Day 2 + 17

In [5]:
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id485/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id221/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id254/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id222/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id427/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id445/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id425/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv
-reading file: ../output/kallisto_libraries

In [6]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/low0_amb0217_vs_low217_indiv/

#### Lowered Day 2 + 17 vs. Ambient Day 2+17

In [7]:
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id221/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id254/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id222/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id427/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id445/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id425/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id485/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id221/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id254/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id222/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id427/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id445/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id425/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv
-reading file: ../output/kallisto_libraries

In [8]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/low217_vs_amb217_indiv/

#### Ambient Day 2 vs. Elevated Day 2, individual libraries only

In [13]:
# Use the perl script from Trinity to create matrix. Output will appear to have errors, but runs just fine for our purposes - all we need is the kallisto.isoform.counts.matrix file
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id272/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id294/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id280/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id272/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id294/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id280/abundance.tsv


* Outputting combined matrix.

/mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl --matrix kallisto.isoform.TPM.not_cross_norm > kallisto.isoform.TMM.EXPR.matrixCMD: R --no-save --no-restore --no-site-file --no-init-file -q < kallisto.isoform.TPM.not_cross_norm.runTMM.R 1>&2 
/mnt/c/Users/acoyl/Downloads/anaconda3/lib/R/bin/exec/R: error while loading shared libraries: libreadline.so.6: canno

In [14]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/amb2_vs_elev2_indiv/

#### Lowered Day 0 vs. Ambient Day 0

In [9]:
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv


* Outputting combined matrix.

/mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl --matrix kallisto.isoform.TPM.not_cross_norm > kallisto.isoform.TMM.EXPR.matrixCMD: R --no-save --no-restore --no-site-file --no-init-file -q < kallisto.isoform.TPM.not_cross_norm.runTMM.R 1>&2 
/mnt/c/Users/acoyl/Downloads/anaconda3/lib/R/bin/exec/R: error while loading shared libraries: libreadline.so.6: canno

In [11]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/low0_vs_amb0_indiv/

#### Ambient Day 0 vs. Ambient Day 2, individual libraries only

In [1]:
# Use the perl script from Trinity to create matrix. Output will appear to have errors, but runs just fine for our purposes - all we need is the kallisto.isoform.counts.matrix file
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv


* Outputting combined matrix.

/mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl --matrix kallisto.isoform.TPM.not_cross_norm > kallisto.isoform.TMM.EXPR.matrixCMD: R --no-save --no-restore --no-site-file --no-init-file -q < kallisto.isoform.TPM.not_cross_norm.runTMM.R 1>&2 
/mnt/c/Users/acoyl/Downloads/anaconda3/lib/R/bin/exec/R: error while loading shared libraries: libreadline.so.6: canno

In [2]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/amb0_vs_amb2_indiv/

#### Ambient Day 0 vs. Ambient Day 17

In [1]:
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id485/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id485/abundance.tsv


* Outputting combined matrix.

/mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl --matrix kallisto.isoform.TPM.not_cross_norm > kallisto.isoform.TMM.EXPR.matrixCMD: R --no-save --no-restore --no-site-file --no-init-file -q < kallisto.isoform.TPM.not_cross_norm.runTMM.R 1>&2 
/mnt/c/Users/acoyl/Downloads/anaconda3/lib/R/bin/exec/R: error while loading shared libraries: libreadline.so.6: canno

In [2]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/amb0_vs_amb17_indiv/

### Ambient Day 0, 2, 17, individual libraries only

This is not used in scripts 40-49. Instead, it is used to track gene expression in individual ambient-temperature crabs at all three time points. Analysis continues in 51_DEGpattern_test.Rmd

In [1]:
# Use the perl script from Trinity to create matrix. Output will appear to have errors, but runs just fine for our purposes - all we need is the kallisto.isoform.counts.matrix file
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id485/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id485/abundance.tsv


* Outputting combined matrix.

/mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl --matrix kallisto.isoform.TPM.not_cross_norm > kalli

In [2]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/amb0_amb2_amb17_indiv/

#### Lowered Day 0 vs. Ambient Day 0

In [1]:
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv


* Outputting combined matrix.

/mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl --matrix kallisto.isoform.TPM.not_cross_norm > kallisto.isoform.TMM.EXPR.matrixCMD: R --no-save --no-restore --no-site-file --no-init-file -q < kallisto.isoform.TPM.not_cross_norm.runTMM.R 1>&2 
/mnt/c/Users/acoyl/Downloads/anaconda3/lib/R/bin/exec/R: error while loading shared libraries: libreadline.so.6: canno

In [2]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/low0_vs_amb0_indiv/

#### Lowered Day 0 vs. Lowered Day 17

In [3]:
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id427/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id445/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id425/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id427/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id445/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id425/abundance.tsv


* Outputting combined matrix.

/mnt/c/Users/acoyl/Documents/GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/support_scripts/run_TMM_scale_matrix.pl --matrix kallisto.isoform.TPM.not_cross_norm > kallisto.isoform.TMM.EXPR.matrixCMD: R --no-save --no-restore --no-site-file --no-init-file -q < kallisto.isoform.TPM.not_cross_norm.runTMM.R 1>&2 
/mnt/c/Users/acoyl/Downloads/anaconda3/lib/R/bin/exec/R: error while loading shared libraries: libreadline.so.6: canno

In [4]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/low0_vs_low17_indiv/

#### Lowered Day 0  + Ambient Day 0 vs. Lowered Day 17 + Ambient Day 17

In [5]:
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id427/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id445/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id425/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id485/abundance.tsv 

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id427/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id445/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id425/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv
-reading file: ../output/kallisto_libraries

In [6]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/low0_amb0_vs_low17_amb17_indiv/

#### Lowered Day 0 + Elevated Day 0 vs. Lowered Day 2 + Elevated Day 2

In [1]:
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id173/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id072/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id127/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id221/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id254/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id222/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id272/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id294/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id280/abundance.tsv 

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id173/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id072/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id127/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id221/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id254/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id222/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id272/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id294/abundance.tsv
-reading file: ../output/kallisto_libraries

In [2]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/low0_elev0_vs_low2_elev2_indiv/

## All Individual Crabs

Create another matrix for all individual libraries. Used to create a PCA of all libraries in the analysis. 

In [2]:
!../../../GradSchool/RobertsLab/Tools/Trinity/trinityrnaseq-v2.11.0/util/abundance_estimates_to_matrix.pl \
--est_method kallisto \
--gene_trans_map 'none' \
--out_prefix kallisto \
--name_sample_by_basedir \
../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id485/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id221/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id427/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id151/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id254/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id445/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id113/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id222/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id425/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id173/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id272/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id072/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id294/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id127/abundance.tsv \
../output/kallisto_libraries/cbai_transcriptomev4.0/id280/abundance.tsv

-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id178/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id359/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id463/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id118/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id349/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id481/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id132/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id334/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id485/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id073/abundance.tsv
-reading file: ../output/kallisto_libraries/cbai_transcriptomev4.0/id221/abundance.tsv
-reading file: ../output/kallisto_libraries

In [3]:
# Since the script outputs files in the working directory without an 
# option to change output dir, move all output over manually
!mv kallisto.isoform.* ../output/kallisto_matrices/cbai_transcriptomev4.0/all_indiv_libraries/

This completes our kallisto analysis and matrix creation. Move to the Rmd file 42_kallisto_to_deseq_to_accessionIDs to begin differential gene expression analysis using DESeq2