# IgBLAST for TCR

Given the FASTA file `A2-i131.fasta`, use IgBLAST to assign germline V, D, and J segments, and post-process using Change-O.

## Obtain reference sequences

Go to [http://www.imgt.org/vquest/refseqh.html](http://www.imgt.org/vquest/refseqh.html) and download TRBV, TRBD, and TRBJ sequences for humans in FASTA format. Save them as `IMGT_Human_TRBV.fasta` etc..

(For advanced users, one could download the whole database from [here](http://www.imgt.org/download/GENE-DB/) and postprocess.)

## Converting IMGT FASTA files to IgBLAST databases

In [None]:
%%bash
# V-segment database
perl ./edit_imgt_file.pl IMGT_Human_TRBV.fasta > database/human_trb_v
makeblastdb -parse_seqids -dbtype nucl -in database/human_trb_v
# D-segment database
perl ./edit_imgt_file.pl IMGT_Human_TRBD.fasta > database/human_trb_d
makeblastdb -parse_seqids -dbtype nucl -in database/human_trb_d
# J-segment database
perl ./edit_imgt_file.pl IMGT_Human_TRBJ.fasta > database/human_trb_j
makeblastdb -parse_seqids -dbtype nucl -in database/human_trb_j

## Use IgBLAST

IgBLAST has many options (see below) but the most important ones are as follows:

- germline_db_V: the V gene database
- germline_db_D: the D gene database
- germline_db_J: the J gene database
- auxiliary_data: contains annotations for the sequences
- domain_system: the system used (e.g. imgt) for defining the domains
- ig_seqtype: Ig or TCR
- organism: e.g. human, mouse
- outfmt: the output format; for postprocessing with ChangeO, has to be '7 std qseq sseq btop'
- query: the input data in FASTA format
- out: the output filename
- num_threads: the number of threads to use

In [None]:
!igblastn -help

Complete the following cell to run `A2-i131,fasta` against the TRB databases generated previously. Ensure that the outfmt term is '7 std qseq sseq btop', and save the output as `A2-i131.fmt7`.

In [None]:
%%bash
igblastn

## Postprocess IgBLAST

The following cells postprocess the BLAST output using Change-O.

In [None]:
%%bash
MakeDb.py igblast -i A2-i131.fmt7 -s A2-i131.fasta -r IMGT_Human_TRB[VDJ].fasta \
    --regions --scores

In [None]:
%%bash
ParseDb.py split -d A2-i131_db-pass.tab -f FUNCTIONAL