The Database command can be used to prepare a protein FASTA file for the following analysis. The database can be downloaded using the UniProt Proteome ID or by using a pre-formed FASTA file.
philosopher database [flags]
Allows you to add one or more custom sequences to your database. Only UniProt FASTA formatting is allowed.
Process a ready-to-use database.
Add 116 common contaminants found in LC-MS/MS experiments. More information can be found here.
Skips the downloading of a fresh database and use an existing one instead. The custom file will also be used to create decoys and contaminants if desired.
The Proteome ID used to find and download an organism proteome. See below where to find the ID.
Allows isoform sequences to be added to the download.
Decoy prefix to be added, default is rev_.
Download only reviewed sequences from Swiss-Prot.
Download a complete human proteome snapshot without isoforms, using Trypsin for protein digestion and adding contaminants.
philosopher database --id UP000005640 --contam
Download the reviewed version of the human proteome, containing isoforms and contaminants.
philosopher database --id UP000005640 --reviewed --contam
Prepare a custom protein FASTA file for the analysis (skip the download).
philosopher database --custom protein.fas --contam
Download the complete human proteome and add external sequences from another FASTA file.
philosopher database --id UP000005640 -add spikes.fas
This example will download all reference sequences from the Human proteome, contaminants will be added to the database with the rev_ prefix and the resulting file will be digested using Lys C.
philosopher database --contam --reviewed --id UP000005640
Where can I find the UniProt Proteome ID for my organism?
The list of all existing UniProt Identifiers can be found here. Before using Philosopher, you need to search the UniProt website for the correct id.