This repository stores taxonomy information about mosquitoes. The source used to construct the tables and plots comes from Mosquito Taxonomic Inventory (MTI). Here we have three tsv files, the mos_tax.tsv that was built from this pdf file, the aedes_tax.tsv file was built from this other pdf file which according to MTI represents "A second pdf lists the aedine species that are classified for convenience as members of a very large composite genus Aedes by Wilkerson et al. (2015).", and the third file mos_mixed_tax.tsv represents a fusion between the two aforementioned tsv files, basically, I replaced the entries of mos_tax.tsv correspondents to mosquitoes of Aedes genus defined by Wikerson et al 2015. Be careful with which taxonomy standard you will use (see plots in Usage section).
This script was build on python 3.6.5+ and have these dependencies:
- pandas;
- numpy;
- argparse;
- seaborn;
- matplotlib;
- Mosquito Taxonomic Inventory : To understand the criteria involved in this taxonomy structure.
- Recomende literature about mosquito taxonomy : To keep up to date with this field.
- Color palletes of seaborn : To pass with -cm argument.
- To plot the number of species by genus in a bar plot.
- To plot the number of species in specific genus in a donut plot.
python plots.py -in mos_tax.tsv -st Anopheles Culex Aedes -pt Donut -gb 1
python plots.py -in mos_mixed_tax.tsv -st Anopheles Culex Aedes -pt Donut -gb 1
- To get a help about the plots.py usage:
python plots.py --help
- BOLDSYSTEMS : To get barcode information;
- VectorBase : To get genomes, transcriptomes, proteomes and other molecular information about mosquito and other vectors;
- TIMETREE: Put Culicidae in BUILD A TIME TREE Group to get the evolutionary tree with many information about geological scale and atmosphere levels, or just Click here to see a cladogram of mosquitoes.
If you use one of these plots, or the tsv table for some pourpose, please put the correct source: plot(https://github.com/dezordi/mosquitotax), taxonomy(http://mosquito-taxonomic-inventory.info/).
All data used in these plots are recovered from MTI.
- I'm not a computer engineer or some related professional, I'm just write this script to study python and to automatize some bioinformatics tasks. So fell free to commit changes that makes the code more efficient or more clean.
- This repository will continue to be developed to englobe others informations, like create histograms with years of mosquito collection.
- This repository will continue to be updated according to information available on MTI.