Create all functional elements datasets you ever wanted.
Ensemble scraper is command-line tool for accessing data from Ensemble and creating classification datasets from them.
Ensembl is a genome browser for vertebrate genomes that supports research in comparative genomics, evolution, sequence variation and transcriptional regulation. Ensembl annotate genes, computes multiple alignments, predicts regulatory function and collects disease data.
pip install git+https://github.com/katarinagresova/ensembl_scraper.git
- downloading loci of functional elements for organisms of interest
- converting data to specified format
- preprocessing to remove low quality data
- generating negative class for classification dataset