Skip to content

katarinagresova/ensembl_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Ensemble scraper

Create all functional elements datasets you ever wanted.

Ensemble scraper is command-line tool for accessing data from Ensemble and creating classification datasets from them.

Ensembl is a genome browser for vertebrate genomes that supports research in comparative genomics, evolution, sequence variation and transcriptional regulation. Ensembl annotate genes, computes multiple alignments, predicts regulatory function and collects disease data.

Instalation

pip install git+https://github.com/katarinagresova/ensembl_scraper.git

Features

  • downloading loci of functional elements for organisms of interest
  • converting data to specified format
  • preprocessing to remove low quality data
  • generating negative class for classification dataset

About

Create all functional elements datasets you ever wanted.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages