General Feature Format files consist of one line per feature, each containing 9 columns of data. For a more detailed explanation about this file format and its different columns please refer to https://www.ensembl.org/info/website/upload/gff.html.
import GeneralFormat as gf
file_path = "hg38_5k.gtf"
gtf = gf.GeneralFormat(file_path)
gtf.nb_nr_tx()
gtf.ex_per_tx()
gtf.cdna_per_tx()
gtf.tx_coverage()
- In each of the cases above, the output will be a dictionary that maps from the transcript to the wished output.