Skip to content

ginolhac/umi_trim

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

UMI tools in rust

UMI stands for Unique Molecular Identifier.

The goal is reproduce some functionalities from umi-tools especially the extract command.

For example, convert the following read

@VH00666:90:AAAWVCCHV:1:1101:24026:1000
GTCAGTTATAGCGGGCGCGCAAAAAAAAAAAAAAAAAAAGATCGGAAGAGCACACGTCTGAACTCCAGTCACTCCC
[...]

into:

@VH00666:90:AAAWVCCHV:1:1101:24026:1000_GTCAGT
GCGGGCGCGCAAAAAAAAAAAAAAAAAAAGATCGGAAGAGCACACGTCTGAACTCCAGTCACTCCC

The UMI was GTCAGT and append to the read name, while being removed from the sequence along with the TATA linker.

TODO

About

move UMI from sequences to names, trim the linker

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages