A sequence statistics and k-mer analysis tool for FASTA files.
It computes basic genome summaries (length, GC content) and exact k-mer frequencies using a streaming design.
BulbaSeq targets the foundational preprocessing layer used by modern genomics pipelines.
Warning
The current implementation is a Python prototype, written for clarity, not performance.
This prototype serves as a reference model for a planned high performance C++/CUDA reimplementation.