Skip to content
View lrvdijk's full-sized avatar

Highlights

  • Pro

Block or report lrvdijk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lrvdijk/README.md

Hi there πŸ‘‹

I am a software engineer and computational biologist with a strong background algorithm design, machine learning, and genomic data analysis. I am comfortable with low-level performance engineering in languages such as C++ and Rust, and experienced with high-level data analysis and visualization in Python and JavaScript.

Python Rust C++ Β Β  NumPy SciPy scikit-learn TensorFlow Jax

Interests

  • Machine learning for multi-modal data integration (genomic, transcriptomic, and proteomic)
  • Protein language models
  • Sequence alignment algorithms
  • Microbiome and metagenomics

Bioinformatics tools and analyses

πŸ€– Protein machine learning and language models

  • Flamino. A Flax NNX-based reimplementation of the ESM-2 protein language model.

🧬 Sequence alignment and graph genomes

  • POASTA. A new algorithm for Partial Order Alignment (POA), a form of DNA sequence-to-DAG alignment. POA is a common component in pangenome graph construction pipelines, such as the pipelines used to construct the human pangenome reference graph [1, 2]. POASTA outperformed existing tools by 4.1x and enabled alignments not possible before, while retaining the guarantee of optimality. Published in Oxford Bioinformatics (2025).
  • Pyfrost. A high-performance and low-memory Python library to construct and analyze compacted, colored de Bruijn graphs (ccDBGs). The ccDBG is a commonly used data structure in de novo genome assemblers. The library includes Python bindings to a fast, memory efficient, and C++-based ccDBG library Bifrost, and provides a NetworkX-like API.
  • Tesserae. A recombination-aware DNA sequence aligner that uses a hidden markov model (HMM) to determine the optimal alignment of a query sequence to a panel of potential reference sequences. This is an improved, much faster version of the HMM described in a paper analyzing de novo genetic variants in experimental crossess of the malaria parasite Plasmodium falciparum [1].

πŸ’© Microbiome and metagenomics

πŸ› οΈ Tooling and editors

  • Zed WDL. An extension for the code editor Zed providing syntax highlighting and code completion support for the Workflow Description Language (WDL).
  • dotfiles. My personal configuration files for various tools and editors.

Old projects

πŸ₯³ Social and fun

  • Reddit /r/place headless pixel placement bot. A CLI tool written in Python that would automatically place pixels on Reddit's /r/place canvas during its 2022 April Fool's event. It obtains instructions from a command and control server, and then logins to Reddit to submit the correct pixel color.

πŸ’» Embedded software and electronics

  • TLC5940 Raspberry Pi Driver. The TLC5940 is a commonly used LED driver chip that can drive up to 16 LEDs (or more if chaining multiple chips). This C++ library provides a simple interface to control the TLC5940 from a Raspberry Pi.

πŸ“Š Data visualization

Pinned Loading

  1. broadinstitute/poasta broadinstitute/poasta Public

    Fast and exact gap-affine partial order alignment

    Rust 52 5

  2. broadinstitute/pyfrost broadinstitute/pyfrost Public

    Python bindings for Bifrost's compacted colored de Bruijn Graph with a NetworkX-compatible API

    Python 28 1

  3. broadinstitute/StrainGE broadinstitute/StrainGE Public

    strain-level analysis tools

    Python 38 11

  4. PlaceNL2022/rPlace2022 PlaceNL2022/rPlace2022 Public archive

    Python version of PlaceNL's headless bot with automatic access token refresh

    Python 19 16

  5. dotfiles dotfiles Public

    Several configuration files for vim, zsh, tmux and more.

    Shell

  6. tlc5940-raspberry tlc5940-raspberry Public

    A C++ library to control the TLC5940 LED driver on your raspberry pi.

    C++ 12 4