Skip to content

vmikk/phylotwin-preprocessor

Repository files navigation

PhyloTwin-Preprocessor - GBIF Occurrence Data Processing Pipeline

A Nextflow pipeline for processing GBIF species occurrence data with spatial outlier detection and H3 grid binning.

The pipeline offers two processing modes:

  • "Atomic" mode: Processes each species independently
  • "Batched" mode: Processes multiple species in batches (optimized for HPC environments)

Dependencies

Primary dependencies

These are the tools that the user must install and configure on their system to run the pipeline:

Secondary dependencies

These are the containerized tools or packages required by the pipeline, which will be automatically handled within the containers:

R packages:

About

Species occurrence pre-processing pipeline

Resources

License

Stars

Watchers

Forks

Contributors