New module: clustering#11337
Closed
dbaku42 wants to merge 25 commits into
Closed
Conversation
* Specify more guidelines on input channels * Linting * Updates based on code review * Update README.md * Fix broken sentence
Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>
* add umicollapse * initial clippy module commit * find working test data * complete test * prettier * add a doi * Delete main.nf * Delete meta.yml * Delete main.nf * Delete nextflow.config * Delete test.yml * remove umicollapse from pytest * rename gtf to include that its from gencode * add test for clippy intergenic mode * remove quay.io
* initial trycycler subsample commit * Update modules/nf-core/trycycler/subsample/main.nf to resolve issue with nf-test and empty gzipped files Co-authored-by: Simon Pearce <24893913+SPPearce@users.noreply.github.com> * addressing comments on initial commit, fixed version number, added stub tests --------- Co-authored-by: Simon Pearce <24893913+SPPearce@users.noreply.github.com>
…index files (nf-core#10319) * Update modules * Fix meta * Update samtools stub
module update version
* Specify more guidelines on input channels * Linting * Updates based on code review * Update README.md * Fix broken sentence
Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>
* add umicollapse * initial clippy module commit * find working test data * complete test * prettier * add a doi * Delete main.nf * Delete meta.yml * Delete main.nf * Delete nextflow.config * Delete test.yml * remove umicollapse from pytest * rename gtf to include that its from gencode * add test for clippy intergenic mode * remove quay.io
* initial trycycler subsample commit * Update modules/nf-core/trycycler/subsample/main.nf to resolve issue with nf-test and empty gzipped files Co-authored-by: Simon Pearce <24893913+SPPearce@users.noreply.github.com> * addressing comments on initial commit, fixed version number, added stub tests --------- Co-authored-by: Simon Pearce <24893913+SPPearce@users.noreply.github.com>
…index files (nf-core#10319) * Update modules * Fix meta * Update samtools stub
module update version
- Added flashpca module using custom container ghcr.io/dbaku42/flashpca:2.0 - Fixed file handling for PLINK inputs inside Docker (cp -L + fixed basename) - Updated nf-test to use proper input tuple - Test now passes with --profile docker
…ate environment/meta files - Added containerOptions '--entrypoint ""' to fix argument parsing in Docker - Separated environment.yml (Conda config only) from meta.yml (module documentation) - Hardcoded version to 2.0 for snapshot stability - Fixed dependencies list in environment.yml to prevent linting TypeError
- Remove name field from environment.yml - Fix meta.yml output structure to match schema - Update versions output to tuple with meta for topic compatibility
…abling autoMounts - Add tests/config/nextflow.config with Docker profile - Update tests/main.nf.test - Improve container declaration and version capture in main.nf - Address mount parsing error on GitHub Actions
Performs KMeans or DBSCAN clustering on PLINK2 --pca .eigenvec files. - Supports both KMeans and DBSCAN - Uses conda environment - Includes nf-test with snapshot testing
0701e49 to
54bc054
Compare
54bc054 to
9e6f155
Compare
fb9fc21 to
ed916d5
Compare
- Updated import path from local to nf-core/clustering - Removed custom container reference for now
auto-merge was automatically disabled
May 13, 2026 20:49
Pull request was closed
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
New module:
clusteringPerforms KMeans or DBSCAN clustering on principal components from PLINK2
--pca(.eigenvecfile).Features
kmeansanddbscanalgorithms via parameterscondaenvironment (environment.yml)*_clusters.csv) and metadata (*_clustering_info.json)nf-testcoverage (normal + stub test)Author
Checklist
nf-core modules lint clustering→ passednf-core modules test clustering→ passed (module only)snpclusteringstill uses custom container (will be updated separately)Closes # (if you have an issue)
This PR adds a new standalone module for population clustering.