[Microbiome] Add tutorial for MAGs building and annotation from raw reads #6556

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

paulzierep merged 7 commits into galaxyproject:main from bebatut:mags-building

Dec 18, 2025

Member

bebatut commented Dec 8, 2025 •

edited

Loading

In complement of the MAGs learning pathway, this tutorial takes raw reads from a publication and, using workflows from IWC:

Preprocess the reads (QC, host & contamination removal)
Build, Refine, and Annotate Metagenome-Assembled Genomes (MAGs)

Each step is explained (with a link to the corresponding dedicated tutorial for a more advanced explanation), and the results are commented on.

I prepared histories with preprocessed data on UseGalaxy.eu, UseGalaxy.org, UseGalaxy.org.au, and UseGalaxy.fr.

bebatut added 2 commits

December 2, 2025 17:48


          Add MAGs building tutorial

fc8c150


          Merge branch 'main' of github.com:galaxyproject/training-material int…

e7288d0

…o mags-building

bebatut requested review from paulzierep and shiltemann as code owners

December 8, 2025 15:09

github-actions bot added faqs microbiome labels

bebatut force-pushed the mags-building branch 2 times, most recently from f1be644 to cc01163 Compare

December 8, 2025 15:48


          Continue writing MAGs building tutorial

b99bf18

bebatut force-pushed the mags-building branch from cc01163 to b99bf18 Compare

December 8, 2025 16:02


          Update results with new workflow version

f98492e

paulzierep reviewed

View reviewed changes

Collaborator

paulzierep left a comment

Need to continue with the main tutorial later.

faqs/galaxy/workflows_run.md

    
              7. **Recommended**: click **Import** (left of Run) to make your own local copy under *Workflows / My Workflows*.

              You may have to refresh your history to see the queued jobs

Collaborator

paulzierep Dec 12, 2025

not needed anymore ?

topics/microbiome/faqs/ani.md Show resolved Hide resolved

topics/microbiome/faqs/ani_dereplication_threshold.md Outdated Show resolved Hide resolved

topics/microbiome/faqs/ani_dereplication_threshold.md Show resolved Hide resolved

topics/microbiome/faqs/ani_dereplication_threshold.md Show resolved Hide resolved

topics/microbiome/faqs/minimum_mag_completeness_percentage.md Show resolved Hide resolved

topics/microbiome/faqs/minimum_mag_completeness_percentage.md Show resolved Hide resolved

topics/microbiome/tutorials/mags-building/generate-input-dataset.md

    
            @@ -0,0 +1,44 @@
          
              # Generate input datasets for the training

Collaborator

paulzierep Dec 12, 2025

This needs a update, now its just two samples, no preprocessing, right ?

Member Author

bebatut Dec 18, 2025

This Markdown is not for learners. It was meant to be as a documentation on how that data were generated

topics/microbiome/tutorials/mags-building/tutorial.md Show resolved Hide resolved

paulzierep reviewed

View reviewed changes

Collaborator

paulzierep left a comment

Amazing work, if you could address the comments still, I think we should merge it for now !

topics/microbiome/tutorials/mags-building/tutorial.md Show resolved Hide resolved

topics/microbiome/tutorials/mags-building/tutorial.md

    
              > >    Forward (Read 1) - Before | 34.5 | 34.6

              > >    Forward (Read 1) - After | 34.5 | 34.6

              > >    Reverse (Read 2) - Before | 33.4 | 32.7

              > >    Reverse (Read 2) - After | 33.4 | 32.7

Collaborator

paulzierep Dec 18, 2025

maybe a comment why this does not change ?

Collaborator

paulzierep Dec 18, 2025

I guess they deposited trimmed reads

topics/microbiome/tutorials/mags-building/tutorial.md

    
              > > <solution-title></solution-title>

              > >

              > > 1. 2.8% for SRR24759598 and 1.1% for SRR24759616

              > > 2. 2.8% for SRR24759598 and 1.1% for SRR24759616

Collaborator

paulzierep Dec 18, 2025

this is not the number of reads ... but if we take percentage the question is a bit useless no ?

topics/microbiome/tutorials/mags-building/tutorial.md Outdated Show resolved Hide resolved

topics/microbiome/tutorials/mags-building/tutorial.md

    
              >      > <comment-title></comment-title>

              >      > metaSPAdes is an alternative assembler.

              >      >

              >      > MEGAHIT is less computationally intensive and generate higher quality single and shorter contigs but shorter. metaSPAdes is very computationally intensive, but generates longer/more complete assemblies.

Collaborator

paulzierep Dec 18, 2025

strange sentence, and maybe point to https://doi.org/10.1093/bib/bbad087 for a benchmark

topics/microbiome/tutorials/mags-building/tutorial.md Outdated Show resolved Hide resolved

topics/microbiome/tutorials/mags-building/tutorial.md Outdated Show resolved Hide resolved

topics/microbiome/tutorials/mags-building/tutorial.md Outdated Show resolved Hide resolved

topics/microbiome/tutorials/mags-building/tutorial.md Outdated Show resolved Hide resolved

topics/microbiome/tutorials/mags-building/tutorial.md

    
              Beyond simply comparing the total number of bins, we can also examine the **contigs per bin** for each binning tool, which provides deeper insight into the **quality and granularity** of the reconstructed microbial genomes. 

              For that, we will use the `collection X, collection Y, and others (as list)` collection of collection. This structure contains two sub-collections—one for each sample. Within each sub-collection, there are four tables, each corresponding to a different binning tool (MetaBAT2, MaxBin2, SemiBin, and CONCOCT). Each table consists of two columns: the contig identifier and its assigned bin ID.

Collaborator

paulzierep Dec 18, 2025

For that, we will use the collection X, collection Y, and others (as list) collection of collection. - not clear

paulzierep and others added 3 commits

December 18, 2025 13:14


          Merge branch 'main' into mags-building

57337b3


          Apply suggestions from code review

b4a38ed

Co-authored-by: paulzierep <paul.zierep@googlemail.com>


          Update tutorial.md

2c19369

paulzierep mentioned this pull request

Fix pending issue with tutorial for MAGs building #6579

Open

paulzierep approved these changes

View reviewed changes

paulzierep merged commit d1574c1 into galaxyproject:main

3 checks passed

Collaborator

paulzierep commented Dec 18, 2025

thanks @bebatut

bebatut deleted the mags-building branch

December 18, 2025 12:46

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

faqs microbiome