Skip to content

Commit

Permalink
Add files via upload
Browse files Browse the repository at this point in the history
  • Loading branch information
naurasd committed Mar 26, 2024
1 parent 0b75a95 commit dffc76e
Show file tree
Hide file tree
Showing 29 changed files with 52,214 additions and 0 deletions.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
25 changes: 25 additions & 0 deletions taxonomic_assignments/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
Here are the output files containing the taxonomic assignments from the PEMA processing of the ARMS-MBON sequences taken between 2018 and 2020, for the COI, 18S, and ITS marker genes.

The first chunk of files are "Extended final tables" that contain the following information:
* an ASV/OTU identifier (these numbers being unique with a single PEMA run)
* the number of reads for each sample that was processed: each sample is in a separate column, with the title being the material sample ID that can be found in ENA (note: these material sample IDs are not always exactly the same as used in the sampling logsheets in the [sample data folder](https://github.com/arms-mbon/data_workspace/tree/main/qualitycontrolled_data/combined), but they are close)
* the full taxonomic classification as returned by the reference database used*
* the associated NBCI taxon ID (where there is one)

*PEMA v.2.1.4 was used during this processing phase. In this version, for COI gene sequences, the taxonomic classification in these tables stops at the genus level. The species-level classification is not included in the Extended Final Tables. To obtain species-level classification for COI gene sequences, we used the "tax_assignments" files (see below). These files include detailed classifications beyond the genus level for each ASV provided in the Extended Final Tables.

The filenames of the extended final tables contain:
* the date the samples were sequenced (e.g. April2021)
* the gene type (e.g. COI)
* whether or not the blank (sequences) were included

The second chunk of files are the more detailed taxonomic assignements which are produced only for the COI dataset. These contain:
* an ASV identifier (these numbers being unique with a single PEMA run; the first part of this ID, before the "_", is included in the ID in the first column of its linked Extended_final_table)
* For each node of the taxonomic classification: its name and its confidence level

Their filenames contain:
* the date the samples were sequenced (e.g. April2021)
* the gene type (e.g. COI)

There is also a file indicating which samples produced [no results](https://github.com/arms-mbon/analysis_release_001/tree/main/taxonomic_assignments/Samples_with_no_results.xlsx)
and those which [were removed because they occurred in the blanks](https://github.com/arms-mbon/analysis_release_001/tree/main/taxonomic_assignments/OTUs_ASVs%20that%20were%20removed_modified%20because%20they%20occurred%20in%20the%20blanks.xlsx)
Binary file not shown.

Large diffs are not rendered by default.

1,327 changes: 1,327 additions & 0 deletions taxonomic_assignments/tax_assignments_April2021_COI_noBlank.tsv

Large diffs are not rendered by default.

1,524 changes: 1,524 additions & 0 deletions taxonomic_assignments/tax_assignments_August2023_COI_noBlank.tsv

Large diffs are not rendered by default.

9,595 changes: 9,595 additions & 0 deletions taxonomic_assignments/tax_assignments_January2020_COI_noBlank.tsv

Large diffs are not rendered by default.

4,369 changes: 4,369 additions & 0 deletions taxonomic_assignments/tax_assignments_January2022_COI_noBlank.tsv

Large diffs are not rendered by default.

14,028 changes: 14,028 additions & 0 deletions taxonomic_assignments/tax_assignments_July2019_COI_noBlank.tsv

Large diffs are not rendered by default.

19,458 changes: 19,458 additions & 0 deletions taxonomic_assignments/tax_assignments_May2021_COI_noBlank.tsv

Large diffs are not rendered by default.

1,798 changes: 1,798 additions & 0 deletions taxonomic_assignments/tax_assignments_September2020_COI_noBlank.tsv

Large diffs are not rendered by default.

0 comments on commit dffc76e

Please sign in to comment.