Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge ISA-Tab specification with ISA-XLSX changes #72

Merged
merged 11 commits into from
Oct 9, 2023
Merged

Conversation

HLWeil
Copy link
Member

@HLWeil HLWeil commented Sep 13, 2023

Merged all current differences between ISA-Tab and ISA-XLSX as we use it into the base ISA-Tab specification.

Differences are listed here: #71

This is no PR on the main specification but a preliminary PR on a side branch to be able to have a real discussion on the specification of ISA-Tab/ISA-XLSX differences.

@muehlhaus @Freymaurer @Brilator @kMutagene

Please take a look and let me know about inconsistencies, miswordings or missing specification patterns.

ISA-XLSX.md Outdated
@@ -1,19 +1,15 @@
# ISA-Tab format
# ISA-XLSX format

For detail on ISA framework terminology, please read the [ISA Abstract Model specification](isamodel.md).
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The linked md file does not exist

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Naturally 👍

ISA-XLSX.md Outdated
> - `STUDY ASSAYS`
> - `STUDY PROTOCOLS`
> - `STUDY CONTACTS`
The purpose of top-level metadata sheets is aggregating and listing top-level metadata. Each sheet consists of sections consisting of a section header and key-value fields. Section headers MUST be completely written in upper case (e.g. STUDY), field headers MUST have the first letter of each word in upper case (e.g. Study Identifier); with the exception of the referencing label (REF).

In the following sections, examples of each section block are given beside the specification of each section.

For a full example of a complete Investigation File, please see [https://git.io/vD1va](https://git.io/vD1va).
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this links to an ISA-Tab example that is txt not xlsx

@@ -127,7 +129,7 @@ Notes:

### ISA-XLSX Format

ISA-XLSX follows the ISA model specification (v1.0) saved in a XLSX format. The XLSX format uses the SpreadsheetML markup language and schema to represent a spreadsheet document. Conceptually, using the terminology of the Spreadsheet ML specification [ISO/IEC 29500-1](https://www.loc.gov/preservation/digital/formats/fdd/fdd000398.shtml#:~:text=The%20XLSX%20format%20uses%20the,a%20rectangular%20grid%20of%20cells.), the document comprises one or more worksheets in a workbook. Every worksheet MUST contain one table object storing the metadata. Comments or auxillary information MAY be stored alongside with table objects in a worksheet.
https://github.com/nfdi4plants/ARC-specfication/blob/main/ISA-XLSX.md
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

does not exist

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yet

ISA-XLSX.md Outdated
@@ -22,74 +18,47 @@ experiment; experimental steps (or sequences of events) are described in the Stu
Investigation file there may be one or more Studies defined with a corresponding Study file; for each Study there may
be one or more Assays defined with corresponding Assay files.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe specify that one assay can be referenced in multiple studies

ISA-XLSX.md Outdated
| Investigation Publication Status Term Source REF | String | Identifies the controlled vocabulary or ontology that this term comes from. The Source REF has to match one the Term Source Name declared in the in the Ontology Source Reference section. |

For example, the `INVESTIGATION PUBLICATIONS` section of an ISA-Tab `i_*.txt` file may look as follows:
For example, the `INVESTIGATION PUBLICATIONS` section of an ISA-XLSX `isa.investigation.xlsx` file may look as follows:

```default
INVESTIGATION PUBLICATIONS
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe do these blocks as table. As different columns are very difficult to recognize in a code block.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done, although i really do not like the examples from the initial ISA specs. the descriptions are far too large. I get where they come from, their sample should represent a 'real' investigation description. but that huge text block does really not help. I think @HLWeil will adapt the samples anyway.

ISA-XLSX.md Outdated

| Characteristics[Organism] | Term Source REF | Term Accession Number |
| Characteristics [organism] | Term Source REF (OBI_0100026) | Term Accession Number (OBI_0100026) |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we allow underscore term ids in headers? OBI_0100026

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nope, copy error 👍

@kMutagene
Copy link
Member

I think this is a nice first draft, one addition that i'd like would be internal links in the document. For example in the Study File section, there is this text block:

The Study File MUST contain one Top-Level Metadata sheet. This sheet MUST be named isa_study and MUST contain the following sections:

  • STUDY
  • STUDY DESIGN DESCRIPTORS
  • STUDY PUBLICATIONS
  • STUDY FACTORS
  • STUDY ASSAYS
  • STUDY PROTOCOLS
  • STUDY CONTACTS

all of the mentioned sections should include a link to their respective specification.

ISA-XLSX.md Outdated
- [INVESTIGATION](#investigation)
- [Example](#example-1)
- [INVESTIGATION PUBLICATIONS](#investigation-publications)
- [Example](#example-2)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I do not like how this looks too much, the idea was giving direct links to the samples, but i think this looks weird. other opinions?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe use **Example** instead of #### Example?

@HLWeil
Copy link
Member Author

HLWeil commented Sep 14, 2023

I think this is a nice first draft, one addition that i'd like would be internal links in the document. For example in the Study File section, there is this text block:

The Study File MUST contain one Top-Level Metadata sheet. This sheet MUST be named isa_study and MUST contain the following sections:

  • STUDY
  • STUDY DESIGN DESCRIPTORS
  • STUDY PUBLICATIONS
  • STUDY FACTORS
  • STUDY ASSAYS
  • STUDY PROTOCOLS
  • STUDY CONTACTS

all of the mentioned sections should include a link to their respective specification.

YES! I think this will make navigating the file much easier. Also for stuff like "Annotation Table sheet" and "Ontology Annotation"

@kMutagene
Copy link
Member

@HLWeil do you still intend to adapt the examples (e.g. shorter descriptions, other names, etc.)? Otherwise LGTM as far as i can tell

@HLWeil
Copy link
Member Author

HLWeil commented Sep 27, 2023

@HLWeil do you still intend to adapt the examples (e.g. shorter descriptions, other names, etc.)? Otherwise LGTM as far as i can tell

Yes I'm still planning to 😄

@HLWeil HLWeil self-assigned this Oct 4, 2023
@HLWeil HLWeil added the enhancement New feature or request label Oct 4, 2023
@HLWeil HLWeil merged commit c893ec0 into status_quo Oct 9, 2023
@HLWeil HLWeil deleted the isa_xlsx branch January 12, 2024 08:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants