feat(schema): Load subject data #1977

effigies · 2024-05-24T01:55:32Z

This fixes the implementation of:

rules.checks.dataset.SubjectFolders
rules.checks.dataset.ParticipantIDMismatch
rules.checks.dataset.PhenotypeSubjectsMissing

These started failing with bids-standard/bids-specification#1833, which activated previously disabled checks.

Closes #1978.

effigies · 2024-05-24T01:56:55Z

The implementation here is really suboptimal. I don't know if we have a way of creating a baseline context.dataset object that just gets passed around, but if so, this should go there. If not, we should make it so we're not potentially parsing many TSV files every time we look at another file.

codecov · 2024-05-24T01:59:12Z

Codecov Report

Attention: Patch coverage is 98.33333% with 1 lines in your changes are missing coverage. Please review.

Project coverage is 87.43%. Comparing base (6e14422) to head (75cb145).

❗ Current head 75cb145 differs from pull request most recent head 0aa1665

Please upload reports for the commit 0aa1665 to get more accurate results.

Files	Patch %	Lines
bids-validator/src/schema/context.ts	98.18%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1977      +/-   ##
==========================================
+ Coverage   85.68%   87.43%   +1.75%     
==========================================
  Files          91      130      +39     
  Lines        3792     6232    +2440     
  Branches     1220     1510     +290     
==========================================
+ Hits         3249     5449    +2200     
- Misses        457      692     +235     
- Partials       86       91       +5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

effigies · 2024-05-24T02:31:03Z

Apparently Javascript doesn't allow you to do array equality with == or anything else, so checks like sorted(columns.participant_id) == sorted(dataset.subjects.sub_dirs) will always fail. I think the only way to handle this will be to create an isEqual(x,y) function in the expression language.

rwblair · 2024-05-24T17:43:33Z

The implementation here is really suboptimal. I don't know if we have a way of creating a baseline context.dataset object that just gets passed around, but if so, this should go there. If not, we should make it so we're not potentially parsing many TSV files every time we look at another file.

I believe that a single reference to the dsContext is used in normal validation. Its instantiated here and then passed to the context generator:
https://github.com/bids-standard/bids-validator/blob/master/bids-validator/src/validators/bids.ts#L48

'll look at if it has the information needed to populate the subjects at creation time. If not we can add a check to the file context to only run load subjects if its undefined. I also need to remember why and when the default dsContext is used.

effigies · 2024-05-24T18:23:21Z

Added a commit to leave .subjects undefined until the first call, and then only create and populate it if undefined. That way we only duplicate effort if the context is regenerated.

rwblair · 2024-05-24T18:26:03Z

Looking good.

A heads up the subjects listed in summary aren't dependent on the dsContext. Every context's filename gets checked for a sub entity and that is added as a subject to the summary:
https://github.com/bids-standard/bids-validator/blob/master/bids-validator/src/summary/summary.ts#L106

I think its trying to capture a case in which a malformed dataset has more subjects files in it than it has proper sub- directories. Another unlikely case is that subjects with completely empty subject dirs won't be counted in the summary.

I can see this potentially as a point of confusion, but also don't think it will matter 99.9% of the time. We can change it in a future PR if need be.

effigies · 2024-05-24T18:35:07Z

I think if there are no files in a subject dir, it won't make it into the fileTree, but I'm okay with a deviation between these if it does.

effigies force-pushed the feat/subjects branch from fe45751 to b7a7c18 Compare May 24, 2024 02:29

effigies mentioned this pull request May 24, 2024

Implement allequal in expression language. #1978

Closed

effigies force-pushed the feat/subjects branch from 2214490 to 34dbe15 Compare May 24, 2024 18:06

effigies and others added 4 commits May 24, 2024 15:29

feat(schema): Load subject data

8b84ae2

feat(expr): Add allequal, fix sorted to not be in-place

49cd18f

feat(context): Load subjects context only once

0680d03

add simple test for populating dscontext subjects object

75cb145

effigies force-pushed the feat/subjects branch from 3099205 to 75cb145 Compare May 24, 2024 19:29

chore: Bump examples module

0aa1665

effigies merged commit ba937d6 into bids-standard:master May 24, 2024
26 of 31 checks passed

effigies deleted the feat/subjects branch May 24, 2024 20:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(schema): Load subject data #1977

feat(schema): Load subject data #1977

effigies commented May 24, 2024 •

edited

Loading

effigies commented May 24, 2024

codecov bot commented May 24, 2024 •

edited

Loading

effigies commented May 24, 2024

rwblair commented May 24, 2024

effigies commented May 24, 2024

rwblair commented May 24, 2024

effigies commented May 24, 2024

feat(schema): Load subject data #1977

feat(schema): Load subject data #1977

Conversation

effigies commented May 24, 2024 • edited Loading

effigies commented May 24, 2024

codecov bot commented May 24, 2024 • edited Loading

Codecov Report

effigies commented May 24, 2024

rwblair commented May 24, 2024

effigies commented May 24, 2024

rwblair commented May 24, 2024

effigies commented May 24, 2024

effigies commented May 24, 2024 •

edited

Loading

codecov bot commented May 24, 2024 •

edited

Loading