Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How are features obtained if you don’t specify any class information for grouping? #678

Open
Pembs opened this issue Jun 5, 2023 · 1 comment

Comments

@Pembs
Copy link

Pembs commented Jun 5, 2023

How are features selected when grouping and filling if you don’t specify any class information for grouping?
I am curious as to how features are obtained through the older xcmsSet workflow, without sample group/class information, (when specifying sample groups was not mandatory for feature grouping for this older workflow).
All parameters were kept the same for each of the workflows.

1-xcmsSet workflow, no class column in phenoData = 5649 features (left out class column by mistake, same number of samples as below)
2-XCMSnExp workflow, sample_groups were 300 samples or 76 QCs =999 features
3-XCMSnExp workflow, considering samples and QCs all in the same group = 712 features

I can understand why there were slightly more features when considering the samples vs QC grouping, as there are smaller sample group numbers to consider when the minFraction is 0.5. But curious as to how features are selected when there is no sample class information. It seems minFraction is not taken into consideration when there is no sample class information – so is this how they are obtained, with a minFraction = 0 with no class information?

@Pembs Pembs closed this as completed Jun 6, 2023
@Pembs Pembs reopened this Jun 6, 2023
@Pembs Pembs changed the title Why are there less features when specifying sample groups ( samples or QCs) vs not? How are features obtained if you don’t specify any class information for grouping? Jun 6, 2023
@jorainer
Copy link
Collaborator

The correspondence method for xcmsSet silently uses sampclass(object) with object being the xcmsSet. I assume the xcmsSet is putting all samples into one group if its not specified - but there is also an automatic sample group estimation based on the folders the provided files are stored in.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants