Order of operations for applying filters.

> Major question/concern: Order of operations for applying filters will affect the results. Currently, the filters are applied in steps Fraction, Fragment, and then Taxonomy. Which sequences that pass the Fraction filter is a bit random and certainly uncaring to the biology; that filter just blindly removes based on a modulo boolean check. Whatever sequences remain may still not pass the other filters. This has the potential to really hamstring an analysis. 
> 
> Applying the Fraction filter last, would enable the other filters to thoroughly remove the biologically uninteresting sequences, leaving behind a set of sequences that we know the user is interested in, from which the Fraction filter can remove sequences in a more random, uncaring fashion. 
> 
> But, changing the order of filtering operations will/may return different results compared to those from the EFI v1 code base.

_Originally posted by @rbdavid in https://github.com/EnzymeFunctionInitiative/EST/pull/151#discussion_r2040025459_


>I would put this as a low priority question to ask John/Remi later this summer.  It can be added as a low priority issue if you want.

_Originally posted by @nilsoberg in https://github.com/EnzymeFunctionInitiative/EST/pull/151#discussion_r2047429454_
            

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Order of operations for applying filters. #161

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Order of operations for applying filters. #161

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions