New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Need to build a specification verification system for all experiment sheets #4
Comments
Basically this would be a full feature 'sanity checker' for any experiment configuration. It would run on any command to aerobio and determine that all the configuration sheets are properly specified for the experiment in question This issue will be for enumerating all the possible things we can think of to check for. It will likely remain open for some time... To start off:
|
The idea here is that if some 'samples' fastq.gz files seem to be 'empty', it may well be due to incorrect library barcodes used in defining the samples.
|
|
|
Actually, making sure that SampleName/IDs do not repeat in SampleSheet should be done generally!! This is because bcl2fastq will catch this but it has no good way of relaying the error to Aerobio... Generally, this sort of error is due to a copy/paste of some strain-condition-repid, where the repid is not incremented to reflect the replicates (they are all '1' or 'a' across several identical strain-condition |
Make sure all names match with case!! For example 12,AbC-WTNoAB-1,ATCGATCG,GAGGCAGAAGC in Exp sheet vs AbC-WTT0,AbC-WTNoAb,... in Comparison sheet! |
|
OK, most of this is done in new validation system. Only using fastq sizes to warn of possible barcode mess up and the part about repeating replicate names is not yet done. I'm not sure about the barcode stuff anymore anyway as that can only be checked once bcl2fastq and phase-0c for sample fqs. So, that is likely just out of scope. So, I will close this and open one just for the repeating rep names |
No description provided.
The text was updated successfully, but these errors were encountered: