Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More flexible filename handling #82

Merged
merged 2 commits into from
Jan 27, 2024
Merged

Conversation

lisakmalins
Copy link
Contributor

This is a fix for #81, where run_DESeq_stats.R and Subsystems_DESeq_stats.R will fail if filenames do not follow the pattern experimental_#_... or control_#_....

Rather than splitting the filenames by underscores, the new code removes experimental_ or control_ from the beginning and .cleaned from the end in order to get the sample names for matching.

Also, I solved the problem of R wanting to change the counts to factors by setting the sample names as row names before transposing. (Previously, if the sample names were not numeric, transposing the dataframe would create columns that were mixed so R would convert all columns to factors. Pulling the sample names into the row names before transposing allows them to directly become column names without affecting the data types.)

Thank you and let me know if you have any questions!

Fix for issue transcript#81. In a nutshell, sample names are now extracted more precisely from filenames so that file naming can be more flexible.
@transcript transcript merged commit 04de463 into transcript:master Jan 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants