Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upread mulitple files and overlaying plots #4
Conversation
|
Great job Man and thank you for your contribution. Being able to plot a collection of samples will be extremely useful to the community. Please make sure to add your name in the documentation to give you the credit.
|
7c317e7
into
kassambara:master
|
I am glad you find the suggestions interesting, @kassambara. I will follow with another pull request with the plotting functions for the other modules. |
Hi @kassambara,
I have been working on a package similar to fastqcr, then I found about yours and I think it's well written. I would like to suggest some code to enable a visual comparison between multiple samples:
qc_read_collection: a simple wrapper to use your functionqc_readto read multiple FastQC files in a way consistent with other package functions. The output of this function is an object of classqc_read_collectionthat can be used to make overlaying plots.This is supposed to work along the lines of the original function, so the output is a list of tibbles and each tibble has an extra column
sampleto track the original file.Output of `r head(qc)`
$per_sequence_gc_content sample GC Content Count 1 S1 0 81.0 2 S1 1 44.0 3 S1 2 14.0 4 S1 3 39.5 5 S1 4 58.0 6 S1 5 78.5 7 S1 6 143.0 8 S1 7 264.5 9 S1 8 342.5 10 S1 9 427.5.plot_gc_contentthat produces a line graph of the GC content of multiple samples.And here is the output of calling
plot_gc_content_collection(qc)If you find these suggestions interesting, I can write the extensions to plotting other modules and figure out a dispatch method on the original
qc_readclass andqc_read_collection.