You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I hope you are having a great day! Thank you again for creating and maintaining this awesome pipeline. It has been extremely useful for a project I have involving A-I editing.
I just wanted to point out an issue I noticed while running FLARE. I will submit a PR shortly, but here is a brief description of the problem below.
I first noticed the issue while viewing some of the output files of SALIOR/FLARE within IGV. In the image(s) below, I have zoomed into a random region. In this figure, the top/bottom coverage tracks are colored by allele frequency.
For this specific project, we were interested in A-I editing sites, so we are looking for A>G editing on the forward strand. These sites are represented as green and yellow coverage bars at any given location.
As you can see here, SAILOR is doing an awesome job calling the A>G sites, and FLARE is also doing an amazing job calling A-I editing regions for the WT samples. But for my KO samples, FLARE did not call this region. After taking a closer look, I noticed something strange in my KO samples.
I tried to grep for this gene within the KO samples' FLARE results, and there were no hits (as expected by the IGV screenshot), so I started looking back further into FLARE's previous output files, and I noticed this had gene inf background_rates.
For this particular region, the math works out to: (2-1)/(22-22). This causes more issues later on in the script here when finding the mean (resulting in inf values) and then the background_rate calculation (resulting in inf values again).
I will submit a PR to fix the issue now. Please feel free to review it whenever you have some free time. Thank you again for your time, and thank you again for creating this awesome pipeline! Please let me know if you have any questions, and have a wonderful weekend!
Hello there,
I hope you are having a great day! Thank you again for creating and maintaining this awesome pipeline. It has been extremely useful for a project I have involving A-I editing.
I just wanted to point out an issue I noticed while running FLARE. I will submit a PR shortly, but here is a brief description of the problem below.
I first noticed the issue while viewing some of the output files of SALIOR/FLARE within IGV. In the image(s) below, I have zoomed into a random region. In this figure, the top/bottom coverage tracks are colored by allele frequency.
For this specific project, we were interested in A-I editing sites, so we are looking for A>G editing on the forward strand. These sites are represented as green and yellow coverage bars at any given location.
As you can see here, SAILOR is doing an awesome job calling the A>G sites, and FLARE is also doing an amazing job calling A-I editing regions for the WT samples. But for my KO samples, FLARE did not call this region. After taking a closer look, I noticed something strange in my KO samples.
I tried to grep for this gene within the KO samples' FLARE results, and there were no hits (as expected by the IGV screenshot), so I started looking back further into FLARE's previous output files, and I noticed this had gene
inf
background_rates.So, then I looked into the step that created this output file, and I found where the error was occurring.
In this script on line 45, there is an edge-case where division by 0 can occur:
![image](https://private-user-images.githubusercontent.com/18038345/253693303-ff63eba6-8f70-40af-8b59-813b377e7ac3.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjI2MTE4MDksIm5iZiI6MTcyMjYxMTUwOSwicGF0aCI6Ii8xODAzODM0NS8yNTM2OTMzMDMtZmY2M2ViYTYtOGY3MC00MGFmLThiNTktODEzYjM3N2U3YWMzLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA4MDIlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwODAyVDE1MTE0OVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTlhMjdjYmQ2OTMxN2ViNzBhZmMzYjIwNzg1MGE3MjYxYmEzYWE2Njk3NzMwOTQxZGM1MjM4OTQ3NDAzZGE1MDMmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.8Q5svZqrez1gqNEjssut5uJBNWx23aYiX_5sED0zbpE)
For my dataset, it happened right here:
For this particular region, the math works out to:
(2-1)/(22-22)
. This causes more issues later on in the script here when finding the mean (resulting in inf values) and then the background_rate calculation (resulting in inf values again).I will submit a PR to fix the issue now. Please feel free to review it whenever you have some free time. Thank you again for your time, and thank you again for creating this awesome pipeline! Please let me know if you have any questions, and have a wonderful weekend!
Best Regards,
@skchronicles
The text was updated successfully, but these errors were encountered: