Update batch processing length normalization to match non-batch processing length normalization #441

bound-to-love · 2024-05-15T10:05:41Z

No description provided.

update help print statements

mschilli87

I had a quick look and thinks this needs work.

mschilli87 · 2024-05-15T11:11:07Z

src/main.cpp

       //<< "    --error_rate              Estimated error rate of long reads (required for --long)" << endl
-       << "    --threshold		            Threshold for rate of unmapped kmers per read" << endl


This looks accidental.

Could you clarify what looks accidental?

According to the diff on github, you are removing whitespace here:

In line 2073 you fix the alignment of 'Treat' but in line 2075, 'Threshold' is shifted to left too much.

mschilli87 · 2024-05-15T11:11:25Z

src/main.cpp

@@ -2180,7 +2180,7 @@ void usageTCCQuant(bool valid_input = true) {
       << "                              (default: equivalence classes are taken from the index)" << endl
       << "-f, --fragment-file=FILE      File containing fragment length distribution" << endl
       << "                              (default: effective length normalization is not performed)" << endl
-       << "--long			                    Use version of EM for long reads " << endl 


Could you clarify your question?

The same problem as in line 2075.

mschilli87 · 2024-05-15T11:12:15Z

src/main.cpp

@@ -2380,7 +2380,7 @@ int main(int argc, char *argv[]) {
              	         if (fld_lr_c[i] > 0.5) {
 		             //Good results with comment below. 
 		             //flensout_f << std::fabs((double)fld_lr[i] / (double)fld_lr_c[i] - index.k);//index.target_lens_[i] - (double)fld_lr[i] / (double)fld_lr_c[i] - k); // take mean of recorded uniquely aligning read lengths 
- 		             flensout_f << std::fabs(index.target_lens_[i] - ((double)fld_lr[i] / (double)fld_lr_c[i]) - index.k);


Care to elaborate a bit? Ideally in comment, else maybe at least in the commit message?

Hi! Based on our analysis for effective length normalization for long reads, the updated effective length provides better results.

I still don't follow what you are changing and why but I am not familiar with the code. So if this makes sense to others without further explanation, feel free to ignore my comment.

mschilli87

@bound-to-love: I tried to answer your questions.

mschilli87 · 2024-05-15T16:04:02Z

src/main.cpp

@@ -2380,7 +2380,7 @@ int main(int argc, char *argv[]) {
              	         if (fld_lr_c[i] > 0.5) {
 		             //Good results with comment below. 
 		             //flensout_f << std::fabs((double)fld_lr[i] / (double)fld_lr_c[i] - index.k);//index.target_lens_[i] - (double)fld_lr[i] / (double)fld_lr_c[i] - k); // take mean of recorded uniquely aligning read lengths 
- 		             flensout_f << std::fabs(index.target_lens_[i] - ((double)fld_lr[i] / (double)fld_lr_c[i]) - index.k);


I still don't follow what you are changing and why but I am not familiar with the code. So if this makes sense to others without further explanation, feel free to ignore my comment.

mschilli87 · 2024-05-15T16:04:06Z

src/main.cpp

@@ -2180,7 +2180,7 @@ void usageTCCQuant(bool valid_input = true) {
       << "                              (default: equivalence classes are taken from the index)" << endl
       << "-f, --fragment-file=FILE      File containing fragment length distribution" << endl
       << "                              (default: effective length normalization is not performed)" << endl
-       << "--long			                    Use version of EM for long reads " << endl 


The same problem as in line 2075.

mschilli87 · 2024-05-15T16:04:09Z

src/main.cpp

       //<< "    --error_rate              Estimated error rate of long reads (required for --long)" << endl
-       << "    --threshold		            Threshold for rate of unmapped kmers per read" << endl


According to the diff on github, you are removing whitespace here:

In line 2073 you fix the alignment of 'Treat' but in line 2075, 'Threshold' is shifted to left too much.

bound-to-love added 3 commits May 13, 2024 19:43

Update main.cpp

0aec950

Update main.cpp

d553ea2

Update main.cpp

55b0749

update help print statements

mschilli87 suggested changes May 15, 2024

View reviewed changes

Merge branch 'devel' into master

5db77fc

Yenaled merged commit 98ec302 into pachterlab:devel May 16, 2024

Yenaled mentioned this pull request May 16, 2024

Revert "Update batch processing length normalization to match non-batch processing length normalization" #442

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update batch processing length normalization to match non-batch processing length normalization #441

Update batch processing length normalization to match non-batch processing length normalization #441

bound-to-love commented May 15, 2024

mschilli87 left a comment

mschilli87 May 15, 2024

bound-to-love May 15, 2024

mschilli87 May 15, 2024

mschilli87 May 15, 2024

bound-to-love May 15, 2024

mschilli87 May 15, 2024

mschilli87 May 15, 2024

bound-to-love May 15, 2024

mschilli87 May 15, 2024

mschilli87 left a comment

mschilli87 May 15, 2024

mschilli87 May 15, 2024

mschilli87 May 15, 2024

		//<< " --error_rate Estimated error rate of long reads (required for --long)" << endl
		<< " --threshold Threshold for rate of unmapped kmers per read" << endl

Update batch processing length normalization to match non-batch processing length normalization #441

Update batch processing length normalization to match non-batch processing length normalization #441

Conversation

bound-to-love commented May 15, 2024

mschilli87 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mschilli87 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment