Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

compute the "ideal" LSE to compare to the actual LSE output by GRNmap #96

Closed
kdahlquist opened this issue May 27, 2015 · 13 comments
Closed

Comments

@kdahlquist
Copy link
Owner

The minimum of the LSE would be attained when Log2 xi (t, theta) - 1/# flasks average of datapoints (not sure if this is transcribed right)

@kdahlquist
Copy link
Owner Author

@GraceJohnson, it occurred to me that you should add up the LSEs from all of the single strain runs and then compare it to the all strain run, otherwise it isnt a fair comparison.

@bengfitzpatrick
Copy link
Collaborator

alternatively, we should compare single strain estimations to ideal single strain sums of squares.

@GraceJohnson
Copy link
Collaborator

@bengfitzpatrick I have compared single strain estimation (for wt) to its ideal sum of squares. I also computed the ideal using all five strains (wt plus the four deletion strains), by following the same procedure and adding up all the sums and dividing by the grand total of data points. Is this how the code computes the LSE when data from multiple strains must be estimated? Or does is add up the individual LSE values for each strain, resulting in a higher total LSE?

@bengfitzpatrick
Copy link
Collaborator

@GraceJohnson can you provide some numbers related to your comparisons?

@GraceJohnson
Copy link
Collaborator

@bengfitzpatrick This page contains MATLAB output LSE values for several strains, alone and combined: http://openwetware.org/wiki/GRNmap_Testing_Report:_Strain_Run_Comparisons_2015-05-27. As each strain is added to the analysis, the LSE ouput by the code seems to increase linearly (i.e. the LSE for one strain is around 7, for two strains around 15, etc.). For the ideal sum of squares calculations, I got an LSE 0.4875 for wt alone and 0.5520 for all strains. I will email you the spreadsheet I used to get these values.

@kdahlquist
Copy link
Owner Author

@bengfitzpatrick doesn't think that the LSE computed for the multiple strains is being computed correctly. It might be dividing by the number of datapoints for a single strain, but it should be dividing by the total number of datapoints. He needs to check the MATLAB code.

What should be put out in diagnostic worksheet #106 is

  • LSE
  • penalty
  • minimum (ideal) LSE

@trixr4kdz
Copy link
Collaborator

@bengfitzpatrick I've merged beta with master, but I didn't fix the order of the worksheets yet. You can check the code now though since I've just finished the release

@bengfitzpatrick
Copy link
Collaborator

@trixr4kdz I downloaded the master last night. When did you do the merge?

@trixr4kdz
Copy link
Collaborator

I merged them yesterday, before 7 I think

@bengfitzpatrick
Copy link
Collaborator

@trixr4kdz sehr gut. I aim to post a new beta by Monday with the diagnostics spread sheet added.

@kdahlquist
Copy link
Owner Author

@GraceJohnson should test this (beta branch)

@GraceJohnson
Copy link
Collaborator

Beta tested. New version correctly computes min LSE. Notes and files can be found here: http://openwetware.org/wiki/Katherine_Grace_Johnson_Electronic_Lab_Notebook#June_15.2C_2015

In addition, for the strain I ran (wt alone), the LSE was only slightly more than the min LSE, 0.508 compared to 0.487. Correspondingly, the model outputs are more dynamic and seem to fit the data's average. However, it should be noted that, with an alpha of 0.001 and estimating P and b, this run took over 400,000 iterations and 2.5 hours.

@kdahlquist
Copy link
Owner Author

verified and closing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants