proposed conversion from u_kln->u_kn formalism #83

mrshirts · 2014-07-09T23:18:52Z

includes a slightly modified harmonic_oscillator_update.py, which gives identical behavior with the new code to pymbar-examples/harmonic_oscillator.py run with the previous code up to numerical precision.

This new code automatically converts from old-style u_kln matrices internally.

It supports all existing functionality, including updating the weave code and the _pymbar.c code.

The one major difference is in handling computeExpectations. This change hopefully provides a much more general solution which can possibly reduce the total amount of repetitive code as well.

Previously, computeExpectations recognized whether observables were state_dependent or state_independent by whether the observable array was KxN_max, or KxLxN_max. Now, this is done by giving an explicit state_dependent flag. If state_dependent is False, computeExpectations handles the code like the 2D version before (although now the array is 1D, the observable at each state).
The big difference is if state_dependent is True. In this case, computeExpectations expects K different observables (the K different observables at each state - for example, the potential energies in each of the K states), as well as K different energies (which defaults to just the u_kn matrix MBAR was generated with). It then calculates the expectations of the first observable and the first state, 2nd observable and the 2nd state, and so forth, which is what the previous code did.

In order to make this happen, I've created a computeGeneralExpectations routine, that takes in I observable arrays and K state observables (see see #89 ).

I'm up for lots of other improvements; but I though it would be easier to start by solving the biggest problem first independently of other issues.

kyleabeauchamp · 2014-07-09T23:23:42Z

Do you know why this can't be automatically merged? It seems like your branch is messed up somehow.

kyleabeauchamp · 2014-07-09T23:24:21Z

Also, eventually we should probably get all tests running via nose, but we can help you migrate your tests later.

mrshirts · 2014-07-09T23:43:13Z

No idea why it can't be merged automatically. I can try to find a way to commit it some other way as a patch if you want -- it's two clean commits, plus manually adding the master patch which was on top of it.

Nose tests can be done after in the polishing if we like what's here.

kyleabeauchamp · 2014-07-09T23:44:15Z

Are you basing this on a branch that doesn't include your recently merged pull request? I think that could be the issue.

mrshirts · 2014-07-09T23:53:11Z

Yes. It's a trivial change to do, but perhaps GitHub can't handle. I supposed I should have rebased onto that version first! If you can't think of an easy way, I can try to just create a new branch with the changes.

mrshirts · 2014-07-09T23:54:01Z

Resolving the conflicts will be trivial, I just need to figure out how to do the right resolves . . .

kyleabeauchamp · 2014-07-09T23:55:40Z

Maybe this page helps:

http://blog.michelemattioni.me/2013/01/29/what-do-i-do-when-my-pull-request-does-not-merge-automatically-in-master/

mrshirts · 2014-07-10T00:03:00Z

OK, looks better now. Travis CI will probably fail, I'm guessing (since it wasn't designed to get it right). Might be best to check it out independently and look at it directly for the logic. Not that important to get it perfect, since it's in its own branch.

mrshirts · 2014-07-10T00:05:42Z

Looks like it failed on the imports? I'm not really sure why, that didn't change at all.

kyleabeauchamp · 2014-07-10T00:06:04Z

That's the version issue which I will look into independently.

kyleabeauchamp · 2014-07-10T00:06:34Z

I guess you've made lots of changes to pymbar.py? It seems like GitHub can't even show the diff because there are so many changes. Does that sound right?

mrshirts · 2014-07-10T00:18:50Z

Yes, mbar.py has a lot of changes. All pretty straightforward, but a lot. Lots of overhaul required to change the entire formalism.

kyleabeauchamp · 2014-07-15T15:02:57Z

So I've gone through most of the code (halfway through mbar.py, through the rest). Overall things are looking good. Below is what I think is the to-do list. For clarity, we can make some of these changes in later pull requests where a single pull request is a single "transformation" of the code. However, things that are obvious bugs should probably get fixed before merging IMHO.

Run pyflakes to fix various possible syntax errors (see below for log)
Move MBAR._kln_to_kn(u_kn) and MBAR._kn_to_n() to separate functions in utils.py. These function are generally useful beyond just MBAR and should be exposed to users looking to migrate their workflows to the new style.
"# take the square root of the matrix" we should probably change this comment, as it's not the "square root of a matrix"
Pull the various #todo comments into separate issues
Sort out useGeneral stuff--right now, we've doubled the number of branches in computeExpectations(). I would prefer reducing the branching and having separate member functions (private?) for the different ways of calculating things. IMHO having product-spaces of code branches inevitably leads to corner case errors.
Next time we make a giant change like this, we should separate the numpy -> np conversion to a separate pull request that only contains that change. That will make our bookkeeping and code review much simpler and hopefully make it easier to identify bugs. This will also make our lives easier if we ever need to go back and look at the log of code changes.

pymbar/mbar.py:71: 'ConvergenceError' imported but unused
pymbar/mbar.py:71: 'BoundsError' imported but unused
pymbar/mbar.py:72: 'pdb' imported but unused
pymbar/mbar.py:197: '_pymbar' imported but unused
pymbar/mbar.py:441: undefined name 'numpy'
pymbar/mbar.py:524: local variable 'L' is assigned to but never used
pymbar/mbar.py:736: undefined name 'general_returns'
pymbar/mbar.py:818: undefined name 'dot'
pymbar/mbar.py:818: local variable 'T_kk' is assigned to but never used
pymbar/mbar.py:1230: undefined name 'numpy'
pymbar/mbar.py:1361: undefined name 'numpy'
pymbar/mbar.py:1633: undefined name 'bin_k'
pymbar/mbar.py:1659: undefined name 'bin_kn'
pymbar/mbar.py:2134: undefined name 'x_kindices'
pymbar/mbar.py:2134: undefined name 'x_kindices'
pymbar/mbar.py:2138: undefined name 'x_kindices'
pymbar/mbar.py:2138: undefined name 'x_kindices'
pymbar/mbar.py:2187: local variable 'z' is assigned to but never used
pymbar/mbar.py:2193: local variable 'K' is assigned to but never used
pymbar/mbar.py:2194: local variable 'f_k' is assigned to but never used
pymbar/mbar.py:2195: local variable 'N' is assigned to but never used
pymbar/mbar.py:2196: local variable 'N_k' is assigned to but never used
pymbar/mbar.py:2197: local variable 'u_kn' is assigned to but never used
pymbar/mbar.py:2226: local variable 'info' is assigned to but never used
pymbar/mbar.py:2532: undefined name 'N_k'

kyleabeauchamp · 2014-07-15T15:03:15Z

I've also run the test script on my machine--it seemed to be working OK.

kyleabeauchamp · 2014-07-15T17:19:28Z

Possible typo in docstring of computePMF(): "bin_n[n]" (later called "bin_kn")

Also in computePMF_states()

kyleabeauchamp · 2014-07-15T17:38:57Z

OK I made a complete pass through the code and have no additional complaints at this time...

mrshirts · 2014-07-16T01:48:38Z

Commenting one at a time;

Next time we make a giant change like this, we should separate the numpy -> np conversion to a
separate pull request that only contains that change. That will make our bookkeeping and code
review much simpler and hopefully make it easier to identify bugs. This will also make our lives easier > if we ever need to go back and look at the log of code changes.

Ah, my bad. I was getting carried away with editing and getting tired of writing numpy instead of np. Should I revert that change in the pull request, and then we put it back in in separate commit?

mrshirts · 2014-07-16T01:50:30Z

"# take the square root of the matrix" we should probably change this comment, as it's not the "square root of a matrix"

Fixed (will update the pull request after number of these).

mrshirts · 2014-07-16T02:41:53Z

Run pyflakes to fix various possible syntax errors (see below for log)

Done, except for ones that are caused by weave or importing the helper functions (pyflakes doesn't seem to know about the syntax there).

jchodera · 2014-07-16T02:47:01Z

We seem to be having the import version error we've had with other projects here now too:
https://travis-ci.org/choderalab/pymbar/builds/29558424#L518

jchodera · 2014-07-16T02:50:49Z

@mrshirts: @kyleabeauchamp is out for a week, so it's up to us to proceed, I think.

Do all the nosetests work on your local version? If so, I wonder if we should merge this and debug the from pymbar import version issue separately?

mrshirts · 2014-07-16T03:01:43Z

@kyleabeauchamp is out for a week, so it's up to us to proceed, I think.

Ah, OK. I'll probably get the quick fixes in. The one remaining item is point 5, Expectations vs. General Expectations. Let's have that conversation when Kyle comes back.

I'll check the nosetests again. If it passes those, and when everything except point 5 is resolved, yes, I think we should go ahead and merge it in to the kn branch. Maybe hold off merging the branches.

jchodera · 2014-07-16T03:09:17Z

What is point 5?

jchodera · 2014-07-16T03:09:46Z

Most importantly, did you fix the undefined variable names? Those are actual bugs which apparently don't have corresponding tests to catch them.

mrshirts · 2014-07-16T03:18:20Z

What is point 5?

[5] Sort out useGeneral stuff

from Kyle's big comment. I'll file a separate issue on this -- it deserves some thinking. The current state of computeExpectations is not what we want, that's just to demonstrate the potential new approach (And that it can be made equivalent to what we want to do next).

Current approach - writing very similar functions 10x
Potential new approach- Writing one somewhat more complex general function with 10 wrappers.

mrshirts · 2014-07-16T03:18:54Z

Most importantly, did you fix the undefined variable names? Those are actual bugs which apparently > don't have corresponding tests to catch them.

Yes, those are fixed, mod ones that are actually weird syntax due to C++ imports or weave.

jchodera · 2014-07-16T03:24:57Z

If the current approach works, then we can work on simplifying the codebase in subsequent PRs. If the current approach is actually "everything is broken", we've got a bit of a problem...

mrshirts · 2014-07-16T03:31:01Z

Current approach works. this is a question of simplifying the codebase (and in parallel, the theory itself).

jchodera · 2014-07-16T04:31:30Z

Great. Go ahead and merge, and we'll simplify in subsequent PRs.

Please make small, individual changes in future! And as soon as you start working on a feature, open a new PR and title it "My new feature [WIP]" to denote work in progress...

mrshirts · 2014-07-16T13:40:09Z

Please make small, individual changes in future! And as soon as you start working on a feature,
open a new PR and title it "My new feature [WIP]" to denote work in progress...

I think in this case, most of it all had to go in together so that the u_kn infrastructure could be supported. I agree smaller changes are usually better.

jchodera · 2014-07-16T13:50:57Z

I think in this case, most of it all had to go in together so that the u_kn infrastructure could be supported. I agree smaller changes are usually better.

Logically grouped changes should occur together, but as Kyle suggested, this shouldn't include things like changing numpy. to np. at the same time, since it completely eliminates our ability to do meaningful code review by looking at the autogenerated diffs. That's all that was meant!

mrshirts · 2014-07-16T14:44:58Z

Sounds good. Also, the latest pull request reverted the numpy change.

jchodera · 2014-07-16T18:25:06Z

Thanks! OK, I'll merge this, and we can work out the import version issues separately!

Extensive conversion from u_kln->u_kn formalism. Still need to debug `import version` issues with travis-ci.

Michael Shirts added 3 commits July 9, 2014 18:35

Updated to be essentially working mapping of u_kln to u_kn.

51598ae

Added a test script that works with the updated u_kln->u_kn code.

ac9111a

Manually adding a change to make manual merging easier.

f3da3a3

resolving issues between master and nk, all in pymbar/__init__.py

e7a5237

Address a number of issues raied with the pull request.

eeb3ecb

jchodera added a commit that referenced this pull request Jul 16, 2014

Merge pull request #83 from mrshirts/nk

7041caa

Extensive conversion from u_kln->u_kn formalism. Still need to debug `import version` issues with travis-ci.

jchodera merged commit 7041caa into choderalab:nk Jul 16, 2014

kyleabeauchamp mentioned this pull request Jul 28, 2014

Move to supporting a NxK data representation #77

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proposed conversion from u_kln->u_kn formalism #83

proposed conversion from u_kln->u_kn formalism #83

mrshirts commented Jul 9, 2014

kyleabeauchamp commented Jul 9, 2014

kyleabeauchamp commented Jul 9, 2014

mrshirts commented Jul 9, 2014

kyleabeauchamp commented Jul 9, 2014

mrshirts commented Jul 9, 2014

mrshirts commented Jul 9, 2014

kyleabeauchamp commented Jul 9, 2014

mrshirts commented Jul 10, 2014

mrshirts commented Jul 10, 2014

kyleabeauchamp commented Jul 10, 2014

kyleabeauchamp commented Jul 10, 2014

mrshirts commented Jul 10, 2014

kyleabeauchamp commented Jul 15, 2014

kyleabeauchamp commented Jul 15, 2014

kyleabeauchamp commented Jul 15, 2014

kyleabeauchamp commented Jul 15, 2014

mrshirts commented Jul 16, 2014

mrshirts commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014

jchodera commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014

jchodera commented Jul 16, 2014

mrshirts commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014

proposed conversion from u_kln->u_kn formalism #83

proposed conversion from u_kln->u_kn formalism #83

Conversation

mrshirts commented Jul 9, 2014

kyleabeauchamp commented Jul 9, 2014

kyleabeauchamp commented Jul 9, 2014

mrshirts commented Jul 9, 2014

kyleabeauchamp commented Jul 9, 2014

mrshirts commented Jul 9, 2014

mrshirts commented Jul 9, 2014

kyleabeauchamp commented Jul 9, 2014

mrshirts commented Jul 10, 2014

mrshirts commented Jul 10, 2014

kyleabeauchamp commented Jul 10, 2014

kyleabeauchamp commented Jul 10, 2014

mrshirts commented Jul 10, 2014

kyleabeauchamp commented Jul 15, 2014

kyleabeauchamp commented Jul 15, 2014

kyleabeauchamp commented Jul 15, 2014

kyleabeauchamp commented Jul 15, 2014

mrshirts commented Jul 16, 2014

mrshirts commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014

jchodera commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014

jchodera commented Jul 16, 2014

mrshirts commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014

mrshirts commented Jul 16, 2014

jchodera commented Jul 16, 2014