Inclusion of ENCORE into MDAnalysis #797

mtiberti · 2016-03-23T09:34:02Z

Fixes #

Changes made in this Pull Request:

Added ArrayReader trajectory reader
Added encore module under MDAnalysis.analysis

PR Checklist

Tests?
Docs?
CHANGELOG updated?
Issue raised/referenced?

into HEAD

- added header to similarity - Updated header to Ensemble - added examples for - hes() - ces() - dres() - Ensemble - added docstrings for hes(),ces(),dres() - updated docstrings to follow numpy style doc - updated codestyle to follow PEP8

…ptions

- updated examples to work - added description of the stocastic nature of dres - added note of stocatic nature of dres

- changed return values header back to numpy array

- changed returns back to numpy array - added note about None in return array

…ysis into develop

…utput formats

…w passed to the individual methods instead. Removed coordinates attributes from Ensemble object. This is now delegated to the new ArrayReader trajectory reader Added ArrayReader trajectory reader for reading from a numpy array through the standard trajectory interface

…ysis into develop

Fixed minor issues in ArrayReader Updated documentation strings in all Encore files.

coveralls · 2016-09-11T21:31:04Z

Coverage decreased (-3.2%) to 81.193% when pulling 43c2159 on encore-similarity:feature-encore into 396e040 on MDAnalysis:develop.

wouterboomsma · 2016-09-16T11:01:37Z

With #976 merged, I'll be returning my attention to this PR. We'll of course start by merging in the newest changes, which should simplify things a bit, now that the MemoryReader stuff is gone. However, it would be good to have a discussion about how best to proceed from here. Given the success of splitting out the MemoryReader into a separate PR, would it make sense to try and do the same for the clustering and dimensionality reduction code? This would only make sense if there is general consensus to move analysis.encore.clustering and analysis.encore.dimensionality_reduction into separate analysis components (so becoming analysis.clustering and analysis.dimensionality_reduction). I see that in the time it took us to finish this pull request, someone has contributed a PCA module. These efforts should of course be merged somehow if we follow this path. Any thoughts on this?

richardjgowers · 2016-09-16T14:58:16Z

package/MDAnalysis/analysis/encore/utils.py

+            yield (i, j)
+
+
+def merge_universes(universes):


Is this concatenating the trajectories of the Universes? Could do with more docs on how it's combining different Universes

richardjgowers · 2016-09-16T15:05:58Z

package/MDAnalysis/coordinates/memory.py

@@ -0,0 +1,257 @@
+# -*- Mode: python; tab-width: 4; indent-tabs-mode:nil; coding:utf-8 -*-


I'm not sure why this is showing up in this PR after we merged this in another PR?

Yeah, sorry. As I mentioned in the comment right before yours, we were still in the process of merging in the newest changes, which is why MemoryReader still appeared. This is done now - so all MemoryReader changes should now be gone.

richardjgowers

Needs a few better error messages to avoid headaches later on.

richardjgowers · 2016-09-16T19:17:04Z

package/MDAnalysis/analysis/encore/utils.py

+        `scalar` : float
+            Scalar to multiply with.
+        """
+        newMatrix = TriangularMatrix(self.size)


Why can't this just be self._elements *= scalar, return self?

Thanks for your comments, I'm almost done addressing them. About this one, if we return self as you suggest, cases in which the returned value is assigned to another variable (as in tm2 = tm1 * scalar) would result in unexpected behavior, as both the assigned variable (tm2) and the array that is being multiplied (tm1) would be affected. The solution you're proposing could be more fit for __imul__ (and of course for __iadd__). Any thought on this?

Ah yeah, it does need to return a new object, you're right - I'm thinking of everything as an __iadd__.
If you want to be super safe you could write this as newMatrix = self.__class__(self.size) which might save someone a headache if they ever subclass this.

richardjgowers · 2016-09-16T19:17:32Z

package/MDAnalysis/analysis/encore/utils.py

+        return newMatrix
+
+    __rmul__ = __mul__
+


Could we add __add__ too?

richardjgowers · 2016-09-16T19:17:58Z

package/MDAnalysis/analysis/encore/utils.py

+import MDAnalysis as mda
+from ...coordinates.memory import MemoryReader
+
+class TriangularMatrix(object):


This is a good candidate for lib.util

richardjgowers · 2016-09-16T19:20:04Z