Managing default selection keywords #104

GoogleCodeExporter · 2015-04-04T00:41:58Z

Hi,
Following-up on a recent thread regarding issue with cholesterol selection, I 
think it'd be a good idea to have more control on the selection keywords used 
by MDAnalysis.
One the one hand having some default keywords built-in is nice and easy and 
will probably suit most users/common operations.

On the other terminology conflict may arise when trying to cover the 
nomenclatures used by different forcefields (e.g. CHO can refer to cholesterol 
but is also a special CHARMM residue).

To leverage on MDAnalysis powerful selection tools I'd suggest implementing a 
way to load one's own dictionary of selection keywords. Maybe the user could be 
alerted if the customised dictionary conflicts with the built-in dictionary and 
could choose whether or not to override it.

This would allow users to tweak dictionary to the particular 
forcefields/systems they use. 

What version of the product are you using? On what operating system?
MDAnalysis-0.7.5.1-py2.7-macosx-10.6-i386

Original issue reported on code.google.com by Jean.He...@gmail.com on 25 Apr 2012 at 10:31

The text was updated successfully, but these errors were encountered:

GoogleCodeExporter · 2015-04-04T00:41:58Z

Points for discussion:

- We could have a module-level dictionary with keywords such as 

  selection_keywords = {'protein': ['ALA', 'ARG', ..., 'VAL'],
                        'nucleic': ['ADE', 'URA', 'CYT', 'GUA', 'THY'],
                        ...}

  and have the selection classes dynamically look up the keywords. 

  This would make it possible for a user to hack (I mean, adapt...) the dictionary. One could probably provide some frontend getter/setter functions such as

  set_keywords('protein', ....)

  which could do some sanity checking. It could work similar to the way that matplotlib manages its rc parameters. 

  One could also use the MDAnalysis.core.Flags registry (which is, I think, somewhat similar to 'traits').

- Maybe we should consider creating a "rc" file for MDAnalysis, such as 

   ~/.MDAnalysisrc

  where defaults for the Flags and selection keywords are set. Then one could easily customize MDAnalysis for one's preferred force field.

One question is, how likely would one want to change the residue selection 
definitions during the run of a script, i.e. is it important to be able to 
change the definitions at run time or would a static initialization (e.g. 
purely through an rc file) suffice?

Original comment by orbeckst on 26 Apr 2012 at 10:35

Added labels: Type-Enhancement
Removed labels: Type-Defect

GoogleCodeExporter · 2015-04-04T00:41:59Z

I would suggest that an .rc file would suffice. The nomenclature that one uses 
may be depending on one's studied system or forcefield used but should be 
fairly constant.

Original comment by Jean.He...@gmail.com on 26 Apr 2012 at 10:49

GoogleCodeExporter · 2015-04-04T00:41:59Z

[deleted comment]

GoogleCodeExporter · 2015-04-04T00:41:59Z

My ten cents would be against a static loading. Atom naming conventions could 
cause conflicts within a set of selection keywords. For example, selection 
string working fine on atoms named according to naming convention/force-field 
A, could be incompatible with the same set of atoms named according to a 
different ff/convention.

Secondly, there is no reason why this has to be static, other than being 
simpler to code up (which is more of an excuse than a reason, in my view).

Original comment by jan...@gmail.com on 1 Dec 2012 at 4:59

GoogleCodeExporter · 2015-04-04T00:41:59Z

We could package default tables and configurations with MDAnalysis but provide 
means to read in user files. 

If so, what format should we choose for such files? Ad-hoc parsing (e.g. what 
we're currently doing with the hard-coded tables in tables.py) or ini-style 
(ConfigParser) or YAML (does python come with a default YAML parser) or XML or 
<insert suggestion here>?

We could still have an RC file but that would really only  specify which data 
files are to be loaded on startup. Without any entries (or without rc file!), 
MDAnalysis should just behave as before and read its internal defaults.

Parts of the code that could benefit from moving data into data/configuration 
files:
- topology building (atom names, masses, radii, ...)
- HBond analysis (define donor/acceptor heavy atoms)
- selections (what counts as protein, nucleic acid, lipid, ion, water, ...) 


For anyone interested: For GromacsWrapper 
https://github.com/orbeckst/GromacsWrapper I'm using ConfigParser and ini-style 
files to manage initialization and some data files although the use case is not 
quite the same as for MDAnalysis. Nevertheless, most of the logic is in 
gromacs.config 
https://github.com/orbeckst/GromacsWrapper/blob/develop/gromacs/config.py , see 
also the docs 
http://orbeckst.github.com/GromacsWrapper/gromacs/core/config.html and perhaps 
some of this could be useful for a configuration module for MDAnalysis — 
although I am certainly open to better solutions than what I hacked together 
:-).

Original comment by orbeckst on 2 Dec 2012 at 10:57

kritika12298 · 2020-03-14T17:12:38Z

Hello, I am looking forward to contribute to the project related to Atom for GSOC 2020.
Kindly guide me on how to get started.

IAlibay · 2020-03-14T17:21:01Z

Welcome to MDAnalysis @kritika12298,

If possible, we would ask you that you write an introduction on the MDAnalysis developer list. Initially I would suggest having a look at the MDAnalysis GSOC blog post and FAQ, and then work on an issue. Further discussions on the projects also can be had over the developer list.

GoogleCodeExporter added Priority-Medium Type-Enhancement labels Apr 4, 2015

orbeckst added enhancement API and removed Type-Enhancement labels Apr 9, 2015

orbeckst added Priority-Low and removed Priority-Medium labels May 20, 2015

orbeckst mentioned this issue Jun 11, 2015

Coarse-grained #301

Closed

orbeckst added the Component-Configuration label Jun 16, 2015

orbeckst mentioned this issue Jun 16, 2015

flexible configuration system #315

Closed

richardjgowers added the Component-Selections label Jul 11, 2015

orbeckst mentioned this issue Oct 24, 2015

MDAnalysis CG / MARTINI awareness #507

Open

orbeckst added the Component-Topology label Dec 15, 2015

orbeckst mentioned this issue Mar 18, 2016

remove flag registry #782

Closed

3 tasks

richardjgowers removed the auto-migrated label Nov 28, 2017

mimischi mentioned this issue Sep 25, 2018

Add "lipid" keyword to select_atoms #2082

Open

orbeckst mentioned this issue Jan 23, 2020

Allow for more flexibility with wildcard in selections #2436

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Managing default selection keywords #104

Managing default selection keywords #104

GoogleCodeExporter commented Apr 4, 2015

GoogleCodeExporter commented Apr 4, 2015

GoogleCodeExporter commented Apr 4, 2015

GoogleCodeExporter commented Apr 4, 2015

GoogleCodeExporter commented Apr 4, 2015

GoogleCodeExporter commented Apr 4, 2015

kritika12298 commented Mar 14, 2020

IAlibay commented Mar 14, 2020

Managing default selection keywords #104

Managing default selection keywords #104

Comments

GoogleCodeExporter commented Apr 4, 2015

GoogleCodeExporter commented Apr 4, 2015

GoogleCodeExporter commented Apr 4, 2015

GoogleCodeExporter commented Apr 4, 2015

GoogleCodeExporter commented Apr 4, 2015

GoogleCodeExporter commented Apr 4, 2015

kritika12298 commented Mar 14, 2020

IAlibay commented Mar 14, 2020