issue with the tutorial Chignolin_Coarse-Grained_Tutorial #23

pojeda · 2022-03-11T19:04:33Z

Hi,

I am trying the Chignolin tutorial but I got some errors at the step "PDB to PSF conversion":

---------------------------------------------------------------------------
InvalidIndexError                         Traceback (most recent call last)
Input In [2], in <cell line: 6>()
      3 PDB_file = 'data/chignolin_cln025.pdb'
      4 PSF_file = 'data/chignolin_ca_top.psf'
----> 6 pdb2psf_CA(PDB_file, PSF_file, bonds = True, angles = False)

File torchmd_cg/utils/psfwriter.py:7, in pdb2psf_CA(pdb_name_in, psf_name_out, bonds, angles)
      6 def pdb2psf_CA(pdb_name_in, psf_name_out, bonds=True, angles=True):
----> 7     mol = Molecule(pdb_name_in)
      8     mol.filter("name CA")
     10     n = mol.numAtoms

...

File pandas/core/indexes/base.py:5637, in Index._check_indexing_error(self, key)
   5633 def _check_indexing_error(self, key):
   5634     if not is_scalar(key):
   5635         # if key is not a scalar, directly raise an error (the code below
   5636         # would convert to numpy arrays and raise later any way) - GH29926
-> 5637         raise InvalidIndexError(key)

InvalidIndexError: []

Do you know about this issue?

The text was updated successfully, but these errors were encountered:

MaciejMajew · 2022-03-12T11:25:39Z

Hi! It seems like moleculekit is not able to read the molecule. You should check if your PDB file is not broken.

MaciejMajew · 2022-03-12T11:26:08Z

If you can share the file you're trying to convert I can take a look

pojeda · 2022-03-15T16:13:46Z

I attached the file

files.zip

MaciejMajew · 2022-03-17T09:33:57Z

I can't reproduce your error. You might be better off just reinstalling the package in the fresh environment. Sorry I cannot be of much help here. Maybe @stefdoerr will know better.

stefdoerr · 2022-03-17T11:11:08Z

I can't reproduce it either. I would also suggest installing a clean env with the latest moleculekit etc.

pojeda · 2022-03-18T09:31:51Z

I rebuilt torchmd-cg in a new env but still I get the same problem at the step of pdb to psf conversion. My only guess now is the python version, I am using 3.9.6. Have you installed and tested torchmd-cg with this python version?

InvalidIndexError                         Traceback (most recent call last)
Input In [4], in <cell line: 6>()
      3 PDB_file = 'data/chignolin_cln025.pdb'
      4 PSF_file = 'data/chignolin_ca_top.psf'
----> 6 pdb2psf_CA(PDB_file, PSF_file, bonds = True, angles = False)

File /env-torchmd/lib/python3.9/site-packages/torchmd_cg/utils/psfwriter.py:7, in pdb2psf_CA(pdb_name_in, psf_name_out, bonds, angles)
      6 def pdb2psf_CA(pdb_name_in, psf_name_out, bonds=True, angles=True):
----> 7     mol = Molecule(pdb_name_in)
      8     mol.filter("name CA")
     10     n = mol.numAtoms

File /env-torchmd/lib/python3.9/site-packages/moleculekit/molecule.py:299, in Molecule.__init__(self, filename, name, **kwargs)
    296 self.viewname = name
    298 if filename is not None:
--> 299     self.read(filename, **kwargs)

File /env-torchmd/lib/python3.9/site-packages/moleculekit/molecule.py:1147, in Molecule.read(self, filename, type, skip, frames, append, overwrite, keepaltloc, guess, guessNE, _logger, **kwargs)
   1145 for rr in readers:
   1146     try:
-> 1147         mol = rr(fname, frame=frame, topoloc=tmppdb, **kwargs)
   1148     except FormatError:
   1149         continue

File /env-torchmd/lib/python3.9/site-packages/moleculekit/readers.py:1100, in PDBread(filename, mode, frame, topoloc, validateElements, uniqueBonds)
   1098 if "element" in parsedtopo:
   1099     idx, newelem = pdbGuessElementByName(parsedtopo.element, parsedtopo.name)
-> 1100     parsedtopo.at[idx, "element"] = newelem
   1102 for field in topodtypes:
   1103     if (
   1104         field in parsedtopo
   1105         and topodtypes[field] == str
   1106         and parsedtopo[field].dtype == object
   1107     ):

File /env-torchmd/lib/python3.9/site-packages/pandas/core/indexing.py:2274, in _AtIndexer.__setitem__(self, key, value)
   2271     self.obj.loc[key] = value
   2272     return
-> 2274 return super().__setitem__(key, value)

File /env-torchmd/lib/python3.9/site-packages/pandas/core/indexing.py:2229, in _ScalarAccessIndexer.__setitem__(self, key, value)
   2226 if len(key) != self.ndim:
   2227     raise ValueError("Not enough indexers for scalar access (setting)!")
-> 2229 self.obj._set_value(*key, value=value, takeable=self._takeable)

File /env-torchmd/lib/python3.9/site-packages/pandas/core/frame.py:3869, in DataFrame._set_value(self, index, col, value, takeable)
   3867 else:
   3868     series = self._get_item_cache(col)
-> 3869     loc = self.index.get_loc(index)
   3871 # setitem_inplace will do validation that may raise TypeError
   3872 #  or ValueError
   3873 series._mgr.setitem_inplace(loc, value)

File /env-torchmd/lib/python3.9/site-packages/pandas/core/indexes/range.py:388, in RangeIndex.get_loc(self, key, method, tolerance)
    386         except ValueError as err:
    387             raise KeyError(key) from err
--> 388     self._check_indexing_error(key)
    389     raise KeyError(key)
    390 return super().get_loc(key, method=method, tolerance=tolerance)

File /env-torchmd/lib/python3.9/site-packages/pandas/core/indexes/base.py:5637, in Index._check_indexing_error(self, key)
   5633 def _check_indexing_error(self, key):
   5634     if not is_scalar(key):
   5635         # if key is not a scalar, directly raise an error (the code below
   5636         # would convert to numpy arrays and raise later any way) - GH29926
-> 5637         raise InvalidIndexError(key)

InvalidIndexError: []

stefdoerr · 2022-03-18T09:37:03Z

Can you try just the following please and see if it works?

from moleculekit.molecule import Molecule
mol = Molecule('data/chignolin_cln025.pdb')

pojeda · 2022-03-18T09:49:32Z

the output of those lines is:

---------------------------------------------------------------------------
InvalidIndexError                         Traceback (most recent call last)
Input In [2], in <cell line: 2>()
      1 from moleculekit.molecule import Molecule
----> 2 mol = Molecule('data/chignolin_cln025.pdb')

File /env-torchmd/lib/python3.9/site-packages/moleculekit/molecule.py:299, in Molecule.__init__(self, filename, name, **kwargs)
    296 self.viewname = name
    298 if filename is not None:
--> 299     self.read(filename, **kwargs)

File /env-torchmd/lib/python3.9/site-packages/moleculekit/molecule.py:1147, in Molecule.read(self, filename, type, skip, frames, append, overwrite, keepaltloc, guess, guessNE, _logger, **kwargs)
   1145 for rr in readers:
   1146     try:
-> 1147         mol = rr(fname, frame=frame, topoloc=tmppdb, **kwargs)
   1148     except FormatError:
   1149         continue

File /env-torchmd/lib/python3.9/site-packages/moleculekit/readers.py:1100, in PDBread(filename, mode, frame, topoloc, validateElements, uniqueBonds)
   1098 if "element" in parsedtopo:
   1099     idx, newelem = pdbGuessElementByName(parsedtopo.element, parsedtopo.name)
-> 1100     parsedtopo.at[idx, "element"] = newelem
   1102 for field in topodtypes:
   1103     if (
   1104         field in parsedtopo
   1105         and topodtypes[field] == str
   1106         and parsedtopo[field].dtype == object
   1107     ):

File /env-torchmd/lib/python3.9/site-packages/pandas/core/indexing.py:2274, in _AtIndexer.__setitem__(self, key, value)
   2271     self.obj.loc[key] = value
   2272     return
-> 2274 return super().__setitem__(key, value)

File /env-torchmd/lib/python3.9/site-packages/pandas/core/indexing.py:2229, in _ScalarAccessIndexer.__setitem__(self, key, value)
   2226 if len(key) != self.ndim:
   2227     raise ValueError("Not enough indexers for scalar access (setting)!")
-> 2229 self.obj._set_value(*key, value=value, takeable=self._takeable)

File /env-torchmd/lib/python3.9/site-packages/pandas/core/frame.py:3869, in DataFrame._set_value(self, index, col, value, takeable)
   3867 else:
   3868     series = self._get_item_cache(col)
-> 3869     loc = self.index.get_loc(index)
   3871 # setitem_inplace will do validation that may raise TypeError
   3872 #  or ValueError
   3873 series._mgr.setitem_inplace(loc, value)

File /env-torchmd/lib/python3.9/site-packages/pandas/core/indexes/range.py:388, in RangeIndex.get_loc(self, key, method, tolerance)
    386         except ValueError as err:
    387             raise KeyError(key) from err
--> 388     self._check_indexing_error(key)
    389     raise KeyError(key)
    390 return super().get_loc(key, method=method, tolerance=tolerance)

File /env-torchmd/lib/python3.9/site-packages/pandas/core/indexes/base.py:5637, in Index._check_indexing_error(self, key)
   5633 def _check_indexing_error(self, key):
   5634     if not is_scalar(key):
   5635         # if key is not a scalar, directly raise an error (the code below
   5636         # would convert to numpy arrays and raise later any way) - GH29926
-> 5637         raise InvalidIndexError(key)

InvalidIndexError: []

stefdoerr · 2022-03-18T09:59:00Z

Please paste here the results of these two commands

conda list moleculekit
conda list pandas

pojeda · 2022-03-18T10:03:03Z

I am not using conda as the center where I work don't support it. I use pip only.

stefdoerr · 2022-03-18T10:12:02Z

Yes ok that explains the issues. Dependency handling is very bad in pip and many packages don't even exist or are on different versions.
Is it not possible for you to install Miniconda in your home directory?

If not, then show me also:

pip show pandas

But I think you will run into many more issues down the line if you try to make it work with pip.

pojeda · 2022-03-18T10:21:56Z

Name: pandas
Version: 1.4.1
Summary: Powerful data structures for data analysis, time series, and statistics
Home-page: https://pandas.pydata.org
Author: The Pandas Development Team
Author-email: pandas-dev@python.org
License: BSD-3-Clause
Location: /env-torchmd/lib/python3.9/site-packages
Requires: python-dateutil, pytz, numpy
Required-by: seaborn, moleculekit

stefdoerr · 2022-03-18T10:36:50Z

That looks correct. And pip show moleculekit?

pojeda · 2022-03-18T10:40:04Z

Is there any issues with the license "unknown" setting?

Name: moleculekit
Version: 0.9.14
Summary: A molecule reading/writing and manipulation package.
Home-page: https://github.com/acellera/moleculekit/
Author: Acellera
Author-email: info@acellera.com
License: UNKNOWN
Location: /env-torchmd/lib/python3.9/site-packages
Requires: networkx, numpy, tqdm, pandas, scipy
Required-by:

stefdoerr · 2022-03-18T11:13:17Z

that's too old. try pip install moleculekit==1.1.8

pojeda · 2022-03-18T11:34:36Z

Hi, that command creates some issues with metadata, I used the following one and it worked:

pip install --upgrade --no-cache-dir --use-deprecated=legacy-resolver moleculekit==1.1.8

Thanks!

stefdoerr closed this as completed Mar 21, 2022

MaciejMajew mentioned this issue Mar 29, 2022

Error while running the tutorial walkthrough #24

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue with the tutorial Chignolin_Coarse-Grained_Tutorial #23

issue with the tutorial Chignolin_Coarse-Grained_Tutorial #23

pojeda commented Mar 11, 2022

MaciejMajew commented Mar 12, 2022

MaciejMajew commented Mar 12, 2022

pojeda commented Mar 15, 2022

MaciejMajew commented Mar 17, 2022

stefdoerr commented Mar 17, 2022

pojeda commented Mar 18, 2022

stefdoerr commented Mar 18, 2022

pojeda commented Mar 18, 2022 •

edited by stefdoerr

Loading

stefdoerr commented Mar 18, 2022

pojeda commented Mar 18, 2022

stefdoerr commented Mar 18, 2022

pojeda commented Mar 18, 2022

stefdoerr commented Mar 18, 2022

pojeda commented Mar 18, 2022

stefdoerr commented Mar 18, 2022

pojeda commented Mar 18, 2022

issue with the tutorial Chignolin_Coarse-Grained_Tutorial #23

issue with the tutorial Chignolin_Coarse-Grained_Tutorial #23

Comments

pojeda commented Mar 11, 2022

MaciejMajew commented Mar 12, 2022

MaciejMajew commented Mar 12, 2022

pojeda commented Mar 15, 2022

MaciejMajew commented Mar 17, 2022

stefdoerr commented Mar 17, 2022

pojeda commented Mar 18, 2022

stefdoerr commented Mar 18, 2022

pojeda commented Mar 18, 2022 • edited by stefdoerr Loading

stefdoerr commented Mar 18, 2022

pojeda commented Mar 18, 2022

stefdoerr commented Mar 18, 2022

pojeda commented Mar 18, 2022

stefdoerr commented Mar 18, 2022

pojeda commented Mar 18, 2022

stefdoerr commented Mar 18, 2022

pojeda commented Mar 18, 2022

pojeda commented Mar 18, 2022 •

edited by stefdoerr

Loading