Python Materials Genomics (pymatgen) is a robust materials analysis code that defines core object representations for structures and molecules with support for many electronic structure codes. It is currently the core analysis code powering the Materials Project.
Python Propeller Spin Common Lisp Roff Perl CSS
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.


Pymatgen is the python library that powers the Materials Project ( This repo contains the public version of this powerful library. These are some of the key features:

  1. Highly flexible classes for the representation of Element, Site, Structure objects.
  2. Powerful io capabilities to manipulate many VASP input and output files ( and the crystallographic information file format. This includes generating Structure objects from vasp input and output.
  3. A comprehensive tool to generate and view compositional and grand canonical phase diagrams.

The public version of pymatgen is free (as in free beer) to download and to use. However, we would also like you to help us improve this library by making your own contributions as well. These contributions can be in the form of additional tools or modules you develop, or even simple things such as bug reports. Please contact the maintainer of this library ( to find out how to include your contributions via github or for bug reports.


Required for proper functioning of the code.

  1. Python 2.7+ required. New default modules such as json are used, as well as new unittest features in Python 2.7.
  2. numpy - For array, matrix and other numerical manipulations. Used extensively by all core modules.
  3. scipy 0.9+ - For interpolation, physical constants and other functions. In particular, scipy.spatial.Delaunay is used for phase diagram construction.
  4. PyYAML - For parsing of important PyYaml configuration files.
  5. nose - For complete unittesting. This is NOT optional!

Optional Python Libraries

Optional python libraries that are required if you need certain features

  1. matplotlib : For plotting (e.g., Phase Diagrams).
  2. PyCifRW : For reading and writing Crystallographic Information Format (CIF) files more info

Optional non-Python programs

Optional non-python libraries (because no good pythonic alternative exists at the moment) required only for certain features.

  1. Qhull : Needed for bond length analysis ( The executable qconvex and qvoronoi must be in the path.

Basic Setup

  1. Clone the repo.
  2. Install the necessary python libraries.
  3. (Recommended) Add pymatgen to your PYTHONPATH.
  4. (Recommended for developers) Copy hooks from the example-hooks directory into the .git/hooks/ directory in your local repo.

With these two basic steps, you should be able to use most of the pymatgen code. I recommend that you start by reading some of the unittests in the tests subdirectory for each package. The unittests demonstrate the expected behavior and functionality of the code.

However, some extra functionality do require additional setup, as outlined below.

Generating POTCARs

For the code to generate POTCAR files, it needs to know where the VASP pseudopotential files are. We are not allowed to distribute these under the VASP license. The good news is that we have included a setup script to help you along.

  1. cd to the root directory of the repo where a file called is present.
  2. Run the file, which will generate a resources directory in a location of your choosing. Please choose a location outside of the repo itself. The script will also write a pymatgen.cfg file in the pymatgen subdir.

Basic usage

Some example scripts have been provided in the scripts directory. In general, most file format conversions, manipulations and io can be done with a few quick lines of code. For example, to read a POSCAR and write a cif:

from import Poscar
from import CifWriter

p = Poscar('POSCAR')
w = CifWriter(p.struct)

For more examples, please take a look at the wiki (