GeomeTRIC Interface #1813

zachglick · 2020-02-17T21:37:47Z

Description

Allows for the use of the GeomeTRIC optimizer within a Psi4 input. The desired optimization engine, either geometric or optking (default), can now specified with an engine argument to the optimize() function. In addition, a dictionary of GeomeTRIC-specific keywords and options (like constraints) may be passed to the optimizer. The test_h2o_constrained pytest demonstrates how this is done.

e = optimize(..., engine=`geometric`, optimizer_keywords={...})

Output is consistent with Psi4's default geometry optimization:

Example result

>>> grep "~" output.dat


  ==> GeomeTRIC Optimizer <==                                                                   ~
  Psi4 convergence criteria QCHEM  not recognized by GeomeTRIC, switching to GAU_TIGHT          ~
  Measures of convergence in internal coordinates in au.                                        ~
  Criteria marked as inactive (o), active & met (*), and active & unmet ( ).                    ~
  --------------------------------------------------------------------------------------------- ~
   Step     Total Energy     Delta E     MAX Force     RMS Force      MAX Disp      RMS Disp    ~
  --------------------------------------------------------------------------------------------- ~
    Convergence Criteria    1.00e-06      1.50e-05      1.00e-05      6.00e-05      4.00e-05    ~
  --------------------------------------------------------------------------------------------- ~
      0  -7.64427364e+01    --------      5.01e-02      4.03e-02      --------      --------    ~
      1  -7.64446505e+01   -1.91e-03      2.68e-03      1.95e-03      3.06e-02      2.16e-02    ~
      2  -7.64446681e+01   -1.77e-05      5.27e-04      4.17e-04      4.22e-03      3.98e-03    ~
      3  -7.64446684e+01   -3.06e-07 *    2.27e-05      2.03e-05      4.11e-04      2.93e-04    ~
      4  -7.64446684e+01    6.91e-10 *    3.28e-06 *    2.74e-06 *    1.78e-05 *    1.49e-05 *  ~
  Optimization converged!                                                                       ~

Todos

Checklist

Tests added for any new features
All or relevant fraction of full tests run

Status

Ready for review
Ready for merge

psi4/driver/driver.py

dgasmith · 2020-02-18T15:53:19Z

You can pull geometric apart like so: https://github.com/leeping/geomeTRIC/blob/master/geometric/tests/test_batch_opt.py

This is a bit complex compared to what you wish to do, but it may give a better product.

Here is everything unwound:

import qcengine as qcng
import qcelemental as qcel
import geometric
import pkg_resources
import logging.config
import sys

mol_uc2 = qcel.models.Molecule.from_data(
    """
O 0 0 0
H 0 0 1
H 0 1 0
"""
)

input_data = {
    "keywords": {
        "convergence_set": "GAU_LOOSE",
        "coordsys": "tric",
        "maxiter": 25,
        "enforce": 0.1,
        "constraints": {
            "set": [
                {"type": "distance", "indices": [0, 1], "value": 1},
            ]
        },
#        "program": "psi4",
        "program": "mopac",
    },
    "input_specification": {
        "driver": "gradient",
        "model": {"method": "pm6-d3"},
#        "model": {"method": "b3lyp-d3", "basis": "sto-3g"},
    },
    "initial_molecule": mol_uc2.dict(),
}



# Set a temporary logger to capture output
log_stream = geometric.nifty.RawStreamHandler(stream=sys.stdout)
#log_stream = geometric.nifty.RawStreamHandler(stream=StringIO())
logger = geometric.nifty.logger
logger.addHandler(log_stream)


# Parse JSON
input_opts = geometric.run_json.parse_input_json_dict(input_data)
M, engine = geometric.optimize.get_molecule_engine(**input_opts)

# Handle constraints
constraints_dict = input_opts.get('constraints', {})
constraints_string = geometric.run_json.make_constraints_string(constraints_dict)
Cons, CVals = None, None
if constraints_string:
    if 'scan' in constraints_dict:
        raise ValueError("No scan!")
    Cons, CVals = geometric.optimize.ParseConstraints(M, constraints_string)


# Set up the internal coordinate system
coordsys = input_opts.get('coordsys', 'tric')
CoordSysDict = {
    'cart': (geometric.internal.CartesianCoordinates, False, False),
    'prim': (geometric.internal.PrimitiveInternalCoordinates, True, False),
    'dlc': (geometric.internal.DelocalizedInternalCoordinates, True, False),
    'hdlc': (geometric.internal.DelocalizedInternalCoordinates, False, True),
    'tric': (geometric.internal.DelocalizedInternalCoordinates, False, False)
}

# Build internal coordinates
CoordClass, connect, addcart = CoordSysDict[coordsys.lower()]
IC = CoordClass(
    M,
    build=True,
    connect=connect,
    addcart=addcart,
    constraints=Cons,
    cvals=CVals[0] if CVals is not None else None)



# Get initial coordinates in bohr
coords = M.xyzs[0].flatten() * geometric.nifty.ang2bohr

# Setup an optimizer object
params = geometric.optimize.OptParams(**input_opts)
optimizer = geometric.optimize.Optimizer(coords, M, IC, engine, None, params)

# Print
IC.printConstraints(coords, thre=-1)

def compute(coords, opt):
    mol_dict = mol_uc2.dict()
    mol_dict['geometry'] = coords

    inpmodel = {
        "molecule": mol_dict,
        "driver": "gradient",
        "model": {"method": "pm6"}
    }
    ret = qcng.compute(inpmodel, "mopac")
    opt.E = ret.properties.return_energy
    opt.gradx = ret.return_result
    return ret


optimizer.calcEnergyForce()
optimizer.prepareFirstStep()
logger.info("[AU]: e=%.5f bl=%.5f,%.5f g=%.4f" % (
        optimizer.E, optimizer.X[0],optimizer.X[3], optimizer.gradx[0]))

while True:
    if optimizer.state in [geometric.optimize.OPT_STATE.CONVERGED, geometric.optimize.OPT_STATE.FAILED]:
        logger.info("Optmization convereged!")
        break

    optimizer.step()
    optimizer.calcEnergyForce()
    optimizer.evaluateStep()
    logger.info("[AU]: e=%.5f bl=%.5f,%.5f g=%.4f" % (
            optimizer.E, optimizer.X[0],optimizer.X[3], optimizer.gradx[0]))

You may want to build another Engine.

zachglick · 2020-02-19T03:57:42Z

You can pull geometric apart like so: https://github.com/leeping/geomeTRIC/blob/master/geometric/tests/test_batch_opt.py

This is a bit complex compared to what you wish to do, but it may give a better product.

Here is everything unwound:

import qcengine as qcng
import qcelemental as qcel
import geometric
import pkg_resources
import logging.config
import sys

mol_uc2 = qcel.models.Molecule.from_data(
    """
O 0 0 0
H 0 0 1
H 0 1 0
"""
)

input_data = {
    "keywords": {
        "convergence_set": "GAU_LOOSE",
        "coordsys": "tric",
        "maxiter": 25,
        "enforce": 0.1,
        "constraints": {
            "set": [
                {"type": "distance", "indices": [0, 1], "value": 1},
            ]
        },
#        "program": "psi4",
        "program": "mopac",
    },
    "input_specification": {
        "driver": "gradient",
        "model": {"method": "pm6-d3"},
#        "model": {"method": "b3lyp-d3", "basis": "sto-3g"},
    },
    "initial_molecule": mol_uc2.dict(),
}



# Set a temporary logger to capture output
log_stream = geometric.nifty.RawStreamHandler(stream=sys.stdout)
#log_stream = geometric.nifty.RawStreamHandler(stream=StringIO())
logger = geometric.nifty.logger
logger.addHandler(log_stream)


# Parse JSON
input_opts = geometric.run_json.parse_input_json_dict(input_data)
M, engine = geometric.optimize.get_molecule_engine(**input_opts)

# Handle constraints
constraints_dict = input_opts.get('constraints', {})
constraints_string = geometric.run_json.make_constraints_string(constraints_dict)
Cons, CVals = None, None
if constraints_string:
    if 'scan' in constraints_dict:
        raise ValueError("No scan!")
    Cons, CVals = geometric.optimize.ParseConstraints(M, constraints_string)


# Set up the internal coordinate system
coordsys = input_opts.get('coordsys', 'tric')
CoordSysDict = {
    'cart': (geometric.internal.CartesianCoordinates, False, False),
    'prim': (geometric.internal.PrimitiveInternalCoordinates, True, False),
    'dlc': (geometric.internal.DelocalizedInternalCoordinates, True, False),
    'hdlc': (geometric.internal.DelocalizedInternalCoordinates, False, True),
    'tric': (geometric.internal.DelocalizedInternalCoordinates, False, False)
}

# Build internal coordinates
CoordClass, connect, addcart = CoordSysDict[coordsys.lower()]
IC = CoordClass(
    M,
    build=True,
    connect=connect,
    addcart=addcart,
    constraints=Cons,
    cvals=CVals[0] if CVals is not None else None)



# Get initial coordinates in bohr
coords = M.xyzs[0].flatten() * geometric.nifty.ang2bohr

# Setup an optimizer object
params = geometric.optimize.OptParams(**input_opts)
optimizer = geometric.optimize.Optimizer(coords, M, IC, engine, None, params)

# Print
IC.printConstraints(coords, thre=-1)

def compute(coords, opt):
    mol_dict = mol_uc2.dict()
    mol_dict['geometry'] = coords

    inpmodel = {
        "molecule": mol_dict,
        "driver": "gradient",
        "model": {"method": "pm6"}
    }
    ret = qcng.compute(inpmodel, "mopac")
    opt.E = ret.properties.return_energy
    opt.gradx = ret.return_result
    return ret


optimizer.calcEnergyForce()
optimizer.prepareFirstStep()
logger.info("[AU]: e=%.5f bl=%.5f,%.5f g=%.4f" % (
        optimizer.E, optimizer.X[0],optimizer.X[3], optimizer.gradx[0]))

while True:
    if optimizer.state in [geometric.optimize.OPT_STATE.CONVERGED, geometric.optimize.OPT_STATE.FAILED]:
        logger.info("Optmization convereged!")
        break

    optimizer.step()
    optimizer.calcEnergyForce()
    optimizer.evaluateStep()
    logger.info("[AU]: e=%.5f bl=%.5f,%.5f g=%.4f" % (
            optimizer.E, optimizer.X[0],optimizer.X[3], optimizer.gradx[0]))

You may want to build another Engine.

This is super helpful, thanks!

lgtm-com · 2020-02-19T04:10:34Z

This pull request introduces 2 alerts when merging 1bf69ab into 3121918 - view on LGTM.com

new alerts:

2 for Unused local variable

tests/pytests/test_geometric.py

psi4/driver/driver.py

PeterKraus

This looks great already and I'm looking forward to trying it out. I have a couple of minor comments, though. Also, can GeomeTRIC do TS searches? If yes, perhaps a test would be good.

psi4/driver/driver.py

zachglick · 2020-04-05T03:28:15Z

This looks great already and I'm looking forward to trying it out. I have a couple of minor comments, though. Also, can GeomeTRIC do TS searches? If yes, perhaps a test would be good.

Thanks Peter, your comments are very helpful! I don't think transition state searches are implemented yet, although I see that there is a PR in the GeomeTRIC repo:
leeping/geomeTRIC#107

loriab

looks great!

doc/sphinxman/source/optking.rst

psi4/driver/driver.py

tests/pytests/test_geometric.py

loriab · 2020-04-07T18:04:31Z

tests/pytests/test_geometric.py

+    pytest.param({'name': 'hf', 'options': {'scf_type': 'pk'}, 'ref_ene' : -76.02082389228, 'ref_nuc': 9.26528625744628}, id='rhf(pk)'),
+    pytest.param({'name': 'mp2', 'options': {'mp2_type': 'df'}, 'ref_ene' : -76.22711819393223, 'ref_nuc': 9.09137805747361}, id='mp2(df)'),
+    pytest.param({'name': 'mp2', 'options': {'mp2_type': 'conv'}, 'ref_ene' : -76.2271678506303, 'ref_nuc': 9.091178486990861}, id='mp2(conv)'),
+    pytest.param({'name': 'b3lyp', 'options': {'scf_type': 'df'}, 'ref_ene' : -76.41632755714534, 'ref_nuc': 9.04535641436914}, id='b3lyp'),


How (step & opt convergence conditions) did you get the ref E and nre? It can be difficult to make opt tests checks that are robust to the optimizer taking an extra step (either b/c of platform or noise or algorithm changes). It can be good to crank up opt crit to verytight and all the e/d/r_convergences to 10 to gather the ref values. Then, at least when the test fails, runtime opt convergence and compare_values atol are rationally adjustable.

I got the reference energies exactly as you described: e_convergence : 10, d_convergence : 8, g_convergence : GAU_TIGHT. I wouldn't have any problem with relaxing the comparison criteria, since I imagine any future error that would cause a test to fail will do so in a major way.

Sounds workable. It can be good to get the ref nre from a verytight opt convergence, then run at plain tight opt convergence so that an extra opt step moves closer to ref value rather than away from it. But existing will be fine for now.

loriab · 2020-04-07T18:10:47Z

psi4/driver/driver.py

+
+    # run gradient at optimized geometry to get a wfn
+    if return_wfn:
+        g, wfn = gradient(name, return_wfn=True, **kwargs)


I got nervous about different P::e.globals or other settings depending on whether return_wfn = T/F. Even if it's fine now, may change btwn psithon/psiapi, if qcng is involved whether extras['psiapi'] = T/F, and with ddd or not. I don't particularly want to waste cycles by always running the extra gradient calc, but may be worth it to avert routing bugs.

Are you saying we should run that gradient calculation regardless of the value of return_wfn? I think that would be a pretty reasonable decision.

I wouldn't be too worried about the extra cycle. In my limited experience, GeomeTRIC usually requires many more optimization cycles than Optking, so if a user has made the decision to use GeomeTRIC, a single additional cycle would not be that much of a problem.

Good point on number of iterations. In that case, yes, go for final gradient() call under all circumstances.

Even better, I did away with the final gradient call. I can stick the wfn to the GeomeTRIC Engine object during the optimization, so there's no need to make any additional calls after the optimization loop.

loriab · 2020-04-08T16:08:08Z

tests/pytests/test_geometric.py

+    pytest.param({'name': 'hf', 'options': {'scf_type': 'pk'}, 'ref_ene' : -76.02082389228, 'ref_nuc': 9.26528625744628}, id='rhf(pk)'),
+    pytest.param({'name': 'mp2', 'options': {'mp2_type': 'df'}, 'ref_ene' : -76.22711819393223, 'ref_nuc': 9.09137805747361}, id='mp2(df)'),
+    pytest.param({'name': 'mp2', 'options': {'mp2_type': 'conv'}, 'ref_ene' : -76.2271678506303, 'ref_nuc': 9.091178486990861}, id='mp2(conv)'),
+    pytest.param({'name': 'b3lyp', 'options': {'scf_type': 'df'}, 'ref_ene' : -76.41632755714534, 'ref_nuc': 9.04535641436914}, id='b3lyp'),


Sounds workable. It can be good to get the ref nre from a verytight opt convergence, then run at plain tight opt convergence so that an extra opt step moves closer to ref value rather than away from it. But existing will be fine for now.

PeterKraus

Thanks for taking the suggestions aboard. LGTM.

dgasmith

FYI @leeping
Thanks, this looks great and will help a lot of requests for constrained op.

dgasmith · 2020-04-15T11:16:19Z

doc/sphinxman/source/optking.rst

+:py:func:`~psi4.optimize`. The optimization will respect the keywords |optking__g_convergence|
+and |optking__geom_maxiter|. Any other GeomeTRIC-specific options (including constraints)
+may be specified with the ``geometric_opts`` argument to :py:func:`~psi4.optimize`.
+


I think we need a few constrained optimization examples here.

doc/sphinxman/source/bibliography.rst

andysim · 2020-04-15T13:56:24Z

psi4/driver/driver.py

+    from qcelemental.util import which_import
+
+    if not which_import('geometric', return_bool=True):
+        raise ModuleNotFoundError('Python module geometric not found. Solve by installing it: `conda install -c conda-forge geometric` or `pip install geometric`')


Excellent! This kind of error message really helps

psi4/driver/driver.py

leeping · 2020-04-17T05:35:46Z

Hi there, thanks a lot for including me on this. :)

Transition state optimization is implemented, but we have not tested it extensively against other codes. It does work quite well for in-house applications containing 50+ atoms.

I'm very interested to see how you run these optimizations directly in Psi4. It should be a lot more efficient than calling Psi4 repeatedly on the command line.

Also happy to provide examples of constrained optimization. Let me know if you need any.

susilehtola · 2020-04-17T07:14:30Z

Is there a performance benefit over running the program in the command line? Nuclear forces / hessians i/o is inconsequential compared to the quantum chemistry part. Any savings would come from reusing checkpoint information for the Fock / density matrices... right?

leeping · 2020-04-17T08:01:59Z

I'm just referring to the overhead associated with setup & teardown of the calculation. It could slow things down if the calculations are fast (which might be the case with semiempirical methods or minimal basis sets).

dgasmith · 2020-04-17T12:46:22Z

It can a bit, Psi's startup time is ~0.4 seconds or so with all of Python loading in. In general QC will dwarf this time so it isn't much of an issue. Your right though with XTB and DFTB it gets more interesting.

Co-Authored-By: Andy Simmonett <andy.simmonett@gmail.com>

…ometric

zachglick added 3 commits February 16, 2020 23:02

minimal working GeomeTRIC

5b3e589

add keywords

b6be726

printing

5e9c2e7

loriab reviewed Feb 17, 2020

View reviewed changes

psi4/driver/driver.py Outdated Show resolved Hide resolved

loriab reviewed Feb 17, 2020

View reviewed changes

psi4/driver/driver.py Outdated Show resolved Hide resolved

psi4/driver/driver.py Outdated Show resolved Hide resolved

psi4/driver/driver.py Outdated Show resolved Hide resolved

manual optimization loop

1bf69ab

zachglick added 2 commits February 18, 2020 23:31

lgtm

6d7e594

pytest added

1558442

dgasmith reviewed Feb 27, 2020

View reviewed changes

tests/pytests/test_geometric.py Outdated Show resolved Hide resolved

psi4/driver/driver.py Outdated Show resolved Hide resolved

psi4/driver/driver.py Outdated Show resolved Hide resolved

loriab added this to the Psi4 1.4 milestone Feb 29, 2020

custom geometric engine and handling of geometric options

0c4213e

PeterKraus requested changes Apr 4, 2020

View reviewed changes

peter kraus suggestions and return_history enabled

9660993

update docs

be6f7b9

zachglick changed the title ~~GeomeTRIC optimizer in native Psi4 input~~ GeomeTRIC Interface Apr 6, 2020

ImportError handling

78f286d

loriab reviewed Apr 7, 2020

View reviewed changes

loriab added enhancement optking labels Apr 7, 2020

LAB suggestions

b985e72

zachglick force-pushed the geometric branch from 8e8e921 to b985e72 Compare April 8, 2020 00:47

remove post-optimization gradient call

c5868f5

loriab approved these changes Apr 8, 2020

View reviewed changes

PeterKraus approved these changes Apr 9, 2020

View reviewed changes

zachglick requested a review from dgasmith April 10, 2020 19:23

dgasmith approved these changes Apr 15, 2020

View reviewed changes

dgasmith reviewed Apr 15, 2020

View reviewed changes

andysim reviewed Apr 15, 2020

View reviewed changes

doc/sphinxman/source/bibliography.rst Outdated Show resolved Hide resolved

andysim reviewed Apr 15, 2020

View reviewed changes

psi4/driver/driver.py Outdated Show resolved Hide resolved

andysim reviewed Apr 15, 2020

View reviewed changes

psi4/driver/driver.py Show resolved Hide resolved

zachglick and others added 4 commits April 21, 2020 20:40

constrained opt examples in docs

9c9eb7a

Fix citation

9171466

Co-Authored-By: Andy Simmonett <andy.simmonett@gmail.com>

geometric_opts renamed optimizer_keywords, optimizer_keywords sanitized

9ebe494

Merge branch 'geometric' of https://github.com/zachglick/psi4 into ge…

3664ca0

…ometric

loriab merged commit 0ec2032 into psi4:master Apr 22, 2020

jeffschriber mentioned this pull request Jul 1, 2021

Psi4 v1.4 Release Notes #1562

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GeomeTRIC Interface #1813

GeomeTRIC Interface #1813

zachglick commented Feb 17, 2020 •

edited

Loading

dgasmith commented Feb 18, 2020

zachglick commented Feb 19, 2020

lgtm-com bot commented Feb 19, 2020

PeterKraus left a comment

zachglick commented Apr 5, 2020

loriab left a comment

loriab Apr 7, 2020

zachglick Apr 8, 2020

loriab Apr 8, 2020

loriab Apr 7, 2020

zachglick Apr 8, 2020

loriab Apr 8, 2020

zachglick Apr 8, 2020

loriab Apr 8, 2020

PeterKraus left a comment •

edited

Loading

dgasmith left a comment

dgasmith Apr 15, 2020

zachglick Apr 22, 2020

andysim Apr 15, 2020

leeping commented Apr 17, 2020

susilehtola commented Apr 17, 2020

leeping commented Apr 17, 2020

dgasmith commented Apr 17, 2020

GeomeTRIC Interface #1813

GeomeTRIC Interface #1813

Conversation

zachglick commented Feb 17, 2020 • edited Loading

Description

Todos

Checklist

Status

dgasmith commented Feb 18, 2020

zachglick commented Feb 19, 2020

lgtm-com bot commented Feb 19, 2020

PeterKraus left a comment

Choose a reason for hiding this comment

zachglick commented Apr 5, 2020

loriab left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PeterKraus left a comment • edited Loading

Choose a reason for hiding this comment

dgasmith left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

leeping commented Apr 17, 2020

susilehtola commented Apr 17, 2020

leeping commented Apr 17, 2020

dgasmith commented Apr 17, 2020

zachglick commented Feb 17, 2020 •

edited

Loading

PeterKraus left a comment •

edited

Loading