NWChem new attributes/metadata by amandadumi · Pull Request #1215 · cclib/cclib

amandadumi · 2023-06-22T04:49:15Z

This is to move forward on (and will replace) #1143 .

Changes:

This brings the changes from @jvalegre and myself, but rebased against main. (I wasn't able to figure out how to do this by pushing to the pull request, but can try again if preferred).
implement parsing of requested metadata objects into the NWChem parser
remove additional attributes and those that need further discussed.
add test for vibrations in NWChem 7.

Questions/To Do:
Is it okay to introduced data attributes that are not yet parsed? (See oniom_energy and nmr_anis as examples).
Possible solutions:

Only add the attributes for metadata that is being parsed, and separate the other unparsed attributes to a draft PR.
~~Merge as is since the unparsed attributes do not introduce a breaking change and parsing can be implemented in a future PR~~
~~Keep open until parsing of all attributes is incorporated.~~

Notes:
Once the newly parsed metadata attributes are merged, the cjson writer additions for metadata (#1148) can be merged.

berquist · 2023-06-22T20:55:08Z

I wasn't able to figure out how to do this by pushing to the pull request, but can try again if preferred

I don't think it's so bad as long as he's the author of the commit, which he still is. "Add more commits by pushing to the nwchem_fixes branch on jvalegre/cclib." means it should work. What I do is gh pr checkout 1143 and it automatically sets up the remote and its tracking branch. I have gh set to use SSH instead of HTTPS for remotes, but I think an HTTPS remote will work too if you log in via gh auth ..., not sure though. Mine looks like

$ gh auth status
github.com
  ✓ Logged in to github.com as berquist (/home/eric/.config/gh/hosts.yml)
  ✓ Git operations for github.com configured to use ssh protocol.
  ✓ Token: gho_************************************
  ✓ Token scopes: gist, read:org, repo

Is it okay to introduced data attributes that are not yet parsed?

No, because it can make the PR harder to review if they're intermingled with other new attributes that are used. That holds even if we don't have the previous problem of multiple PRs adding the same things, used or otherwise. It also means we'll be able to isolate discussions for them better and most changes are isolated to the parser(s).

My vote is for 1, "Only add the attributes for metadata that is being parsed, and separate the other unparsed attributes to a draft PR".

berquist · 2023-06-24T21:21:15Z

            self.append_attribute("dispersionenergies", dispersion)
+
+        # type of dispersion
+        if line.strip().find('disp vdw 3') > -1:


We need a test for this because these lines aren't present at all in our examples. From dvb_dispersion_bp86_d3zero.out,

Dispersion Parameters --------------------- DFT-D3 Model s8 scale factor : 1.000000000000 sr6 scale factor : 1.683000000000 sr8 scale factor : 1.139000000000 vdW contrib : 1.000000000000 DFT-D3 Model s8 scale factor : -0.014719930985 sr6 scale factor :

This is D3 (D3(zero), no damping), I haven't tried D3(BJ) in NWChem yet.

I see the disp vdw 3 in the input, but that is not repeated in the output from what I can tell. This might need to be changed in general. (mostly just a note to myself so i don't forget)

https://nwchemgit.github.io/Density-Functional-Theory-for-Molecules.html?h=disp#disp-empirical-long-range-contribution-vdw

I've added a file to test this for D3BJ with the dvb molecule, but it wouldn't be used through the typical dispersion tests on D3, but instead just to check metadata parsing. Is this overkill?

Sorry I missed this, I think it's ok since more people are doing D3(BJ) than D3(0).

berquist · 2023-08-02T16:21:02Z

+                rotemp.append(float(split_line)[4])
+                line=next(inputfile)
+            self.set_attribute('rotconsts', roconst)
+            self.set_attribute('rottemp', rotemp)


What do we think about rotational temperature? #1093 (comment) definitely a method in the future, but should we parse it too?

Hm, if they are available, having them immediately available and not have to do extra work of a recalculation through a method seems like a benefit to me, but maybe I am not seeing a reason that this could be problematic?

berquist · 2023-08-30T01:25:13Z

It's dumb but if you apply 9f7dea1 until we remove this it'll solve the CI problem.

berquist

I haven't investigated but it hangs for me on parsing the vibrational frequency output.

berquist · 2023-09-24T12:51:56Z

+            self.metadata['num_processors'] = line.split()[-1]
+        if "Memory information" in line:
+            self.skip_lines(inputfile,['d','b','heap','stack','global'])
+            self.metadata['memory'] = line.split()[-2:]


int, but we also want this in bytes. I don't know if Mbytes is actually megabytes. I guess it isn't, since the number of doubles should be the number of bytes.

berquist · 2023-09-30T00:08:23Z

+            self.metadata['num_processors'] = int(line.split()[-1])
+        if "Memory information" in line:
+            self.skip_lines(inputfile,['d','b','heap','stack','global'])
+            self.metadata['memory'] = int(line.split()[2:])*8


Suggested change

self.metadata['memory'] = int(line.split()[2:])*8

self.metadata['memory'] = int(line.split()[2])*8

amandadumi force-pushed the nwchem_fixes_pr branch from c5cf6ef to 480bffb Compare June 22, 2023 14:59

berquist self-requested a review June 22, 2023 20:31

berquist added parsers NWChem labels Jun 22, 2023

berquist added this to the v1.8.1 milestone Jun 22, 2023

berquist requested changes Jun 24, 2023

View reviewed changes

amandadumi marked this pull request as draft July 16, 2023 19:11

amandadumi force-pushed the nwchem_fixes_pr branch from 793445f to 9918621 Compare August 1, 2023 23:00

amandadumi changed the title ~~NWChem new metadata (parsed) and objects (unparsed)~~ NWChem new attributes/metadata Aug 2, 2023

amandadumi force-pushed the nwchem_fixes_pr branch from 89a3eb4 to d6570e4 Compare August 2, 2023 15:59

berquist reviewed Aug 2, 2023

View reviewed changes

amandadumi force-pushed the nwchem_fixes_pr branch from d6570e4 to 8ab3867 Compare August 16, 2023 03:13

amandadumi force-pushed the nwchem_fixes_pr branch from f13b3ff to 06917b9 Compare September 1, 2023 17:50

jvalegre and others added 14 commits September 22, 2023 17:06

1. NWChem fixes

56b2ed5

starting to move metadata to dictionary, additional parsing

b33a66a

parsing new metadata

e32dc6d

removing metadate attributes

691f066

incoporate upstream changes

c49fab1

relabel attributes not parsed by json yet

e6c8c70

remove unparsed attributes from data.py

c593923

fix rotconsts parse and test, some additional parsing issues

43d7d2c

skiplines

7eae12b

remove undecided attributes

0c658f3

incorrect comment

df959bc

formatting

b3352a2

D3 vs D3BJ metadata detection

adc39e0

remove extra next() call

cdaacac

amandadumi force-pushed the nwchem_fixes_pr branch from 4d7b370 to cdaacac Compare September 22, 2023 23:08

berquist self-assigned this Sep 23, 2023

berquist requested changes Sep 24, 2023

View reviewed changes

amandadumi added 2 commits September 29, 2023 15:11

ensure int dtypes, and D3 fix

c7c743f

memory in bytes from the reported doubles

a89bfce

berquist reviewed Sep 30, 2023

View reviewed changes

berquist modified the milestones: v1.8.1, v2.0 Dec 18, 2023

	self.metadata['memory'] = int(line.split()[2:])*8
	self.metadata['memory'] = int(line.split()[2])*8

Conversation

amandadumi commented Jun 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

berquist commented Jun 22, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

berquist commented Aug 30, 2023

Uh oh!

berquist left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

amandadumi commented Jun 22, 2023 •

edited

Loading