test Results against target values #3

sbenthall · 2023-01-24T11:11:49Z

This PR addresses llorracc#7 by providing a test script that compares the data in the Results/ files with target values.

There are few tricky things about this PR:

Only one set of target values, PYbetaPointIndNetWorthResults, is from the original cstwMPC paper. The other target values are currently taken from the data files from this repository as commit 3aefa00 using HARK 0.11.0
Of those Results, some do not compare favorably with the original paper results. The third quintile target is 7.285, but commit 3aefa00 gets 5.241. This is 30% off of the original target, whereas @llorracc has set the criteria as 'not much more than 10%'. However, this change in result was accepted by an earlier review by @llorracc and @mnwhite . So, it's not clear what to do here.
Because of the design of the original cstwMPC code, which involves a lot on non-standard code execution (using exec()) and file output (saving txt files with custom data), I designed this script to work with the files output to the Results/ directory. This means that it doesn't operate like a normal Python unit test of some part of the code. Rather, the test must be ran by hand to verify results.

…arget values.

… threshold to passing

llorracc · 2023-01-25T00:07:03Z

Seb,

Great, I'm very glad to see this!

Of those Results, some do not compare favorably with the original paper results. The third quintile target is 7.285, but commit 3aefa00 gets 5.241. This is 30% off of the original target, whereas @llorracc has set the criteria as 'not much more than 10%'. However, this change in result was accepted by an earlier review by @llorracc and @mnwhite. So, it's not clear what to do here.

The crucial statistics are those for the MPC, which are all within reasonable tolerance of the cstwMPC paper's numbers. Probably not worth the archaeology to run down why the third quintile is off so much -- except insofar as we want to be sure that these numbers are stable in the sense that if we increase, say, the number of consumers being simulated, they don't change much.

Because of the design of the original cstwMPC code, which involves a lot on non-standard code execution (using exec()) and file output (saving txt files with custom data), I designed this script to work with the files output to the Results/ directory. This means that it doesn't operate like a normal Python unit test of some part of the code. Rather, the test must be ran by hand to verify results.

I'd really like to get this into some form that can be run automatically when we update the development branch of the HARK toolkit. Whether that requires something in the form of a unit test, I don't know, but my goal is to choose a small number of REMARKs that are "unpinned" because they give a thorough workout to the substantive, quantitative results of the toolkit and any code merge that changes those substantive results needs to be closely scrutinized to understand why.

sbenthall · 2023-01-25T08:45:57Z

Point of order -- this PR is a duplicate of econ-ark#9
I at first made the mistake of making this PR on my personal fork.

I'll respond to these comments in appropriate places in the econ-ark repository.

sbenthall added 3 commits January 24, 2023 13:48

adding test file

e70c67f

adding test script that compares the data in the Results files with t…

c872dd5

…arget values.

fix reference to PYbetaDistIndNetWorthResults.txt, clean up code; set…

42adcbe

… threshold to passing

sbenthall closed this Jan 24, 2023

This was referenced Jan 25, 2023

test Results against target values econ-ark/DistributionOfWealthMPC#9

Merged

Automated test of substantive results tied to HARK development branch econ-ark/DistributionOfWealthMPC#10

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test Results against target values #3

test Results against target values #3

sbenthall commented Jan 24, 2023

llorracc commented Jan 25, 2023

sbenthall commented Jan 25, 2023

test Results against target values #3

test Results against target values #3

Conversation

sbenthall commented Jan 24, 2023

llorracc commented Jan 25, 2023

sbenthall commented Jan 25, 2023