Making things "pythonic" #17

iliakur · 2014-08-31T20:04:43Z

This is a biggish update from me (mostly addressing #10 and #11) and I would like to discuss a couple of things in it before you merge my changes into the master branch. Treat this not just as a finished product, but as a work in progress. Please do take advantage of commenting on specific lines (git lets you do that, hurrah!), I'm looking forward to your feedback.
I've worked on the following stuff:

PUF file is read from a pickled binary file which is much much faster than using Pandas. What is read in is already a dictionary where every variable is a key and its PUF np.array column is the value. While generating the pickle file I saved the len() of the pandas DataFrame which came from puf.csv under the key 'PUF DIM'. This then gets used by Sameer's PUF() function to set the vars we're missing to arrays of zeros with the right length (same as other PUF vars). This length is also used in a couple of other functions and I'd like to talk about those in more detail, see my comments to (and in!) the source code below.
Plan X and (optionally!) Plan Y can now be read in as JSON files. I've created a version of the Plan X JSON and will have to show it to Dan to finalize, but it's a copy of all the variables being defined in Sameer's code and the script runs fine with it, so I presume I haven't broken anything and we're fine.
Some "pythony" things (based on documentation and this):
- all tabs have been replaced with spaces (1 tab char = 4 space chars)
- I've started removing underscores from variable names, but would like to finish that before we merge
- Python doesn't require one to preallocate memory for an array, so I removed some np.zeros() calls, but would like to chat more about that with @SameerSarkar

rough translation

…o params-dict

iliakur · 2014-08-31T20:06:50Z

translation.py

-namesCap = [str.upper(n) if str.isalpha(n) else n for n in names]
+# This should be dealt with so that we don't keep referencing the same array
+# simply find all uses of this and figure out if they're called for (mostly not)
+DIMARRAY = np.zeros((139651,))


I know this is sketchy (basically all subsequent np.zeros() arrays are just referencing this one), but it's here to purposefully draw our attention to cases where we've been using np.zeros() and make us revisit them critically.

iliakur · 2014-10-23T02:03:32Z

This is stale, closing.

SameerSarkar and others added 10 commits August 26, 2014 14:15

rough translation

692cb69

Merge pull request PSLmodels#9 from SameerSarkar/master

79394c7

rough translation

added option of null planY

136cb08

puf now dict read from pickle file, all dependent vars as keys

41fafd2

plan X vars now read from JSON

69ec5ae

added puf update code and tested all dict lookups, they work

2e86aed

Merge branch 'pickling' of https://github.com/Copper-Head/taxcalc int…

7d0f2bf

…o params-dict

Getting latest pickling code

261eda2

removed unnecessary np.zeros calls, replaced tabs with spaces.

615eb7e

added some inline comments

3795bcd

iliakur reviewed Aug 31, 2014
View reviewed changes

removed all var-initial underscores

bee1296

iliakur closed this Oct 23, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Making things "pythonic" #17

Making things "pythonic" #17

iliakur commented Aug 31, 2014

iliakur Aug 31, 2014

iliakur commented Oct 23, 2014

Making things "pythonic" #17

Making things "pythonic" #17

Conversation

iliakur commented Aug 31, 2014

iliakur Aug 31, 2014

Choose a reason for hiding this comment

iliakur commented Oct 23, 2014