Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
ssa_national set is not gender balanced #9
Comments
|
Thanks for pointing this out, @bmschmidt. I'll have to take it into account. |
lmullen
added the
bug
label
Jun 25, 2014
|
First stab at a solution with justification for the reasoning: http://rpubs.com/lmullen/gender-imbalance-ssa |
lmullen
added a commit
that referenced
this issue
Jul 22, 2014
|
|
lmullen |
a210ee1
|
|
Fixed on |
lmullen
closed this
Jul 22, 2014
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
bmschmidt commentedJun 25, 2014
method="ssa"could probably use some adjustment to compensate for something I noticed in the Social Security dataset: before about 1918, it's two-thirds women.I assume this has to do with who was eligible for benefits when the program was created in the late 30s: either the men born around 1900 are dead, or more likely they're not eligible for survivor benefits for spouses or something.
All the ratios for years around 1900 from this method are distorting the female % of the name: for example, in 1901
merlehas 91 women and 52 men counted, but since 69% of the sample is female that year, that male number should be 2.2x higher: the right prediction that it's male, not female.An illustrative plot: