Population variance and ability to suppress computation of standard errors #48

tzoltak · 2019-03-14T00:13:55Z

Hi!

I added two functions: survey_var and survey_sd allowing computation of population variance and standard deviation (with svyvar underneath). I also added some basic tests of this functions to test_survey_statistics.r.
I patched all the survey_ functions, so it is possible to suppress computation of (any kind of) standard errors (resolving Calculating contingency tables without showing SE #45 ). And I also added drop=FALSE to one row in summarise.grouped_svy so it is able to carry on with such results.
I made survey_median don't add "_q50" suffix to names of all the variables it generates.
I made some assertion checks and error messages among survey_ functions a little more consistent.

Hope it will be somehow helpful.

gergness · 2019-03-14T14:30:28Z

Wow, awesome, thank you so much! I'll take a look in the next week or so!

codecov-io · 2019-03-14T14:34:38Z

Codecov Report

Merging #48 into master will increase coverage by 1.72%.
The diff coverage is 90.9%.

@@            Coverage Diff             @@
##           master      #48      +/-   ##
==========================================
+ Coverage   79.44%   81.17%   +1.72%     
==========================================
  Files          17       17              
  Lines         905      956      +51     
==========================================
+ Hits          719      776      +57     
+ Misses        186      180       -6

Impacted Files	Coverage Δ
R/summarise.r	`88.57% <100%> (ø)`	⬆️
R/survey_statistics_helpers.R	`83.59% <100%> (+1.97%)`	⬆️
R/survey_statistics.r	`90.37% <90%> (+3.91%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update caf5055...0d125b4. Read the comment docs.

tzoltak · 2019-03-14T22:34:20Z

I hope to implement joins (apart of full join and right join) in the future - they're reasonable if only there is no one-to-many relations between survey object data and data that is to be joined. So this is something that has to be checked before doing actual join and if such relations are found, it will throw an error. With inner, semi and anti joins subsetting of the survey design information must be also implemented but it won't be hard to do with all the functions that are already in the package.

gergness

Well, not quite one week, sorry about that!

This is great, but a few minor comments. Do you have time to fix them? I really appreciate it.

R/survey_statistics.r

tests/testthat/test_as_survey_twophase.r

R/survey_statistics.r

tzoltak · 2019-03-26T21:52:21Z

I can fix this issues tomorrow evening.

tzoltak · 2019-03-28T09:38:09Z

I didn't manage yesterday - I started to rewrite tests regarding survey_quantile() and survey_median a little (I checked how to get SE from objects returned by svy_quantile) and also I want to add tests to cover calls to functions in survey_statistics.r that result in errors thrown on assertion stage (missing x in case of non-grouped objects and factor/character x). However I'll be working on this today and perhaps tomorrow (if some other things don't let me spend enough time on this today).

Regarding CI for survey_var I thought about and changed my mind: if srvyr is a kind of front-end for survey, perhaps we shouldn't try to be wiser than survey is. If it is possible to obtain CI for variance in survey (in fact it is) and this CI is estimated using t (or normal with df=Inf) distribution - let it be. We may write a warning in description within help page of survey_var that such CI can be a very poor one (especially when analyzing subgroups) but nothing more.
What do you think about this?

gergness · 2019-03-28T17:33:04Z

No worries, thanks for the update!

Yeah that makes a lot of sense to me, I’ve generally tried to defer the statistical reasoning to the survey package, as it makes support easier. A warning in the documentation seems like a reasonable compromise to me.

gergness · 2019-04-11T01:32:48Z

Ugh, sorry that took me so long, it's been a month of sickness in my house.

Thank you so so much for your work!

tzoltak · 2019-04-11T07:18:29Z

Great! :)
Perhaps you should only tidy up version number, because in my pull request it was a rather provisional one.

gergness · 2019-04-11T22:35:52Z

Yep, will do a full CRAN release soon

tzoltak added 2 commits March 14, 2019 00:49

calculation of population variance

3fa4da7

NEWS updated

caf6e22

gergness requested changes Mar 26, 2019

View reviewed changes

tzoltak added 2 commits March 29, 2019 00:34

survey_var accepts vartype="ci" and df

6443894

tidying and adding some new tests

0d125b4

gergness approved these changes Apr 11, 2019

View reviewed changes

gergness merged commit 192e06a into gergness:master Apr 11, 2019

gergness mentioned this pull request Jun 23, 2019

Calculating contingency tables without showing SE #45

Closed

gergness mentioned this pull request Jan 24, 2021

survey_sd with vartype and level #112

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Population variance and ability to suppress computation of standard errors #48

Population variance and ability to suppress computation of standard errors #48

tzoltak commented Mar 14, 2019 •

edited

gergness commented Mar 14, 2019

codecov-io commented Mar 14, 2019 •

edited

tzoltak commented Mar 14, 2019

gergness left a comment

tzoltak commented Mar 26, 2019

tzoltak commented Mar 28, 2019

gergness commented Mar 28, 2019

gergness commented Apr 11, 2019

tzoltak commented Apr 11, 2019

gergness commented Apr 11, 2019

Population variance and ability to suppress computation of standard errors #48

Population variance and ability to suppress computation of standard errors #48

Conversation

tzoltak commented Mar 14, 2019 • edited

gergness commented Mar 14, 2019

codecov-io commented Mar 14, 2019 • edited

Codecov Report

tzoltak commented Mar 14, 2019

gergness left a comment

Choose a reason for hiding this comment

tzoltak commented Mar 26, 2019

tzoltak commented Mar 28, 2019

gergness commented Mar 28, 2019

gergness commented Apr 11, 2019

tzoltak commented Apr 11, 2019

gergness commented Apr 11, 2019

tzoltak commented Mar 14, 2019 •

edited

codecov-io commented Mar 14, 2019 •

edited