Weighted frequencies #136

andresimi · 2020-09-13T12:22:00Z

Hi, is it possible to apply a weight to have weighted frequencies in modelsummary?
Or to work with survey data from survey package?
thanx

vincentarelbundock · 2020-09-13T16:00:00Z

If you give me an example with real data that I can replicate, and possibly the survey package command you want to emulate, then I will almost certainly find a solution for you.

…

On Sun, Sep 13, 2020, at 08:22, andresimi wrote: Hi, is it possible to apply a weight to have weighted frequencies in modelsummary? Or to work with survey data from survey package? thanx — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#136>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAHQ7MLREA5R5ROVHBL4NWTSFS2PLANCNFSM4RKUGR7Q>.

-- Vincent Arel-Bundock Professeur agrégé / Associate professor http://arelbundock.com Université de Montréal, Science politique 3150 rue Jean-Brillant, Pav. Lionel-Groulx, C-4020 Montréal, Québec, Canada, H3T 1N8

andresimi · 2020-09-15T18:13:58Z

Thanx a lot.
You can dowload the data from here.

The table I am trying to reproduce is something like this one:

And the code I was working until now is the following:

library(rio); library(tidyverse); library(sjmisc); library(sjlabelled); library(modelsummary)

data <- readRDS("table_example.rds") %>% 
  mutate(across(starts_with("dc"), as_label))
data %>% names()
#data %>% select(starts_with("dc")) %>% names %>% paste(collapse = " + ")

# Table one
f <- dcany + dcanyanx + dcgena + dcpanic + dcagor + dcsoph + dcspph + dcptsd + dcocd + dcsepa + dcotanx + 
  dcanydep + dcmadep + dcotdep + dcanyhk + dcadhdi + dcadhdh + dcadhdc + dcadhdo + dcanycd + dcodd + 
  dccd + dcothcd + dcanyother + dcmania + dcpsych + dceat + dcpdd + dctic ~ redcap_event_name*selection*age_group*DropEmpty()*(1+Percent(denom = "col"))

t <- data %>% 
  datasummary(f, data = ., output = "data.frame", title = "title") %>% 
  as.tibble() %>% 
  set_na(1, na="") %>% 
  fill(1) %>%
  filter(`  `=="Yes") %>% 
  select(` `, 
         starts_with("Wave0 Random 5-9"), starts_with("Wave0 Random 10-14"), starts_with("Wave0 High Risk 5-9"), starts_with("Wave0 High Risk 10-14"),
         starts_with("Wave1 Random 9-12"), starts_with("Wave1 Random 13-17"), starts_with("Wave1 High Risk 9-12"), starts_with("Wave1 High Risk 13-17"),
         starts_with("Wave2 Random 12-17"), starts_with("Wave2 Random 18-21"), starts_with("Wave2 High Risk 12-17"), starts_with("Wave2 High Risk 19-21")) %>% 
  as_hux(add_colnames=T) %>% 
  set_bottom_border(row=1, col = everywhere)
t


# Weighted table
f <- dcany + dcanyanx + dcgena + dcpanic + dcagor + dcsoph + dcspph + dcptsd + dcocd + dcsepa + dcotanx + 
  dcanydep + dcmadep + dcotdep + dcanyhk + dcadhdi + dcadhdh + dcadhdc + dcadhdo + dcanycd + dcodd + 
  dccd + dcothcd + dcanyother + dcmania + dcpsych + dceat + dcpdd + dctic ~ redcap_event_name*age_group*DropEmpty()*(1+Percent(denom = "col"))

tw <- data %>%
  as_survey_design(weights = weights) %>% 
  datasummary(f, data = ., output = "data.frame", title = "title") %>% 
  as.tibble() %>% 
  set_na(1, na="") %>% 
  fill(1) %>%
  filter(`  `=="Yes") %>% 
  select(` `, 
         starts_with("Wave0 5-9"), starts_with("Wave0 10-14"), starts_with("Wave0 5-9"), starts_with("Wave0 10-14"),
         starts_with("Wave1 9-12"), starts_with("Wave1 13-17"), starts_with("Wave1 9-12"), starts_with("Wave1 13-17"),
         starts_with("Wave2 12-17"), starts_with("Wave2 18-21"), starts_with("Wave2 12-17"), starts_with("Wave2 19-21")) %>% 
  as_hux(add_colnames=T) %>% 
  set_bottom_border(row=1, col = everywhere)
tw

vincentarelbundock · 2020-09-15T19:10:30Z

The easiest way to do this would probably be to divide your weights by the sum of weights in the subgroup which you want to use as margins (i.e., in the subgroup where the sum of frequencies should equal 100%). Then, you just take the sum of these new weights.

Does something like this work for you?

library(tidyverse)
library(modelsummary) 

data <- readRDS("table_example.rds") %>%
        group_by(redcap_event_name, age_group, selection) %>%
        mutate(weights = weights / sum(weights) * 100)

f <- Factor(dcany) + Factor(dcanyanx) + 1 ~ 
     sum * weights * redcap_event_name * age_group * selection * DropEmpty()
datasummary(f, data)

vincentarelbundock added the supported already label Sep 25, 2020

vincentarelbundock closed this as completed Jul 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weighted frequencies #136

Weighted frequencies #136

andresimi commented Sep 13, 2020

vincentarelbundock commented Sep 13, 2020 via email

andresimi commented Sep 15, 2020 •

edited

Loading

vincentarelbundock commented Sep 15, 2020 •

edited

Loading

Weighted frequencies #136

Weighted frequencies #136

Comments

andresimi commented Sep 13, 2020

vincentarelbundock commented Sep 13, 2020 via email

andresimi commented Sep 15, 2020 • edited Loading

vincentarelbundock commented Sep 15, 2020 • edited Loading

andresimi commented Sep 15, 2020 •

edited

Loading

vincentarelbundock commented Sep 15, 2020 •

edited

Loading