Replication materials for "Gender, Institutions, and Legislative Speech" (2024), by Eduardo Alemán, Tiffany Barnes, Juan Pablo Micozzi, and Sebastián Vallejo Vera, at the Comparative Politics journal.
Abstract: Speechmaking is a vital resource for legislators and holds particular importance for women lawmakers, who frequently constitute a numerical minority and face constraints on their political influence. We argue that formal and informal institutions, district characteristics, and issue priorities shape women’s speech participation. Analyzing twenty-eight years of speeches from Chile’s Chamber of Deputies, we first show that women’s speeches constitute a small share of all speeches, directly corresponding to their numeric representation. Proportionally, however, women are over/under-represented in speechmaking across different policy areas. After controlling for various factors correlating with gender, including committee assignments, tenure, and district characteristics, women’s relative participation is similar to men’s in most topics but exceeds men’s in areas that disproportionately affect women’s lives, reflecting their commitment to substantive representation.
A link to the latest pre-print is available here. The published version is here.
This README file provides an overview of the replications materials for the article. The Data section describes the main dataset required to reproduce the tables and figures in the paper. The Analysis section summarizes the purpose of each R or python script.
data/data_rep.Rdata: dataset with information on the number of speeches delivered in every topic by a legisator, across differnt legislative sessions. The unit of analysis is the session-legislator-topic. We include all control variables. It contains the following variables:- perodo: legislative session k
- candidatoa: candidate i name
- female: indicator variable that takes value of 1 if legislator is a woman and 0 otherwise
- topics_sp: topic j
- total_sp_leg_type_topic: total number of speeches delivered by candidate i on topic j during session k
- total_sp_leg_type_topic2: total number of speeches not mentioning women delivered by candidate i on topic j during session k
- total_sp_leg_period: total number of speeches delivered by candidate i during session k
- total_mentions_muj: total number of speeches delivered mentioning women by candidate i during session k
- mesa: indicator variable that takes value of 1 if the legislator is part of the executive board of the Chamber of Deputies (Mesa) and 0 otherwise
- membc1:membc31: indicator variable that takes value of 1 if the legislator is a member of committee p and 0 otherwise (see
data/committees.xlsxfor the names of the committees matching the numbers) - chairc1:chairc31: indicator variable that takes value of 1 if the legislator is president of committee p and 0 otherwise (see
data/committees.xlsxfor the names of the committees matching the numbers) - alianza: indicator variable that takes value of 1 if the legislator is part of the Alianza coalition and 0 otherwise
- other: indicator variable that takes value of 1 if the legislator is part of the Other coalition and 0 otherwise
- tenure: the number of terms the legislator has served in the chamber
- lndist: logged distance between main town of the legislator’s district and La Moneda, the presiden- tial building located in Santiago, computed using Google Maps
- porcRural: share of rural inhabitants in the district, obtained from Chilean census information
- margenlist: difference between the votes received by the legislator and her list’s partner
data/keyword mujer.xlsx: dictionary with words associated with women-related topics.data/committees.xlsx: names of the committees matching the numbers (see variables fromdata/data_rep.Rdata.data/topics_committees.xlsx: names of topics that are matched to committees.
- figure 2.R to replicate Figure 2 of the paper, where we show the proportion of speeches delivered by women by topic.
- figure 3.R to replicate Figure 3 of the paper, where we show gender difference in speechmaking by topic.
- figure 4.R to replicate Figures 4a and 4b of the paper, where we show gender differences in Law and Crime speeches when we include and exclude women issues.
- figure 5.R to replicate Figure 5 of the paper, where we plot the relationship between being a women a delivering non women-related speeches and women-related speeches.
- table 1b appendix.R to replicate Table 1.B of the Appendix, where we estimate negative binomial models for the determinants of speech topic. We use the odds ratio of the variable female to plot Figure 3.
data/data_rep.Rdata contains the training set used to train an XLM-RoBERTa model to identify topics in speeches. A full tutorial on how to train Transformer-based models can be found here. A complete course on Computational Text Analysis can be found here




