Replication: Do Women Give Up Competing More Easily? Evidence from the Lab and the Dutch Math Olympiad
Author of the paper: Thomas Buser & Huaiping Yuan
This project replicates and expands upon the field study in the paper "Do Women Give Up Competing More Easily? Evidence from the Lab and the Dutch Math Olympiad" by Thomas Buser and Huaiping Yuan. The study investigates gender differences in the likelihood of continuing to compete after losing. Using field data from the Dutch Math Olympiad (2010–2014), the project analyzes the impact of scoring just above or below the threshold for qualifying for the second round. A regression discontinuity design is employed to estimate the causal effect of losing versus winning on the likelihood of participating again the following year, with a particular focus on gender differences. The results show that girls who narrowly miss the threshold are 14.3% less likely to compete again, while boys do not exhibit a statistically significant change in behavior. Despite this, when applying clustered standard errors, the gender difference becomes less significant. The findings highlight that girls may react more strongly to losing, offering insights into how gender impacts competitive behavior in academic settings.
For the full report, please see: Replication_Peijin_pl724.html
American Economic Journal: Applied Economics 2019, 11(3): 225–252 https://doi.org/10.1257/app.20170160
This paper mainly answers the question: Do women give up competing more easily? The authors use lab experiments and field data from the Dutch Math Olympiad to conduct this research. My replication focuses on the field study part. In this part, the authors use the field data from the Dutch Math Olympiad to determine whether the gender difference in the reaction to losing carries over to the field.
This paper takes the binary decision of whether to compete again one year later as the outcome measure. This study uses anonymized data for all participants in the 2010–2014 Olympiads, including their score, gender, and whether they participated again the year after. The data present a sharp regression discontinuity design because winning and losing depends on a strict cutoff (the threshold score). Comparing the subsequent participation choices of individuals just below and just above the cutoff makes it possible to estimate the causal effect of losing versus winning on the likelihood of participating again one year later. In particular, this paper is interested in the gender difference in this effect.
In this thesis, I replicated the results of Figure 2,3,4 of the paper. Firstly, I conducted a set of estimates. I made subsets of boys and girls separately and estimate the following equation and present regression discontinuity results separately for each gender:
where Y is a binary indicator for participating again a year later, T is a binary indicator for scoring above the cutoff (that is, making the second round).
to learn the gender difference, I added F as a binary indicator for being female and an interaction variable of F and T, which estimates the gender difference in the reaction to loss. I also interacted the NetScore with the cutoff indicator T to allow for different slopes left and right of the cutoff. And the interaction of NetScore and T with F to allow for different slopes for each gender.
To remove unobserved heterogeneity between students from different schools, I introduced school fixed effects, and I also clustered the standard errors at the score level to account for situations where observations within each score group are not independently and identically distributed. In practice, I estimated the following equation:
According to the results, we could learn that in the non-clustered regression, the coefficient of our interest T is 0.143, which means that those girls who get a score higher than the threshold, have a 14.3% higher possibility of participating in the competition in the following year. This difference between girls is statistically significant under the confidence level of 95%. In other words, according to girls who score within 5 points of the cutoff, the effect of losing roughly translates to a 14.3 percent reduction in the likelihood of participating again.
For boys, the influence of their score results on the following year’s participation is not significant. In regression (2), we could see that the gender difference in the reaction to losing is significant, with a confidence level of 90%. However, these differences are not significant with school fixed effects and standard errors clustered at the score level.
According to Figures 3 and 4, regardless of the degree of the polynomial and the bandwidth used, the discontinuity estimates for the boys are always very close to zero. Unlike the paper's results, I found the discontinuity estimates for the girl are more distant from zero but fail to reject the null hypothesis that the effect of losing is different between girls who pass the threshold and who do not. In the estimation of gender difference in the effect of losing in Figure 3, we could see that with the bandwidth to be 7, we could reject the null hypothesis that there is no gender difference in the effect of losing. Even though we can not reach the conclusion that girls are sensitive to the effect of losing, we learn that losing exerts different effects on different genders. According to Figure 2 and the results of regression(2) and regression(3), we could conclude that girls react significantly more strongly to losing than boys.
In the replication of this paper, I found that the results were statistically significant when I applied non-cluster regression in replicating Figures 3 and 4. If I employ clustered regression, in many cases, I cannot reject the null hypothesis. In the replicated regression, I noticed that the significance of coefficients is diminished with clustered regression. This leads to the reflection of Clustering. In the paper-- When Should You Adjust Standard Errors for Clustering?( https://economics.mit.edu/files/13927), the authors discussed this condition of adjusting standard errors for Clustering. They proposed that Clustering is in essence a design problem, either a sampling design or an experimental design issue. It is a sampling design issue if sampling follows a two-stage process in which, in the first stage, a subset of clusters is randomly drawn from the population, and in the second stage, units are randomly drawn from the sampled clusters. In this case, the clustering adjustment is justified because there are clusters in the population that we do not see in the sample. And Clustering is an experimental design issue if the assignment is correlated within the clusters.
In the case of studying the gender difference in losing effect, it is neither a sampling design nor an experimental design issue, since we cannot analogize a student achieving a certain score to a random assignment of scores, and there is no treatment effect in this case. Therefore, the use of Clustering in this case may lead to underestimation of significance and commit the second type of error in statistical inference. Last but not least, when I conducted the OLS estimates of the discontinuity for a range of score bandwidths around the cutoff using the linear and quadratic approaches without cluster on “Score”, I found the gender difference in losing effect is statistically significant and generated the conclusion that girls give up competing more easily than boys, as demonstrated by the fault at the threshold in Figure 2.
References
- Abadie, Alberto, et al. When Should You Adjust Standard Errors for Clustering? Oct. 2017.
- Buser, Thomas, et al. “Do Women Give Up Competing More Easily? Evidence from the Lab and the Dutch Math Olympiad.” American Economic Journal: Applied Economics, 2019, pp. 225–52.