Genetics of educational attainment and social problem behaviors: Robust estimates of direct genetic effects
Abstract
This study examines the genetic underpinnings of (problematic) socioemotional behaviors in children (6–18 years old) by leveraging a within-family trio design using data from the Lifelines cohort (N=3,090–4,510). Socio-emotional development is key to understanding long-term educational, occupational, and mental health outcomes. Yet, their genetic foundations are not fully understood. We estimate direct genetic effects of educational attainment (EA)-related polygenic indices (PGIs), as a whole and decomposed into cognitive (Cog) and non-cognitive (NonCog) components, on a comprehensive set of internalizing and externalizing behaviors assessed via self- and parental-reported Achenbach System of Empirically Based Assessment (ASEBA) measures. Our findings reveal that higher EA-related PGIs, particularly their cognitive component, are robustly associated with lower levels of attention problems, rule-breaking, and somatic complaints. These effects persist after adjusting for parental genetics, supporting a causal interpretation. Some associations—such as those of the NonCog EA PGI with attention problems and rule-breaking, and of the full EA PGI with externalizing behaviors—lose statistical significance when parental genetics are included, suggesting that small environmentally mediated parental influences may play a role. We also find notable sex differences, with stronger associations in girls, especially for internalizing traits, and observe modest (although insignificant) differences between self- and parent-reported outcomes. Our findings contribute novel evidence that cognitive genetics influence behavioral development and highlight the value of family-based genetic designs for uncovering the complex interplay of genes and environment in socio-emotional functioning.
Article type: Research Article
Keywords: Social behavioral traits, cognitive/non-cognitive traits, educational attainment, polygenic indices, within-family trio design, sex
Affiliations: Vrije Universiteit Amsterdam (VU), The Netherlands; Amsterdam UMC, University of Amsterdam, The Netherlands; University of Southern California, Los Angeles, United States
License: CC BY 4.0 This work is licensed under a Creative Commons Attribution 4.0 International License, which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
Article links: DOI: 10.21203/rs.3.rs-7328005/v1 | PubMed: 40909789 | PMC: PMC12408008
Relevance: Moderate: mentioned 3+ times in text
Full text: PDF (7.5 MB)
Introduction
Socioemotional behaviors are shaped by complex interactions of genetic, environmental, and developmental factors. Unlike personality traits, which are considered relatively stable, socioemotional behaviors exhibit dynamic patterns of emotional regulation, impulse control, and social interaction that can vary across contexts. Understanding the role of genetic factors in socioemotional behaviors may provide insight into their etiology and help to refine theory.
A growing body of research has demonstrated the importance of social and emotional functioning behaviors in shaping long-term outcomes, including educational attainment, career success, and social functioning (ref. Belfi and Borghans, 2025). Heckman and colleagues emphasize that perseverance and self-control play a crucial role in labor-market success and economic stability, sometimes exceeding the predictive power of cognitive abilities (ref. Heckman and Kautz, 2012; ref. Heckman et al., 2006; ref. Heckman and Rubinstein, 2001). Furthermore, meta-analytic evidence shows that emotional intelligence is a significant predictor of academic performance (ref. MacCann et al., 2020).
Given the central role of social and emotional behaviors in shaping educational outcomes, genetic predictors of educational attainment should also capture variance in these behaviors. Recent advances in genome-wide association studies (GWAS) provide an opportunity to explore such associations. In this paper, we investigate the role of genetic factors in social behavior, estimating robust associations with educational attainment (EA)-related genetic factors, on a comprehensive set of parental- and self-reported socioemotional behaviors.
Large-measure GWASs on educational attainment (EA) have identified thousands of genome-wide significant single-nucleotide polymorphisms (SNPs) associated with years of schooling, with polygenic indices (PGIs), which aggregate the millions of small genetic effects of SNPs across the genome, explaining up to 16% of the variance in EA (ref. Lee et al., 2018; ref. Okbay et al., 2022). EA-related genetic correlates extend beyond educational achievement, demonstrating predictive power for a range of diverse outcomes, including self-control, depression, and personality traits (ref. Belsky et al., 2016; ref. Smith-Woolley et al., 2019; ref. Okbay et al., 2022). Given the substantial sample size of EA GWASs, and therefore predictive power, and the broad predictive validity of EA-related PGIs beyond educational attainment, we use the EA PGI in our analyses to explore its relevance for (problematic) socioemotional behaviors.
Recent work has demonstrated that EA-related genetic influences are multifaceted, encompassing both cognitive and non-cognitive components. Using GWAS-by-subtraction, ref. Demange et al. (2021) developed two orthogonal polygenic indices (PGIs) that isolate the portion of EA-related genetic variance attributable to performance on cognitive tasks from the residual, non-cognitive component. This decomposition offers a powerful empirical tool for probing how distinct dimensions of EA-related genetic variance—cognitive and non-cognitive1—are associated with socioemotional behaviors. While the cognitive components (i.e. those related to performance on standardized tests) of EA-related genetics are well-documented, growing attention has been given to the role of non-cognitive factors, such as motivation, self-regulation, and personality (ref. Demange et al., 2021,0; ref. Malanchini et al., 2024). These are increasingly recognized as critical to understanding the broader associations of EA-related genetics on socioemotional behaviors.
The cognitive / non-cognitive decomposition aligns with what ref. Belsky and Harden (2019) term phenotypic annotation—a framework for bridging the interpretive gap between GWAS findings and psychological or developmental mechanisms. Rather than tracing the biological pathways linking individual genetic variants to outcomes, phenotypic annotation emphasizes mapping polygenic associations onto behavioral, developmental, and social phenotypes across the life course. It advocates for shifting focus from specific genes to whole-genome scores, from narrow target traits to broader nomological networks, and from proximate biology to long-run developmental trajectories.
Our study builds on the phenotypic annotation framework by examining how distinct dimensions of EA-related genetic variance—cognitive and non-cognitive—are associated with internalizing and externalizing socioemotional behaviors, as measured by the Achenbach System of Empirically Based Assessment (ASEBA) (ref. Achenbach, 2001), across different reporter perspectives. Internalizing behaviors—processes within the self—are linked to mental health conditions such as depression and anxiety, and include somatic complaints and social withdrawal. Externalizing behaviors, by contrast, manifest as outward-directed actions such as aggression and rule-breaking, often associated with conduct disorders and substance abuse. By relating EA-related PGIs to these behavioral domains across informants, our approach contributes to the broader goal of phenotypic annotation: unpacking the psychological mechanisms through which polygenic influences on educational attainment operate.
The heritability of ASEBA (problem) behaviors has been widely studied, with twin studies consistently reporting moderate to high estimates ranging from 30% to 90% (ref. Hicks et al., 2004; ref. Kendler and Myers, 2014; ref. Kendler et al., 2016; ref. Nikstat and Riemann, 2020). These estimates vary by age, gender, source of symptom reporting, and methodology. Heritability estimates tend to increase with age, while the contribution of shared environment decreases (ref. Nikstat and Riemann, 2020). Studies of adolescents and young adults consistently show higher heritability of externalizing behaviors (ref. Kendler and Myers, 2014; ref. Kendler et al., 2016; ref. Hicks et al., 2004), whereas research in younger children suggests a greater contribution from family-level factors early in development (ref. Franić et al., 2014; ref. de Zeeuw et al., 2020).
Sex differences also shape these patterns. ref. Kendler et al. (2016) observe that genetic factors play an important role in females, whereas shared environmental influences are more prominent in males. Relatedly, ref. Kendler and Myers (2014) report sex-specific patterns in disorders associated with the genetic likelihood for externalizing and internalizing behaviors, emphasizing the need to consider sex-specific pathways to better understand these behaviors.
Reporter source is another key moderator: heritability tends to be higher for other-reported data than for self-reports, likely reflecting discrepancies in perception (ref. Nikstat and Riemann, 2020). Externalizing behaviors generally show higher heritability than internalizing behaviors (ref. Nikstat and Riemann, 2020), though this may partly reflect greater difficulty of measuring internalizing behaviors, which involve processes within the self. Nonetheless, both domains exhibit overlapping as well as unique genetic liabilities (ref. Hicks et al., 2004; ref. Franić et al., 2014; ref. Kendler et al., 2016).
GWASs on socioemotional behaviors have begun to illuminate their molecular underpinnings. ref. Karlsson Linnér et al. (2021) identify over 500 genome-wide significant SNPs associated with traits related to self-regulation and addiction in a GWAS of 1.5 million individuals. Polygenic indices (PGIs), aggregating over SNP (genetic loci) associations estimated from these GWASs predict a range of behavioral and medical outcomes, including opioid use disorder, suicide, and unemployment.
However, findings for ASEBA-relevant domains remain limited. A family-based GWAS by ref. Barr et al. (2020) found no robust genome-wide associations for externalizing behaviors, likely due to limited sample size (N=7,568). Similarly, a meta-analysis of univariate GWASs on internalizing symptoms by ref. Jami et al. (2022) reported low SNP heritability (~2%) and no genome-wide significant hits in a meta-analysis of internalizing symptoms—again likely a result of limited sample size (N=64,561)— although self-reported symptoms showed somewhat higher heritability (6%) than other-reported ones—again underscoring the importance of reporter heterogeneity.
Building on these insights, this study aims to address the following research questions:
- RQ1 To what extent are educational attainment (EA)-related genetics robustly associated with socioemotional behaviors?
- RQ2 To what extent do these robust associations run through cognitive (Cog) and non-cognitive (NonCog) educational attainment (EA)-related genetics?
- RQ3 To what extent do these robust associations differ between boys and girls or reporter source (parental and self-reported outcomes)?
By identifying robust genetic associations, distinguishing between cognitive and non-cognitive genetic pathways, and by comparing parental and offspring self-reported measures, this study provides nuanced insight into the genetic and environmental influences on socio-emotional development.
This study leverages the Lifelines genotyped (and partially imputed) trio sample to investigate these questions. We employ PGIs for educational attainment in a within-family design, controlling for parental genetics. Incorporating parental genotypes allows us to distinguish direct genetic effects in the offspring from the total genetic effects, which are biased by indirect genetic effects, population stratification and non-random mating. We define the direct effect as the association between offspring genetics and offspring outcomes after controlling for parental genetics2.
Furthermore, this study design allows for a causal interpretation of direct genetic effects. This is because Mendelian inheritance gives rise to a natural experiment whereby conditional on parental genotype the offspring’s genotype is as good as random. Such a conceptualization of ”causality” relies on counterfactual reasoning (ref. Biroli et al., 2025; ref. Harden, 2021), which considers what would have happened if an individual had inherited a different genetic variant from their parents while holding environmental factors constant. A direct test of such counterfactuals is unfeasible—arguably also unethical—in practice, which is why within-family and trio-based designs prove useful (see section PGI construction for a detailed explanation of PGI derivation and how they are used to support causal inference)
It is necessary to emphasize that the identification of causal genetic effects does not imply biological determinism; rather, genetic influences unfold through complex and long causal chains involving gene-environment interactions. The same inherited genetic variant may lead to different behavioral outcomes depending on external conditions. This perspective is particularly relevant when interpreting PGIs, which aggregate the estimated additive effects of millions of SNPs each with small effects. Although social scientists often describe PGIs as proxies for a “genetic propensity” or “genetic endowment” for a trait, such terms can be misleading, as they may suggest that genetic effects are fixed or innate. PGIs are better understood as statistical summaries of genetic associations that are shaped by the environmental and social structures in which genetic differences are expressed. Even when using within-family designs, PGIs still correlate with environmental selection processes—such as individuals actively choosing certain environments (active rGE) or eliciting particular responses from others (evocative rGE)—even though they do not directly estimate these mechanisms. In this sense, PGIs can be conceptualized as an aggregate of active and evocative gene-by-environment interactions (G×E) (ref. Biroli et al., 2025). It is therefore crucial to interpret their effects within a broader biopsychosocial framework, reflecting how genetic influences unfold dynamically through environmental contingencies.
Data
Source
Lifelines is a multi-disciplinary prospective population-based cohort study examining in a unique three-generation design the health and health-related behaviours of 167,729 persons living in the North of the Netherlands3. It employs a broad range of investigative procedures in assessing the biomedical, socio-demographic, behavioural, physical and psychological factors which contribute to the health and disease of the general population, with a special focus on multi-morbidity and complex genetics. Lifelines data has been collected in three waves (2007–2013; 2014–2017; 2019–2023).
Genetic data processing
Lifelines comprises two major genotyped cohorts: CytoSNP and UGLI. The CytoSNP cohort includes only unrelated adults aged 18 and over, and is therefore excluded from our analysis. Instead, we utilize the UGLI cohort, which consists of 64,589 individuals aged eight years and older, genotyped using the Infinium Global Screening Array (691,072 SNPs). A series of rigorous quality control (QC) procedures (ref. Marees et al., 2018) were applied to the genetic data obtained from the Lifelines Biobank. Unreliable genetic information was removed, such as multiallelic SNPs, low-frequency loci (minor allele frequency <1%), and SNPs with low imputation quality (info score <0.8). Additionally, participants with excess homozygosity were excluded, as were SNPs that violated Hardy-Weinberg Equilibrium at a stringent threshold (p < 10−6).
Since many participants were missing genetic data, we imputed parental genotypes using Mendelian imputation (ref. Young et al., 2022). This approach infers the parental genotype by leveraging data from a genotyped sibling or a single genotyped parent. Mendelian imputation relies on the principle that siblings inherit different combinations of their parents’ genetic material, allowing for a statistical reconstruction of the unobserved parental genome. This technique provides an unbiased estimate of parental genetic effects while minimizing measurement error (ref. Young et al., 2022).
Final sample
We focus on children aged 6–18 years with genotyped data available for both offspring and parents (i.e., genetic trios, with some parental genotypes imputed). This final analysis sample consists of 3,960 unique individuals, contributing a total of 7,692 observations, of which 6,219 include at least one imputed parental genotype. A majority of individuals (81.3%) are observed more than once, and 12.9% are observed more than twice. The observations are drawn from two versions of the Achenbach System of Empirically Based Assessment (ASEBA): 4,554 from the self-reported Youth Self-Report (YSR) and 3,138 from the parent-reported Child Behavior Checklist (CBCL). For more details, see subsection Problem behavior as assessed by ASEBA.
PGI construction
After quality control, PGIs for EA were constructed. PGIs quantify genetic influences on complex traits by summarizing associations between ~ million genetic independent loci and a particular phenotype (ref. Becker et al., 2021). They are particularly useful for polygenic traits like educational attainment, where millions of SNPs each have small associations with the outcome (ref. Mills et al., 2020). Essentially, a PGI is a weighted sum of the tiny associations of these individual’s SNPs, capturing their combined influence on a given trait:
Here, represents the allele count at SNP for individual , combined with its weight , derived from genome-wide association studies (GWAS). In GWAS, each SNP is regressed on years of schooling, with controls for sex, age, and genetic principal components to minimize spurious associations due to ancestral differences. These regression coefficients serve as the SNP-weights in the PGI, enhancing prediction for polygenic traits like educational attainment.
PGIs were constructed leveraging the latest genome-wide association study (GWAS). More specifically, we used two sets of polygenic indices as predictors. The first includes the full educational attainment (EA) PGI, derived from the EA GWAS conducted by ref. Okbay et al. (2022) (excluding the 23andMe sample), which explains 12–16% of the variation in educational attainment between individuals. The second includes educational attainment PGIs decomposed into cognitive (Cog) and non-cognitive (NonCog) components, obtained using GWAS-by-subtraction (ref. Demange et al., 2021) with weights provided by ref. Lee et al. (2018)4.
Given that standard GWAS-based PGI estimates can be biased due to sample overlap and population structure, MetaSubtract was applied to correct for potential biases (ref. Nolte, 2020). In other words, we removed Lifelines-related contributions to the GWAS from the discovery GWAS dataset. To maximize predictive power, SBayesR, a Bayesian regression approach that accounts for linkage disequilibrium, was used to ensure that SNP weights were optimally adjusted ref. Lloyd-Jones et al. 2019. The resulting PGI was standardized to allow for comparability between individuals.
In line with the aims of our study, we construct PGIs not only for the offspring but also one for the parents by summing the PGIs of both individual parents. This enables us to leverage triobased designs and Mendelian inheritance and ensures that, conditional on parental genotypes, the child’s genotype is as good as randomly assigned (see Introduction for a longer discussion) This principle underpins our identification strategy and justifies causal interpretation. Visual evidence supporting the randomness of genetic transmission is provided in Figure 1 of the Appendix

Problem behavior as assessed by ASEBA
The Lifelines study uses modified versions of the Child Behavior Checklist (CBCL) and Youth Self Report (YSR) from the Achenbach System of Empirically Based Assessment (ASEBA) (ref. Achenbach, 2001). We use the CBCL and YSR problem behavior measures observed in by parents, as well as children and adolescents themselves. The CBCL and YSR are widely used in clinical and educational settings to identify developmental problems, monitor changes over time, and guide intervention strategies.
The CBCL6/18 is administered to the parents of children aged 8–17 years in the first wave and aged 4–12 years in subsequent waves (parental reports). The YSR is completed directly by adolescents aged 13–17 years (self reports). This age range is particularly relevant for studying how genetic influences on socioemotional behaviors evolve, as externalizing behaviors tend to stabilize in adolescence, while the heritability of internalizing behaviors increases over time (ref. Nikstat and Riemann, 2020). Furthermore, previous findings suggest that the predictive power of PGIs varies between stages of development, reflecting differential and changing gene-environment interactions and the influence of genetic nurturing (ref. de Zeeuw et al., 2020; ref. Malanchini et al., 2024).
The CBL and YSR have identical response options (0: Not at all; 1: A bit or sometimes; 2: Clearly or often). Each assessment includes approximately 90 items, which are not mapped one-to-one with individual behaviors, but rather contribute to a set of empirically derived scales that reflect patterns of related symptoms. These scales are constructed by summing specific items associated with particular behavioral domains. Some of these scales are further grouped into broader categories of internalizing and externalizing problem behaviors (ref. Achenbach, 2001).
The problem behavior measures used in this study span internalizing and externalizing domains and are displayed below from most internalizing to most externalizing. Note that three intermediate domains—Social, Thought, and Attention Problems—are placed between these groupings.
These behaviors are relevant for children’s everyday functioning, particularly in how they interact with others, engage with learning tasks, and adapt to structured environments such as schools. Internalizing behaviors may influence emotional well-being and lead to social withdrawal. The intermediate domains often reflect difficulties with attention, social engagement, or atypical patterns of thinking. Externalizing behaviors, on the other hand, may involve outward expressions of conflict or rule-breaking that challenge social expectations.
Internalizing Problem Behaviors:
Reflect issues directed inwardly. These are assessed through:
- Anxious Scale, which assesses symptoms of anxiety and depression.
- Withdrawn Scale, which captures symptoms of withdrawal and depression.
- Somatic Complaints Scale, which evaluates physical symptoms associated with emotional distress.
Social Problems Scale
Measures difficulties in social interactions, such as issues with forming and maintaining friendships, dependence, or isolation.
Thought Problems Scale:
Assesses unusual or obsessive thoughts, behaviors, and experiences, including hallucinations, compulsive actions, or bizarre ideas.
Attention Problems Scale:
Measures symptoms such as difficulty in concentrating, impulsivity, and hyperactivity, which are common in conditions like attention-deficit/hyperactivity disorder (ADHD).
Externalizing Problem Behaviors:
Outwardly directed behaviors that can disrupt social environments. These are assessed through:
- Rule-Breaking Behavior Scale, which includes behaviors that violate social norms and rules.
- Aggressive Behavior Scale, which captures conduct-related issues.
Scores are standardized within samples (CBCL and YSR samples, respectively) to facilitate comparisons across different age groups, accounting for developmental variations. The densities of these standardized scores (Figures 2–4)5 are presented for each problem behavior scale in the Appendix.


Covariates
The analyses control for several covariates (Z), each selected for its relevance to educational and genetic studies:
- Offspring sex: Binary variable to account for sex-based differences in problem behavior. Mean-centered within CBCL and YSR samples, respectively.
- Offspring age at inclusion: Measured in years to control for age-related variability in problem behavior. Standardized within CBCL and YSR samples, respectively.
- Offspring birth year: To adjust for unobserved differences across birth cohorts. Standardized within CBCL and YSR samples, respectively.
- First 10 principal components for genetic similarity: To account for ancestral differences within the sample. Standardized within CBCL and YSR samples, respectively.
- Wave number: To control for unobserved differences across collection waves. Standardized within CBCL and YSR samples, respectively.
Statistical analysis
We fit two ordinary least squares (OLS) regression models. Each model is estimated separately for the multiple outcome variables, for the two sources (self- and parental reported outcomes), and the two sets of predictors (EA, cognitive and non-cognitive PGIs). All models are estimated with the same set of controls Z. These include offspring’s sex, birth year, and age at inclusion, the first ten principal components of the genetic similarity, and wave number.
Following ref. de Zeeuw et al. (2020), we set the significance threshold (alpha level) at 0.01, rather than the conventional 0.05. Using a stricter threshold helps mitigate the risk of spurious findings, given the multiple hypothesis testing. However, considering that the behavioral outcomes are correlated, we believe that applying highly stringent corrections such as the Bonferroni method—which assumes independence—would be overly punitive and likely reduce statistical power.
Additionally, some individuals were observed across multiple waves, introducing serial correlation in the error terms. While this does not bias point estimates of regression coefficients, it can lead to underestimation of the standard errors, thereby affecting hypothesis testing.
In principle, clustering at the individual level could account for serial correlation and within-individual dependence. In our final analysis sample, over 80% of individuals are observed more than once; however, the majority of those are observed only twice, with just 13% contributing more than two observations. This limited within-individual variation reduces the stability of standard error estimates when clustering at such a fine level. In contrast, our analytic focus is on family-level variation—leveraging PGIs from both children and parents—making the family the conceptually appropriate clustering unit. Given this, we opted to cluster standard errors at the family level. This approach strikes a practical balance—addressing within-family correlation and repeated observations—while ensuring sufficient variation for valid estimation of robust standard errors.
We employ two models: Model 1 estimates the total genetic effect measured by the PGI, where subscript O refers to the offspring.
Model 2 adds an additional parameter for the parent’s (subscript P) PGI and thereby decomposes the total genetic effect into a direct effect measured by and an indirect effect measured by . In this model, the estimated coefficient for the offspring can be interpreted as the causal effect of the PGI on the outcome .
To test sex differences, we ran the models with a sex-PGI interaction, as well as separately for boys and girls. In the interaction term approach, all covariates (excluding sex) are also interacted (following ref. Keller, 2014).
If variation is observed in estimated coefficients between each sample (CBCL and YSR), a Wald test is applied to test the significance of their difference.
Results
Effects of full and decomposed EA PGI on socioemotional traits in parental and self-reported samples
Figures 1 and 2 display the estimated coefficients for the offspring PGIs from Models 1 and 2 in the CBCL and YSR samples, respectively. The top panels of each figure show results for the full EA PGI, while the bottom panels present results for its cognitive (Cog) and non-cognitive (NonCog) components. Model 1 captures the total genetic effect, reflecting the association between offspring PGI and behavioral outcomes without adjusting for parental genotype. Model 2, by contrast, estimates the direct genetic effect by conditioning on parental PGIs, thereby isolating the effect of the offspring’s inherited genetics from potential confounding due to shared familial environment or parental genetic influences.


In the parental report sample (CBCL)
(Figure 1) significant and robust associations are observed for Attention Problems, Rule-Breaking Behavior, and Externalizing Problem Behavior. All associations are negative, implying that higher EA PGIs are associated with lower scores on these ASEBA problem behavior scales.
Attention Problems and Rule-Breaking Behavior exhibit robust negative associations with both the full EA PGI (top panel) and its cognitive component (bottom panel), even after accounting for parental genetics. For Attention Problems, the direct genetic effect—captured by the coefficient on the offspring PGI in Model 2—is −0.081 (p < 0.002) for EA and −0.288 (p < 0.001) for Cog EA. For Rule-Breaking Behavior, the direct genetic effect is −0.078 (p < 0.002) for EA and −0.103 (p < 0.009) for Cog EA. Interestingly, while the NonCog EA PGI is initially significant for both behaviors in Model 1 (bottom panel), these associations are not robust to the inclusion of parental genetics. No other behavioral scale shows a significant relationship with the non-cognitive EA PGI in either model.
Externalizing Problem Behavior shows a statistically significant negative total genetic effect of the EA PGI. However, these effects are not robust to controlling for parental PGIs. In contrast, the Cog EA PGI demonstrates both significant total and direct effects, with a direct genetic effect of Cog EA PGI is −0.305 (p < 0.007).
In the self-reported sample (YSR)
(Figure 2) only Somatic Complaints show robust negative associations with the EA and the Cog EA PGI, including after controlling for parental genetics. The direct genetic effect is −0.115 (p < 0.001) for EA and −0.254 (p < 0.001) for Cog EA. In the self-reported sample, there were no significant NonCog EA PGI associations.
We tested for the statistical significance of the differences in the estimated EA PGI coefficients in the parental versus self-reported samples using a Wald test. At a significance level of 1%, results are statistically identical across CBCL and YSR for all scales.
Complete results for all behavioral measures are presented in Tables 1 and 2 in the Supplementary material. These tables also display the estimated coefficients for parental genetic factors, which are statistically insignificant for all observed ASEBA behavioral scales.
Sex Differences
Figures 3 to 6 present the estimated coefficients for the full and decomposed EA PGIs of the offspring from Models 1 (total effect) and 2 (direct effect), disaggregated by sex, in the CBCL and YSR samples, respectively.


In the parental report sample (CBCL)
(Figures 3 and 4), stratifying by sex reveals patterns consistent with the full sample, with robust associations observed for Attention Problems, Rule-Breaking Behavior, and Externalizing Problem Behavior.

Among boys, only the association between the full EA PGI and Attention Problems reaches significance (β = −0.105, p < 0.01). For girls, significant associations are observed across all three behavioral measures. These remain robust to adjustment for parental genetics only in the case of the Cog EA PGI, with direct effects of −0.319 (p < 0.004) for Attention Problems, −0.148 (p < 0.008) for Rule-Breaking Behavior, and −0.424 (p < 0.005) for Externalizing Problem Behavior.
Among girls, interesting patterns emerge for the Thought Problems outcome. The NonCog EA PGI has a significant coefficient only after controlling for parental genetics (β = 0.149 p < 0.01. Furthermore, for this outcome we observe the only statistically significant parental PGI effect across all models, the NonCog EA PGI (β = −0.096 p < 0.01, see Table 4 in the Supplementary material). Notably, the offspring and parental PGIs genetic effects exhibiting opposing signs—an unusual pattern that warrants further interpretation.
In the self-reported sample (YSR)
(Figures 5 and 6), sex differences are more pronounced and associations reach higher levels of statistical significance. No robust genetic associations are found for boys. In contrast, girls show a strong and consistent relationship between Somatic Complaints and both the full EA PGI (β = −0.167, p < 0.001) and the EA Cog PGI (β = −0.446, p < 0.001).

To formally test sex differences
we estimated interaction models including sex-PGI terms6. Sex interactions with the full and decomposed EA PGIs show almost no significant hits in the parental report sample, with the effect of the interaction of sex and the NonCog EA PGI on Thought Problems being the only exception (pM1 < 0.05 and pM2 < 0.009). In the self-reported sample, genetic effects for many measures show significant sex differences, especially for the Cog EA PGI. These include Attention Problems, Anxious, Somatic Complaints, Withdrawn, Social Problems, Externalizing and Internalizing Problem Behavior. From these, Somatic Complaints is the only measure that exhibits some significant differences for both the full and the cog EA PGI (pM1 < 0.02, pM2 < 0.08, and pM1 < 0.003, pM2 < 0.02, respectively).
Complete results by sex and sex-interactions for all behavioral measures are presented in Tables 3 to 8 in the Supplementary material. Parental genetics as a predictor remains statistically insignificant for all observed behavioral measures.
Results for Wald tests are presented in Table 9 in the Supplementary material.
Discussion
Main Findings
This study investigated the genetic underpinnings of (problematic) socioemotional behaviors in children and adolescents (ages 6–18) using polygenic indices (PGIs) for educational attainment (EA) and its cognitive (Cog) and non-cognitive (NonCog) components. Leveraging a trio-based, within-family design in the Lifelines cohort, we account for confounding due to population stratification, assortative mating, and indirect genetic effects, allowing for a causal interpretation of offspring PGI estimates. While not implying genetic determinism, this approach strengthens the case for genetic influence on socio-emotional traits beyond purely correlational inference. We examined whether EA-related PGIs predict socioemotional behaviors, whether cognitive and non-cognitive components show differential effects, and whether associations vary by sex or reporter source (parental vs. self-report). We find robust evidence for all three, offering novel insights into the distinct pathways through which genetics contribute to socio-emotional development.
Our findings provide four key insights: (i) EA-related genetic factors contribute positively to the development of healthier socioemotional behaviors, with cognitive genetic components predominantly driving these associations; (ii) Sex differences emerge in the genetic influences on socioemotional behaviors, with stronger genetic effects observed for females particularly for Somatic Complaints; (iii) We observe modest differences between self-reported and parental-reported measures, albeit not statistically significant; (iv) We find little robust evidence for indirect genetic effects, though some attenuations after controlling for parental genetics suggest they may exist in modest form, primarily through non-cognitive pathways.
Cognitive vs. Non-Cognitive Genetics
We find that EA-related genetic influences on socioemotional behaviors are predominantly driven by cognitive genetic components. Behavioral measures such as Attention Problems and Rule-Breaking Behavior exhibit robust associations with both the full EA PGI and the cognitive EA PGI, whereas non-cognitive genetic influences are weaker or absent. These results align with prior studies that emphasize the importance of cognitive traits in behavioral and educational outcomes (ref. Demange et al., 2021; ref. de Zeeuw et al., 2020).
The significant role of cognitive genetics raises critical questions. One interpretation is that problematic socioemotional behaviors are inherently linked to cognitive functioning, reflecting behavioral characteristics that align with success in standardized tests. Traits such as focus, self-regulation, and impulse control, which are essential for educational success, may also shape social behavior, explaining the strong cognitive genetic associations we observe. The developmental literature further supports a more integrated view of cognitive and behavioral traits. ref. Best and Miller (2010) emphasize that executive functions—defined as cognitive processes orchestrated by the prefrontal cortex, including inhibition, working memory, and shifting—develop across childhood and are foundational for goal-directed behavior. While these processes are traditionally classified as “cognitive,” their role in behavior regulation highlights the limitations of sharply separating cognitive and non-cognitive domains. Rather than reflecting distinct genetic influences, the traits captured by Cog and NonCog PGIs may overlap substantially in both biological basis and developmental function. Executive function, as described by Best and Miller, may thus serve as a useful conceptual bridge for understanding how cognitive development underpins both academic and socio-emotional outcomes, even if their paper does not explicitly frame EF in those terms.
Alternatively, these findings might reflect methodological nuances in how NonCog PGIs are constructed. Although ref. Demange et al. (2021) suggest that their methodology effectively isolates non-cognitive components, residual overlap may still influence observed associations. The GWAS-by-subtraction procedure assumes by construction that cognitive and non-cognitive genetic influences are orthogonal. This strong assumption excludes any genetic loci that contribute to both cognitive and non-cognitive traits—such as self-regulation or motivation that are partly grounded in cognitive function—from the NonCog PGI. Instead, such pleiotropic effects are captured entirely within the Cog PGI. As a result, the NonCog PGI likely captures only a subset of the genetic variance in non-cognitive contributors to educational attainment—specifically, those that are genetically uncorrelated with cognition. This modeling decision, while useful for isolating distinct components, may lead to an underestimation of the true role of non-cognitive traits in predicting behavioral outcomes. In turn, it may help explain why we observe stronger associations between socioemotional behaviors and the Cog EA PGI across our analyses.
Sex Differences
Our findings also reveal significant sex differences in the genetic influences on socioemotional behaviors. Girls exhibit stronger genetic associations across multiple behavioral traits, while boys show more limited associations, primarily for Attention Problems in parental-reported data.
These results align with the findings of ref. Kendler and Myers (2014), who demonstrated that the genetic structure of internalizing and externalizing traits differs by sex. Specifically, genetic influences on internalizing traits (e.g., anxiety, depression) are stronger in females, whereas shared environmental influences play a larger role in male externalizing behaviors (ref. Kendler et al., 2016).
A possible explanation for this is that boys’ socio-emotional development is more strongly shaped by environmental influences, such as peer interactions and family dynamics, rather than direct genetic predispositions. This aligns with prior evidence that shared environmental effects contribute more substantially to externalizing traits in males (ref. Kendler et al., 2016). Moreover, the fact that sex-stratified GWASs do not find substantial genetic differences (ref. Jami et al., 2022; ref. Okbay et al., 2022) suggests that sex-specific gene-environment interactions may drive the observed disparities.
Reporter Variability
Although prior research has highlighted substantial differences in genetic associations depending on the source of behavioral reports—particularly for internalizing symptoms (ref. Jami et al., 2022)—our findings suggest only modest and statistically insignificant variation between self-reported and parental-reported outcomes. For example, while self-reported Somatic Complaints showed robust associations with EA and Cog EA PGIs, and parental-reported Attention Problems and Rule-Breaking Behavior were more strongly associated with cognitive genetics, these patterns did not reach statistical significance when directly compared across reporter types. This may reflect true similarities in underlying genetic influences across sources, or limitations in statistical power. Nonetheless, our results are directionally consistent with the notion that self-reports may better capture internalized emotional states, whereas parental reports may more reliably reflect observable, externalizing behaviors. As such, even in the absence of strong statistical differences, the choice of reporter remains an important methodological consideration for genetic studies of behavioral traits.
Lack of Robust Evidence for Indirect Genetic Effects
We find little robust evidence for indirect genetic effects of parental EA-related genetics on offspring socioemotional behaviors. Parental PGIs show no statistically significant associations with any socio-emotional traits, and for most outcomes, the estimated effects of the offspring PGIs remain virtually unchanged after conditioning on parental genetics.
These findings are consistent with those of ref. Tanksley et al. (2025), who examined the genetic underpinnings of child externalizing behavior using an externalizing PGI derived from a large-scale adult GWAS of self-regulatory traits (e.g., ADHD, problematic alcohol use, cannabis use, risk tolerance). Together, these results could suggest that indirect genetic effects—or gene-environment correlations—may play a more limited role in shaping socio-emotional behaviors than they do in shaping educational outcomes.
They are also in line with prior findings from Dutch cohorts, such as ref. Demange et al. (2022), who reported similarly limited evidence for indirect genetic effects on educational outcomes. These parallels may reflect contextual factors specific to the Netherlands, such as its relatively egalitarian social structure and parenting norms that emphasize autonomy and peer influence. Our final analysis sample (3,960 individuals from 2,469 families) is comparable in size—and in fact slightly larger—than the trio sample used by ref. Demange et al. (2022), suggesting that our null findings are unlikely to be driven by a smaller or less informative design.
An exception to this pattern emerged in the parental reported Thought Problems outcome for girls, where we observed the only statistically significant parental PGI effect across the full set of models. Notably, this association is accompanied by direct and indirect genetic effects with opposing signs—an unusual pattern that may indicate an environmentally mediated influence of parental EA-related traits that contrasts with the child’s own genetic likelihood. One possible interpretation is that parents with high EA-related PGIs may shape environments marked by high cognitive demand, structure, or achievement expectations, which could heighten internalizing or obsessive tendencies in daughters. Alternatively, this may reflect reporter bias, whereby higher-PGI parents are more attentive to or more likely to report subtle or subclinical psychological symptoms.
Some attenuation in offspring PGI associations after controlling for parental genetics also warrant a more nuanced interpretation. Specifically, total genetic effects of the NonCog EA PGI on Attention Problems and Rule-Breaking Behavior, as well as the total effect of the full EA PGI on Externalizing Problem Behavior, became non-significant once parental genetics were included in the model. In sex-stratified analyses, similar attenuation patterns emerged among girls, where associations of the full EA PGI with Attention Problems, Rule-Breaking Behavior, and Externalizing Problem Behavior lost significance after adjusting for parental PGIs. These patterns could reflect weak environmentally mediated parental influences—such as parental behaviors correlated with their own genetic predispositions shaping the child’s social environment—consistent with modest indirect genetic effects or gene–environment correlation processes (e.g., active or evocative rGE).
At the same time, the overall pattern of results suggests that any indirect effects are likely to be small. While our simulation-based power analysis demonstrates high sensitivity to modest genetic effects (e.g., β ≥ 0.10, see Appendix), power for detecting smaller effects is substantially lower. Given that indirect genetic effects are typically weaker, our models may lack sufficient power to reliably detect them. This limitation is compounded by potential measurement constraints: the behavioral measures used here are based on parental and self-reports rather than teacher reports, which have been shown to more reliably capture socioemotional behaviors (ref. Feng et al., 2022). The absence of such third-party reports may further reduce sensitivity to environmentally mediated parental influences.
Future research with larger samples, cross-national comparisons, and richer environmental measures (e.g., teacher or school-based assessments) will be crucial to determine whether these attenuations reflect genuine indirect genetic effects or statistical noise. Notably, the direct genetic effects of the Cog EA PGI remain robust across nearly all traits—even in models adjusting for parental genetics—suggesting that the primary causal pathway linking EA-related genetics to socioemotional behaviors operates through cognitive rather than non-cognitive mechanisms. To the extent that indirect effects exist, they appear more likely to operate through non-cognitive pathways, which are weaker and less consistently associated with problem behaviors in this cohort.
Implications
These findings offer several important implications for research, policy, and intervention. First, the robust associations between cognitive genetic factors and problematic socioemotional behaviors highlight the potential of cognitive-based interventions—such as those targeting executive functioning, attentional control, and self-regulation—not only to support academic achievement but also to improve behavioral outcomes. This reinforces our understanding about the interconnected nature of cognitive and socio-emotional development.
Second, although reporter differences in genetic associations were modest and statistically insignificant, the distinct patterns observed for internalizing versus externalizing traits suggest that multi-informant approaches remain valuable. Self-reports may be more sensitive to internal emotional states, while parental reports more accurately reflect external behaviors. Future assessments and interventions should consider the complementary insights offered by each perspective.
Third, the observed sex differences in genetic effects—particularly the stronger associations found among girls for internalizing symptoms—point to the need for sex-sensitive intervention strategies. Tailoring supports to the differential genetic and environmental influences across sexes may improve the effectiveness of socio-emotional development programs.
Lastly, while we find little robust evidence for indirect genetic effects, some of our results attenuations suggest that environmentally mediated effects may exist, particularly through non-cognitive pathways. In more egalitarian societies like the Netherlands, with parenting norms that emphasize autonomy and peer influence, such effects may be attenuated, highlighting the need for cross-national and school-based studies to examine how social and cultural contexts shape the strength of gene–environment correlations.
Limitations and Future Directions
Despite its strengths, this study has several important limitations. First, although PGIs offer a powerful tool for estimating genetic influences, they currently explain only a modest proportion of the variance in socio-emotional outcomes. The EA-related PGIs used here—particularly the non-cognitive component—may not fully capture the relevant genetic architecture of socioemotional behaviors. Continued refinement of GWAS discovery efforts may help to improve predictive power.
Second, the behavioral phenotypes analyzed were derived from standardized ASEBA instruments (CBCL and YSR), which, while validated and widely used, may not encompass the full spectrum of socio-emotional behaviors. These instruments may miss subtle, situational, or culturally specific behaviors, and rely on self- or parent-reporting, which can introduce measurement error and informant biases. The absence of teacher-reported or observational data may limit the detection of environmentally mediated or context-dependent behaviors. Future work could benefit from integrating broader behavioral assessments.
Third, while we observe sex differences in genetic associations, the underlying mechanisms remain unclear. It is uncertain whether these differences reflect developmental trajectories, environmental buffering, or sex-specific gene–environment interactions. Addressing these questions will require more targeted examination of moderating factors and longitudinal developmental processes.
Fourth, our sample size—though substantial for trio-based analyses—remains modest relative to the statistical power typically required in polygenic research. Simulation-based power analyses (see Appendix) indicate that our design is sufficiently powered (≥ 99.8%) to detect modest genetic effects (β ≥ 0.10), even at the lower end of our sample range (N = 3,090). However, statistical power is meaningfully reduced for smaller effect sizes, particularly for subgroup analyses. As such, null or inconsistent findings for non-cognitive PGIs or reporter differences may reflect limitations in statistical power rather than the absence of true underlying effects. Larger samples, ideally spanning multiple cohorts, are needed to strengthen the robustness and generalizability of these findings.
Lastly, our findings are based on a single cohort drawn from the Netherlands—a country characterized by relatively low socioeconomic inequality and parenting norms that emphasize autonomy. These contextual features may attenuate indirect genetic effects and limit the generalizability of our findings. While the within-family design controls for many confounders, broader influences—such as school environments, peer dynamics, and community context—are not directly captured. Future research should seek to replicate these findings across more diverse populations, including cohorts of non-European ancestry and different sociocultural backgrounds, to better understand the universality or specificity of genetic effects on socio-emotional development.

Table 1:: Simulation-based power estimates by sample size (N), effect size (β), and observed intra-cluster correlation (ICC).
| N | Clusters | β | Mean ICC | Power |
|---|---|---|---|---|
| 3,090 | 1,931 | 0.05 | 0.083 | 0.556 |
| 3,090 | 1,931 | 0.10 | 0.083 | 0.998 |
| 3,090 | 1,931 | 0.15 | 0.080 | 1.000 |
| 3,090 | 1,931 | 0.20 | 0.080 | 1.000 |
| 3,090 | 1,931 | 0.25 | 0.079 | 1.000 |
| 3,090 | 1,931 | 0.30 | 0.078 | 1.000 |
| 4,510 | 2,469 | 0.05 | 0.082 | 0.758 |
| 4,510 | 2,469 | 0.10 | 0.082 | 1.000 |
| 4,510 | 2,469 | 0.15 | 0.080 | 1.000 |
| 4,510 | 2,469 | 0.20 | 0.081 | 1.000 |
| 4,510 | 2,469 | 0.25 | 0.078 | 1.000 |
| 4,510 | 2,469 | 0.30 | 0.076 | 1.000 |
References
- T. M. Achenbach. Manual for the ASEBA School-Age Forms & Profiles: An Integrated System of Multi-Informant Assessment., 2001
- P. B. Barr, J. E. Salvatore, L. Wetherill, A. Anokhin, G. Chan, H. J. Edenberg, S. Kuperman, J. Meyers, J. Nurnberger, B. Porjesz. A family-based genome wide association study of externalizing behaviors.. Behavior genetics,, 2020. [PubMed]
- J. Becker, C. A. Burik, G. Goldman, N. Wang, H. Jayashankar, M. Bennett, D. W. Belsky, R. Karlsson Linnér, R. Ahlskog, A. Kleinman. Resource profile and user guide of the polygenic index repository.. Nature human behaviour,, 2021
- B. Belfi, L. Borghans. Handbook of Education and Work, pages, 2025
- D. W. Belsky, K. P. Harden. Phenotypic annotation: using polygenic scores to translate discoveries from genome-wide association studies from the top down.. Current Directions in Psychological Science,, 2019. [PubMed]
- D. W. Belsky, T. E. Moffitt, D. L. Corcoran, B. Domingue, H. Harrington, S. Hogan, R. Houts, S. Ramrakha, K. Sugden, B. S. Williams. The genetics of success: How single-nucleotide polymorphisms associated with educational attainment relate to life-course development.. Psychological science,, 2016. [PubMed]
- J. R. Best, P. H. Miller. A developmental perspective on executive function.. Child development,, 2010. [PubMed]
- P. Biroli, T. Galama, S. von Hinke, H. Van Kippersluis, C. A. Rietveld, K. Thom. The economics and econometrics of gene–environment interplay.. Review of Economic Studies, page, 2025
- E. L. de Zeeuw, J.-J. Hottenga, K. G. Ouwens, C. V. Dolan, E. A. Ehli, G. E. Davies, D. I. Boomsma, E. van Bergen. Intergenerational transmission of education and adhd: effects of parental genotypes.. Behavior Genetics,, 2020. [PubMed]
- P. A. Demange, J. J. Hottenga, A. Abdellaoui, E. M. Eilertsen, M. Malanchini, B. W. Domingue, E. Armstrong-Carter, E. L. de Zeeuw, K. Rimfeld, D. I. Boomsma. Estimating effects of parents’ cognitive and non-cognitive skills on offspring education using polygenic scores.. Nature communications,, 2022
- P. A. Demange, M. Malanchini, T. T. Mallard, P. Biroli, S. R. Cox, A. D. Grotzinger, E. M. Tucker-Drob, A. Abdellaoui, L. Arseneault, E. Van Bergen. Investigating the genetic architecture of noncognitive skills using gwas-by-subtraction.. Nature genetics,, 2021. [PubMed]
- S. Feng, Y. Han, J. J. Heckman, T. Kautz. Comparing the reliability and predictive power of child, teacher, and guardian reports of noncognitive skills.. Proceedings of the National Academy of Sciences,, 2022
- S. Franić, C. V. Dolan, D. Borsboom, C. E. van Beijsterveldt, D. I. Boomsma. Three-and-a-half-factor model? the genetic and environmental structure of the cbcl/6–18 internalizing grouping.. Behavior genetics,, 2014. [PubMed]
- K. P. Harden. The genetic lottery: Why DNA matters for social equality., 2021
- J. J. Heckman, T. Kautz. Hard evidence on soft skills.. Labour economics,, 2012. [PubMed]
- J. J. Heckman, Y. Rubinstein. The importance of noncognitive skills: Lessons from the ged testing program.. American economic review,, 2001
- J. J. Heckman, J. Stixrud, S. Urzua. The effects of cognitive and noncognitive abilities on labor market outcomes and social behavior.. Journal of Labor economics,, 2006
- B. M. Hicks, R. F. Krueger, W. G. Iacono, M. McGue, C. J. Patrick. Family transmission and heritability of externalizing disorders: a twin-family study.. Archives of general psychiatry,, 2004. [PubMed]
- E. S. Jami, A. R. Hammerschlag, H. F. Ip, A. G. Allegrini, B. Benyamin, R. Border, E. W. Diemer, C. Jiang, V. Karhunen, Y. Lu. Genome-wide association meta-analysis of childhood and adolescent internalizing symptoms.. Journal of the American Academy of Child & Adolescent Psychiatry,, 2022. [PubMed]
- R. Karlsson Linnér, T. T. Mallard, P. B. Barr, S. Sanchez-Roige, J. W. Madole, M. N. Driver, H. E. Poore, R. de Vlaming, A. D. Grotzinger, J. J. Tielbeek. Multivariate analysis of 1.5 million people identifies genetic associations with traits related to self-regulation and addiction.. Nature neuroscience,, 2021. [PubMed]
- M. C. Keller. Gene× environment interaction studies have not properly controlled for potential confounders: the problem and the (simple) solution.. Biological psychiatry,, 2014. [PubMed]
- K. S. Kendler, S. L. Lönn, H. H. Maes, P. Lichtenstein, J. Sundquist, K. Sundquist. A swedish population-based multivariate twin study of externalizing disorders.. Behavior genetics,, 2016. [PubMed]
- K. S. Kendler, J. Myers. The boundaries of the internalizing and externalizing genetic spectra in men and women.. Psychological Medicine,, 2014. [PubMed]
- J. J. Lee, R. Wedow, A. Okbay, E. Kong, O. Maghzian, M. Zacher, T. A. Nguyen-Viet, P. Bowers, J. Sidorenko, R. Karlsson Linnér. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals.. Nature genetics,, 2018. [PubMed]
- L. R. Lloyd-Jones, J. Zeng, J. Sidorenko, L. Yengo, G. Moser, K. E. Kemper, H. Wang, Z. Zheng, R. Magi, T. Esko. Improved polygenic prediction by bayesian multiple regression on summary statistics.. Nature communications,, 2019
- C. MacCann, Y. Jiang, L. E. Brown, K. S. Double, M. Bucich, A. Minbashian. Emotional intelligence predicts academic performance: A meta-analysis.. Psychological bulletin,, 2020. [PubMed]
- M. Malanchini, A. G. Allegrini, M. G. Nivard, P. Biroli, K. Rimfeld, R. Cheesman, S. von Stumm, P. A. Demange, E. van Bergen, A. D. Grotzinger. Genetic associations between non-cognitive skills and academic achievement over development.. Nature human behaviour,, 2024
- A. T. Marees, H. De Kluiver, S. Stringer, F. Vorspan, E. Curis, C. Marie-Claire, E. M. Derks. A tutorial on conducting genome-wide association studies: Quality control and statistical analysis.. International journal of methods in psychiatric research,, 2018. [PubMed]
- M. C. Mills, N. Barban, F. C. Tropf. An introduction to statistical genetic data analysis., 2020
- A. Nikstat, R. Riemann. On the etiology of internalizing and externalizing problem behavior: A twin-family study.. PLoS One,, 2020. [PubMed]
- I. M. Nolte. Metasubtract: An r-package to analytically produce leave-one-out meta-analysis gwas summary statistics.. Bioinformatics,, 2020. [PubMed]
- A. Okbay, Y. Wu, N. Wang, H. Jayashankar, M. Bennett, S. M. Nehzati, J. Sidorenko, H. Kweon, G. Goldman, T. Gjorgjieva. Polygenic prediction of educational attainment within and between families from genome-wide association analyses in 3 million individuals.. Nature genetics,, 2022. [PubMed]
- E. Smith-Woolley, S. Selzam, R. Plomin. Polygenic score for educational attainment captures dna variants shared between personality traits and educational achievement.. Journal of Personality and Social Psychology,, 2019. [PubMed]
- P. T. Tanksley, S. J. Brislin, J. Wertz, R. de Vlaming, N. S. Courchesne-Krak, T. T. Mallard, L. L. Raffington, R. Karlsson Linnér, P. Koellinger, A. A. Palmer. Do polygenic indices capture “direct” effects on child externalizing behavior problems? Within-family analyses in two longitudinal birth cohorts.. Clinical Psychological Science,, 2025. [PubMed]
- A. I. Young, S. M. Nehzati, S. Benonisdottir, A. Okbay, H. Jayashankar, C. Lee, D. Cesarini, D. J. Benjamin, P. Turley, A. Kong. Mendelian imputation of parental genotypes improves estimates of direct genetic effects.. Nature Genetics,, 2022. [PubMed]
