Copy the page URI to the clipboard
Christiansen, Colette E.; Arathimos, Ryan; Pain, Oliver; Molokhia, Mariam; Bell, Jordana T. and Lewis, Cathryn M.
(2023).
DOI: https://doi.org/10.1093/hmg/ddad093
Abstract
Type 2 diabetes (T2D) is a heterogeneous illness caused by genetic and environmental factors. Previous genome wide association studies (GWAS) have identified many genetic variants associated with T2D and found evidence of differing genetic profiles by age-at-onset. This study seeks to explore further the genetic and environmental drivers of T2D by analysing subgroups based on age-at-onset of diabetes and body mass index (BMI). In UK Biobank, 36 494 T2D cases were stratified into 3 subgroups and GWAS performed for all T2D cases and for each subgroup relative to 421 021 controls. Altogether, 18 SNPs significantly associated genome-wide with T2D in one or more subgroups also showed evidence of heterogeneity between the subgroups, (Cochrane’s Q p < 0.01) with 2 remaining significant after multiple testing (in CDKN2B and CYTIP). Combined risk scores, based on genetic profile, BMI and age, resulted in excellent diabetes prediction (AUC = 0.92). A modest improvement in prediction (AUC = 0.93) was seen when the contribution of genetic and environmental factors was evaluated separately for each subgroup. Increasing sample sizes of genetic studies enables us to stratify disease cases into subgroups which have sufficient power to highlight areas of genetic heterogeneity. Despite some evidence that optimising combined risk scores by subgroup improves prediction, larger sample sizes are likely needed for prediction when using a stratification approach.