Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

The combined impact of common and rare exonic variants in COVID-19 host genetics is currently insufficiently understood. Here, common and rare variants from whole-exome sequencing data of about 4000 SARS-CoV-2-positive individuals were used to define an interpretable machine-learning model for predicting COVID-19 severity. First, variants were converted into separate sets of Boolean features, depending on the absence or the presence of variants in each gene. An ensemble of LASSO logistic regression models was used to identify the most informative Boolean features with respect to the genetic bases of severity. The Boolean features selected by these logistic models were combined into an Integrated PolyGenic Score that offers a synthetic and interpretable index for describing the contribution of host genetics in COVID-19 severity, as demonstrated through testing in several independent cohorts. Selected features belong to ultra-rare, rare, low-frequency, and common variants, including those in linkage disequilibrium with known GWAS loci. Noteworthily, around one quarter of the selected genes are sex-specific. Pathway analysis of the selected genes associated with COVID-19 severity reflected the multi-organ nature of the disease. The proposed model might provide useful information for developing diagnostics and therapeutics, while also being able to guide bedside disease management.

Original publication

DOI

10.1007/s00439-021-02397-7

Type

Journal article

Journal

Human genetics

Publication Date

01/2022

Volume

141

Pages

147 - 173

Addresses

Department of Medical Biotechnologies, Med Biotech Hub and Competence Center, University of Siena, Siena, Italy.

Keywords

WES/WGS Working Group Within the HGI, GenOMICC Consortium, GEN-COVID Multicenter Study, Humans, Genetic Predisposition to Disease, Severity of Illness Index, Cohort Studies, Phenotype, Polymorphism, Single Nucleotide, Adult, Aged, Aged, 80 and over, Middle Aged, Quebec, Germany, Italy, Sweden, Female, Male, United Kingdom, Whole Exome Sequencing, COVID-19, SARS-CoV-2