Advertisement
Original Article| Volume 3, ISSUE 6, P751-757, June 2021

Evaluating Polygenic Risk Scores in “Lone” Atrial Fibrillation

Open AccessPublished:February 05, 2021DOI:https://doi.org/10.1016/j.cjco.2021.02.001

      Abstract

      Background

      Polygenic scores incorporating varying numbers of single nucleotide polymorphisms (SNPs) have been demonstrated to exert a prominent role in atrial fibrillation (AF). We sought to compare the relative discriminatory capacities of 2 previously validated polygenic scores in “lone” AF.

      Methods

      A total of 186 lone AF cases of European ancestry underwent SNP genotyping. A genome-wide polygenic score (GPS) and polygenic risk score (PRS) involving 6,730,541 and 1168 SNPs, respectively, were calculated for 186 cases and 423 controls of European ancestry from the 1000 Genomes (1KG) Project. The distribution of the polygenic scores was compared between the cases and controls and their discriminatory capacities were evaluated using receiver operating characteristic (ROC) curves.

      Results

      A total of 34.4% of patients with lone AF had GPS scores greater than the top 10th percentile of 1KG controls, corresponding to a 4.64-fold increased odds (95% confidence interval [CI], 2.99-7.18; P < 0.001) for AF. A PRS score in the top 10th percentile of 1KG controls was observed in 26.3% of cases, which equated to a 3.16-fold increased odds (95% CI, 2.01-4.98; P < 0.001) for AF. Comparison of C-statistics from ROC curves indicated improved discriminatory capacity of the GPS (0.76) relative to the PRS (0.70) (P = 0.002).

      Conclusions

      Our study evaluating 2 polygenic scores for AF suggests that the GPS, containing more than 6.7 million SNPs, exhibits an improved discriminatory capacity in lone AF compared with a PRS possessing 1168 SNPs. Our findings suggest that genetic risk scores for AF that maximally leverage genomic data may provide improved predictive power.

      Résumé

      Contexte

      Il a été démontré que des scores polygéniques intégrant un nombre variable de polymorphismes mononucléotidiques (PMN) jouent un rôle important en ce qui concerne la fibrillation auriculaire (FA). Nous avons comparé le potentiel discriminatoire relatif de deux scores polygéniques déjà validés dans la FA idiopathique.

      Méthodologie

      Au total, 186 sujets d’ascendance européenne atteints de FA idiopathique ont été soumis à un génotypage des PMN. Un score polygénique génomique (SPG) et un score de risque polygénique (SRP) comprenant respectivement 6 730 541 et 1 168 PMN ont été calculés pour les 186 sujets et pour 423 témoins d’ascendance européenne dont les données sont tirées du projet 1000 Genomes (1KG). Les distributions des scores polygéniques des sujets et des témoins ont été comparées, et leur potentiel discriminatoire a été évalué au moyen des courbes caractéristiques de la performance d’un test (courbes ROC, de l’anglais Receiver Operating Characteristic).

      Résultats

      Au total, 34,4 % des patients atteints de FA idiopathique avaient un SPG supérieur à celui des témoins du 10e centile supérieur du projet 1KG, ce qui représente une probabilité de FA 4,64 fois plus élevée (intervalle de confiance [IC] à 95 % : 2,99 à 7,18; p < 0,001). Un SRP situé dans le 10e centile supérieur des témoins du projet 1KG a été observé chez 26,3 % des patients atteints de FA, soit une probabilité de FA 3,16 fois plus élevée (IC à 95 % : 2,01 à 4,98; p < 0,001). Les résultats de la comparaison des statistiques C des courbes ROC indiquent que le SPG (0,76) a un potentiel discriminatoire supérieur à celui du SRP (0,70) (p = 0,002).

      Conclusions

      Les résultats de notre étude de deux scores polygéniques relatifs à la FA indiquent que le potentiel discriminatoire du SPG, qui comprend plus de 6,7 millions de PMN, pour prédire une FA idiopathique est supérieur à celui du SRP, qui comprend 1 168 PMN. Ces résultats indiquent que les scores de risque génétique de FA qui exploitent pleinement les données génomiques pourraient avoir un pouvoir prédictif supérieur.
      Rare and common genetic variants have been shown to affect the risk of developing atrial fibrillation (AF).
      • Choi S.H.
      • Jurgens S.J.
      • Weng L.C.
      • et al.
      Monogenic and polygenic contributions to atrial fibrillation risk: results from a national biobank.
      Notably, 22% of AF heritability has been suggested to be explained primarily by the cumulative effects of common variants or single nucleotide polymorphisms (SNPs), highlighting the significant polygenic burden of AF, with a minimal, yet important, contribution of rare variants with large effect size such as those found in the structural gene titin (TTN).
      • Choi S.H.
      • Jurgens S.J.
      • Weng L.C.
      • et al.
      Monogenic and polygenic contributions to atrial fibrillation risk: results from a national biobank.
      • Weng L.C.
      • Choi S.H.
      • Klarin D.
      • et al.
      Heritability of atrial fibrillation.
      • Ahlberg G.
      • Refsgaard L.
      • Lundegaard P.R.
      • et al.
      Rare truncating variants in the sarcomeric protein titin associate with familial and early-onset atrial fibrillation.
      To date, genome-wide association studies (GWAS) have identified more than 100 SNPs that significantly associate with AF susceptibility.
      • Roselli C.
      • Chaffin M.D.
      • Weng L.C.
      • et al.
      Multi-ethnic genome-wide association study for atrial fibrillation.
      ,
      • Nielsen J.B.
      • Thorolfsdottir R.B.
      • Fritsche L.G.
      • et al.
      Biobank-driven genomic discovery yields new insight into atrial fibrillation biology.
      In this context, polygenic scores that incorporate multiple small-effect SNPs and identify a proportion of subjects from the general population at an increased risk for AF, are viewed as harbouring great clinical utility, given the potential to enhance screening and prevention therapies.
      • Khera A.V.
      • Chaffin M.
      • Aragam K.G.
      • et al.
      Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations.
      Notably, however, three-quarters of AF genetic risk has been identified to be driven by common variants in regions that have yet to satisfy the stringent thresholds for GWAS statistical significance.
      • Weng L.C.
      • Choi S.H.
      • Klarin D.
      • et al.
      Heritability of atrial fibrillation.
      Recognizing that sizeable amounts of genomic data may not be adequately leveraged at present, contemporary genetic risk scores have begun to include loci that have yet to reach GWAS levels of significance, in an effort to better estimate AF susceptibility.
      • Choi S.H.
      • Jurgens S.J.
      • Weng L.C.
      • et al.
      Monogenic and polygenic contributions to atrial fibrillation risk: results from a national biobank.
      ,
      • Khera A.V.
      • Chaffin M.
      • Aragam K.G.
      • et al.
      Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations.
      ,
      • Weng L.C.
      • Preis S.R.
      • Hulme O.L.
      • et al.
      Genetic predisposition, clinical risk factor burden, and lifetime risk of atrial fibrillation.
      “Lone” AF represents a subtype of the arrhythmia that develops in the absence of identifiable clinical risk factors.
      • Fuster V.
      • Ryden L.E.
      • Cannom D.S.
      • et al.
      ACC/AHA/ESC 2006 guidelines for the management of patients with atrial fibrillation: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines and the European Society of Cardiology Committee for Practice Guidelines (Writing Committee to Revise the 2001 Guidelines for the Management of Patients With Atrial Fibrillation): developed in collaboration with the European Heart Rhythm Association and the Heart Rhythm Society.
      Although previously criticized because of its heterogeneous definitions and our improved ability to identify predisposing clinical risk factors, such cases are considered to harbour a greater genetic contribution relative to more common forms of the arrhythmia that develop in the setting of structural heart disease or other established risk factors.
      • Wyse D.G.
      • Van Gelder I.C.
      • Ellinor P.T.
      • et al.
      Lone atrial fibrillation: does it exist?.
      We previously explored the role of genetic risk scores in a lone AF cohort and a locally procured healthy control set.
      • Lazarte J.D.
      • McIntyre J.S.
      • Skanes A.D.
      • Gula A.C.
      • et al.
      The role of common genetic variation in “lone” atrial fibrillation.
      We found that genetic risk scores developed by both Weng et al. (2018; an ~1000 SNP polygenic risk score [PRS]) and Khera et al. (2018; ~6 million genome-wide polygenic risk score [GPS]), identified a significant portion of patients with lone AF with an elevated polygenic score compared with healthy controls.
      • Khera A.V.
      • Chaffin M.
      • Aragam K.G.
      • et al.
      Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations.
      ,
      • Weng L.C.
      • Preis S.R.
      • Hulme O.L.
      • et al.
      Genetic predisposition, clinical risk factor burden, and lifetime risk of atrial fibrillation.
      ,
      • Lazarte J.D.
      • McIntyre J.S.
      • Skanes A.D.
      • Gula A.C.
      • et al.
      The role of common genetic variation in “lone” atrial fibrillation.
      However, no difference was observed when comparing the discriminatory capacity of each score, perhaps secondary to inadequate statistical power. In an effort to address this limitation and attempt to discern which harbours the greatest predictive power—a concept that has yet to be evaluated in the AF literature—we sought to evaluate the behaviour of these 2 validated risk scores in lone AF, using a large publicly available dataset as a control cohort.

      Methods

      AF study cohort

      Patients referred for AF management at the London Health Sciences Centre, London, Ontario, Canada, and St. Paul’s Hospital, Vancouver, British Columbia, Canada, with AF, in the absence of known clinical risk factors, before 60 years of age, defined as lone AF, were recruited to the study. At least 1 episode of electrocardiographically documented AF, characterized by erratic atrial activity without distinct P waves and irregularly irregular QRS intervals lasting > 30 seconds, was required per patient. Exclusion criteria consisted of known risk factors for AF, including hypertension, coronary artery disease, left-ventricular ejection fraction < 50% or a history of clinical heart failure, moderate to severe valvular heart disease, hyperthyroidism, obstructive sleep apnea, and presence of inherited cardiomyopathy. All participants had a clinical history, physical examination, 12-lead electrocardiogram (ECG), and echocardiogram. A positive family history of AF was defined as presence of the arrhythmia in a first- or second-degree relative.

      Control cohort

      The control cohort was derived from the 1000 Genomes (1KG) Project, a publicly available multiancestry cohort of 1756 persons above the age of 18 who self-reported as healthy.
      • Genomes Project C.
      • Auton A.
      • Brooks L.D.
      • et al.
      A global reference for human genetic variation.
      SNP genotyping of the 1KG cohort was performed using the Illumina Omni 2.5 M DNA microarray (Illumina, San Diego, CA). Analyses were restricted to 423 persons of European (non-Finnish) ancestry following principal component analysis, described as follows.
      • Genomes Project C.
      • Auton A.
      • Brooks L.D.
      • et al.
      A global reference for human genetic variation.
      ,
      • Roslin N.M.L.W.
      • Paterson A.D.
      • Strug L.J.
      Quality control analysis of the 1000 Genomes Project Omni2.5 genotypes (Abstract/Program #576/F).

      DNA preparation and microarray genotyping of AF cases

      Genomic DNA for lone AF cases was isolated using the Puregene DNA Blood Kit (Gentra Systems, Qiagen Inc., Mississauga, ON). Microarray analysis was performed with Infinium Global Screening Array-24 v2.0 (Illumina) at Genome Québec, Montréal, Québec, Canada. GenomeStudio software (Illumina) was used to retrieve and export the microarray data to SNP & Variant Suite (SVS) v8.8.3 (Golden Helix Inc, Bozeman, MT). To improve genotyping accuracy, data points were filtered if they had a GenCall score cutoff < 0.15. Microarray data was further cleaned by filtering samples with a < 95% rate of autosomal SNP calls over the total number of SNP calls in the dataset to avoid inappropriate results from faulty genotyping calls (n = 2).
      • Miyagawa T.
      • Nishida N.
      • Ohashi J.
      • et al.
      Appropriate data cleaning methods for genome-wide association study.
      Using X chromosome heterozygosity, samples were removed if positive for sex discordance between clinical data and genotype information (n = 4). Linkage disequilibrium (LD) pruning was applied to all autosomes to prepare the data for identity by descent estimation analysis; samples were removed if estimated Pi-hat (cryptic relatedness) for a sample pair was > = 0.5 consistent with first-degree relatives. One father and son pair was detected, and the father was removed from further analysis. Deviation from Hardy-Weinberg equilibrium was not used as a method to filter SNPs, given that it was unclear if various assumptions were met for both the lone AF and 1KG cohorts, including random mating and sufficiently large population sizes.
      • Roslin N.M.L.W.
      • Paterson A.D.
      • Strug L.J.
      Quality control analysis of the 1000 Genomes Project Omni2.5 genotypes (Abstract/Program #576/F).

      Ancestry correction: European (non-Finnish) subgroup

      Ancestry inference using principal component analysis was performed on SNP & Variant Suite (SVS) v8.8.3 (Golden Helix Inc) for the lone AF cases and controls post-LD pruning using EIGENSTRAT.
      • Price A.L.
      • Patterson N.J.
      • Plenge R.M.
      • Weinblatt M.E.
      • Shadick N.A.
      • Reich D.
      Principal components analysis corrects for stratification in genome-wide association studies.
      Among 5 formulated eigenvalues, the top 3 explained the majority of the stratification (Supplemental Fig. S1). Around the 1KG European (non-Finnish) population cluster, a centroid was mathematically identified, and any case sample that fell outside the 1.5 interquartile range (IQR) was excluded from the analysis, as it was deemed outside the European (non-Finnish) population cluster. Among a total of 240 lone AF cases that had undergone DNA microarray analysis, 54 were excluded on this basis.

      Imputation

      Genotype imputation of DNA microarray data was performed for both cohorts to increase genomic coverage using the Michigan Imputation Server.
      • Das S.
      • Forer L.
      • Schonherr S.
      • et al.
      Next-generation genotype imputation service and methods.
      Imputation was performed using the Minimac4 1.2.4 imputation algorithm with Haplotype Reference Consortium r1.1 (cases) and 1KG Phase 3 v5 (1KG controls) reference set. Only SNPs that were genotyped or imputed with r2 > 0.3 were used for score calculation.

      Polygenic score calculation

      Two validated AF polygenic scores were calculated: a GPS developed by Khera et al. and a PRS developed by Weng et al. (both in 2018), hereafter referred to as "GPS" and "PRS," respectively.
      • Khera A.V.
      • Chaffin M.
      • Aragam K.G.
      • et al.
      Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations.
      ,
      • Weng L.C.
      • Preis S.R.
      • Hulme O.L.
      • et al.
      Genetic predisposition, clinical risk factor burden, and lifetime risk of atrial fibrillation.
      The GPS was derived by Khera and colleagues, using the LDPred algorithm and association data from a previous genome-wide association study for AF, using separate testing and validation datasets from the UK Biobank.
      • Khera A.V.
      • Chaffin M.
      • Aragam K.G.
      • et al.
      Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations.
      ,
      • Christophersen I.E.
      • Rienstra M.
      • Roselli C.
      • et al.
      Large-scale analyses of common and rare variants identify 12 new loci associated with atrial fibrillation.
      The PRS developed by Weng and colleagues also used association data from the same AF genome-wide association study, however, used pruning and thresholding at various tuning parameters for its derivation.
      • Weng L.C.
      • Preis S.R.
      • Hulme O.L.
      • et al.
      Genetic predisposition, clinical risk factor burden, and lifetime risk of atrial fibrillation.
      ,
      • Christophersen I.E.
      • Rienstra M.
      • Roselli C.
      • et al.
      Large-scale analyses of common and rare variants identify 12 new loci associated with atrial fibrillation.
      A total of 30 candidate scores were developed and tested within the UK Biobank dataset and the optimal one was identified on the basis of its goodness-of-fit in accordance with the Akaike’s Information Criterion.
      • Weng L.C.
      • Preis S.R.
      • Hulme O.L.
      • et al.
      Genetic predisposition, clinical risk factor burden, and lifetime risk of atrial fibrillation.
      Subjects were scored by counting genetic dosages of imputed variants using the --score option in PLINK2.0 and Wrapper Python script.
      • Khera A.V.
      • Chaffin M.
      • Zekavat S.M.
      • et al.
      Whole-genome sequencing to characterize monogenic and polygenic contributions in patients hospitalized with early-onset myocardial infarction.
      For each variant, the number of risk alleles present is multiplied by its respective weight, and the products for each variant are added to generate the final score. In total, 5,978,070 of 6,730,541 variants (88.82%) were available for the GPS and 872 of 1168 variants (74.66%) for the PRS in both cohorts.

      Statistical analysis

      GPS/PRS distributions were assessed for normality using the D'Agostino-Pearson omnibus K2 test. Odds ratios (ORs) were calculated by comparing proportions of subjects in the top 10, 5, and 1 percentiles of the GPS/PRS using 2-by-2 contingency tables with Fisher’s exact test. The performance of the GPS and PRS for discerning AF cases vs controls was assessed using receiver operating characteristic (ROC) curves with 1KG controls as the reference and compared in R version 3.5.1 (R Core Team, 2018) with pROC (version 1.7.2).
      • Robin X.
      • Turck N.
      • Hainard A.
      • et al.
      pROC: an open-source package for R and S+ to analyze and compare ROC curves.
      Impact of a high GPS/PRS score (in the top 10th percentile) on age at diagnosis (divided into quartiles: < 37, 37-45, 46-51, > 51) among lone AF cases was assessed using the χ2 test and the χ2 test for trend. Evaluation for a different likelihood of possessing a high GPS/PRS score (in the top 10th percentile) in lone AF cases by sex and by body mass index (BMI) > 30 kg/m2 was assessed using 2-by-2 contingency tables with Fisher’s exact test. Unless otherwise stated, all statistical analyses were conducted using GraphPad Prism 8 for Windows (version 8.3.1; GraphPad Software, San Diego California). Statistical significance was defined as P < 0.05.

      Results

      Characteristics of lone AF cases

      Demographic and clinical characteristics of 186 lone AF cases are described in Table 1. The mean age at AF diagnosis was 44.5 ± 9.8 years, and 157 (80.5%) patients were male. Twenty-two (11.3%) study participants had persistent AF at the time of diagnosis, whereas the remainder presented with paroxysmal AF. The mean left-atrial diameter on echocardiography at the time of presentation was 4.0 ± 0.6 cm.
      Table 1Clinical Characteristics of the “Lone” AF Cohort
      Clinical variable“Lone” AF cohort n = 186
      Age at diagnosis (years)44.26 ± 9.85
      Sex (male)151 (81.2%)
      Body mass index (kg/m2)
      Values for body mass index, family history of AF, PR-interval, and left-atrial diameter were missing for 1, 1, 2, and 2 cases, respectively.
      27.39 ± 3.92
      Type of AF at diagnosis
       Paroxysmal165 (89.7%)
       Persistent19 (10.3%)
      Family history of AF
      Values for body mass index, family history of AF, PR-interval, and left-atrial diameter were missing for 1, 1, 2, and 2 cases, respectively.
      57 (30.8%)
      ECG values
       PR-interval
      Values for body mass index, family history of AF, PR-interval, and left-atrial diameter were missing for 1, 1, 2, and 2 cases, respectively.
      166.73 ± 25.95
       QRS duration95.27 ± 14.74
       QTc420.60 ± 26.70
      Left-atrial diameter (cm)
      Values for body mass index, family history of AF, PR-interval, and left-atrial diameter were missing for 1, 1, 2, and 2 cases, respectively.
      3.96 ± 0.55
      Data are n (%) or mean ± standard deviation.
      AF, atrial fibrillation; ECG, electrocardiogram.
      Values for body mass index, family history of AF, PR-interval, and left-atrial diameter were missing for 1, 1, 2, and 2 cases, respectively.

      Polygenic scores

      The distributions of the GPS (A) and PRS (B) across the cases and 1KG controls are shown in Figure 1. The GPS had a Gaussian distribution across the 2 groups, whereas the PRS was skewed to higher polygenic scores in the 1KG controls failing the normality test (P = 0.02).
      Figure thumbnail gr1
      Figure 1Distribution of the GPS (A) and PRS (B) percentiles among “lone” AF cases vs 1KG controls. For each boxplot, the horizontal lines represent the following: middle line = the median; the top and bottom line = interquartile range; and the whiskers = the maximum and minimum values within each group. AF, atrial fibrillation; GPS, genome-wide polygenic risk score; PRS, polygenic risk score; 1KG, 1000 genomes.
      In total, 34.4% (64 of 186) of patients with lone AF were in the top 10th percentile of the GPS distribution. Presence of a GPS score within the top 10th percentile of 1KG controls was associated with 4.64-fold increased odds (95% confidence interval [CI], 2.99-7.18; P < 0.001) for AF (Fig. 1A, Table 2). The odds of a score within the top 5% and 1% of the GPS distribution within 1KG controls were 4.22-fold (95% CI, 2.40-7.57; P < 0.0001) and 3.76-fold (95% CI, 1.18-10.30; P = 0.02) more likely among lone AF cases, respectively (Table 2).
      Table 2Proportion of “lone” AF cases and odds of possessing a GPS/PRS in the Top 10, 5, and 1 Percentiles
      Percentile of the polygenic score corresponds to the distribution of the 1KG controls.
      High GPS definition“Lone” AF cases1KG controlsOdds ratio95% CIP value
      Top 10% of distribution64 (34.4%)43 (10.2%)4.642.99-7.18< 0.0001
      Top 5% of distribution35 (18.8%)22 (5.2%)4.222.40-7.57< 0.0001
      Top 1% of distribution8 (4.3%)5 (1.2%)3.761.18-10.300.02
      High PRS definition
      Top 10% of distribution49 (26.3%)43 (10.2%)3.162.00-4.96< 0.0001
      Top 5% of distribution27 (14.5%)22 (5.2%)3.101.71-5.490.0002
      Top 1% of distribution15 (8.1%)5 (1.2%)7.332.61-18.55< 0.0001
      Data are n (%).
      AF, atrial fibrillation; CI, confidence interval; GPS, genome-wide polygenic risk score containing 5,978,070 single nucleotide polymorphisms; PRS, polygenic risk score containing 872 single nucleotide polymorphisms; 1KG, 1000 genomes.
      Percentile of the polygenic score corresponds to the distribution of the 1KG controls.
      Inheriting a PRS score in the top 10th percentile was seen in 26.3% (49 of 186) of patients with lone AF, which conferred a 3.16-fold increased odds (95% CI, 2.01-4.98; P < 0.001) for AF relative to 1KG controls (Fig. 1B, Table 2). The odds of a PRS score within the top 5% and 1% distribution were 3.10-fold (95% CI, 1.71-5.49; P = 0.0002) and 7.33-fold (95% CI, 2.61-18.55; P < 0.0001) more likely among lone AF cases relative to 1KG controls, respectively (Table 2).

      Discriminative capacity of GPS and PRS

      The ability of a high polygenic score to differentiate between lone AF cases and 1KG controls was assessed using ROC curve analysis. The C-statistic for the GPS was 0.76 (95% CI, 0.72-0.80) in comparison with a value of 0.70 (95% CI, 0.65-0.75) for the PRS. The GPS was noted to be superior relative to the PRS in discriminating lone AF cases vs 1KG controls (P = 0.002) (Fig. 2).
      Figure thumbnail gr2
      Figure 2Receiver operating characteristic curves for the GPS (black line) and PRS (grey line) with the 1KG control distribution as the reference. The area under the curve for the GPS (75.9%; 95% CI, 71.9-79.9) was consistent with improved discriminatory capacity relative to the PRS (70.0%; 95% CI, 65.5-74.5; P = 0.002). AUC, area under the curve; CI, confidence interval; GPS, genome-wide polygenic risk score; PRS, polygenic risk score.

      Impact of age and sex on GPS and PRS

      The likelihood of a high polygenic score did not differ on the basis of age at diagnosis (P = 0.65 [GPS], P = 0.98 [PRS]) or sex (P = 0.17 [GPS], P = 0.56 [PRS]) or BMI (P = 0.55 [GPS], P = 0.30 [PRS]) in the lone AF cohort and no statistical trend was identified for age (P = 0.61 [GPS], P = 0.71 [PRS]).

      Discussion

      Our study evaluating the performance characteristics of a GPS containing ∼ 6 million SNPs and a PRS containing ∼ 1000 SNPs demonstrated a 6% improved discriminatory capacity of the GPS over the PRS for distinguishing lone AF cases from healthy controls. To our knowledge, this represents the first time a more comprehensive GPS has been found to exhibit superior predictive performance relative to a validated, but more parsimonious, polygenic score in the setting of AF. These findings highlight the value of maximizing the depth of genetic detail incorporated into polygenic scores designed to identify persons at increased risk of developing or possessing AF. Notably, high scores for both the GPS and PRS, defined as greater than the top 10th percentile for the control population, were present in upward of 25% of lone AF cases, highlighting their potential relevance to a large proportion of individuals affected by the arrhythmia.
      Genome-wide polygenic scores, capturing millions of common variants from the entire genome, have previously been shown to provide a superior capacity to smaller polygenic scores in discerning affected patients from healthy controls across various disease entities.
      • Khera A.V.
      • Chaffin M.
      • Aragam K.G.
      • et al.
      Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations.
      ,
      • Dron J.S.
      • Hegele R.A.
      The evolution of genetic-based risk scores for lipids and cardiovascular disease.
      For example, an ~ 6 million-SNP score outperformed genetic risk scores possessing 50 and 49 thousand SNPs in head-to-head comparisons for prediction of coronary artery disease risk.
      • Khera A.V.
      • Chaffin M.
      • Aragam K.G.
      • et al.
      Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations.
      This is a notable departure from the initial strategy within the genetics field to keep polygenic scores with as few carefully selected SNPs as possible. Indeed, the incrementally improved performance of the ~ 6 million SNP GPS relative to the ~ 1000 SNP PRS in our lone AF cohort is novel in the field but consistent with previous work in other disease entities.
      We previously investigated the role of these genetic risk scores in lone AF using a locally sourced control set (controls = 86).
      • Lazarte J.D.
      • McIntyre J.S.
      • Skanes A.D.
      • Gula A.C.
      • et al.
      The role of common genetic variation in “lone” atrial fibrillation.
      No difference in their discriminatory capacities for distinguishing lone AF cases from healthy controls was observed; however, the relatively small size of the control cohort may have resulted in limited the statistical power. Although use of publicly available datasets as control cohorts must be performed cautiously secondary to their generally stemming from different source populations, coupled with different genotyping platforms often being used, these drawbacks may be counterbalanced by their large size and the resulting improved statistical power provided.
      For the current analysis, we believed it was reasonable to use the 1KG cohort as a control dataset, given that we were evaluating previously validated genetic risk scores rather than deriving our own or attempting to identify novel loci, which should reduce the likelihood of false positive associations. Principal component analysis was used to restrict cases and controls to a uniform genetic ancestry, which should limit potential bias secondary to cohort-selection factors and population stratification. In addition, we applied several additional measures to further minimize biases in the analysis, including filtering out data points with low accuracy, removing low call SNPs before imputation, and only keeping imputed SNPs with high quality. Finally, we also compared the distribution of scores between the 1KG cohort and our locally sourced control set (derived from the same region as our cases and genotyped with the same technology) and identified no difference in the proportion of individuals in the top 10th percentile using 2-by-2 contingency tables with Fisher’s exact test (Supplemental Fig. S2).
      Beyond highlighting the improved utility of genome-wide polygenic scores containing millions of SNPs relative to smaller scale genetic risk scores for lone AF, their relevance to a large proportion of AF cases further alludes to their potential clinical utility. The rapidly expanding prevalence of AF, worldwide and in Canada, has led to a major impetus to try to curb incident cases. Although evidence for prevention of AF through upstream therapies has yet to be established with randomized trials, the use of Mendelian randomization studies has served to bolster the probable causal role of certain AF risk factors, including BMI and increased thyroid activity, suggesting that intervening on these factors may prevent AF.
      • Chatterjee N.A.
      • Giulianini F.
      • Geelhoed B.
      • et al.
      Genetic obesity and the risk of atrial fibrillation: causal estimates from Mendelian randomization.
      • Salem J.E.
      • Shoemaker M.B.
      • Bastarache L.
      • et al.
      Association of thyroid function genetic predictors with atrial fibrillation: a phenome-wide association study and inverse-variance weighted average meta-analysis.
      • Ellervik C.
      • Roselli C.
      • Christophersen I.E.
      • et al.
      Assessment of the relationship between genetic determinants of thyroid function and atrial fibrillation: a Mendelian randomization study.
      Genetic risk scores may enable targeted delivery of therapies to individuals at greatest likelihood to benefit: indeed, in particular to those with substantial genetic burdens.
      • Andrade J.G.
      • Aguilar M.
      • Atzema C.
      • et al.
      The 2020 Canadian Cardiovascular Society/Canadian Heart Rhythm Society comprehensive guidelines for the management of atrial fibrillation.
      Moreover, pairing polygenic scores with data from new "wearable" device technology could potentially maximize the clinical utility of both technologies and improve early detection of AF.
      • Perez M.V.
      • Mahaffey K.W.
      • Hedlin H.
      • et al.
      Large-scale assessment of a smartwatch to identify atrial fibrillation.
      Thus, identifying individuals from the general population who are at substantial increased risk of the arrhythmia before its onset may enable effective administration of primary prevention strategies that may allow for early intervention and potentially curb incidence of AF. Hence, in this context, polygenic scores may serve as valuable clinical tools.

      Limitations

      Although our study provides important insight into the value of maximizing the depth of genetic detail polygenic scores in lone AF, it has several limitations. Our study was restricted to European ancestry, partially necessitated by the PRS and GPS scores having been derived in this ancestry, coupled with the allele frequency and effect size of common variants being ancestry specific.
      • Khera A.V.
      • Chaffin M.
      • Aragam K.G.
      • et al.
      Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations.
      ,
      • Weng L.C.
      • Preis S.R.
      • Hulme O.L.
      • et al.
      Genetic predisposition, clinical risk factor burden, and lifetime risk of atrial fibrillation.
      In this context, our findings are not anticipated to be generalizable beyond cohorts of European ancestry, and hence additional polygenic scores will need to be developed for this purpose. Use of the 1KG dataset was pursued to provide a larger control group; however, we acknowledge the significant potential for bias secondary to different genotyping methods and population substructures. Because of these concerns, we performed principal component analysis to minimize the potential impact of ancestry and batch-effect differences and included the top 3 principal components in the outlier analysis to formulate our final cohort of 186 subjects. Although our study sample size was an important limiting factor for a contemporary investigation into the complex genetics of a common disease, the statistically significant findings for our primary hypotheses highlight that our statistical power was adequate, although insufficient power likely precluded meaningful assessment for interactions between clinical risk factors and the polygenic scores in relation to AF risk. Indeed, 1KG control participants self-reported as healthy, but it is conceivable that some had undetected AF; however, the likelihood of AF under ascertainment would not be anticipated to be affected by GPS/PRS values. The corresponding nondifferential misclassification of the outcome among controls would only serve to reduce our statistical power secondary to bias toward the null rather than resulting in spurious false positive associations. Finally, the failure of the normality test for the 1KG controls PRS score distribution may have potentially biased the discriminatory capacity of the PRS score toward the null. Although no significant difference in subjects in the top 10th percentile was encountered between 1KG and locally sourced controls (Supplemental Fig. S2), the skew toward higher scores in the 1KG controls may have diminished the PRS discriminatory capacity. Given these collective limitations, future replication in an independent lone AF cohort of European ancestry will be critical for validation of our current findings.

      Conclusions

      Our study findings suggest that genome-wide polygenic scores, capturing millions of common variants from the entire genome, provide a superior discriminatory capacity compared with smaller polygenic scores in lone AF. Given their relevance to a large proportion of lone AF cases, integration of genome-wide polygenic scores into clinical practice may facilitate identification of persons at risk of developing AF, potentially leading to improved care.

      Funding Sources

      J.L. is supported by the Canadian Institutes of Health Research (Doctoral Research Award) and the Schulich School of Medicine and Dentistry (Cobban Student Award in Heart and Stroke Research). R.A.H. is supported by the Jacob J. Wolfe Distinguished Medical Research Chair, the Edith Schulich Vinet Canada Research Chair in Human Genetics, the Martha G. Blackburn Chair in Cardiovascular Research, and operating grants from the Canadian Institutes of Health Research (Foundation Grant) and the Heart and Stroke Foundation of Ontario. J.D.R. is supported by the Marianne Barrie Philanthropic Fund, the Heart and Stroke Foundation of Canada, and the Canadian Cardiovascular Society Atrial Fibrillation Award.

      Disclosures

      R.A.H. has received consulting fees from Acasti, Aegerion, Akcea/Ionis, Amgen, HLS Therapeutics, and Sanofi. The other authors have no conflicts of interest to disclose.

      Supplementary Material

      References

        • Choi S.H.
        • Jurgens S.J.
        • Weng L.C.
        • et al.
        Monogenic and polygenic contributions to atrial fibrillation risk: results from a national biobank.
        Circ Res. 2020; 126: 200-209
        • Weng L.C.
        • Choi S.H.
        • Klarin D.
        • et al.
        Heritability of atrial fibrillation.
        Circ Cardiovasc Genet. 2017; 10
        • Ahlberg G.
        • Refsgaard L.
        • Lundegaard P.R.
        • et al.
        Rare truncating variants in the sarcomeric protein titin associate with familial and early-onset atrial fibrillation.
        Nat Commun. 2018; 9: 4316
        • Roselli C.
        • Chaffin M.D.
        • Weng L.C.
        • et al.
        Multi-ethnic genome-wide association study for atrial fibrillation.
        Nat Genet. 2018; 50: 1225-1233
        • Nielsen J.B.
        • Thorolfsdottir R.B.
        • Fritsche L.G.
        • et al.
        Biobank-driven genomic discovery yields new insight into atrial fibrillation biology.
        Nat Genet. 2018; 50: 1234-1239
        • Khera A.V.
        • Chaffin M.
        • Aragam K.G.
        • et al.
        Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations.
        Nat Genet. 2018; 50: 1219-1224
        • Weng L.C.
        • Preis S.R.
        • Hulme O.L.
        • et al.
        Genetic predisposition, clinical risk factor burden, and lifetime risk of atrial fibrillation.
        Circulation. 2018; 137: 1027-1038
        • Fuster V.
        • Ryden L.E.
        • Cannom D.S.
        • et al.
        ACC/AHA/ESC 2006 guidelines for the management of patients with atrial fibrillation: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines and the European Society of Cardiology Committee for Practice Guidelines (Writing Committee to Revise the 2001 Guidelines for the Management of Patients With Atrial Fibrillation): developed in collaboration with the European Heart Rhythm Association and the Heart Rhythm Society.
        Circulation. 2006; 114: e257-354
        • Wyse D.G.
        • Van Gelder I.C.
        • Ellinor P.T.
        • et al.
        Lone atrial fibrillation: does it exist?.
        J Am Coll Cardiol. 2014; 63: 1715-1723
        • Lazarte J.D.
        • McIntyre J.S.
        • Skanes A.D.
        • Gula A.C.
        • et al.
        The role of common genetic variation in “lone” atrial fibrillation.
        Circ Genom Precis Med. 2021; e003179
        • Genomes Project C.
        • Auton A.
        • Brooks L.D.
        • et al.
        A global reference for human genetic variation.
        Nature. 2015; 526: 68-74
        • Roslin N.M.L.W.
        • Paterson A.D.
        • Strug L.J.
        Quality control analysis of the 1000 Genomes Project Omni2.5 genotypes (Abstract/Program #576/F).
        66th Annual Meeting of The American Society of Human Genetics, Vancouver, British Columbia, Canada2016
        • Miyagawa T.
        • Nishida N.
        • Ohashi J.
        • et al.
        Appropriate data cleaning methods for genome-wide association study.
        J Hum Genet. 2008; 53: 886-893
        • Price A.L.
        • Patterson N.J.
        • Plenge R.M.
        • Weinblatt M.E.
        • Shadick N.A.
        • Reich D.
        Principal components analysis corrects for stratification in genome-wide association studies.
        Nat Genet. 2006; 38: 904-909
        • Das S.
        • Forer L.
        • Schonherr S.
        • et al.
        Next-generation genotype imputation service and methods.
        Nat Genet. 2016; 48: 1284-1287
        • Christophersen I.E.
        • Rienstra M.
        • Roselli C.
        • et al.
        Large-scale analyses of common and rare variants identify 12 new loci associated with atrial fibrillation.
        Nat Genet. 2017; 49: 946-952
        • Khera A.V.
        • Chaffin M.
        • Zekavat S.M.
        • et al.
        Whole-genome sequencing to characterize monogenic and polygenic contributions in patients hospitalized with early-onset myocardial infarction.
        Circulation. 2019; 139: 1593-1602
        • Robin X.
        • Turck N.
        • Hainard A.
        • et al.
        pROC: an open-source package for R and S+ to analyze and compare ROC curves.
        BMC Bioinformatics. 2011; 12: 77
        • Dron J.S.
        • Hegele R.A.
        The evolution of genetic-based risk scores for lipids and cardiovascular disease.
        Curr Opin Lipidol. 2019; 30: 71-81
        • Chatterjee N.A.
        • Giulianini F.
        • Geelhoed B.
        • et al.
        Genetic obesity and the risk of atrial fibrillation: causal estimates from Mendelian randomization.
        Circulation. 2017; 135: 741-754
        • Salem J.E.
        • Shoemaker M.B.
        • Bastarache L.
        • et al.
        Association of thyroid function genetic predictors with atrial fibrillation: a phenome-wide association study and inverse-variance weighted average meta-analysis.
        JAMA Cardiol. 2019; 4: 136-143
        • Ellervik C.
        • Roselli C.
        • Christophersen I.E.
        • et al.
        Assessment of the relationship between genetic determinants of thyroid function and atrial fibrillation: a Mendelian randomization study.
        JAMA Cardiol. 2019; 4: 144-152
        • Andrade J.G.
        • Aguilar M.
        • Atzema C.
        • et al.
        The 2020 Canadian Cardiovascular Society/Canadian Heart Rhythm Society comprehensive guidelines for the management of atrial fibrillation.
        Can J Cardiol. 2020; 36: 1847-1948
        • Perez M.V.
        • Mahaffey K.W.
        • Hedlin H.
        • et al.
        Large-scale assessment of a smartwatch to identify atrial fibrillation.
        N Engl J Med. 2019; 381: 1909-1917