Key Takeaways:
- Population-based genetic studies paired with electronic health records (EHRs) enhance cardiovascular research by providing access to large, diverse cohorts that reveal rare variants, gene-environment interactions, and relationships between genetic factors and disease outcomes.
- Genetic sequencing datasets, like those from NashBio and the BioVU cohort, facilitate expansion of research capabilities to identify therapeutic targets and predict treatment responses.
- Studies using these datasets have uncovered previously unrecognized atrial fibrillation phenotypes, supporting precision medicine approaches tailored to specific subtypes that may offer improved outcomes over traditional strategies.
Introduction
Cardiovascular disease remains the leading cause of mortality worldwide, yet our ability to predict, prevent, and treat these conditions is constrained by incomplete understanding of their underlying biological heterogeneity.1 The integration of human genetics into cardiovascular research represents a shift in how we approach these complex diseases.2 By leveraging naturally occurring genetic variation, researchers can establish causal relationships, identify disease subtypes, and pinpoint therapeutic targets with greater precision than conventional epidemiological methods allow. For example, this genetics-based approach has already yielded clinical innovations in the identification of decreased atherosclerotic cardiovascular disease risk with ANGPTL3 loss-of-function and its subsequent clinical trials with CRISPR-Cas9 gene editing.3 To pursue these discoveries at scale, population-based biobanks linking genomic data with longitudinal electronic health records (EHR) have become indispensable tools for modern cardiovascular research.4
Studying Population-Based Genetic Studies and Cardiovascular Disease
The integration of human genetic evidence into drug discovery pipelines has significantly improved the probability of clinical therapeutic success. Population-based genetic studies offer several distinct advantages for cardiovascular research.
Unraveling Atrial Fibrillation Heterogeneity Through Integrated Genomic and Phenotypic Analysis
Atrial fibrillation (AF) affects over 40 million individuals globally and is associated with significant morbidity, including a five-fold increased risk of stroke and elevated mortality.9 However, AF is remarkably heterogeneous in its presentation, progression, and response to therapy which has hampered efforts to develop universally effective treatments and risk stratification tools.10 Growing evidence indicates that AF encompasses mechanistically distinct subtypes shaped by complex gene-environment interactions.11 In particular, genetic susceptibility and inflammation appear to be major contributors to AF heterogeneity, though their specific roles in defining AF subgroups remain unclear.11
To clarify AF phenotypes and its genetic underpinnings, researchers leverage BioVU, one of the largest DNA biobanks linked to de-identified EHRs, comprising of over 300,000 samples from diverse patient populations (https://victr.vumc.org/biovu-description/). Researchers identified 23,271 AF patients and characterized them using 35 clinical features, including demographics, body mass index, CHA2DS2-VASc score, sleep apnea, presence of cardiac implantable electronic devices, left ventricular systolic dysfunction, thyroid disease, and valvular heart disease.11
The machine learning analyses revealed three phenotypic clusters characterized by progressively higher burdens of comorbidities, especially renal disease and coronary artery disease.11 Polygenic risk for AF was greatest in the cluster with the fewest comorbidities, whereas clinically assessed inflammatory biomarkers were most elevated in the high-comorbidity cluster.11 In addition, survival differed significantly among comorbidity clusters, with decreased survival in the high-comorbidity group.11
Key Takeaways from This Research:
- The BioVU cohort enabled robust statistical power to detect associations, while the unsupervised co-clustering machine learning algorithm avoided assumptions about which clinical features might be most relevant for classification.
- AF is comprised of multiple distinct subtypes with different genetic liabilities, inflammatory profiles, and clinical trajectories.
- Precision medicine approaches tailored to specific AF subtypes may improve outcomes compared to traditional strategies.
NashBio Genotyping Dataset: Expanding Cardiovascular Genomics
Recently, NashBio has generated whole genome sequencing (WGS) data from BioVU participants. WGS provides a level of genomic resolution that enables the characterization of rare variants, structural variations, and non-coding regulatory regions. These data allow researchers to identify natural loss-of-function variants, fine-map GWAS loci, and pinpoint causal genes that drive cardiovascular disease risk. When integrated with deep phenotyping, WGS data supports precise genotype-phenotype correlations that inform disease mechanisms, therapeutic targets, and modifiers of treatment response. This approach also enhances drug development by enabling genetically informed patient stratification and early identification of safety considerations. Ultimately, the datasets will accelerate translational cardiovascular research by linking genomic variation to actionable biological insights.
Conclusion
The integration of population-scale genomics with deep clinical phenotyping represents a shift in cardiovascular research, transforming our understanding of disease heterogeneity and opening new therapeutic opportunities. The VUMC atrial fibrillation study exemplifies how this approach can distill complex cardiovascular conditions into biologically and clinically meaningful subtypes, each with distinct genetic architectures and mechanistic underpinnings.11 For pharmaceutical industry and clinical researchers, these resources offer unprecedented opportunities to validate drug targets, discover novel therapeutic mechanisms, and stratify patients for precision medicine approaches.
References
- Palaniappan LP, Allen NB, Almarzooq ZI, et al. 2026 Heart Disease and Stroke Statistics: A Report of US and Global Data From the American Heart Association. Circulation. Published online January 21, 2026. doi:10.1161/CIR.0000000000001412
- Landstrom AP, Ferguson JF, James CA, et al. Genetic and Genomic Testing in Cardiovascular Disease: A Policy Statement From the American Heart Association. Circulation. 2025;152(24):e474-e489. doi:10.1161/CIR.0000000000001385
- Laffin LJ, Nicholls SJ, Scott RS, et al. Phase 1 Trial of CRISPR-Cas9 Gene Editing Targeting ANGPTL3. N Engl J Med. 2025;393(21):2119-2130. doi:10.1056/NEJMoa2511778
- Lewandowski AJ, Rutter MK, Collins R. Scientific and Clinical Impacts of UK Biobank in Cardiovascular Medicine. Circulation. 2024;150(24):1907-1909. doi:10.1161/CIRCULATIONAHA.124.072449
- Hall JL, Ryan JJ, Bray BE, et al. Merging Electronic Health Record Data and Genomics for Cardiovascular Research: A Science Advisory From the American Heart Association. Circ Cardiovasc Genet. 2016;9(2):193-202. doi:10.1161/HCG.0000000000000029
- Dewey FE, Gusarova V, O’Dushlaine C, et al. Inactivating Variants in ANGPTL4 and Risk of Coronary Artery Disease. N Engl J Med. 2016;374(12):1123-1133. doi:10.1056/NEJMoa1510926
- O’Sullivan JW, Raghavan S, Marquez-Luna C, et al. Polygenic Risk Scores for Cardiovascular Disease: A Scientific Statement From the American Heart Association. Circulation. 2022;146(8):e93-e118. doi:10.1161/CIR.0000000000001077
- Bell KJL, Loy C, Cust AE, Teixeira-Pinto A. Mendelian Randomization in Cardiovascular Research: Establishing Causality When There Are Unmeasured Confounders. Circ Cardiovasc Qual Outcomes. 2021;14(1):e005623. doi:10.1161/CIRCOUTCOMES.119.005623
- Kornej J, Börschel CS, Benjamin EJ, Schnabel RB. Epidemiology of Atrial Fibrillation in the 21st Century: Novel Methods and New Insights. Circ Res. 2020;127(1):4-20. doi:10.1161/CIRCRESAHA.120.316340
- Ang YS, Rajamani S, Haldar SM, Hüser J. A New Therapeutic Framework for Atrial Fibrillation Drug Development. Circ Res. 2020;127(1):184-201. doi:10.1161/CIRCRESAHA.120.316576
- Davogustto G, Zhao S, Li Y, et al. Unbiased Characterization of Atrial Fibrillation Phenotypic Architecture Provides Insight Into Genetic Liability and Clinically Relevant Outcomes. Circ Genom Precis Med. Published online January 28, 2026:e004853. doi:10.1161/CIRCGEN.124.004853
