2025 Hyper Recent •CC0 1.0 Universal

This work is dedicated to the public domain. No rights reserved.

Access Preprint From Server
July 2nd, 2025
Version: 1
National Institutes of Health
genomics
biorxiv

Haplotype-Resolved DNA Methylation at the APOE Locus identifies Allele-Specific Epigenetic Signatures Relevant to Alzheimer's Disease Risk

Genner, R. M.Open in Google Scholar•Meredith, M.Open in Google Scholar•Moller, A.Open in Google Scholar•Weller, C.Open in Google Scholar•Daida, K.Open in Google Scholar•Ayuketah, A.Open in Google Scholar•Jerez, P. A.Open in Google Scholar•Akeson, S.Open in Google Scholar•Malik, L.Open in Google Scholar•Baker, B.Open in Google Scholaret al.

The APOE gene encodes a key lipid transport protein and plays a central role in Alzheimer\'s disease (AD) pathogenesis. Three common APOE alleles, {epsilon}2 (rs7412(C>T), {epsilon}3 (reference), and {epsilon}4 (rs429358(T>C)), arise from two coding variants in exon 4 and confer distinct AD risk profiles, with {epsilon}4 increasing risk and {epsilon}2 providing protection. The {epsilon}3-linked APOE variant rs769455[T] has also been associated with elevated AD risk in individuals of African ancestry carrying both rs769455[T] and {epsilon}4 alleles. These single nucleotide variants (SNVs) reside in a cytosine-phosphate-guanine (CpG) island, which is a region with a higher frequency of CpG sites compared to the rest of the genome. CpG sites are subject to 5-methylcytosine (5mC) methylation by DNA methyltransferases which add a methyl group to the fifth carbon on the cytosine residue of a CpG site. The presence of SNVs can disrupt this process, making these regions prime targets for differential methylation; however, allele-specific methylation patterns in APOE remain poorly resolved due to technical limitations of conventional bisulfite and methylation array based methods, including degraded DNA quality, sparse CpG coverage, and lack of haplotype phasing. Here, we leverage high-accuracy long-read sequencing data to generate haplotype-resolved methylation profiles of the APOE locus in 332 postmortem brain samples from two ancestrally different cohorts. This includes 201 individuals of European ancestry from the North American Brain Expression Consortium (NABEC), comprising 402 haplotypes (48 {epsilon}2 and 58 {epsilon}4 alleles), and 131 individuals of African and African admixed ancestry from the Human Brain Core Collection (HBCC), comprising 262 haplotypes (25 {epsilon}2, 64 {epsilon}4, and 7 rs769455 alleles). A linear regression analysis identified 18 novel differentially methylated CpG sites (DMCs) associated with APOE {epsilon}2, {epsilon}4, and rs769455 within a gene cluster spanning TOMM40, APOE, APOC1, and APOC4-APOC2. This represents the most comprehensive haplotype-resolved methylation study of APOE in human brain tissue to date. Our results uncover distinct allele-specific methylation signatures and demonstrate the power of long-read sequencing for resolving epigenetic variation relevant to AD risk.

Similar Papers

biorxiv
Wed Jul 02 2025
Persistent Activation of Endothelial Cells is Linked to Thrombosis and Inflammation in Cerebral Cavernous Malformation Disease
BACKGROUND: Cerebral cavernous malformations (CCM) are neurovascular lesions that affect both children and adults, and morbidity often results from thrombosis, bleeding, and neurological dysfunction. Studies indicate that inflammation-related activation of endothelial cells contributes significantly to the worsening of CCM disease. This suggests that ongoing vascular inflammation and endothelial d...
Gallego-Gutierrez, H.
•
Frias-Anaya, E.
•
Bui, C.
•
Zhao, L.
...•
Lopez-Ramirez, M. A.
biorxiv
Wed Jul 02 2025
Illuminating the mystery of thylacine extinction: a role for relaxed selection and gene loss
Gene loss shapes lineage-specific traits but is often overlooked in species survival. In this study, we investigate the role of ancestral gene loss using the extinction icon - thylacine (Thylacinus cynocephalus). While studies of neutral genetic variation indicate a population decline before extinction, the impact of thylacine-specific ancestral gene losses remains unexplored. The availability of ...
Salve, B. G.
•
Vijay, N.
biorxiv
Wed Jul 02 2025
Integrative Transcriptomic and Machine Learning Approaches to decipher Mitochondrial Gene Regulation in severe Plasmodium vivax Malaria
Mitochondria in Plasmodium vivax are functionally vital despite possessing a highly reduced genome and differing substantially from the human organelle. Beyond their classical role in energy production, they dynamically coordinate processes like pyrimidine biosynthesis and heme metabolism, adapting their functions across the intra-erythrocytic development cycle (IDC). Their unique architecture and...
Roy, P.
•
Aggarwal, Y.
•
Kochar, S. K.
•
Kochar, D. K.
•
Das, A.
biorxiv
Wed Jul 02 2025
An optimised computational approach for the identification of somatic structural variants in cancer
Structural variants play a critical role in tumorigenesis. At present, these events are most commonly identified using short-read whole-genome sequencing data, and a number of computational tools are available for this purpose. Consensus approaches have been used to improve precision, but may reduce sensitivity. The optimal number and combination of callers remains unclear, in part due to the lack...
Waise, S.
•
Mensah, N.
•
Lesluyes, T.
•
Demeulemeester, J.
...•
Van Loo, P.
biorxiv
Wed Jul 02 2025
Language models learn to represent antigenic properties of human influenza A(H3) virus
Given that influenza vaccine effectiveness depends on a good antigenic match between the vaccine and circulating viruses, it is important to assess the antigenic properties of newly emerging variants continuously. With the increasing application of real-time pathogen genomic surveillance, a key question is if antigenic properties can reliably be predicted from influenza virus genomic information. ...
Durazzi, F.
•
Koopmans, M.
•
Fouchier, R. A. M.
•
Remondini, D.
biorxiv
Wed Jul 02 2025
Multi-omic identification of key transcriptional regulatory programs during endurance exercise training in rats
Transcription factors (TFs) play a key role in regulating gene expression. We conducted an integrated analysis of chromatin accessibility, DNA methylation, mRNA expression, protein abundance and phosphorylation across eight tissues in fifty rats of equally represented sexes following endurance exercise training (EET) to identify coordinated epigenomic and transcriptional changes and determine key ...
Smith, G. R.
•
Zhao, B.
•
Lindholm, M. E.
•
Raja, A.
...•
Sealfon, S. C.
biorxiv
Wed Jul 02 2025
spRefine Denoises and Imputes Spatial Transcriptomics with a Reference-Free Framework Powered by Genomic Language Model
The analysis of spatial transcriptomics is hindered by high noise levels and missing gene measurements, challenges that are further compounded by the higher cost of spatial data compared to traditional single-cell data. To overcome this challenge, we introduce spRefine, a deep learning framework that leverages genomic language models to jointly denoise and impute spatial transcriptomic data. Our r...
Liu, T.
•
Huang, T.
•
Jin, W.
•
Chu, T.
...•
Zhao, H.
biorxiv
Wed Jul 02 2025
microRNA-206 is a reproducibly sensitive and specific plasma biomarker of amyotrophic lateral sclerosis
Amyotrophic lateral sclerosis (ALS) is a devastating and fatal neurodegenerative disease with no current therapeutic to modify disease progression. Reliable biomarkers for ALS are essential for improving diagnosis and evaluating therapeutic efficacy. We combined small-RNA sequencing from a discovery cohort of ALS patients and healthy controls with sequencing data from a previously published ALS co...
Henderson, B. W.
•
Roberts, B. S.
•
Kolodziejczak, S.
•
Cohcran, M.
•
Myers, R. M.
biorxiv
Wed Jul 02 2025
Perplexity as a Metric for Isoform Diversity in the Human Transcriptome
Long-read sequencing (LRS) has revealed a far greater diversity of RNA isoforms than earlier technologies, increasing the critical need to determine which, and how many, isoforms per gene are biologically meaningful. To define the space of relevant isoforms from LRS, many existing analysis pipelines rely on arbitrary expression cutoffs, but a single threshold cannot accommodate the broad variabili...
Schertzer, M. D.
•
Park, S. H.
•
Su, J.
•
Sheynkman, G. M.
•
Knowles, D. A.