2025 Hyper Recent •CC0 1.0 Universal

This work is dedicated to the public domain. No rights reserved.

Access Preprint From Server
January 22nd, 2025
Version: 1
Gladstone Institutes
genomics
biorxiv

Pervasive and programmed nucleosome distortion patterns on single mammalian chromatin fibers

Yang, M. G.Open in Google Scholar•Richter, H. J.Open in Google Scholar•Wang, S.Open in Google Scholar•McNally, C. P.Open in Google Scholar•Harris, N.Open in Google Scholar•Dhillon, S.Open in Google Scholar•Maresca, M.Open in Google Scholar•de Wit, E.Open in Google Scholar•Willenbring, H.Open in Google Scholar•Maher, J.Open in Google Scholaret al.

We present a genome-scale method to map the single-molecule co-occupancy of structurally distinct nucleosomes, subnucleosomes, and other protein-DNA interactions via long-read high-resolution adenine methyltransferase footprinting. Iteratively Defined Lengths of Inaccessibility (IDLI) classifies nucleosomes on the basis of shared patterns of intranucleosomal accessibility, into: i.) minimally-accessible chromatosomes; ii.) octasomes with stereotyped DNA accessibility from superhelical locations (SHLs) +/-1 through +/-7; iii.) highly-accessible unwrapped nucleosomes; and iv.) subnucleosomal species, such as hexasomes, tetrasomes, and other short DNA protections. Applying IDLI to mouse embryonic stem cell (mESC) chromatin, we discover widespread nucleosomal distortion on individual mammalian chromatin fibers, with >85% of nucleosomes surveyed displaying degrees of intranucleosomally accessible DNA. We observe epigenomic-domain-specific patterns of distorted nucleosome co-occupancy and positioning, including at enhancers, promoters, and mouse satellite repeat sequences. Nucleosome distortion is programmed by the presence of bound transcription factors (TFs) at cognate motifs; occupied TF binding sites are differentially decorated by distorted nucleosomes compared to unbound sites, and degradation experiments establish direct roles for TFs in structuring binding-site proximal nucleosomes. Finally, we apply IDLI in the context of primary mouse hepatocytes, observing evidence for pervasive nucleosomal distortion in vivo. Further genetic experiments reveal a role for the hepatocyte master regulator FOXA2 in directly impacting nucleosome distortion at hepatocyte-specific regulatory elements in vivo. Our work suggests extreme - but regulated - plasticity in nucleosomal DNA accessibility at the single-molecule level. Further, our study offers an essential new framework to model transcription factor binding, nucleosome remodeling, and cell-type specific gene regulation across biological contexts.

Similar Papers

biorxiv
Sat Jul 05 2025
The human transcription factor occupancy landscape viewed using high-resolution in situ base-conversion strand-specific single-molecule chromatin accessibility mapping
Chromatin accessibility profiling is a key tool for mapping the location of cis-regulatory elements (cREs) in the genome and tracking chromatin state dynamics during development, in response to various external and internal stimuli, and in disease contexts. Single-molecule footprinting (SMF) methods that rely on the labeling of individual accessible DNA bases have emerged in recent years as a powe...
Marinov, G. K.
•
Doughty, B. R.
•
Schaepe, J. M.
•
Wang, T.
...•
Greenleaf, W.
biorxiv
Sat Jul 05 2025
Allostery is a widespread cause of loss-of-function variant pathogenicity
Allosteric communication between non-contacting sites in proteins plays a fundamental role in biological regulation and drug action. While allosteric gain-of-function variants are known drivers of oncogene activation, the broader importance of allostery in genetic disease and protein evolution is less clear. Here, we integrate large-scale experimental measurements and neural network models to prov...
Liao, X.
•
Lehner, B.
biorxiv
Fri Jul 04 2025
Transcriptomic entropy reveals tissue-specific patterns in aging and predicts cancer progression
Aging and cancer share complex molecular mechanisms, yet distinguishing between causative factors and byproducts remains challenging. Here, we investigated the role of transcriptomic entropy in aging and cancer processes by analyzing RNA-sequencing data from thousands of human and mouse samples. We found that entropy changes during aging are highly tissue-specific, with some tissues showing increa...
dos Santos, G. A.
•
Castro, J. P.
•
Galante, P. A. F.
biorxiv
Fri Jul 04 2025
Mitochondrial Genome-Based Phylogeny of Turbellarians and Evidence for Accelerated Mitochondrial Evolution in Symbiotic Species
Background: Flatworms are a highly diverse phylum with over 26,500 predominantly parasitic species. A minor portion of this diversity comprise predominantly free-living \"turbellarians\" Phylogenetic relationships within turbellarian orders remain debated, with recent mitochondrial genome studies also questioning the monophyly of the \"Neoophora clade\" . Some unique mitochondrial gene features ha...
Wang, J.-Q.
•
Song, R.
•
Liu, M.-D.
•
Ye, T.
...•
Zhang, D.
biorxiv
Fri Jul 04 2025
A single cell multi-omic analysis identifies molecular and gene-regulatory mechanisms dysregulated in the developing Down syndrome neocortex
Down syndrome is the most common genetic cause of intellectual disability, presenting with cognitive, learning, memory, and language deficits. The cellular and molecular mechanisms driving this disorder remain unclear, limited by a lack of systematic studies in the developing human brain. Here, we leveraged single-nucleus multi-omics to profile the mid-gestation neocortex in a cohort of 26 donors....
Vuong, C. K.
•
Weber, A.
•
Seong, P.
•
Matoba, N. K.
...•
de la Torre-Ubieta, L.
biorxiv
Fri Jul 04 2025
Single-cell RNA sequencing in Hirschsprungs disease tissues reveals lack of neuronal differentiation in the aganglionic colon segment
The enteric nervous system (ENS) is a complex network of neurons and glial cells. Hirschsprungs disease (HSCR) is a congenital condition characterized by the absence of ganglion cells in the distal colon, leading to functional bowel obstruction. In this study, we used single-cell RNA sequencing (scRNA-seq) and whole genome sequencing (WGS) to analyze healthy and aganglionic colon segments from HSC...
Tarapcsak, S.
•
Huang, X.
•
Qiao, Y.
•
Farrell, A.
...•
Marth, G. T.
biorxiv
Fri Jul 04 2025
Recombinase Polymerase Amplification of Forensic Short Tandem Repeat Loci
Short tandem repeats (STRs) are highly polymorphic repetitive DNA sequences extensively used in forensic science for identification of individuals. STR genotyping is usually performed by capillary electrophoresis (CE) or next-generation sequencing (NGS) in centralized laboratories. However, there is an increasing need for a low cost, portable and rapid STR genotyping method. Multiple methods for m...
Skevin, S.
•
De Keyzer, L.
•
De Waele, L.
•
Tytgat, O.
...•
Van Nieuwerburgh, F.
biorxiv
Fri Jul 04 2025
GLM-Prior: a nucleotide transformer model reveals prior knowledge as the driver of GRN inference performance
Gene regulatory network inference depends on high-quality prior-knowledge, yet curated priors are often incomplete or unavailable across species and cell types. We present GLM-Prior, a genomic language model fine-tuned to predict transcription factor to target gene interactions directly from nucleotide sequence. We integrate GLM-Prior with PMF-GRN, a probabilistic matrix factorization model, to cr...
Gibbs, C. S.
•
Chen, A.
•
Bonneau, R.
•
Cho, K.
biorxiv
Fri Jul 04 2025
GENERanno: A Genomic Foundation Model for Metagenomic Annotation
The rapid growth of genomic and metagenomic data has underscored the pressing need for advanced computational tools capable of deciphering complex biological sequences. In this study, we introduce GENERanno, a compact yet powerful genomic foundation model (GFM) specifically optimized for metagenomic annotation. Trained on an extensive dataset comprising 715 billion base pairs (bp) of prokaryotic D...
Li, Q.
•
Wu, W.
•
Zhu, Y.
•
Feng, F.
...•
Wang, Z.
biorxiv
Fri Jul 04 2025
High-Quality PacBio Genome Assembly of Populus alba L. Villafranca
This study presents the high-quality genome assemblies for Populus alba L. Villafranca using PacBio HiFi sequencing. The assembly spans 498.95 Mb, an N50 of 18.18 Mb and the largest contig of 52.03 Mb. BUSCO analysis revealed genome completeness (embryophyta_odb10) with 98.8% of the 1,614 BUSCO groups searched. The Transposable element and repetitive content accounted for ~31.37%. The comparison o...
Sarfraz, I.
•
Zuccolo, A.
•
Celii, M.
•
Francini, A.
...•
Sebastiani, L.