2025 Hyper Recent •CC0 1.0 Universal

This work is dedicated to the public domain. No rights reserved.

Access Preprint From Server
July 3rd, 2025
Version: 1
Centre for Genomic Regulation (CRG)
genomics
biorxiv

Decoding cnidarian cell type gene regulation

Elek, A.Open in Google Scholar•Iglesias, M.Open in Google Scholar•Mahieu, L.Open in Google Scholar•Zolotarov, G.Open in Google Scholar•Grau-Bove, X.Open in Google Scholar•Aerts, S.Open in Google Scholar•Sebe-Pedros, A.Open in Google Scholar

Animal cell types are defined by differential access to genomic information, a process orchestrated by the combinatorial activity of transcription factors that bind to cis-regulatory elements (CREs) to control gene expression. However, the regulatory logic and specific gene networks that define cell identities remain poorly resolved across the animal tree of life. As early-branching metazoans, cnidarians can offer insights into the early evolution of cell type-specific genome regulation. Here, we profiled chromatin accessibility in 60,000 cells from whole adults and gastrula-stage embryos of the sea anemone Nematostella vectensis. We identified 112,728 CREs and quantified their activity across cell types, revealing pervasive combinatorial enhancer usage and distinct promoter architectures. To decode the underlying regulatory grammar, we trained sequence-based models predicting CRE accessibility and used these models to infer ontogenetic relationships among cell types. By integrating sequence motifs, transcription factor expression, and CRE accessibility, we systematically reconstructed the gene regulatory networks that define cnidarian cell types. Our results reveal the regulatory complexity underlying cell differentiation in a morphologically simple animal and highlight conserved principles in animal gene regulation. This work provides a foundation for comparative regulatory genomics to understand the evolutionary emergence of animal cell type diversity.

Similar Papers

biorxiv
Fri Jul 04 2025
A single cell multi-omic analysis identifies molecular and gene-regulatory mechanisms dysregulated in the developing Down syndrome neocortex
Down syndrome is the most common genetic cause of intellectual disability, presenting with cognitive, learning, memory, and language deficits. The cellular and molecular mechanisms driving this disorder remain unclear, limited by a lack of systematic studies in the developing human brain. Here, we leveraged single-nucleus multi-omics to profile the mid-gestation neocortex in a cohort of 26 donors....
Vuong, C. K.
•
Weber, A.
•
Seong, P.
•
Matoba, N. K.
...•
de la Torre-Ubieta, L.
biorxiv
Thu Jul 03 2025
Divergence between transcriptomes and chromatin accessibility during differentiation from a bipotential progenitor cell population to erythroblasts and megakaryocytes
Changes in gene expression drive differentiation along distinct cell lineages, and these shifts in gene expression are associated with alterations in chromatin accessibility and modifications reflecting activation or repression. We used deep sequencing of polyA+ RNA to map the transcriptomes of the megakaryocyte-erythroid progenitor (MEP) and cells of its two daughter lineages, erythroblasts (ERY)...
Mishra, T.
•
Giardine, B. M.
•
Morrissey, C. S.
•
Keller, C. A.
...•
Hardison, R. C.
biorxiv
Thu Jul 03 2025
An Enhancement of Extrachromosomal Circular DNA Enrichment and Amplification to Address the Extreme Low Overlap Between Replicates
Extrachromosomal circular DNA (eccDNA) of chromosomal origin is commonly present in all eukaryotic organisms and tissue tested so far. EccDNA populations exhibit immense diversity and a characteristically low degree of overlap between samples, suggesting low inherence of eccDNA between cells or a deficiency the methods by which eccDNA is detected. This study revisits the Circle-seq approach for en...
Burnham, C. M.
•
Kurilung, A.
•
Wanchai, V.
•
Regenberg, B.
...•
Nookaew, I.
biorxiv
Thu Jul 03 2025
Integrative Transcriptomic and Machine Learning Approaches to decipher Mitochondrial Gene Regulation in severe Plasmodium vivax Malaria
Mitochondria in Plasmodium vivax are functionally vital despite possessing a highly reduced genome and differing substantially from the human organelle. Beyond their classical role in energy production, they dynamically coordinate processes like pyrimidine biosynthesis and heme metabolism, adapting their functions across the intra-erythrocytic development cycle (IDC). Their unique architecture and...
Roy, P.
•
Aggarwal, Y.
•
Kochar, S. K.
•
Kochar, D. K.
•
Das, A.
biorxiv
Thu Jul 03 2025
NewtCap: an efficient target capture approach to boost genomic studies in Salamandridae (True Salamanders and Newts)
Salamanders have large and complex genomes, hampering whole genome sequencing. However, reduced representation sequencing provides a feasible alternative to obtain genome-wide data. We present NewtCap: a sequence capture bait set that targets c.7k coding regions across the genomes of all true salamanders and newts (the family Salamandridae, also known as salamandrids). We test the efficacy of New...
de Visser, M. C.
•
France, J.
•
McCartney-Melstad, E.
•
Bucciarelli, G. M.
...•
Wielstra, B.
biorxiv
Thu Jul 03 2025
Chromatin accessibility variation provides insights into missing regulation underlying immune-mediated diseases
Most genetic loci associated with complex traits and diseases through genome-wide association studies (GWAS) are noncoding, suggesting that the causal variants likely have gene regulatory effects. However, only a small number of loci have been linked to expression quantitative trait loci (eQTLs) detected currently. To better understand the potential reasons for many trait-associated loci lacking e...
Jeong, R.
•
Bulyk, M. L.
biorxiv
Wed Jul 02 2025
microRNA-206 is a reproducibly sensitive and specific plasma biomarker of amyotrophic lateral sclerosis
Amyotrophic lateral sclerosis (ALS) is a devastating and fatal neurodegenerative disease with no current therapeutic to modify disease progression. Reliable biomarkers for ALS are essential for improving diagnosis and evaluating therapeutic efficacy. We combined small-RNA sequencing from a discovery cohort of ALS patients and healthy controls with sequencing data from a previously published ALS co...
Henderson, B. W.
•
Roberts, B. S.
•
Kolodziejczak, S.
•
Cohcran, M.
•
Myers, R. M.
biorxiv
Wed Jul 02 2025
An optimised computational approach for the identification of somatic structural variants in cancer
Structural variants play a critical role in tumorigenesis. At present, these events are most commonly identified using short-read whole-genome sequencing data, and a number of computational tools are available for this purpose. Consensus approaches have been used to improve precision, but may reduce sensitivity. The optimal number and combination of callers remains unclear, in part due to the lack...
Waise, S.
•
Mensah, N.
•
Lesluyes, T.
•
Demeulemeester, J.
...•
Van Loo, P.
biorxiv
Wed Jul 02 2025
Perplexity as a Metric for Isoform Diversity in the Human Transcriptome
Long-read sequencing (LRS) has revealed a far greater diversity of RNA isoforms than earlier technologies, increasing the critical need to determine which, and how many, isoforms per gene are biologically meaningful. To define the space of relevant isoforms from LRS, many existing analysis pipelines rely on arbitrary expression cutoffs, but a single threshold cannot accommodate the broad variabili...
Schertzer, M. D.
•
Park, S. H.
•
Su, J.
•
Sheynkman, G. M.
•
Knowles, D. A.