January 21st, 2025
Version: 1
University of South Florida
genomics
biorxiv

Proximal relationships of moonlighting Proteins in Escherichia coli: a mathematical genomic perspective

Moonlighting proteins in Escherichia coli(E.coli) perform multiple independent functions without altering their primary amino acid sequence, challenging the one gene-one enzyme hypothesis. Bacterial proteins serve various functions, including host cell adhesion, extracellular matrix interaction, and immune modulation, while also supporting essential physiological processes within the bacteria. Identifying these proteins in pathogens and tracking their genetic changes is crucial for understanding bacterial survival and virulence. A quantitative understanding of these proteins is pivotal as it enables the identification of specific patterns and relationships between amino acid composition, protein stability, and functional versatility. This study quantitatively analyzes fifty E. coli moonlighting proteins, focusing on their structural and functional features. Key findings include variability in amino acid composition, with alanine predominating, and a preference for non-polar residues, which may enhance protein stability. Quantitative features analyses identified seven distinct proximal sets, reflecting the proteins\' spatial arrangements of amino acids, structural diversity, and functional roles in processes such as metabolism, stress response, and gene regulation. These results deepen our understanding of the multifunctionality of E. coli moonlighting proteins, indicating their adaptability and implications for bacterial survival and pathogenicity.

Similar Papers

biorxiv
Sat Apr 05 2025
PanSpace: Fast and Scalable Indexing for Massive Bacterial Databases
Motivation: Species identification is a crucial task in fields such as agriculture, food processing, and healthcare. The rapid expansion of genomics databases, especially with the growing focus on investigating new bacterial genomes in clinical microbiology, has surpassed the capabilities of conventional tools like BLAST for basic search and query procedures. A major bottleneck in microbiome studi...
Avila Cartes, J. E.
Ciccolella, S.
Denti, L.
Dandinasivara, R.
...
Schonhuth, A.
biorxiv
Fri Apr 04 2025
Spatial transcriptomics AI agent charts hPSC-pancreas maturation in vivo
Spatial transcriptomics has revolutionized our understanding of tissue organization by simultaneously capturing gene expression and spatial localization within intact tissues. However, analyzing these increasingly complex datasets requires specialized expertise across computational biology, statistics, and biological context. To address this challenge, we introduce the Spatial Transcriptomics AI A...
Lin, Z.
Wang, W.
Marin-Llobet, A.
Li, Q.
...
Liu, J.
biorxiv
Fri Apr 04 2025
Neural stem cell epigenomes and fate bias are temporally coordinated during corticogenesis
The cerebral cortex orchestrates complex cognitive functions, yet how its distinct temporal lineages are molecularly patterned during development remains unresolved. Here, we integrate single-cell transcriptomics and chromatin accessibility, together with genome-wide profiling of DNA methylation and 3D chromosomal contact across mouse corticogenesis (E13-E18) to elucidate cell fate transitions. Us...
Shapira, Y.
Noack, F.
Vangelisti, S.
Chong, F.
...
Bonev, B.
biorxiv
Fri Apr 04 2025
BRCA1 is a molecular correlate of cell proliferation in human brain development and in Group 3 and 4 medulloblastoma
The role of the BRCA1-mediated DNA damage repair pathway in regulating human brain development remains unknown, although it has been studied in mouse development. We report evidence for breast cancer type 1 susceptibility protein (BRCA1) being a molecular correlate of proliferation in human neural progenitor cells and in medulloblastoma (MB), a malignant pediatric hindbrain cancer whose cells rese...
Cheong, I.
Sun, X.
Lau, L.
Kishore, N.
...
Pai, S.
biorxiv
Fri Apr 04 2025
Adaptively integrated sequencing and assembly of near-complete genomes
Recent advances in long-read sequencing (LRS) and assembly algorithms have made it possible to create highly complete genome assemblies for humans, animals, plants and other eukaryotes. However, there is a need for ongoing development to improve accessibility and affordability of the required data, increase the range of usable sample types, and reliably resolve the most challenging, repetitive gen...
Gamaarachchi, H.
Stevanovski, I.
Hammond, J. M.
Reis, A. L. M.
...
Deveson, I. W.
biorxiv
Fri Apr 04 2025
Compensatory evolution following deleterious episodes of GC-biased gene conversion in rodents
GC-biased gene conversion (gBGC) is a widespread evolutionary force associated with meiotic recombination that favours the accumulation of deleterious AT to GC substitutions in proteins, moving them away from their fitness optimum. In many mammals recombination hotspots have a rapid turnover, leading to episodic gBGC, with the accumulation of deleterious mutations stopping when the recombination h...
Riffis, M.
Saclier, N.
Galtier, N.
biorxiv
Thu Apr 03 2025
CREsted: modeling genomic and synthetic cell type-specific enhancers across tissues and species
Sequence-based deep learning models have become the state of the art for the analysis of the genomic regulatory code. Particularly for transcriptional enhancers, deep learning models excel at deciphering sequence features and grammar that underlie their spatiotemporal activity. To enable end-to-end enhancer modeling and design, we developed a software and modeling package, called CREsted. It combi...
Kempynck, N.
De Winter, S.
Blaauw, C. H.
Konstantakos, V.
...
Aerts, S.
biorxiv
Thu Apr 03 2025
Neisseria gonorrhoeae LIN codes: a Robust, Multi-Resolution Lineage Nomenclature
Investigation of the bacterial pathogen Neisseria gonorrhoeae is complicated by extensive horizontal gene transfer: a process which disrupts phylogenetic signals and impedes our understanding of population structure. The ability to identify N. gonorrhoeae lineages in a consistent manner is essential for surveillance of this increasingly antimicrobial resistant organism, and to facilitate efficient...
Unitt, A.
Krisna, M. A.
Parfitt, K. M.
Jolley, K. A.
...
Harrison, O. B.
biorxiv
Thu Apr 03 2025
Trypanosoma cruzi Has 32 Chromosomes: A Telomere-to-Telomere Assembly Defines Its Karyotype.
Trypanosoma cruzi, the causative agent of Chagas disease, exhibits remarkable genomic plasticity and possesses an expanded genome rich in multigene families. However, its precise chromosomal composition has remained elusive due to challenges in resolving repetitive regions. Using PacBio HiFi long-read sequencing, we assembled the complete nuclear genome of the Dm28c strain into 32 telomere-to-telo...
Greif, G.
Diaz-Viraque, F.
Chiribao, M. L.
Sanz-Rodriguez, C.
Robello, C.
biorxiv
Thu Apr 03 2025
Expression spectrum of TE-derived transcripts in human adult tissues
Abstract Transposable elements (TEs) are vital components of eukaryotic genomes and have played a critical role in genome evolution. Although most TEs are silenced in the mammalian genome, increasing evidence suggests that certain TEs are actively involved in gene regulation during early developmental stages. However, the extent to which human TEs drive gene transcription in adult tissues remains ...
Miao, B.
Zhang, B.
WU, T. P.
Luo, X.
...
Yang, Y.