2025 Hyper Recent •CC0 1.0 Universal

This work is dedicated to the public domain. No rights reserved.

Access Preprint From Server
May 20th, 2025
Version: 2
Nuffield Department of Population Health, University of Oxford
genomics
biorxiv

Neisseria gonorrhoeae LIN codes: a Robust, Multi-Resolution Lineage Nomenclature

Unitt, A.Open in Google Scholar•Krisna, M. A.Open in Google Scholar•Parfitt, K. M.Open in Google Scholar•Jolley, K. A.Open in Google Scholar•Maiden, M. C. J.Open in Google Scholar•Harrison, O. B.Open in Google Scholar

Investigation of the bacterial pathogen Neisseria gonorrhoeae is complicated by extensive horizontal gene transfer: a process which disrupts phylogenetic signals and impedes our understanding of population structure. The ability to consistently identify N. gonorrhoeae lineages is important for surveillance of this increasingly antimicrobial resistant organism, facilitating efficient communication regarding its epidemiology; however, all current typing systems fail to reflect N. gonorrhoeae strain taxonomy in a reliable and stable manner. Here, a N. gonorrhoeae genomic lineage nomenclature, based on the barcoding system of Life Identification Number (LIN) codes, was developed using a refined 1430 core gene MLST (cgMLST). This hierarchical LIN code nomenclature conveys lineage information at multiple levels of resolution within one code, enabling it to provide immediate context to an isolate's ancestry, and to relate to familiar, previously used typing schemes such as Ng cgMLST v1, 7-locus MLST, or NG-STAR clonal complex (CC). Clustering with LIN codes accurately reflects gonococcal diversity and population structure, providing insight into associations between genotype and phenotype for traits such as antibiotic resistance. These codes are automatically assigned and publicly accessible via the pubmlst.org/organisms/neisseria-spp database.

Similar Papers

biorxiv
Fri May 23 2025
A catalog of ancient proxies for modern genetic variants
The ability to observe the genomes of past human populations using ancient DNA provides an extraordinary perspective on many fundamental questions in human genetics, including understanding the evolutionary history of variants that underlie human disease and other phenotypes. However, ancient DNA is often damaged and degraded, yielding low-coverage of most nucleotides. Further, many publicly avail...
Brand, C. M.
•
Capra, J. A.
biorxiv
Fri May 23 2025
Designing DNA With Tunable Regulatory Activity Using Score-Entropy Discrete Diffusion
Designing regulatory DNA sequences with precise, cell-type-specific activity is critical for applications in medicine and biotechnology, but remains challenging due to the vast combinatorial space and complex regulatory grammar governing gene expression. Recent deep generative models---including genomic language models and diffusion-based approaches---offer new tools for sequence design, yet lack ...
Sarkar, A.
•
Kang, Y.
•
Somia, N.
•
Mantilla, P.
...•
Koo, P.
biorxiv
Fri May 23 2025
The CLAMP GA-binding transcription factor regulates heat stress-induced transcriptional repression by associating with 3D chromatin loops
To survive exposure to heat stress (HS), organisms activate stress response genes and repress constitutive gene expression, thereby preventing the accumulation of potentially toxic RNA and protein products. Although many studies have elucidated the mechanisms that drive HS-induced activation of stress response genes across species, little is known about the mechanisms that repress constitutively e...
Aguilera, J.
•
Duan, J.
•
Cortez, K.
•
Lee, R.
...•
Larschan, E.
biorxiv
Thu May 22 2025
Functional genomic analysis of non-canonical DNA regulatory elements of the aryl hydrocarbon receptor
The aryl hydrocarbon receptor (AHR) is a ligand-dependent transcription factor that is activated by environmental toxicants, like halogenated and polycyclic aromatic hydrocarbons, and then binds to DNA and regulates gene expression. AHR is involved in various physiological processes, including liver and immune system function, cell cycle regulation, oncogenesis, and metabolism. In the canonical pa...
Shahriar, S.
•
Patel, T. D.
•
Nakka, M.
•
Grimm, S. L.
...•
Gorelick, D. A.
biorxiv
Thu May 22 2025
A complete reference genome assembly and annotation of the Black Redstart (Phoenicurus ochruros)
The Black Redstart (Phoenicurus ochruros) is one of the most widely distributed species, occupying diverse habitats and exhibiting remarkable altitudinal migration, making it suitable model for studying altitudinal migration and high-altitude adaptation. In this study, we present the first reference genome of Phoenicurus ochruros, generated using PacBio HiFi long-read sequencing. The nuclear genom...
Ghimire, P.
•
Wang, N.
•
Lamichhaney, S.
biorxiv
Thu May 22 2025
Draft genome and transcriptomic sequence data of three invasive insect species
Cydalima perspectalis (the box tree moth), Leptoglossus occidentalis (the western conifer seed bug), and Tecia solanivora (the Guatemalan tuber moth) are three economically harmful invasive insect species. This study presents their genomic and transcriptomic sequences, generated through whole-genome sequencing, RNA-seq transcriptomic data, and Hi-C sequencing. The resulting genome assemblies exhib...
Lombaert, E.
•
Klopp, C.
•
Blin, A.
•
Annonay, G.
...•
Deleury, E.
biorxiv
Thu May 22 2025
Phenotypic tolerance for rDNA copy number variation within the natural range of C. elegans
The genes for ribosomal RNA (rRNA) are encoded by ribosomal DNA (rDNA), whose structure is notable for being present in arrays of tens to thousands of tandemly repeated copies in eukaryotic genomes. The exact number of rDNA copies per genome is highly variable within a species, with differences between individuals measuring in potentially hundreds of copies and megabases of DNA. The extent to whic...
Hall, A. N.
•
Morton, E.
•
Walters, R.
•
Cuperus, J. T.
•
Queitsch, C.
biorxiv
Wed May 21 2025
Inferring domestic goat demographic history through ancient genome imputation
Goats were among the earliest managed animals, making them a natural model to explore the genetic consequences of domestication. However, a challenge in ancient genomic analysis is the relatively low genome coverage for most samples, limiting analysis to pseudohaploid genotypes. Genotype imputation offers potential to alleviate this limitation by improving information content and accuracy in low c...
Erven, J. A. M.
•
Etourneau, A.
•
Mashkour, M.
•
Neupane, M.
...•
Daly, K. G.
biorxiv
Wed May 21 2025
Fusion transcription factor dosage controls cell state in rhabdomyosarcoma
In the fusion-positive subset of rhabdomyosarcoma, the PAX3::FOXO1 oncoprotein is the most common fusion driver. We previously established a human myoblast system for inducible expression of PAX3::FOXO1. In the current study, we modulate PAX3::FOXO1 protein expression to understand the epigenetic and phenotypic functions at different PAX3::FOXO1 levels. Proliferative and oncogenic outcomes depend ...
Hoffman, R. A.
•
Wang, M.
•
Sunkel, B. D.
•
Nguyen, T. H.
...•
Stanton, B. Z.
biorxiv
Wed May 21 2025
The coordination between CTCF, cohesin and TFs impacts nucleosome repositioning and chromatin insulation to define state specific 3D chromatin folding
CTCF-mediated chromatin folding plays a key role in gene regulation, however the mechanisms controlling chromatin organization across cell states are not fully elucidated. Comprehensive analyses reveal that CTCF binding stability and cohesin overlap in mice and humans, are regulated by species specific differences in CTCF binding site (CBS) accessibility and enrichment of motifs corresponding to e...
Do, C.
•
Jiang, G.
•
Cova, G.
•
Zappile, P.
...•
Skok, J. A.