2025 Hyper Recent •CC0 1.0 Universal

This work is dedicated to the public domain. No rights reserved.

Access Preprint From Server
July 2nd, 2025
Version: 3
Institute for Human Genetics, University Medical Center of the Johannes Gutenberg University Mainz, Germany
bioinformatics
biorxiv

Direct RNA sequencing (RNA004) allows for improved transcriptome assessment and near real-time tracking of methylation for medical applications

Hewel, C.Open in Google Scholar•Wierczeiko, A.Open in Google Scholar•Miedema, J.Open in Google Scholar•Hofmann, F.Open in Google Scholar•Weissbach, S.Open in Google Scholar•Dietrich, V.Open in Google Scholar•Friedrich, J.Open in Google Scholar•Holthoefer, L.Open in Google Scholar•Haug, V.Open in Google Scholar•Muendnich, S.Open in Google Scholaret al.

Direct RNA sequencing (DRS) is a nanopore-based technique for analyzing RNA in its native form, promising breakthroughs in diagnostics and biomarker development. Coupled to RNA002 sequencing chemistry, its clinical implementation has been challenging due to low throughput, low accuracy, and lack of large-scale RNA-modification models. In this study, we evaluate the improvements achieved by pairing the latest RNA004 chemistry with novel modified-base-calling models for pseudouridine and N6-methyladenosine using diverse RNA samples from cell lines, synthetic oligos, and human blood. Finally, we present the first clinical application of DRS by confirming the loss of RNA methylation in a patient carrying truncating mutations in the methyltransferase METTL5. Conclusively, the combined use of RNA004 chemistry with the base-calling models significantly improved the throughput, accuracy, and site-specific detection of modifications. From this perspective, we offer an outlook on the potential suitability of DRS for use in routine diagnostics, the first comprehensive benchmark of human peripheral blood, and quality assessments of RNA therapeutics.

Similar Papers

biorxiv
Wed Jul 02 2025
Uncovering smooth structures in single-cell data with PCS-guided neighbor embeddings
Single-cell sequencing is revolutionizing biology by enabling detailed investigations of cell-state transitions. Many biological processes unfold along continuous trajectories, yet it remains challenging to extract smooth, low-dimensional representations from inherently noisy, high-dimensional single-cell data. Neighbor embedding (NE) algorithms, such as t-SNE and UMAP, are widely used to embed hi...
Ma, R.
•
Li, X.
•
Hu, J.
•
Yu, B.
biorxiv
Wed Jul 02 2025
Benchmarking Large Language Models for Pathogen-Disease Classification in Post-Acute Infection Syndromes
Post-Acute Infection Syndromes (PAIS) are medical conditions that persist following acute infections from pathogens such as SARS-CoV-2, Epstein-Barr virus, and Influenza virus. Despite growing global awareness of PAIS and the exponential increase in biomedical literature, only a small fraction of this literature pertains specifically to PAIS, making the identification of pathogen-disease associati...
Khalid, S. M.
•
Woelker, T.
•
Molano, L.-A. G.
•
Graf, S.
•
Keller, A.
biorxiv
Wed Jul 02 2025
Streamlined genomes, not horizontal gene transfer, mark bacterial transitions to unfamiliar environments
Bacterial colonization of unfamiliar environments constitutes a drastic evolutionary transition, likely altering the strength and direction of selection. This process is often assumed to increase rates of horizontal gene transfer (HGT), a major driver of bacterial adaptation. Larger genomes are thought to facilitate such adaptations by providing broader functional repertoires and more integration ...
Mishra, S.
•
Lercher, M. J.
biorxiv
Wed Jul 02 2025
Confidence: A Web App for Cross-Platform Differential Gene Expression Analysis, Gene Scoring, and Enrichment Analysis
RNA sequencing (RNA-seq) is used to quantify transcript levels through measurement of nucleotide sequences. To evaluate statistically significant changes in gene expression, transcript counts between samples are compared using differential expression analysis methods. However, three of the most pressing challenges in transcriptomics analyses are: 1) analytical packages produce a distinct number of...
Shastry, A.
•
Ott, B.
•
Paterson, A.
•
Simpson, M.
...•
Hindmarch, C. C. T.
biorxiv
Wed Jul 02 2025
Inferring metabolite states from spatial transcriptomes using multiple graph neural network
Metabolism serves as the pivotal interface connecting genotype and phenotype in various contexts, such as cancer reprogramming and immune metabolic reprogramming. Compared to the transcriptome, the development of the single-cell metabolome faces significant challenges. While various methods exist for predicting metabolite levels from transcriptome, their efficacy remains limited. We developed an e...
Jiaxu, L.
•
Daosheng, A.
•
Sun, W.
biorxiv
Wed Jul 02 2025
A systematic assessment of phylogenomic approaches for microbial species tree reconstruction
A key challenge in microbial phylogenomics is that microbial gene families are often affected by extensive horizontal gene transfer (HGT). As a result, most existing methods for microbial phylogenomics can only make use of a small subset of the gene families present in the microbial genomes under consideration, potentially biasing their results and affecting their accuracy. To address this challen...
Weiner, S.
•
Feng, Y.
•
Gogarten, J. P.
•
Bansal, M. S.
biorxiv
Wed Jul 02 2025
MOTL: enhancing multi-omics matrix factorization with transfer learning
Joint matrix factorization is popular for extracting lower dimensional representations of multi-omics data but loses effectiveness with limited samples. Addressing this limitation, we introduce MOTL (Multi-Omics Transfer Learning), a framework that enhances MOFA (Multi-Omics Factor Analysis) by inferring latent factors for small multi-omics target datasets with respect to those inferred from a lar...
Hirst, D.
•
Terezol, M.
•
Cantini, L.
•
Villoutreix, P.
...•
Baudot, A.
biorxiv
Wed Jul 02 2025
hoodscanR: profiling single-cell neighborhoods in spatial transcriptomics data
Understanding complex cellular niches and neighborhoods have provided new insights into tissue biology. Thus, accurate neighborhood identification is crucial, yet existing methodologies often struggle to detect informative neighborhoods and generate cell-specific neighborhood profiles. To address these limitations, we developed hoodscanR, a Bioconductor package designed for neighborhood identifica...
Liu, N.
•
Martin, J.
•
Bhuva, D. D.
•
Chen, J.
...•
Davis, M. J.
biorxiv
Wed Jul 02 2025
XtractPAV: An Automated Pipeline for Identifying Presence-Absence Variations Across Multiple Genomes
Motivation: Presence-absence variations (PAVs) significantly influence phenotypic diversity across and within species by modulating functional modules involved in stress responsiveness, adaptation, and developmental processes. This modulation ultimately contributes to genetic diversity at both inter- and intra-species levels. However, existing tools for detecting PAVs offer limitations in achievin...
Ahmad, R. S.
•
Sadaqat, M.
•
Tahir ul Qamar, M.
biorxiv
Wed Jul 02 2025
Machine learning-based structural classification of lytic polysaccharide monooxygenases
Lytic polysaccharide monooxygenase (LPMO) is a copper-dependent redox enzyme and according to CAZy is classified either as cellulolytic or chitinolytic. According to CAZy, there are eight families of LPMO namely AA9, AA10, AA11, AA13, AA14, AA15, AA16, and AA17, where AA stands for Auxiliary Activity. Previously, using the sequence information machine learning-based functional annotation was succe...
Manikandan, A.
•
Yennamalli, R. M.