Hyper Recent

Mon Jun 30 2025

STORIES: learning cell fate landscapes from spatial transcriptomics

In dynamic biological processes such as development, spatial transcriptomics is revolutionizing the study of the mechanisms underlying spatial organization within tissues. Inferring cell fate trajectories from spatial transcriptomics profiled at several time points has thus emerged as a critical goal, requiring novel computational methods. Wasserstein gradient flow learning is a promising framewor...

Mon Jun 30 2025

scHDeepInsight: A Hierarchical Deep Learning Framework for Precise Immune Cell Annotation in Single-Cell RNA-seq Data

Immune cell classification from single-cell RNA sequencing (scRNA-seq) presents significant challenges due to complex hierarchical relationships among cell types. We introduce scHDeepInsight, a deep learning framework that extends our previous scDeepInsight model by integrating a biologically-informed classification architecture with an adaptive hierarchical focal loss. The framework leverages our...

Boroevich, K. A.

Mon Jun 30 2025

Identifying Optimal Machine Learning Approaches for Microbiome-Metabolomics Integration with Stable Feature Selection

Microbiome research has been limited by methodological inconsistencies. Taxonomy-based profiling presents challenges such as data sparsity, variable taxonomic resolution, and the reliance on DNA-based profiling, which provides limited functional insight. Multi-omics integration has emerged as a promising approach to link microbiome composition with function. However, the lack of standardized metho...

Mon Jun 30 2025

FEDRANN: effective long-read overlap detection based on dimensionality reduction and approximate nearest neighbors

Overlap detection is a key step in de novo genome assembly pipelines based on the Overlap-Layout-Consensus (OLC) paradigm. However, existing methods for overlap detection either rely on heuristic seed-and-extension strategies or locality-sensitive hashing (LSH), both of which struggle to handle repetitive genomic regions and the computational burden of large-scale datasets. Here, we present FEDRAN...

Mon Jun 30 2025

CpGeneAge: multi-omics aging clocks associated with Nf-κB signaling pathway in aging

Aging clocks have emerged as the primary tools for measuring biological aging and have been developed for a wide range of single-omic measurements. Epigenetic aging clocks showed high accuracy in age prediction, however, their biological interpretation is still a challenging task. Transcriptomics aging clocks provide better interpretability but worse age prediction accuracy. To exploit the benefit...

Mon Jun 30 2025

Molecular characterization of unique multi-domain harbouring fungal rhodopsin for establishing their novel opto-synthetic biological usages

Organisms employ light as an external stimulus for regulating cellular functions. The light-sensitive photoreceptors detect light at varying wavelengths, activating signaling cascades and triggering a range of physiological responses. Rhodopsin is a transmembrane heptahelical protein that functions as an ion channel, or a pump, and sensory receptor, respectively. It consists of a light-sensing chr...

Mon Jun 30 2025

Quantitative analysis of genetic interactions in human cells from genome-wide CRISPR-Cas9 screens

Genetic interaction (GI) networks in model organisms have revealed how combinations of genome variants can impact phenotypes and underscored the value of GI maps for functional genomics. To advance efforts toward a reference human GI network, we developed the quantitative Genetic Interaction (qGI) score, a method for precise GI measurement from genome-wide CRISPR-Cas9 screens in isogenic human cel...

Mon Jun 30 2025

A Systematic Benchmark of High-Accuracy PacBio Long-Read RNA Sequencing for Transcript-Level Quantification

PacBio long-read RNA sequencing resolves transcripts with greater clarity than short-read technologies, yet its quantitative performance remains under-evaluated at scale. Here, we benchmark the high-throughput PacBio Kinnex platform against Illumina short-read RNA-seq using matched, deeply sequenced datasets across a time course of endothelial cell differentiation. Compared to Illumina, Kinnex ach...

Mehlferber, M. M.

Sheynkman, G. M.

Mon Jun 30 2025

Genomic Touchstone: Benchmarking Genomic Language Models in the Context of the Central Dogma

The emergence of genomic language models (gLMs) has revolutionized the analysis of genomic sequences, enabling robust capture of biologically meaningful patterns from DNA sequences for an improved understanding of human genome-wide regulatory programs, variant pathogenicity and therapeutic discovery. Given that DNA serves as the foundational blueprint within the central dogma, the ultimate evaluat...

Mon Jun 30 2025

Cell type-specific functions of nucleic acid-binding proteins revealed by deep learning on co-expression networks

Nucleic acid-binding proteins (NABPs) exhibit cell type-specific regulatory functions, but their target genes and biological roles remain incompletely characterized due to the limitations of current experimental approaches. Here, we present a deep learning framework that integrates gene co-expression correlations to predict NABP regulatory targets and infer their functions across diverse cellular ...