2025 Hyper Recent •CC0 1.0 Universal

This work is dedicated to the public domain. No rights reserved.

Access Preprint From Server
July 5th, 2025
Version: 1
University of Toronto
bioinformatics
biorxiv

OmniPert: A Deep Learning Foundation Model for Predicting Responses to Genetic and Chemical Perturbations in Single Cancer Cells

Taj, F.Open in Google Scholar•Stein, L. D.Open in Google Scholar

In cancer, intra- and inter-patient heterogeneity presents a significant challenge for therapeutic management, as patients with apparently similar profiles often exhibit divergent responses to the same therapies. This heterogeneity is primarily attributed to genetic and molecular variations among individuals and their tumors. Understanding the impact of these differences on treatment outcomes is widely believed to be a key step for developing effective precision medicine strategies. However, the complexity of most biological pathways makes it difficult to predict the effect of genetic variation on cells and tissues, let alone predict a patient's response to therapy. As a result, high-throughput genetic and chemical perturbation screens have emerged as valuable tools for precision medicine-related tasks, such as disease modeling, target discovery, cellular programming, and pathway reconstruction. This approach is fundamentally limited, however, because the number of possible combinations of cell types, cell states, perturbation targets, and perturbation types is huge and cannot be exhaustively tested experimentally. This calls for computational approaches that can simulate such experiments in silico, guiding in vitro experiments towards perturbations that are more likely to produce the desired effect. Here we describe OmniPert, a novel generative AI tool, which utilizes a deep learning, transformer-based architecture to model the effects of genetic and chemical perturbations on single-cell transcriptomes. Trained on millions of diverse cellular profiles, this approach allows for more granular analysis of cellular responses, thereby facilitating downstream applications in cell-specific gene-gene and gene-drug interaction networks, biomarker and drug target discovery, drug repurposing, and in silico perturbation reverse-engineering. In the context of oncology, OmniPert promises to facilitate the discovery of novel cell type- and state-specific targets, ultimately contributing to more effective and personalized cancer treatments.

Similar Papers

biorxiv
Sat Jul 05 2025
Regulation Flow Analysis discovers molecular mechanisms of action from large knowledge databases
Drug development is a long and expensive process, with only a small fraction of potential drugs being finally approved. The challenge of drug development is rooted in our limited understanding of biological systems and the disease processes that drugs are trying to modulate. We propose a novel method, called Regulation Flow Analysis (RFA), which is based on the principles of biological regulation,...
Roca, C. P.
•
Sysoev, O.
•
Eyre, E.
•
Galan, S.
...•
Mangion, J.
biorxiv
Sat Jul 05 2025
Mechanistic modeling and machine learning identifies optimum radiotherapy schedules to prevent treatment-induced metastasis
Lung cancer patients often experience increased metastasis formation after radiotherapy. However, it is incompletely understood whether radiation affects the migratory behavior of tumor cells and how altered radiotherapy schedules might mitigate this risk. To address these questions, we performed live-cell microscopy experiments to profile changes in cell migration during radiation across 12 cance...
Graser, C.
•
Zhou, Z.
•
Schürch, M.
•
Moorhead, G.
...•
Michor, F.
biorxiv
Sat Jul 05 2025
A Computational Workflow for Structure-Guided Design of Potent and Selective Kinase Peptide Substrates
Kinases are pivotal cell signaling regulators and prominent drug targets. Short peptide substrates are widely used in kinase activity assays essential for investigating kinase biology and drug discovery. However, designing substrates with high activity and specificity remains challenging. Here, we present Subtimizer (substrate optimizer), a streamlined computational pipeline for structure-guided k...
Yekeen, A. A.
•
Meyer, C. J.
•
McCoy, M.
•
Posner, B.
•
Westover, K. D.
biorxiv
Sat Jul 05 2025
Gain of Function p53 mutant R273H confers distinct methylation profiles and consequent partial or full EMT states to colon tumour
p53 is the second most frequently mutated gene in colorectal cancer. While different p53 mutations have been correlated with metastasis, the distinct phenotypes exhibited by site-specific mutations of p53 are not well elucidated. Here, we analyse transcriptomic and methylation data from TCGA-COAD cohort to understand the epigenetic impact of three most prevalent hotspot mutations of p53 (R175H, R2...
Rani, H.
•
Subhadarshini, S.
•
Jolly, M. K.
•
Mahadevan, V.
biorxiv
Sat Jul 05 2025
OMIDIENT: Multiomics Integration for Cancer by Dirichlet Auto-Encoder Networks
To achieve a more comprehensive understanding of cancer, novel computational methods are required for the integrative analysis of data from different molecular layers, such as genomics, transcriptomics, and epigenomics. Here, we present a novel multi-omics integrative method that performs unsupervised representation learning, referred to as OMIDIENT: multiOMics Integration for cancer by DIrichlet ...
Safinianaini, N.
•
Valimaki, N.
•
Bresson, R.
•
Gorbonos, A.
...•
Marttinen, P.
biorxiv
Sat Jul 05 2025
Fold-Conditioned De Novo Binder Design via AlphaFold2-Multimer Hallucination.
De novo protein binder design has been revolutionized by deep learning methods, yet controlling binder topology remains a challenge. We introduce a fold-conditioned AlphaFold2-Multimer hallucination framework - FoldCraft - guided by a contact map similarity loss, enabling precise generation of binders with user-defined structural folds. This single loss function enforces fold-specific geometry whi...
Rustamov, K. R.
•
Baev, A. Y.
biorxiv
Sat Jul 05 2025
Modelling punctuated similarity
Inter-subject, pairwise similarity models provide a methodological resource for flexibly measuring complex, non-linear relationships between brain and behavior. Similarity models, however, can extend beyond brain behavior relationships and can be readily applied to any data where they may be useful. The work presented in this paper introduces a new way of modelling similarity, termed punctuated si...
Crockford, S. K.
biorxiv
Sat Jul 05 2025
GatorST: A Versatile Contrastive Meta-Learning Framework for Spatial Transcriptomic Data Analysis
Introduction: Recent advances in spatial transcriptomics (ST) technologies have revolutionized our understanding of cellular functions by providing gene expression profiles with rich spatial context. Effectively learning spatial representations is crucial for downstream analyses and requires robust integration of spatial information with transcriptomic data. While existing methods have shown promi...
Wang, S.
•
Liu, Y.
•
Zhang, Z.
•
Song, Q.
•
Bian, J.
biorxiv
Sat Jul 05 2025
PULPO: Pipeline of understanding large-scale patterns of oncogenomic signatures
PULPO v1.0 is a novel, fully automated pipeline designed for the preprocess and extraction of mutational signatures from raw Optical Genome Mapping (OGM) data. Built using Snakemake and executed within an isolated, Conda-managed environment, PULPO transforms complex cytogenetic alterations, captured at ultra-high resolution, into Catalogue of somatic mutations in Cancer (COSMIC)-based mutational s...
Portasany-Rodriguez, M.
•
Soria-Alcaide, G.
•
G.Sanchez, E.
•
Ivanova, M.
...•
Garcia-Martinez, J.