2025 Hyper Recent •CC0 1.0 Universal

This work is dedicated to the public domain. No rights reserved.

Access Preprint From Server
July 3rd, 2025
Version: 1
Dana-Farber Cancer Institute
bioinformatics
biorxiv

PROFET Predicts Continuous Gene Expression Dynamics from scRNA-seq Data to Elucidate Heterogeneity of Cancer Treatment Responses

Cheng, Y.-C.Open in Google Scholar•Gu, H.Open in Google Scholar•McDonald, T. O.Open in Google Scholar•Wu, W.Open in Google Scholar•Tripathi, S.Open in Google Scholar•Guarducci, C.Open in Google Scholar•Russo, D.Open in Google Scholar•Abravanel, D. L.Open in Google Scholar•Bailey, M.Open in Google Scholar•Wang, Y.Open in Google Scholaret al.

Single-cell RNA sequencing captures static snapshots of gene expression but lacks the ability to track continuous gene expression dynamics over time. To overcome this limitation, we developed PROFET (Particle-based Reconstruction Of generative Force-matched Expression Trajectories), a computational framework that reconstructs continuous, nonlinear single-cell gene expression trajectories from sparsely sampled scRNA-seq data. PROFET first generates particle flows between time-stamped samples using a novel Lipschitz-regularized gradient flow approach and then learns a global vector field for trajectory reconstruction using neural force-matching. The framework was developed using synthetic data simulating cell state transitions and subsequently validated on both mouse and human in vitro datasets. We then deployed PROFET to investigate heterogeneity in treatment responses to palbociclib, a CDK4/6 inhibitor, in hormone receptor positive breast cancer. By comparing newly generated scRNA-seq data from a palbociclib-resistant breast cancer cell line with published patient-derived datasets, we identified a subpopulation of patient cells exhibiting profound phenotypic shifts in response to treatment, along with surface markers uniquely enriched in those cells. By recovering temporal information from static snapshots, PROFET enables inference of continuous single-cell expression trajectories, providing a powerful tool for dissecting the heterogeneity of cell state transitions in treatment responses.

Similar Papers

biorxiv
Fri Jul 04 2025
Know your RNA-Seq data in depth: a case study using data from early life stress in mouse
Next-generation sequencing (NGS) is a technology that enables rapid and high-throughput sequencing of entire genomes, transcriptomes or specific DNA/RNA populations. RNA-Seq is an NGS-based method that specifically targets the transcriptome and can be applied to bulk tissue or single cells. NGS produces large volumes of partial sequences (reads), which must be aligned, assembled and analyzed to ex...
Lindlof, A.
biorxiv
Thu Jul 03 2025
Spliformer-v2 predicts multi-tissue RNA splicing and reveals functional genomic links with neurodegenerative diseases
Precise regulation of pre-mRNA splicing underpins molecular diversity and is linked to aging and disease. Genetic variants are key drivers of RNA mis-splicing, yet how they induce tissue-specific splicing remains largely unclear. Here, we introduce Spliformer-v2, a deep learning model based on SegmentNT architecture to predict multi-tissue RNA splicing. Spliformer-v2 is trained on paired genome/tr...
Tang, X.
•
Lei, H.
•
Guo, J.
•
Shen, Y.
•
Zhang, M.
biorxiv
Thu Jul 03 2025
Amino acid exchangeability and surface accessibility underpin the effects of single substitutions
Deep mutational scans have measured the effects of many mutations on many different proteins. Here we use a collection of such scans to perform a statistical meta-analysis of the effects of single amino acid substitutions. Specifically, we model the relative deleteriousness of each substitution in each deep mutational scan with respect to the identities of the wildtype and mutant residues, and the...
Alpay, B. A.
•
Nanda, P.
•
Nagy, E.
•
Desai, M. M.
biorxiv
Thu Jul 03 2025
In Silico Investigation Reveals a Potential Functional Role for Human Microbiome in Chronic Obstructive Pulmonary Disease
Chronic Obstructive Pulmonary Disease (COPD) is a progressive enervating lung disease characterized by chronic inflammation, airway inhibition and unrecoverable structural damage to the lungs. While traditionally associated with environmental factors similar as cigarette smoke and air pollution as well as genetic factors, recent revelations has increasingly indicative of the role of microbiomes in...
Jana, N.
•
Dhara, O.
•
Bhattacharya, S. S.
biorxiv
Thu Jul 03 2025
Allosteric Site Prediction Using Protein Language Models and Orthosteric Conditioning
Allosteric modulators as therapeutics offer many advantages over orthosteric modulators, including improved selectivity and tunability. However, identifying and characterising allosteric sites remains a major challenge both experimentally and computationally. Accurate prediction of allosteric binding sites is critical to facilitate allosteric drug discovery. Here, we evaluate three strategies to p...
Eccleston, R. C.
•
Furnham, N.
biorxiv
Thu Jul 03 2025
tugMedi: simulator of cancer-cell evolution for personalized medicine based on the genomic data of patients
Cancer comprehensive genomic profiling tests are increasingly used, but drug response rates remain limited. Simulations forecasting cancer progression could aid targeted therapies; however, existing simulations focus mainly on basic biology. We present tugMedi, a cancer-cell evolution simulator designed for cancer genome medicine. By integrating patient-specific genomic and imaging data, tugMedi r...
Nagornov, I.
•
Furukawa, E.
•
Nagai, M.
•
Yagishita, S.
...•
Kato, M.
biorxiv
Thu Jul 03 2025
Hybrid Generative Model: Bridging Machine Learning and Biophysics to Expand RNA Functional Diversity
Functional RNAs perform diverse catalytic roles, yet natural sequences represent only a narrow subset of what is possible. Rediscovering such activities requires exploring functional sequence diversity beyond natural RNAs. We introduce a Bayesian hybrid generative model that combines a coevolutionary likelihood with an RNA secondary structure prior. This approach disentangles folding constraints f...
Opuu, V.
biorxiv
Thu Jul 03 2025
CLONEID: A Framework for Longitudinal Integration of Phenotypic and Genotypic Data to Monitor and Steer Subclonal Dynamics
Understanding how genetic and phenotypic diversity emerges and evolves within cancer cell populations is a fundamental challenge in cancer biology. CLONEID is a novel framework designed to organize and analyze clone-specific measures as structured time-series data. By integrating and monitoring genotypic and phenotypic experimental data over time, CLONEID facilitates hypothesis-driven and hypothes...
Veith, T.
•
Beck, R. J.
•
Tagal, V.
•
Li, T.
...•
Andor, N.
biorxiv
Thu Jul 03 2025
Foundation Model Attributions Reveal Shared Inflammatory Program Across Diseases
Determining a gene's functional significance within a specific cellular context has long been a challenge. We introduce a framework for quantifying gene importance by leveraging attributions learned by foundation models (FMs) trained on large corpora of single-cell RNA-sequencing (scRNA-seq) datasets. Attribution scores robustly quantify gene importance across datasets, emphasizing key genes in re...
Gold, M. P.
•
Reyes, M.
•
Diamant, N.
•
Kuo, T.
...•
Biancalani, T.
biorxiv
Thu Jul 03 2025
Human protein interactome structure prediction at scale with Boltz-2
In humans, protein-protein interactions mediate numerous biological processes and are central to both normal physiology and disease. Extensive research efforts have aimed to elucidate the human protein interactome, and comprehensive databases now catalog these interactions at scale. However, structural coverage of the human protein interactome is limited and remains challenging to resolve through ...
Ille, A. M.
•
Markosian, C.
•
Burley, S. K.
•
Pasqualini, R.
•
Arap, W.