2025 Hyper Recent •CC0 1.0 Universal

This work is dedicated to the public domain. No rights reserved.

Access Preprint From Server
January 22nd, 2025
Version: 1
University of Oxford
cancer biology
biorxiv

araCNA: Somatic copy number profiling using long-range sequence models

Visscher, E.Open in Google Scholar•Yau, C.Open in Google Scholar

Somatic copy number alterations (CNAs) are hallmarks of cancer. Current algorithms that call CNAs from whole genome sequenced (WGS) data have not exploited deep learning methods owing to computational scaling limitations. Here, we present a novel deep-learning approach, araCNA, trained only on simulated data that can accurately predict CNAs in real WGS cancer genomes. araCNA uses novel transformer alternatives (e.g Mamba) to handle genomic-scale sequence lengths ($\\sim$1M) and learn long-range interactions. Results are extremely accurate on simulated data, and this zero-shot approach is on par with existing methods when applied to 50 WGS samples from the cancer genome atlas. Notably, our approach requires only a tumour sample and not a matched normal sample, has fewer markers of overfitting, and performs inference in only a few minutes. araCNA demonstrates how domain knowledge can be used to simulate training sets that harness the power of modern machine learning in biological applications.

Similar Papers

biorxiv
Thu Jul 03 2025
Cell enlargement causes mitotic errors and aneuploidy in cells that evade senescence after CDK4/6 inhibition
CDK4/6 inhibitors (CDK4/6i) arrest the cell cycle in G1 leading to cellular overgrowth and p53-dependent senescence. They are used to treat metastatic HR+/HER2- breast cancer, but resistance is common, and this has been associated with TP53 loss and senescence evasion. We show here that enlarged CDK4/6i-treated cells that evade senescence mis-segregate chromosomes due to defective chromosomal alig...
Pareri, A. U.
•
Losito, M.
•
Foijer, F.
•
Saurin, A. T.
biorxiv
Thu Jul 03 2025
Multi-gradient Permutation Survival Analysis Identifies Mitosis and Immune Signatures Steadily Associated with Cancer Patient Prognosis
The inconsistency of the association between genes and cancer prognosis is often attributed to many variables that contribute to patient survival. Whether there exist the Genes Steadily Associated with Prognosis (GEARs) and what their functions are remain largely elusive. Here we developed a novel method named "Multi-gradient Permutation Survival Analysis " (MEMORY) to screen the GEARs by using RN...
Cai, X.
•
Ye, Y.
•
Liu, X.
•
Fang, Z.
...•
Ji, H.
biorxiv
Thu Jul 03 2025
Cell Settling, Migration, and Stochastic Cancer Expression Suggest Potassium Flux Per H+- Sensitive KCN Members Associated with Oncogenes and Histone/DNA Modulators Potentially Initiates pH-Reversal
Attraction of glioblastoma cells to potassium was suspected when cells clustered around dying cells and they migrated towards serum (high [K+]) and a positive potassium gradient. Potassium channel proteins (KCN family, 90 members) mediating altered transmembrane flux may provide K+ that releases H+ bound to inner membranes in cancer cells for cytosolic proton transfer (possibly Grotthuss) to extru...
Beckner, M.
biorxiv
Thu Jul 03 2025
Bedaquiline Amplifies Proteasome Inhibitor Efficacy and Overcomes Resistance in Multiple Myeloma
Proteasome inhibitors (PIs) are cornerstone therapies for multiple myeloma (MM), yet resistance remains a major barrier to durable responses. To identify druggable vulnerabilities that enhance PI efficacy, we performed a small-molecule chemical screen in the presence of carfilzomib (CFZ). We identified bedaquiline (BDQ), an FDA-approved antimycobacterial agent, as a potent synergistic partner. BDQ...
Cumerlato, M.
•
Maccagno, M.
•
Labrador, M.
•
Luciano, B.
...•
PIVA, R.
biorxiv
Thu Jul 03 2025
CIT tumor lines: A novel series of immunogenic cutaneous squamous cell carcinoma cell lines derived from chemical carcinogenesis
Immunotherapy is now widely used to treat advanced-stage skin cancer, but it is effective for only approximately half of skin cancer patients, including both patients with melanoma and cutaneous squamous cell carcinoma (cSCC). To overcome current barriers, preclinical mouse models that faithfully recapitulate the tumor genetics, mutation burdens, and neoantigen patterns of specific human tumor typ...
Letchworth, R.
•
Tanaka, M.
•
Barnes, A. A.
•
Lum, L.
...•
Reeves, M.
biorxiv
Thu Jul 03 2025
Interplay of YEATS2 and GCDH regulates histone crotonylation and drives EMT in head and neck cancer
The regulation of gene expression is an integral cellular process orchestrated by epigenetic marks like histone modifications. Perturbations in the activity or abundance of epigenetic factors can lead to tumorigenesis. Remarkably, several metabolites influence the epigenetic landscape in cells. Here, we investigated the interplay between a highly expressed epigenetic factor, YEATS2, and a metaboli...
Pant, D.
•
KAKANI, P.
•
Joshi, R.
•
Sabu, A.
...•
Shukla, S.
biorxiv
Wed Jul 02 2025
High Throughput Screening Identifies Small Molecules that Synergize with MRTX1133 Against Acquired Resistant KRAS G12D Mutated CRC
Colorectal cancer (CRC) remains a significant clinical challenge, with a 5-year survival rate of 10%. Over half of all CRCs harbor mutations in the KRAS gene, leading to poor response to standard therapy. This underscores the crucial need for novel therapeutics targeting KRAS and overcoming the growing barrier of resistance. To address these critical challenges, we conducted a high-throughput scre...
Thielen, N.
•
Wei, N.
•
Nagai, E.
•
Chu, E.
...•
Kuang, C.
biorxiv
Wed Jul 02 2025
KLF5 controls subtype-independent highly interactive enhancers in pancreatic cancer to regulate cell survival
Pancreatic ductal adenocarcinoma (PDAC) remains a highly lethal cancer with a 5-year survival rate of 13%. Despite recent molecular stratification of tumors into distinct classical and basal-like cell states, most tumors are heterogeneous contain of both subtypes. Therefore, therapeutic approaches targeting one subtype may not be suitable for PDAC therapy. Here, we integrated chromatin accessibili...
Ekstrom, T. L.
•
Saha, B. J.
•
Thiel, J.
•
Dai, Z.
...•
Johnsen, S. A.
biorxiv
Wed Jul 02 2025
cIAP1 inhibitor of apoptosis is a tumor suppressor in Ewing sarcoma
Ewing sarcoma (EwS) is a highly aggressive pediatric malignancy driven by EWSR1::ETS fusion oncoproteins -primarily EWSR1::FLI1- which deregulate genes essential for differentiation, proliferation, and cell survival. To uncover key downstream targets of this fusion involved in cell differentiation, we combined transcriptomic profiling of EwS cell lines following EWSR1::FLI1 inhibition with gene on...
Cidre-Aranaz, F.
•
Geyer, F. H.
•
Hoelting, T. L.
•
Faehling, T.
...•
Gruenewald, T. G. P.
biorxiv
Wed Jul 02 2025
Exosomal cargo genes as biomarkers and potential mediators of exosome-driven cell communication in Pleural Mesothelioma
Pleural mesothelioma (PM) is a rare yet aggressive and heterogeneous cancer type with very poor survival rates. Due to its long latency period and nonspecific symptoms, the disease is usually detected at advanced stages, limiting available treatment options and leading to poor survival rates. So far, the disease can only be confirmed through invasive thoracoscopic biopsy. Moreover, the proposed ci...
Kraft, A.
•
Toenz, A.
•
Schlaepfer, F.
•
Ronner, M.
...•
Meerang, M.