April 15th, 2025
Version: 2
Institute for Computational Biomedicine, Faculty of Medicine, Heidelberg University and Heidelberg University Hospital, Heidelberg, Germany
bioinformatics
biorxiv

Comparison and Optimization of Cellular Neighbor Preference Methods for Quantitative Tissue Analysis

Studying the spatial distribution of cell types in tissues is essential for understanding their function in health and disease. A widely used spatial feature for quantifying tissue organization is the pairwise neighbor preference (NEP) of cell types, commonly referred to as co-occurrence or colocalization. Various methods to infer NEP have proved their utility in spatial omics studies, but despite their broad usage, no clear guidelines exist for selecting one method over the other. In this paper, we deconstruct frequently used methods into their underlying analysis steps and evaluate their optimal combination. We studied the methods on two aspects: (1) their discriminatory power to distinguish different tissue architectures and (2) their ability to recover the directionality of NEPs. We compared existing as well as our in-house developed method (conditional z-score (COZI)) and compared their performance using in silico tissue simulations and demonstrated its biological applicability in a myocardial infarction dataset. Overall, our study serves as a comprehensive guide for users and method developers in spatial omics analysis and offers a novel approach (COZI), which outperforms existing methods, to performing NEP analysis.

Similar Papers

biorxiv
Fri Apr 25 2025
BindCraft: one-shot design of functional protein binders
Protein-protein interactions (PPIs) are at the core of all key biological processes. However, the complexity of the structural features that determine PPIs makes their design challenging. We present BindCraft, an open-source and automated pipeline for de novo protein binder design with experimental success rates of 10-100%. BindCraft leverages the weights of AlphaFold2 to generate binders with nan...
Pacesa, M.
Nickel, L.
Schellhaas, C.
Schmidt, J.
...
Correia, B. E.
biorxiv
Fri Apr 25 2025
Back to Basics: Spectrum and Peptide Sequence are Sufficient for Top-tier Mass Spectrometry Proteomics Identification
The original mass spectrometry search engines used simple algorithms for peptide identification. Recent tools improved accuracy by adding several extra components such as fragment ion intensities or retention times prediction and training target-decoy classifiers on-the-fly, leading to sometimes inconsistent results. Our study explores the impact of replacing those extra components with a deep-lea...
Burq, M.
Stepec, D.
Restrepo, J.
Zbontar, J.
...
Cimermancic, P.
biorxiv
Fri Apr 25 2025
A benchmark for large language models in bioinformatics
The rapid advancements in artificial intelligence, particularly in Large Language Models (LLMs) such as GPT-4, Gemini, and LLaMA, have opened new avenues for computational biology and bioinformatics. We report the development of BioLLMBench, a novel framework designed to evaluate LLMs in bioinformatics tasks. This study assessed GPT-4, Gemini, and LLaMA through 2,160 experimental runs, focusing on...
Sarwal, V.
Andreoletti, G.
Munteanu, V.
Suhodolschi, A.
...
Mangul, S.
biorxiv
Fri Apr 25 2025
Shiny-Calorie: A context-aware application for indirect calorimetry data analysis and visualization using R
Indirect calorimetry is a cornerstone technique for metabolic phenotyping of animal models in preclinical research, with well-established experimental protocols and platforms. However, a flexible, extensible, and user-friendly software suite that enables standardized integration of data and metadata from diverse metabolic phenotyping platforms - followed by unified statistical analysis and visuali...
Grein, S.
Elschner, T.
Kardinal, R.
Bruder, J.
...
Klingenspor, M.
biorxiv
Fri Apr 25 2025
MiT4SL: multi-omics triplet representation learning for cancer cell line-adapted prediction of synthetic lethality
Synthetic lethality (SL) offers a promising approach for targeted cancer therapies. Current SL prediction models heavily rely on extensive labeled data for specific cell lines to accurately identify SL pairs. However, a major limitation is the scarcity of SL labels across most cell lines, which makes it challenging to predict SL pairs for target cell lines with limited or even no available labels ...
Tao, S.
Feng, Y.
Yang, Y.
Wu, M.
Zheng, J.
biorxiv
Thu Apr 24 2025
One-Hot News: Drug Synergy Models Take a Shortcut
Combinatorial drug therapy holds great promise for tackling complex diseases, but the vast number of possible drug combinations makes exhaustive experimental testing infeasible. Computational models have been developed to guide experimental screens by assigning synergy scores to drug pair-cell line combinations, where they take input structural and chemical information on drugs and molecular featu...
Candir, E. B.
Kuru, H. I.
Rattray, M.
Cicek, A. E.
Tastan, O.
biorxiv
Thu Apr 24 2025
Ontology-aware DNA methylation classification with a curated atlas ofhuman tissues and cell types
DNA methylation (DNAm) is a core gene regulatory mechanism that captures cellular responses to short- and long- term stimuli such as environmental exposures, aging, and cellular differentiation. Although DNAm has proven valuable as a baseline biomarker for aging by enabling robust characterization of disease-associated methylation shifts associated with age, its potential to reveal analogous shift...
Kim, M.
Dannenfelser, R.
Cui, Y.
Allen, G.
Yao, V.
biorxiv
Thu Apr 24 2025
SCassist: An AI Based Workflow Assistant for Single-Cell Analysis
Single-cell RNA sequencing (scRNA-seq) data analysis often involves complex iterative workflow, requiring significant expertise and time. To navigate this complexity, we have developed SCassist, an R package that leverages the power of the large language models (LLM\'s) to guide and enhance scRNA-seq analysis. SCassist integrates LLM\'s into key workflow steps, to analyze user data and provide rel...
Nagarajan, V.
Shi, G.
Arunkumar, S.
Liu, C.
...
Caspi, R.
biorxiv
Thu Apr 24 2025
Inferring the cell type-specific functional roles of nucleic acid-binding proteins using deep learning on co-expression networks
The regulatory target genes and functional roles of nucleic acid-binding proteins (NABPs), including DNA- and RNA-binding proteins, can vary in a cell- or tissue-specific manner. However, experimentally identifying NABP-binding sites and functionally characterizing these proteins remain technically challenging. Deep learning methods have been developed to predict gene expression levels across dive...
Osato, N.
biorxiv
Thu Apr 24 2025
Optimizing drug synergy prediction through categorical embeddings in Deep Neural Networks
Cancer treatments often lose effectiveness as tumors develop resistance to single-agent therapies. Combination treatments can overcome this limitation, but the overwhelming combinatorial space of drug-dose interactions makes exhaustive experimental testing impractical. Data-driven methods, such as Machine and Deep Learning, have emerged as promising tools to predict synergistic drug combinations. ...
Gonzalez Lastre, M.
Gonzalez de Prado Salas, P.
Guantes, R.