Hyper Recent

Thu May 08 2025

FlashFold: a standalone command-line tool for accelerated protein structure and stoichiometry prediction

ABSTARCTAlphaFold has revolutionized the decades-old issue of precisely predicting protein structures. However, its high accuracy relies on a computationally intensive step that involves searching vast databases for homologous sequences as the query protein of interest. Additionally, predicting the quaternary structure of protein complexes requires prior knowledge of subunit counts, a prerequisite...

Thu May 08 2025

Not All Saliva Samples Are Equal: The Role of Cellular Heterogeneity in DNA methylation and Epigenetic Age Analyses with Biological and Psychosocial Factors

Saliva is widely used in biomedical population research, including epigenetic analyses to investigate gene-environment interplay and identify biomarkers. Its minimally invasive collection procedure makes it ideal for studies in pediatric populations. Saliva is a heterogenous tissue composed of immune and buccal epithelial cells (BEC). Amongst the many epigenetic marks, DNA methylation (DNAm) is th...

Thu May 08 2025

AI-powered integration of multi-source data for TAA discovery to accelerate ADC and TCE drug development (I): TAA Target Identification and Prioritization

The advancement of T-cell engagers (TCEs) and antibody-drug conjugates (ADCs) has been hindered by fragmented data landscapes. This paper, the first in a series, introduces an AI-driven framework specifically for tumor-associated antigen (TAA) target identification and prioritization, a critical initial step in TCE and ADC development. Our framework integrates diverse datasets, including multi-omi...

Thu May 08 2025

Surforama: interactive exploration of volumetric data by leveraging 3D surfaces

Motivation: Visualization and annotation of segmented surfaces is of paramount importance for studying membrane proteins in their native cellular environment by cryogenic electron tomography (cryo-ET). Yet, analyzing membrane proteins and their organization is challenging due to their small sizes and the need to consider local context constrained to the membrane surface. Results: To interactively ...

Yamauchi, K. A.

Righetto, R. D.

Thu May 08 2025

INLAomics for Scalable and Interpretable Spatial Multiomic Data Integration

Integrating spatial transcriptomics with antibody-based proteomics enables the investigation of biological regulation within intact tissue architecture. However, current approaches for spatial multi-omics integration often depend on dimensionality reduction or autoencoders, which disregard spatial context, limit interpretability, and face challenges with scalability. To address these limitations, ...

Thu May 08 2025

Predicting Molecular Taste: Multi-Label and Multi-Class Classification

Predicting the taste of chemical compounds is a complex task and has been a challenge for decades. This study explores the application of machine learning to predict taste profiles of chemical compounds using the ChemTastesDB dataset, comprising 2,944 tastants categorized into 44 taste labels and 9 taste classes. Addressing the challenges of label imbalance and correlation, the dataset was preproc...

Thu May 08 2025

A novel machine learning-based algorithm for eQTL identification reveals complex pleiotropic effects in the MHC region

Expression quantitative trait loci (eQTLs) are regulatory variants that affect the expression level of their target genes and have significant impact on disease biology. However, eQTL mapping has been done mostly in one tissue at a time, despite the known prevalence of correlations among tissues. Multivariate analyses incorporating multiple phenotypes are available, but they emphasize linear combi...

Thu May 08 2025

Deep learning inference of miRNA expression from bulk and single-cell mRNA expression

Understanding the activity of miRNA in individual cells presents a challenge due to the limitations of single-cell technologies in capturing miRNAs. To tackle this obstacle, we introduce two deep learning models: Cross-Modality (CM) and Single-Modality (SM). These models utilize encoder-decoder architectures to predict miRNA expression at the bulk and single-cell levels from mRNA data. We compared...

Thu May 08 2025

GeneFix-AI: AI-Powered CRISPR-Cas9 System for Real-Time Detection and Correction of Mutations in Non-Human Species

The evolution of genome engineering technologies has transformed biomedical research, enabling precise and efficient modification of genetic material Doudna and Charpentier, 2014. Among these, CRISPR-Cas9 stands out as a revolutionary gene-editing tool, though it often requires extensive expertise and technical knowledge Cong et al., 2013; J. G. Doench et al., 2016. We propose GeneFix-AI, an Artif...

Thu May 08 2025

ORANGE: A Machine Learning Approach for Modeling Tissue-Specific Aging from Transcriptomic Data

Despite aging being a fundamental biological process which profoundly influences health and disease, the interplay between tissue-specific aging and mortality remains underexplored. This study applies machine learning on GTEx transcriptomic data to model tissue-specific biological ages across 12 different types of tissues and introduces an age-gap metric to quantify deviations from the chronologic...

Samee, M. A. H.